Data Storage Systems

Undergraduate course, Friedrich-Schiller University Jena, 2021

Objectives:

This course is primarily designed for undergraduate students, who already completed introductionary courses in database & distributed systems, algorithms & data structures, and operating systems. The course puts a strong emphasis on a systemic view of data storage systems, ranging from the physical inner workings of modern data storage devices, such as HDDs and SSDs, to full-fledged, distributed, cloud-based exascale storage systems. The students will gradually gain a deeper understanding of fundamental concepts and algorithms of data storage systems, including replication & crash recovery, data deduplication, and storage tiering & caching, but also insights into systematic I/O performance analysis & tuning on the device and the file system level. In the second half of the course, the fundamental concepts of file systems and large-scale distributed storage systems will be discussed. Finally, we conclude the course with an outlook into active areas of research and novel storage technologies that will come up (or are already on the market) and will have a quite dramatic impact on any data-intensive application having to deal with large data volumes.

Prerequisites:

An undergraduate-level understanding of maths, programming, data structures & algorithms, operating systems, and distributed systems is assumed.

Tentative Course Schedule:

 Date & TimeTopic(s)SlidesSupplementary Materials
113-04-2021, 16:00 - 18:00Introduction Data Storage Systemsslides 
220-04-2021, 16:00 - 18:00Hardware Devices & Firmwareslides 
327-04-2021, 16:00 - 18:00I/O Protocols & Interfacesslides 
404-05-2021, 16:00 - 18:00Linux I/O Subsystemslides 
511-05-2021, 16:00 - 18:00File Systemsslides 
618-05-2021, 16:00 - 18:00Benchmarking I/Oslides 
725-05-2021, 16:00 - 18:00I/O Performance Enhancementsslides 
801-06-2021, 16:00 - 18:00Replication & Crash Recoveryslides 
908-06-2021, 16:00 - 18:00Storage Tiering & Cachingslides 
1015-06-2021, 16:00 - 18:00Data Deduplicationslides 
1122-06-2021, 16:00 - 18:00Distributed File Systemsslides 
1229-06-2021, 16:00 - 18:00KV-Stores & Object Storageslides 
1306-07-2021, 16:00 - 18:00Cloud Storageslides 
1413-07-2021, 16:00 - 18:00Recent Trends in the Storage Landscapeslides