Parallel DatabasesExplores parallel and distributed databases, covering architectures, query optimization, data storage, and distributed transactions.
Biological Contaminants: PathogensCovers the definition of biological contaminants and the transmission of pathogens, including emerging infectious diseases and antibiotic resistance.
Relational Query OptimizationCovers relational query optimization, including logical and physical query plans, cost estimation, equivalences, and the System R strategy.
Data Wrangling with HadoopCovers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.
Spark Data FramesCovers Spark Data Frames, distributed collections of data organized into named columns, and the benefits of using them over RDDs.
Learning-aided Program ReasoningExplores bug-finding, verification, and the use of learning-aided approaches in program reasoning, showcasing examples like the Heartbleed bug and differential Bayesian reasoning.
Query OptimizationCovers query optimization, from logical to physical plans, including query transformation and cost estimation.
Optimizing Recursive QueriesExplores optimizing recursive queries in database systems using Datalog and semirings, discussing the challenges and solutions in data analytics.