Data Wrangling with HadoopCovers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.
General Introduction to Big DataCovers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.
Handling Data: Intro to PandasIntroduces the fundamentals of handling data, emphasizing the importance of Pandas and data modeling for effective analysis.
Introduction to Data ScienceIntroduces the basics of data science, covering decision trees, machine learning advancements, and deep reinforcement learning.