Introduction to Data ScienceIntroduces the basics of data science, covering decision trees, machine learning advancements, and deep reinforcement learning.
Data Wrangling with HadoopCovers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.
Machine Learning BasicsIntroduces the basics of machine learning, covering supervised and unsupervised learning, linear regression, and data understanding.
General Introduction to Big DataCovers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.
Decision Trees: ClassificationExplores decision trees for classification, entropy, information gain, one-hot encoding, hyperparameter optimization, and random forests.