Covers vectorisation, functions, and flow control in Matlab, emphasizing the importance of avoiding global variables and providing examples of simple plots and debugging techniques.
Covers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.