Data Wrangling with HadoopCovers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.
Data Issues in ResearchExplores challenges in data assumptions, biases, and more in research, including incomplete write-ups and frustrations of newcomers.
Water Consumption in GenevaExplores water consumption data in Geneva, including charts on consumption and losses, available datasets, and data processing phases.
Handling Data: Intro to PandasIntroduces the fundamentals of handling data, emphasizing the importance of Pandas and data modeling for effective analysis.
Digital History and Digitized PressDelves into the 'digital turn' in history, examining historical research using digitized newspapers and exploring text reuse, word embeddings, and data visualization.