Delves into the 'digital turn' in history, examining historical research using digitized newspapers and exploring text reuse, word embeddings, and data visualization.
Delves into the intersection of physics and data in machine learning models, covering topics like atomic cluster expansion force fields and unsupervised learning.
Discusses advanced Spark optimization techniques for managing big data efficiently, focusing on parallelization, shuffle operations, and memory management.
Explores text mining of long-tail data in neuroscience and brain connectivity, including named entity recognition, protein concentration mining, and comparison of connectivity matrices.