Lecture

Collaborative Data Science: Tools and Techniques

Description

This lecture provides an introduction to collaborative data science, focusing on essential tools such as Git, Docker, and package managers like Mamba. The instructor emphasizes the importance of forming groups for collaborative projects and outlines the agenda for the course, which includes a graded assignment. The lecture covers the basics of Git, including version control, branching, and merging, as well as the significance of MLOps in streamlining machine learning workflows. Docker is introduced as a means to create isolated and portable runtime environments, allowing for consistent execution of code across different platforms. The instructor also discusses the use of Jupyter notebooks for data analysis and visualization, particularly in the context of the Carbosense project, which involves CO₂ sensor data from Switzerland. The session concludes with practical exercises to reinforce the concepts learned, ensuring students are prepared for upcoming assignments and collaborative work.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.