Skip to main content
Graph
Search
fr
|
en
Switch to dark mode
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Lecture
Data Wrangling with Hadoop: Storage Formats and Hive
Graph Chatbot
Related lectures (32)
Previous
Page 2 of 4
Next
Data Wrangling Techniques: HBase and Hive Integration
Covers data wrangling techniques using HBase and Hive, focusing on integration and practical applications.
Data formats and data wrangling with Hadoop
Explores Apache Hive for data warehousing, data formats, and partitioning, with practical exercises in querying and connecting to Hive.
Handling Data: Intro to Pandas
Introduces the fundamentals of handling data, emphasizing the importance of Pandas and data modeling for effective analysis.
Data Wrangling with Hadoop: Advanced Techniques
Covers advanced data wrangling techniques using Hadoop, focusing on Hive and HBase integration.
Elements of Collaborative Data Science
Introduces collaborative data science tools like Jupyter notebooks, Docker, and Git, emphasizing data versioning and containerization.
Analytics on Data at Rest and Data in Motion
Explores combining data at rest with data in motion, emphasizing the Lambda architecture complexities and quality assessment of streams and batches.
Data Modeling: Concepts and Applications
Explores data modeling concepts, SQL implementations, and practical applications in handling missing data.
Advanced Spark Optimization Techniques: Managing Big Data
Discusses advanced Spark optimization techniques for managing big data efficiently, focusing on parallelization, shuffle operations, and memory management.
Gitlab Agent for Kubernetes (`agentk`)
Covers the setup of a Gitlab agent for Kubernetes, focusing on installation, version control, and troubleshooting.
Collaborative Data Science: Tools and Techniques
Introduces collaborative data science tools like Git and Docker, emphasizing teamwork and practical exercises for effective learning.