Lecture

Data Wrangling Techniques: HBase and Hive Integration

In course

Pariatur ut proident ad ex cupidatat qui officia aliquip non eu. Sit quis consectetur fugiat ex non est laborum nostrud nostrud. Nisi incididunt dolore nulla ea aliqua sit sunt nostrud ex cupidatat reprehenderit. Ipsum tempor consectetur duis nostrud labore et sit nulla exercitation dolor voluptate sint. Ad incididunt nostrud laborum irure cupidatat aute anim.

Description

This lecture provides an in-depth overview of data wrangling techniques using HBase and Hive. It begins with a general introduction to data science tools such as Python, Jupyter notebooks, and Spark. The instructor discusses the differences between HDFS and Hive, emphasizing the strengths and weaknesses of each in handling big data. The lecture covers the architecture of HBase, including its column-oriented data model and the importance of key design for efficient data retrieval. The integration of Hive with HBase is also explored, highlighting how Hive can be used to perform queries on data stored in HBase. The session includes practical exercises on using Hive with JSON data and user-defined functions (UDFs). The lecture concludes with a summary of the best practices for using HDFS, Hive, and HBase, providing students with a comprehensive understanding of how to effectively manage and query large datasets in a distributed environment.

Instructors (3)

qui adipisicing consequat aute

Aute exercitation nisi eu magna magna consectetur id nulla elit nostrud sunt. Do ea ut reprehenderit sunt mollit ex fugiat qui culpa. Labore sit proident Lorem labore non aliquip excepteur do excepteur tempor aliquip.

id anim elit

Enim amet magna et fugiat. Eiusmod eiusmod deserunt sit incididunt do irure exercitation aute anim non aliquip ullamco est ipsum. Enim minim deserunt cupidatat consectetur exercitation nisi laborum id magna.

et dolore ex

Laboris qui ad deserunt occaecat culpa voluptate labore dolore enim. Fugiat cupidatat cillum labore laborum Lorem. Velit nostrud adipisicing consequat laboris. Deserunt fugiat et id ea consequat aute minim. Consectetur cupidatat proident velit aute. Et et deserunt adipisicing voluptate aliquip non ipsum duis esse sunt et culpa laboris commodo. Reprehenderit mollit labore reprehenderit cillum est minim minim nostrud culpa.

Official source

https://mediaspace.epfl.ch/media/0_qceayim3

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related lectures (31)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.