Lecture

Machine Learning Accelerators: Types and Efficiency

In course

Incididunt exercitation sint velit qui aliqua ipsum laborum deserunt cupidatat eu aliqua reprehenderit. Mollit labore culpa pariatur culpa amet non laboris pariatur deserunt. Minim esse quis exercitation ipsum non.

Description

This lecture covers the types of machine learning workloads, including inference and training, and the computational intensity of deep neural networks. It discusses the different types of DNN layers, such as convolutional and fully connected layers, and the computational differences between them. The lecture also explores the concept of systolic arrays in DNNs, focusing on spatially distributed processing elements and the core operation of matrix-matrix multiplication. Additionally, it delves into the inefficiencies of CPUs and GPUs for DNNs, highlighting the need for specialized accelerators like TPUs. The instructor emphasizes the importance of data movement in DNNs and how systems like TPUs leverage algorithm tolerance to low precision to achieve high performance at low utilization.

Instructor

elit magna

Non laborum nulla elit elit aliqua anim cupidatat magna. Fugiat veniam qui officia aliquip veniam ea laborum dolore est et ad cillum magna. Dolor aliquip duis duis enim ex magna est dolor irure culpa officia eiusmod.

Official source

https://mediaspace.epfl.ch/media/0_8jmor4xm

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Ontological neighbourhood

Information engineering

Machine learning: Artificial neural networks

Mathematics

Algebra: Linear algebra

Related lectures (33)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.