Lecture

Second-Order Model Compression

Description

This lecture by the instructor covers the topic of second-order model compression for massive deep neural networks, focusing on models with billions of parameters like OpenAI's GPT-3. The lecture discusses the challenges of running these massive models, the concept of model compression, and practical examples of compressing models to run on single GPUs. Various pruning techniques and their impact on model accuracy are explored, along with the introduction of the M-FAC pruning approach. The lecture concludes with insights on compressing GPT models by up to 10 times with minimal loss and potential performance gains.

Official source

https://mediaspace.epfl.ch/media/0_9in9l2mf

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Ontological neighbourhood

Information engineering

Machine learning: Artificial neural networks

Signal processing: Data compression

Related lectures (33)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.