Lecture

Accelerating AI/ML: Specialization and Efficiency

Description

This lecture by the instructor covers the topic of accelerating AI/ML models by striking a balance between generality and specialization. It delves into the challenges of designing custom hardware, the importance of domain-specific full stacks, and the role of hardware acceleration in bridging the gap between compute-intensive kernels and applications. The lecture also explores the concept of domain acceleration across the system stack, the use of accelerators within database systems, and the significance of efficient distributed execution. Additionally, it discusses the design of adaptable accelerators, the impact of memory layout on execution efficiency, and the future directions in architecture and system solutions for large AI/ML models.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.