Lecture

Vision-Language-Action Models: Training and Applications

Description

This lecture explores the training and applications of Vision-Language-Action models, focusing on large language models and their integration with robotic control. The presentation covers topics such as language to rewards for robotic skill synthesis, VLMs as robot policies, and the transfer of web knowledge to robotic control. Results from various experiments are discussed, showcasing emergent skills, quantitative evaluations, and the performance of different models in language-based tasks. The lecture concludes with insights on representing actions in VLMs and the future directions of research in this field.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.