Artificial Neural Network Training on an Optical Processor via Direct Feedback Alignment

Artificial Neural Networks (ANN) are habitually trained via the back-propagation (BP) algorithm. This approach has been extremely successful: Current models like GPT-3 have O(10 11 ) parameters, are trained on O(10 11 ) words and produce awe-inspiring results. However, there are good reasons to look for alternative training methods: With current algorithms and hardware constraints sometimes only half the available computing power is actually used. This is due to a complicated interplay between the size of the ANN, the available memory, throughput limitations of interconnects, the architecture of the network of computers, and the training algorithm. Training a model like the aforementioned GPT-3 takes months and costs millions. A different training paradigm, which could make clever use of specialized hardware, may train large ANNs more efficiently.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Artificial Neural Network Training on an Optical Processor via Direct Feedback Alignment

Graph Chatbot

Chat with Graph Search

Fundamental Limits in Statistical Learning Problems: Block Models and Neural Networks

Power Transformer Fault Diagnosis Using Neural Network Optimization Techniques

Controlling spatiotemporal nonlinearities in multimode fibers with deep neural networks

Fundamental Limits in Statistical Learning Problems: Block Models and Neural Networks

Controlling spatiotemporal nonlinearities in multimode fibers with deep neural networks

Power Transformer Fault Diagnosis Using Neural Network Optimization Techniques