Publication

Unveiling the complexity of learning and decision-making

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Temporal difference learning

Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function. These methods sample from the environment, like Monte Carlo methods, and perform updates based on current estimates, like dynamic programming methods. While Monte Carlo methods only adjust their estimates once the final outcome is known, TD methods adjust predictions to match later, more accurate, predictions about the future before the final outcome is known.

Group decision-making

Group decision-making (also known as collaborative decision-making or collective decision-making) is a situation faced when individuals collectively make a choice from the alternatives before them. The decision is then no longer attributable to any single individual who is a member of the group. This is because all the individuals and social group processes such as social influence contribute to the outcome. The decisions made by groups are often different from those made by individuals.

Observational learning

Observational learning is learning that occurs through observing the behavior of others. It is a form of social learning which takes various forms, based on various processes. In humans, this form of learning seems to not need reinforcement to occur, but instead, requires a social model such as a parent, sibling, friend, or teacher with surroundings. Particularly in childhood, a model is someone of authority or higher status in an environment.

Convolutional neural network

Convolutional neural network (CNN) is a regularized type of feed-forward neural network that learns feature engineering by itself via filters (or kernel) optimization. Vanishing gradients and exploding gradients, seen during backpropagation in earlier neural networks, are prevented by using regularized weights over fewer connections. For example, for each neuron in the fully-connected layer 10,000 weights would be required for processing an image sized 100 × 100 pixels.

L-DOPA

DISPLAYTITLE:L-DOPA DOPA, also known as levodopa and -3,4-dihydroxyphenylalanine, is an amino acid that is made and used as part of the normal biology of some plants and animals, including humans. Humans, as well as a portion of the other animals that utilize -DOPA, make it via biosynthesis from the amino acid -tyrosine. -DOPA is the precursor to the neurotransmitters dopamine, norepinephrine (noradrenaline), and epinephrine (adrenaline), which are collectively known as catecholamines.

Neurofeedback

Neurofeedback is a type of biofeedback that focuses on the neuronal activity of the brain. The training method is based on reward learning (operant conditioning) where a real-time feedback provided to the trainee is supposed to reinforce desired brain activity or inhibit unfavorable activity patterns. Different mental states (for example, concentration, relaxation, creativity, distractibility, rumination, etc.) are associated with different brain activities or brain states.

Behavioral addiction

Behavioral addiction is a form of addiction that involves a compulsion to engage in a rewarding non-substance-related behavior – sometimes called a natural reward – despite any negative consequences to the person's physical, mental, social or financial well-being. Addiction canonically refers to substance abuse; however, the term's connotation has been expanded to include behaviors that may lead to a reward (such as gambling, eating, or shopping) since the 1990s.

Spiking neural network

Artificial neural network Spiking neural networks (SNNs) are artificial neural networks that more closely mimic natural neural networks. In addition to neuronal and synaptic state, SNNs incorporate the concept of time into their operating model. The idea is that neurons in the SNN do not transmit information at each propagation cycle (as it happens with typical multi-layer perceptron networks), but rather transmit information only when a membrane potential—an intrinsic quality of the neuron related to its membrane electrical charge—reaches a specific value, called the threshold.

Gamma wave

A gamma wave or gamma rhythm is a pattern of neural oscillation in humans with a frequency between 25 and 140 Hz, the 40 Hz point being of particular interest. Gamma rhythms are correlated with large scale brain network activity and cognitive phenomena such as working memory, attention, and perceptual grouping, and can be increased in amplitude via meditation or neurostimulation. Altered gamma activity has been observed in many mood and cognitive disorders such as Alzheimer's disease, epilepsy, and schizophrenia.

Event-related potential

An event-related potential (ERP) is the measured brain response that is the direct result of a specific sensory, cognitive, or motor event. More formally, it is any stereotyped electrophysiological response to a stimulus. The study of the brain in this way provides a noninvasive means of evaluating brain functioning. ERPs are measured by means of electroencephalography (EEG). The magnetoencephalography (MEG) equivalent of ERP is the ERF, or event-related field. Evoked potentials and induced potentials are subtypes of ERPs.