Multi-agent reinforcement learningMulti-agent reinforcement learning (MARL) is a sub-field of reinforcement learning. It focuses on studying the behavior of multiple learning agents that coexist in a shared environment. Each agent is motivated by its own rewards, and does actions to advance its own interests; in some environments these interests are opposed to the interests of other agents, resulting in complex group dynamics. Multi-agent reinforcement learning is closely related to game theory and especially repeated games, as well as multi-agent systems.
Q-learningQ-learning is a model-free reinforcement learning algorithm to learn the value of an action in a particular state. It does not require a model of the environment (hence "model-free"), and it can handle problems with stochastic transitions and rewards without requiring adaptations. For any finite Markov decision process (FMDP), Q-learning finds an optimal policy in the sense of maximizing the expected value of the total reward over any and all successive steps, starting from the current state.
Linear subspaceIn mathematics, and more specifically in linear algebra, a linear subspace or vector subspace is a vector space that is a subset of some larger vector space. A linear subspace is usually simply called a subspace when the context serves to distinguish it from other types of subspaces. If V is a vector space over a field K and if W is a subset of V, then W is a linear subspace of V if under the operations of V, W is a vector space over K.
Convolutional neural networkConvolutional neural network (CNN) is a regularized type of feed-forward neural network that learns feature engineering by itself via filters (or kernel) optimization. Vanishing gradients and exploding gradients, seen during backpropagation in earlier neural networks, are prevented by using regularized weights over fewer connections. For example, for each neuron in the fully-connected layer 10,000 weights would be required for processing an image sized 100 × 100 pixels.
Direct air captureDirect air capture (DAC) is the use of chemical or physical processes to extract carbon dioxide directly from the ambient air. If the extracted is then sequestered in safe long-term storage (called direct air carbon capture and sequestration (DACCS)), the overall process will achieve carbon dioxide removal and be a "negative emissions technology" (NET). As of 2022, DAC has yet to become profitable because the cost of using DAC to sequester carbon dioxide is several times the carbon price.
Complemented subspaceIn the branch of mathematics called functional analysis, a complemented subspace of a topological vector space is a vector subspace for which there exists some other vector subspace of called its (topological) complement in , such that is the direct sum in the category of topological vector spaces. Formally, topological direct sums strengthen the algebraic direct sum by requiring certain maps be continuous; the result retains many nice properties from the operation of direct sum in finite-dimensional vector spaces.
Self-playSelf-play is a technique for improving the performance of reinforcement learning agents. Intuitively, agents learn to improve their performance by playing "against themselves". In multi-agent reinforcement learning experiments, researchers try to optimize the performance of a learning agent on a given task, in cooperation or competition with one or more agents. These agents learn by trial-and-error, and researchers may choose to have the learning algorithm play the role of two or more of the different agents.
Inverse problemAn inverse problem in science is the process of calculating from a set of observations the causal factors that produced them: for example, calculating an image in X-ray computed tomography, source reconstruction in acoustics, or calculating the density of the Earth from measurements of its gravity field. It is called an inverse problem because it starts with the effects and then calculates the causes. It is the inverse of a forward problem, which starts with the causes and then calculates the effects.
ElectrocatalystAn electrocatalyst is a catalyst that participates in electrochemical reactions. Electrocatalysts are a specific form of catalysts that function at electrode surfaces or, most commonly, may be the electrode surface itself. An electrocatalyst can be heterogeneous such as a platinized electrode. Homogeneous electrocatalysts, which are soluble, assist in transferring electrons between the electrode and reactants, and/or facilitate an intermediate chemical transformation described by an overall half reaction.
Pressure swing adsorptionPressure swing adsorption (PSA) is a technique used to separate some gas species from a mixture of gases (typically air) under pressure according to the species' molecular characteristics and affinity for an adsorbent material. It operates at near-ambient temperature and significantly differs from the cryogenic distillation commonly used to separate gases. Selective adsorbent materials (e.g., zeolites, (aka molecular sieves), activated carbon, etc.) are used as trapping material, preferentially adsorbing the target gas species at high pressure.