Q-learningQ-learning is a model-free reinforcement learning algorithm to learn the value of an action in a particular state. It does not require a model of the environment (hence "model-free"), and it can handle problems with stochastic transitions and rewards without requiring adaptations. For any finite Markov decision process (FMDP), Q-learning finds an optimal policy in the sense of maximizing the expected value of the total reward over any and all successive steps, starting from the current state.
Nonlinear systemIn mathematics and science, a nonlinear system (or a non-linear system) is a system in which the change of the output is not proportional to the change of the input. Nonlinear problems are of interest to engineers, biologists, physicists, mathematicians, and many other scientists since most systems are inherently nonlinear in nature. Nonlinear dynamical systems, describing changes in variables over time, may appear chaotic, unpredictable, or counterintuitive, contrasting with much simpler linear systems.
Torsades de pointesTorsades de pointes, torsade de pointes or torsades des pointes (TdP) (tɔːˌsɑːd_də_ˈpwãt, tɔʁsad də pwɛ̃t̪, translated as "twisting of peaks") is a specific type of abnormal heart rhythm that can lead to sudden cardiac death. It is a polymorphic ventricular tachycardia that exhibits distinct characteristics on the electrocardiogram (ECG). It was described by French physician François Dessertenne in 1966. Prolongation of the QT interval can increase a person's risk of developing this abnormal heart rhythm, occurring in between 1% and 10% of patients who receive QT-prolonging antiarrhythmic drugs.