Language modelA language model is a probabilistic model of a natural language that can generate probabilities of a series of words, based on text corpora in one or multiple languages it was trained on. Large language models, as their most advanced form, are a combination of feedforward neural networks and transformers. They have superseded recurrent neural network-based models, which had previously superseded the pure statistical models, such as word n-gram language model.
Natural language generationNatural language generation (NLG) is a software process that produces natural language output. A widely-cited survey of NLG methods describes NLG as "the subfield of artificial intelligence and computational linguistics that is concerned with the construction of computer systems than can produce understandable texts in English or other human languages from some underlying non-linguistic representation of information". While it is widely agreed that the output of any NLG process is text, there is some disagreement about whether the inputs of an NLG system need to be non-linguistic.
Focus (linguistics)In linguistics, focus (abbreviated ) is a grammatical category that conveys which part of the sentence contributes new, non-derivable, or contrastive information. In the English sentence "Mary only insulted BILL", focus is expressed prosodically by a pitch accent on "Bill" which identifies him as the only person Mary insulted. By contrast, in the sentence "Mary only INSULTED Bill", the verb "insult" is focused and thus expresses that Mary performed no other actions towards Bill.
Minimal pairIn phonology, minimal pairs are pairs of words or phrases in a particular language, spoken or signed, that differ in only one phonological element, such as a phoneme, toneme or chroneme, and have distinct meanings. They are used to demonstrate that two phones represent two separate phonemes in the language. Many phonologists in the middle part of the 20th century had a strong interest in developing techniques for discovering the phonemes of unknown languages, and in some cases, they set up writing systems for the languages.
History of natural language processingThe history of natural language processing describes the advances of natural language processing (Outline of natural language processing). There is some overlap with the history of machine translation, the history of speech recognition, and the history of artificial intelligence. The history of machine translation dates back to the seventeenth century, when philosophers such as Leibniz and Descartes put forward proposals for codes which would relate words between languages.
Logic translationLogic translation is the process of representing a text in the formal language of a logical system. If the original text is formulated in ordinary language then the term natural language formalization is often used. An example is the translation of the English sentence "some men are bald" into first-order logic as . The purpose is to reveal the logical structure of arguments. This makes it possible to use the precise rules of formal logic to assess whether these arguments are correct.
Language familyA language family is a group of languages related through descent from a common ancestral language or parental language, called the proto-language of that family. The term "family" reflects the tree model of language origination in historical linguistics, which makes use of a metaphor comparing languages to people in a biological family tree, or in a subsequent modification, to species in a phylogenetic tree of evolutionary taxonomy. Linguists therefore describe the daughter languages within a language family as being genetically related.
Linguistic competenceIn linguistics, linguistic competence is the system of unconscious knowledge that one knows when they know a language. It is distinguished from linguistic performance, which includes all other factors that allow one to use one's language in practice. In approaches to linguistics which adopt this distinction, competence would normally be considered responsible for the fact that "I like ice cream" is a possible sentence of English, the particular proposition that it denotes, and the particular sequence of phones that it consists of.
Manx languageManx (Gaelg or Gailck, pronounced ɡilɡ, geːlɡ or gilk), also known as Manx Gaelic, is a Gaelic language of the insular Celtic branch of the Celtic language family, itself a branch of the Indo-European language family. Manx is the historical language of the Manx people. Although only few children native to the Isle of Man speak Manx as a first language, there has been a steady increase in the number of speakers since the death of Ned Maddrell in 1974. He was considered to be the last speaker to grow up in a Manx-speaking community environment.
Critical discourse analysisCritical discourse analysis (CDA) is an interdisciplinary approach to the study of discourse that views language as a form of social practice. CDA combines critique of discourse and explanation of how it figures within and contributes to the existing social reality, as a basis for action to change that existing reality in particular respects. Scholars working in the tradition of CDA generally argue that (non-linguistic) social practice and linguistic practice constitute one another and focus on investigating how societal power relations are established and reinforced through language use.