Data scienceData science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processes, algorithms and systems to extract or extrapolate knowledge and insights from noisy, structured, and unstructured data. Data science also integrates domain knowledge from the underlying application domain (e.g., natural sciences, information technology, and medicine). Data science is multifaceted and can be described as a science, a research paradigm, a research method, a discipline, a workflow, and a profession.
TransliterationTransliteration is a type of conversion of a text from one script to another that involves swapping letters (thus trans- + liter-) in predictable ways, such as Greek → , Cyrillic → , Greek → the digraph , Armenian → or Latin → . For instance, for the Modern Greek term "Ελληνική Δημοκρατία", which is usually translated as "Hellenic Republic", the usual transliteration to Latin script is , and the name for Russia in Cyrillic script, "Россия", is usually transliterated as .
Russian formalismRussian formalism was a school of thought literary theory in Russia from the 1910s to the 1930s. It includes the work of a number of highly influential Russian and Soviet scholars such as Viktor Shklovsky, Yuri Tynianov, Vladimir Propp, Boris Eichenbaum, Roman Jakobson, Boris Tomashevsky, Grigory Gukovsky who revolutionised literary criticism between 1914 and the 1930s by establishing the specificity and autonomy of poetic language and literature.
Minimalist programIn linguistics, the minimalist program is a major line of inquiry that has been developing inside generative grammar since the early 1990s, starting with a 1993 paper by Noam Chomsky. Following Imre Lakatos's distinction, Chomsky presents minimalism as a program, understood as a mode of inquiry that provides a conceptual framework which guides the development of linguistic theory. As such, it is characterized by a broad and diverse range of research directions.
Head-driven phrase structure grammarHead-driven phrase structure grammar (HPSG) is a highly lexicalized, constraint-based grammar developed by Carl Pollard and Ivan Sag. It is a type of phrase structure grammar, as opposed to a dependency grammar, and it is the immediate successor to generalized phrase structure grammar. HPSG draws from other fields such as computer science (data type theory and knowledge representation) and uses Ferdinand de Saussure's notion of the sign. It uses a uniform formalism and is organized in a modular way which makes it attractive for natural language processing.
Prague linguistic circleThe Prague school or Prague linguistic circle is a language and literature society. It started in 1926 as a group of linguists, philologists and literary critics in Prague. Its proponents developed methods of structuralist literary analysis and a theory of the standard language and of language cultivation from 1928 to 1939. The linguistic circle was founded in the Café Derby in Prague, which is also where meetings took place during its first years. The Prague School has had a significant continuing influence on linguistics and semiotics.
StemmingIn linguistic morphology and information retrieval, stemming is the process of reducing inflected (or sometimes derived) words to their word stem, base or root form—generally a written word form. The stem need not be identical to the morphological root of the word; it is usually sufficient that related words map to the same stem, even if this stem is not in itself a valid root. Algorithms for stemming have been studied in computer science since the 1960s.