Natural language processingNatural language processing (NLP) is an interdisciplinary subfield of linguistics and computer science. It is primarily concerned with processing natural language datasets, such as text corpora or speech corpora, using either rule-based or probabilistic (i.e. statistical and, most recently, neural network-based) machine learning approaches. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them.
Speech communityA speech community is a group of people who share a set of linguistic norms and expectations regarding the use of language. It is a concept mostly associated with sociolinguistics and anthropological linguistics. Exactly how to define speech community is debated in the literature. Definitions of speech community tend to involve varying degrees of emphasis on the following: Shared community membership Shared linguistic communication A typical speech community can be a small town, but sociolinguists such as William Labov claim that a large metropolitan area, for example New York City, can also be considered one single speech community.
Speaker recognitionSpeaker recognition is the identification of a person from characteristics of voices. It is used to answer the question "Who is speaking?" The term voice recognition can refer to speaker recognition or speech recognition. Speaker verification (also called speaker authentication) contrasts with identification, and speaker recognition differs from speaker diarisation (recognizing when the same speaker is speaking).
Language convergenceLanguage convergence is a type of linguistic change in which languages come to resemble one another structurally as a result of prolonged language contact and mutual interference, regardless of whether those languages belong to the same language family, i.e. stem from a common genealogical proto-language. In contrast to other contact-induced language changes like creolization or the formation of mixed languages, convergence refers to a mutual process that results in changes in all the languages involved.
TranslanguagingTranslanguaging is a term that can refer to different aspects of multilingualism. It can describe the way bilinguals and multilinguals use their linguistic resources to make sense of and interact with the world around them. It can also refer to a pedagogical approach that utilizes more than one language within a classroom lesson. The term "translanguaging" was coined in the 1980s by Cen Williams (applied in Welsh as trawsieithu) in his unpublished thesis titled “An Evaluation of Teaching and Learning Methods in the Context of Bilingual Secondary Education.
SpeechSpeech is a human vocal communication using language. Each language uses phonetic combinations of vowel and consonant sounds that form the sound of its words (that is, all English words sound different from all French words, even if they are the same word, e.g., "role" or "hotel"), and using those words in their semantic character as words in the lexicon of a language according to the syntactic constraints that govern lexical words' function in a sentence. In speaking, speakers perform many different intentional speech acts, e.
Grammatical genderIn linguistics, a grammatical gender system is a specific form of a noun class system, where nouns are assigned to gender categories that are often not related to the real-world qualities of the entities denoted by those nouns. In languages with grammatical gender, most or all nouns inherently carry one value of the called gender; the values present in a given language (of which there are usually two or three) are called the genders of that language.
Pattern recognitionPattern recognition is the automated recognition of patterns and regularities in data. While similar, pattern recognition (PR) is not to be confused with pattern machines (PM) which may possess (PR) capabilities but their primary function is to distinguish and create emergent pattern. PR has applications in statistical data analysis, signal processing, , information retrieval, bioinformatics, data compression, computer graphics and machine learning.
History of natural language processingThe history of natural language processing describes the advances of natural language processing (Outline of natural language processing). There is some overlap with the history of machine translation, the history of speech recognition, and the history of artificial intelligence. The history of machine translation dates back to the seventeenth century, when philosophers such as Leibniz and Descartes put forward proposals for codes which would relate words between languages.
Sumerian languageSumerian (Cuneiform: "native tongue") is the language of ancient Sumer. It is one of the oldest attested languages, dating back to at least 2900 BC. It is accepted to be a local language isolate and to have been spoken in ancient Mesopotamia, in the area that is modern-day Iraq. Akkadian, a Semitic language, gradually replaced Sumerian as a spoken language in the area 2000 BC (the exact date is debated), but Sumerian continued to be used as a sacred, ceremonial, literary and scientific language in Akkadian-speaking Mesopotamian states such as Assyria and Babylonia until the 1st century AD.