Natural language processingNatural language processing (NLP) is an interdisciplinary subfield of linguistics and computer science. It is primarily concerned with processing natural language datasets, such as text corpora or speech corpora, using either rule-based or probabilistic (i.e. statistical and, most recently, neural network-based) machine learning approaches. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them.
Merriam-WebsterMerriam-Webster, Incorporated is an American company that publishes reference books and is especially known for its dictionaries. It is the oldest dictionary publisher in the United States. In 1831, George and Charles Merriam founded the company as G & C Merriam Co. in Springfield, Massachusetts. In 1843, after Noah Webster died, the company bought the rights to An American Dictionary of the English Language from Webster's estate. All Merriam-Webster dictionaries trace their lineage to this source.
Lexical Markup FrameworkLanguage resource management Lexical markup framework (LMF; ISO 24613:2008), is the International Organization for Standardization ISO/TC37 standard for natural language processing (NLP) and machine-readable dictionary (MRD) lexicons. The scope is standardization of principles and methods relating to language resources in the contexts of multilingual communication. The goals of LMF are to provide a common model for the creation and use of lexical resources, to manage the exchange of data between and among these resources, and to enable the merging of large number of individual electronic resources to form extensive global electronic resources.
LexicologyLexicology is the branch of linguistics that analyzes the lexicon of a specific language. A word is the smallest meaningful unit of a language that can stand on its own, and is made up of small components called morphemes and even smaller elements known as phonemes, or distinguishing sounds. Lexicology examines every feature of a word – including formation, spelling, origin, usage, and definition. Lexicology also considers the relationships that exist between words.
Bilingual dictionaryA bilingual dictionary or translation dictionary is a specialized dictionary used to translate words or phrases from one language to another. Bilingual dictionaries can be unidirectional, meaning that they list the meanings of words of one language in another, or can be bidirectional, allowing translation to and from both languages. Bidirectional bilingual dictionaries usually consist of two sections, each listing words and phrases of one language alphabetically along with their translation.
Usage (language)The usage of a language is the ways in which its written and spoken variations are routinely employed by its speakers; that is, it refers to "the collective habits of a language's native speakers", as opposed to idealized models of how a language works or (should work) in the abstract. For instance, Fowler characterized usage as "the way in which a word or phrase is normally and correctly used" and as the "points of grammar, syntax, style, and the choice of words.
Corpus linguisticsCorpus linguistics is the study of a language as that language is expressed in its text corpus (plural corpora), its body of "real world" text. Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora collected in the field—the natural context ("realia") of that language—with minimal experimental interference. The text-corpus method uses the body of texts written in any natural language to derive the set of abstract rules which govern that language.
Foreign-language writing aidA foreign language writing aid is a computer program or any other instrument that assists a non-native language user (also referred to as a foreign language learner) in writing decently in their target language. Assistive operations can be classified into two categories: on-the-fly prompts and post-writing checks. Assisted aspects of writing include: lexical, syntactic (syntactic and semantic roles of a word's frame), lexical semantic (context/collocation-influenced word choice and user-intention-driven synonym choice) and idiomatic expression transfer, etc.
Specialized dictionaryA specialized dictionary is a dictionary that covers a relatively restricted set of phenomena. The definitive book on the subject (Cowie 2009) includes chapters on some of the dictionaries included below: synonyms pronunciations names (place names and personal names) phrases and idioms dialect terms slang quotations etymologies rhymes lyrics Dictionaries of idioms and slang are common in most cultures.
DefinitionA definition is a statement of the meaning of a term (a word, phrase, or other set of symbols). Definitions can be classified into two large categories: intensional definitions (which try to give the sense of a term), and extensional definitions (which try to list the objects that a term describes). Another important category of definitions is the class of ostensive definitions, which convey the meaning of a term by pointing out examples. A term may have many different senses and multiple meanings, and thus require multiple definitions.