Orthographic depthThe orthographic depth of an alphabetic orthography indicates the degree to which a written language deviates from simple one-to-one letter–phoneme correspondence. It depends on how easy it is to predict the pronunciation of a word based on its spelling: shallow orthographies are easy to pronounce based on the written word, and deep orthographies are difficult to pronounce based on how they are written. In shallow orthographies, the spelling-sound correspondence is direct: from the rules of pronunciation, one is able to pronounce the word correctly.
OrthographyAn orthography is a set of conventions for writing a language, including norms of spelling, hyphenation, capitalization, word boundaries, emphasis, and punctuation. Most transnational languages in the modern period have a writing system, and most of these systems have undergone substantial standardization, thus exhibiting less dialect variation than the spoken language. These processes can fossilize pronunciation patterns that are no longer routinely observed in speech (e.g.
Foreign languageA foreign language is a language that is not an official language of, nor typically spoken in, a specific country. Native speakers from that country usually need to acquire it through conscious learning, such as through language lessons at school, self-teaching, or attending language courses. A foreign language might be learned as a second language; however, there is a distinction between the two terms. A second language refers to a language that plays a significant role in the region where the speaker lives, whether for communication, education, business, or governance.
English as a second or foreign languageEnglish as a second or foreign language is the use of English by speakers with different native languages. Language education for people learning English may be known as English as a foreign language (EFL), English as a second language (ESL), English for speakers of other languages (ESOL), English as an additional language (EAL), or English as a New Language (ENL). The aspect in which EFL is taught is referred to as teaching English as a foreign language (TEFL), teaching English as a second language (TESL) or teaching English to speakers of other languages (TESOL).
Second-language acquisitionSecond-language acquisition (SLA), sometimes called second-language learning — otherwise referred to as L2 (language 2) acquisition, is the process by which people learn a second language. Second-language acquisition is also the scientific discipline devoted to studying that process. The field of second-language acquisition is regarded by some but not everybody as a sub-discipline of applied linguistics but also receives research attention from a variety of other disciplines, such as psychology and education.
Word-sense disambiguationWord-sense disambiguation (WSD) is the process of identifying which sense of a word is meant in a sentence or other segment of context. In human language processing and cognition, it is usually subconscious/automatic but can often come to conscious attention when ambiguity impairs clarity of communication, given the pervasive polysemy in natural language. In computational linguistics, it is an open problem that affects other computer-related writing, such as discourse, improving relevance of search engines, anaphora resolution, coherence, and inference.
Optical character recognitionOptical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of s of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene photo (for example the text on signs and billboards in a landscape photo) or from subtitle text superimposed on an image (for example: from a television broadcast).
WordNetWordNet is a lexical database of semantic relations between words that links words into semantic relations including synonyms, hyponyms, and meronyms. The synonyms are grouped into synsets with short definitions and usage examples. It can thus be seen as a combination and extension of a dictionary and thesaurus. While it is accessible to human users via a web browser, its primary use is in automatic text analysis and artificial intelligence applications.
Emotion recognitionEmotion recognition is the process of identifying human emotion. People vary widely in their accuracy at recognizing the emotions of others. Use of technology to help people with emotion recognition is a relatively nascent research area. Generally, the technology works best if it uses multiple modalities in context. To date, the most work has been conducted on automating the recognition of facial expressions from video, spoken expressions from audio, written expressions from text, and physiology as measured by wearables.
Proto-Semitic languageProto-Semitic is the hypothetical reconstructed proto-language ancestral to the Semitic languages. There is no consensus regarding the location of the Proto-Semitic Urheimat: scholars hypothesize that it may have originated in the Levant, the Sahara, the Horn of Africa, the Arabian Peninsula, or northern Africa. Nowadays the likeliest place of origin is North Africa. The Semitic language family is considered part of the broader macro-family of Afroasiatic languages.