ArabicArabic (اَلْعَرَبِيَّةُ, DIN al ʕaraˈbijːa; عَرَبِيّ, DIN ˈʕarabiː or ʕaraˈbij) is a Semitic language spoken primarily across the Arab world. Having emerged in the 1st century, it is named after the Arab people; the term "Arab" was initially used to describe those living in the Arabian Peninsula, as perceived by geographers from ancient Greece. Since the 7th century, Arabic has been characterized by diglossia, with an opposition between a standard prestige language—i.e.
WordNetWordNet is a lexical database of semantic relations between words that links words into semantic relations including synonyms, hyponyms, and meronyms. The synonyms are grouped into synsets with short definitions and usage examples. It can thus be seen as a combination and extension of a dictionary and thesaurus. While it is accessible to human users via a web browser, its primary use is in automatic text analysis and artificial intelligence applications.
Varieties of ArabicThe varieties (or dialects or vernacular languages) of Arabic, a Semitic language within the Afroasiatic family originating in the Arabian Peninsula, are the linguistic systems that Arabic speakers speak natively. There are considerable variations from region to region, with degrees of mutual intelligibility that are often related to geographical distance and some that are mutually unintelligible. Many aspects of the variability attested to in these modern variants can be found in the ancient Arabic dialects in the peninsula.
BabelNetBabelNet is a multilingual lexicalized semantic network and ontology developed at the NLP group of the Sapienza University of Rome. BabelNet was automatically created by linking Wikipedia to the most popular computational lexicon of the English language, WordNet. The integration is done using an automatic mapping and by filling in lexical gaps in resource-poor languages by using statistical machine translation. The result is an encyclopedic dictionary that provides concepts and named entities lexicalized in many languages and connected with large amounts of semantic relations.
Yemeni ArabicYemeni Arabic is a cluster of varieties of Arabic spoken in Yemen and southwestern Saudi Arabia. It is generally considered a very conservative dialect cluster, having many classical features not found across most of the Arabic-speaking world. Yemeni Arabic can be divided roughly into several main dialect groups, each with its own distinctive vocabulary and phonology. The most important four groups are San'ani in the North and Center and Hadhrami in the East, where is pronounced g and is d͡ʒ or ɟ (except in coastal Hadhrami where is j), in addition to Ta'izzi-Adeni in the South and Tihami in the West, where is q and is g.
Semantic lexiconA semantic lexicon is a digital dictionary of words labeled with semantic classes so associations can be drawn between words that have not previously been encountered. Semantic lexicons are built upon semantic networks, which represent the semantic relations between words. The difference between a semantic lexicon and a semantic network is that a semantic lexicon has definitions for each word, or a "gloss". Semantic lexicons are made up of lexical entries. These entries are not orthographic, but semantic, eliminating issues of homonymy and polysemy.
Tunisian ArabicTunisian Arabic, or simply Tunisian, is a set of dialects of Maghrebi Arabic spoken in Tunisia. It is known among its over 11 million speakers as Tounsi ˈtunsi, "Tunisian" or Derja "Everyday Language" to distinguish it from Modern Standard Arabic, the official language of Tunisia. Tunisian Arabic is mostly similar to eastern Algerian Arabic and western Libyan Arabic. As part of a dialect continuum, Tunisian merges into Algerian Arabic and Libyan Arabic at the borders of the country.
Arabic phonologyWhile many languages have numerous dialects that differ in phonology, the contemporary spoken Arabic language is more properly described as a continuum of varieties. This article deals primarily with Modern Standard Arabic (MSA), which is the standard variety shared by educated speakers throughout Arabic-speaking regions. MSA is used in writing in formal print media and orally in newscasts, speeches and formal declarations of numerous types. Modern Standard Arabic has 28 consonant phonemes and 6 vowel phonemes or 8 or 10 vowels in most modern dialects.
TranslationTranslation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between translating (a written text) and interpreting (oral or signed communication between users of different languages); under this distinction, translation can begin only after the appearance of writing within a language community.
Lexical Markup FrameworkLanguage resource management Lexical markup framework (LMF; ISO 24613:2008), is the International Organization for Standardization ISO/TC37 standard for natural language processing (NLP) and machine-readable dictionary (MRD) lexicons. The scope is standardization of principles and methods relating to language resources in the contexts of multilingual communication. The goals of LMF are to provide a common model for the creation and use of lexical resources, to manage the exchange of data between and among these resources, and to enable the merging of large number of individual electronic resources to form extensive global electronic resources.