TransliterationTransliteration is a type of conversion of a text from one script to another that involves swapping letters (thus trans- + liter-) in predictable ways, such as Greek → , Cyrillic → , Greek → the digraph , Armenian → or Latin → . For instance, for the Modern Greek term "Ελληνική Δημοκρατία", which is usually translated as "Hellenic Republic", the usual transliteration to Latin script is , and the name for Russia in Cyrillic script, "Россия", is usually transliterated as .
Machine translationMachine translation is use of either rule-based or probabilistic (i.e. statistical and, most recently, neural network-based) machine learning approaches to translation of text or speech from one language to another, including the contextual, idiomatic and pragmatic nuances of both languages. History of machine translation The origins of machine translation can be traced back to the work of Al-Kindi, a ninth-century Arabic cryptographer who developed techniques for systemic language translation, including cryptanalysis, frequency analysis, and probability and statistics, which are used in modern machine translation.
Arabic alphabetThe Arabic alphabet (الْأَبْجَدِيَّة الْعَرَبِيَّة, ALA ʔælʔæbʒædijːæ-lʕɑrɑbijːæ or الْحُرُوف الْعَرَبِيَّة, ALA), or Arabic abjad, is the Arabic script as it is codified for writing Arabic. It is written from right to left in a cursive style and includes 28 letters. Most letters have contextual letterforms. The Arabic alphabet is considered an abjad, meaning it only uses consonants, but it is now considered an "impure abjad". As with other impure abjads, such as the Hebrew alphabet, scribes later devised means of indicating vowel sounds by separate vowel diacritics.
Google TranslateGoogle Translate is a multilingual neural machine translation service developed by Google to translate text, documents and websites from one language into another. It offers a website interface, a mobile app for Android and iOS, as well as an API that helps developers build browser extensions and software applications. As of 2022, Google Translate supports languages at various levels; it claimed over 500 million total users , with more than 100 billion words translated daily, after the company stated in May 2013 that it served over 200 million people daily.
Computer-assisted translationComputer-aided translation (CAT), also referred to as computer-assisted translation or computer-aided human translation (CAHT), is the use of software to assist a human translator in the translation process. The translation is created by a human, and certain aspects of the process are facilitated by software; this is in contrast with machine translation (MT), in which the translation is created by a computer, optionally with some human intervention (e.g. pre-editing and post-editing).
Lebanese ArabicLebanese Arabic (عَرَبِيّ لُبْنَانِيّ ; autonym: ˈʕaɾabe lɪbˈneːne), or simply Lebanese (لُبْنَانِيّ ; autonym: lɪbˈneːne), is a variety of North Levantine Arabic, indigenous to and spoken primarily in Lebanon, with significant linguistic influences borrowed from other Middle Eastern and European languages and is in some ways unique from other varieties of Arabic. Due to multilingualism and pervasive diglossia among Lebanese people (a majority of the Lebanese people are bilingual or trilingual), it is not uncommon for Lebanese people to code-switch between or mix Lebanese Arabic, French, and English in their daily speech.
Scientific transliteration of CyrillicScientific transliteration, variously called academic, linguistic, international, or scholarly transliteration, is an international system for transliteration of text from the Cyrillic script to the Latin script (romanization). This system is most often seen in linguistics publications on Slavic languages. Scientific transliteration of Cyrillic into Latin was first introduced in 1898 as part of the standardization process for the Preußische Instruktionen (PI) in 1899.
Statistical machine translationStatistical machine translation (SMT) was a machine translation approach, that superseded the previous, rule-based approach because it required explicit description of each and every linguistic rule, which was costly, and which often did not generalize to other languages. Since 2003, the statistical approach itself has been gradually superseded by the deep learning-based neural network approach. The first ideas of statistical machine translation were introduced by Warren Weaver in 1949, including the ideas of applying Claude Shannon's information theory.
International Phonetic AlphabetThe International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation of speech sounds in written form. The IPA is used by lexicographers, foreign language students and teachers, linguists, speech–language pathologists, singers, actors, constructed language creators, and translators.
Moroccan ArabicMoroccan Arabic (العربية المغربية الدارجة Moroccan vernacular Arabic), also known as Darija (الدارجة), is the dialectal, vernacular form or forms of Arabic spoken in Morocco. It is part of the Maghrebi Arabic dialect continuum and as such is mutually intelligible to some extent with Algerian Arabic and to a lesser extent with Tunisian Arabic. It is spoken by 90.9% of the population of Morocco.