Speech recognitionSpeech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.
Phonemic orthographyA phonemic orthography is an orthography (system for writing a language) in which the graphemes (written symbols) correspond to the phonemes (significant spoken sounds) of the language. Natural languages rarely have perfectly phonemic orthographies; a high degree of grapheme–phoneme correspondence can be expected in orthographies based on alphabetic writing systems, but they differ in how complete this correspondence is.
PhonemeIn phonology and linguistics, a phoneme (ˈfoʊniːm) is a unit of phone that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west of England, the sound patterns sɪn (sin) and sɪŋ (sing) are two separate words that are distinguished by the substitution of one phoneme, /n/, for another phoneme, /ŋ/. Two words like this that differ in meaning through the contrast of a single phoneme form a minimal pair.
ReadingReading is the process of taking in the sense or meaning of letters, symbols, etc., especially by sight or touch. For educators and researchers, reading is a multifaceted process involving such areas as word recognition, orthography (spelling), alphabetics, phonics, phonemic awareness, vocabulary, comprehension, fluency, and motivation. Other types of reading and writing, such as pictograms (e.g., a hazard symbol and an emoji), are not based on speech-based writing systems.
GraphemeIn linguistics, a grapheme is the smallest functional unit of a writing system. The word grapheme is derived and the suffix -eme by analogy with phoneme and other names of emic units. The study of graphemes is called graphemics. The concept of graphemes is abstract and similar to the notion in computing of a character. By comparison, a specific shape that represents any particular grapheme in a given typeface is called a glyph. There are two main opposing grapheme concepts.
Bengali languageBengali (bɛnˈɡɔːli ), generally known by its endonym Bangla (বাংলা, ˈbaŋla), is an Indo-Aryan language native to the Bengal region of South Asia. With approximately 300 million native speakers and another 50 million as second language speakers, Bengali is the sixth most spoken native language and the seventh most spoken language by the total number of speakers in the world. Bengali is the fifth most spoken Indo-European language. Bengali is the official, national, and most widely spoken language of Bangladesh, with 98% of Bangladeshis using Bengali as their first language.
Writing systemA writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable form of information storage and transfer. Writing systems require shared understanding between writers and readers of the meaning behind the sets of characters that make up a script.
Speech synthesisSpeech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech. The reverse process is speech recognition. Synthesized speech can be created by concatenating pieces of recorded speech that are stored in a database.
GlyphA glyph (ɡlɪf) is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A grapheme, or part of a grapheme (such as a diacritic), or sometimes several graphemes in combination (a composed glyph) can be represented by a glyph. In most languages written in any variety of the Latin alphabet except English, the use of diacritics to signify a sound mutation is common.
LogogramIn a written language, a logogram, logograph, or lexigraph (from Greek logo, "word", and gramma "that which is drawn or written") is a written character that represents a word or morpheme. Chinese characters (pronounced Hànzì in Mandarin Chinese, Kanji in Japanese, Hanja in Korean, Hán tự in Vietnamese and Sawgun in Standard Zhuang) are generally logograms, as are many hieroglyphic and cuneiform characters. The use of logograms in writing is called logography, and a writing system that is based on logograms is called a logography or logographic system.