Speech recognitionSpeech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.
International Phonetic AlphabetThe International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation of speech sounds in written form. The IPA is used by lexicographers, foreign language students and teachers, linguists, speech–language pathologists, singers, actors, constructed language creators, and translators.
SyllableA syllable is a unit of organization for a sequence of speech sounds typically made up of a syllable nucleus (most often a vowel) with optional initial and final margins (typically, consonants). Syllables are often considered the phonological "building blocks" of words. They can influence the rhythm of a language, its prosody, its poetic metre and its stress patterns. Speech can usually be divided up into a whole number of syllables: for example, the word ignite is made of two syllables: ig and nite.
Phonemic orthographyA phonemic orthography is an orthography (system for writing a language) in which the graphemes (written symbols) correspond to the phonemes (significant spoken sounds) of the language. Natural languages rarely have perfectly phonemic orthographies; a high degree of grapheme–phoneme correspondence can be expected in orthographies based on alphabetic writing systems, but they differ in how complete this correspondence is.
Syllable weightIn linguistics, syllable weight is the concept that syllables pattern together according to the number and/or duration of segments in the rime. In classical Indo-European verse, as developed in Greek, Sanskrit, and Latin, distinctions of syllable weight were fundamental to the meter of the line. Mora (linguistics) A heavy syllable is a syllable with a branching nucleus or a branching rime, although not all such syllables are heavy in every language.
Length (phonetics)In phonetics, length or quantity is a feature of sounds that have distinctively extended duration compared with other sounds. There are long vowels as well as long consonants (the latter are often called geminates). Many languages do not have distinctive length. Among the languages that have distinctive length, there are only a few that have both distinctive vowel length and distinctive consonant length. It is more common that there is only one or that they depend on each other.
PhoneticsPhonetics is a branch of linguistics that studies how humans produce and perceive sounds, or in the case of sign languages, the equivalent aspects of sign. Linguists who specialize in studying the physical properties of speech are phoneticians. The field of phonetics is traditionally divided into three sub-disciplines based on the research questions involved such as how humans plan and execute movements to produce speech (articulatory phonetics), how various movements affect the properties of the resulting sound (acoustic phonetics), or how humans convert sound waves to linguistic information (auditory phonetics).
ConsonantIn articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are [p] and [b], pronounced with the lips; [t] and [d], pronounced with the front of the tongue; [k] and [g], pronounced with the back of the tongue; [h], pronounced in the throat; [f], [v], and [s], pronounced by forcing air through a narrow channel (fricatives); and [m] and [n], which have air flowing through the nose (nasals). Contrasting with consonants are vowels.
TelephoneA telephone is a telecommunications device that permits two or more users to conduct a conversation when they are too far apart to be easily heard directly. A telephone converts sound, typically and most efficiently the human voice, into electronic signals that are transmitted via cables and other communication channels to another telephone which reproduces the sound to the receiving user. The term is derived from τῆλε (tēle, far) and φωνή (phōnē, voice), together meaning distant voice.
PhonotacticsPhonotactics (from Ancient Greek phōnḗ "voice, sound" and taktikós "having to do with arranging") is a branch of phonology that deals with restrictions in a language on the permissible combinations of phonemes. Phonotactics defines permissible syllable structure, consonant clusters and vowel sequences by means of phonotactic constraints. Phonotactic constraints are highly language-specific. For example, in Japanese, consonant clusters like /st/ do not occur.