UTF-8UTF-8 is a variable-length character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode (or Universal Coded Character Set) Transformation Format - 8-bit. UTF-8 is capable of encoding all 1,112,064 valid character code points in Unicode using one to four one-byte (8-bit) code units. Code points with lower numerical values, which tend to occur more frequently, are encoded using fewer bytes.
Character (computing)In computer and machine-based telecommunications terminology, a character is a unit of information that roughly corresponds to a grapheme, grapheme-like unit, or symbol, such as in an alphabet or syllabary in the written form of a natural language. Examples of characters include letters, numerical digits, common punctuation marks (such as "." or "-"), and whitespace. The concept also includes control characters, which do not correspond to visible symbols but rather to instructions to format or process the text.
PunctuationPunctuation marks are marks indicating how a piece of written text should be read (silently or aloud) and, consequently, understood. The oldest known examples of punctuation marks were found in the Mesha Stele from 9th century BC, consisting of points between the words and horizontal strokes between sections. The alphabet-based writing begun with no spaces, no capitalization, no vowels (see abjad), and with only a few punctuation marks, as it was mostly aimed at recording business transactions.
BrailleBraille (breɪl , bʁɑj) is a tactile writing system used by people who are visually impaired, including people who are blind, deafblind or who have low vision. It can be read either on embossed paper or by using refreshable braille displays that connect to computers and smartphone devices. Braille can be written using a slate and stylus, a braille writer, an electronic braille notetaker or with the use of a computer connected to a braille embosser. Braille is named after its creator, Louis Braille, a Frenchman who lost his sight as a result of a childhood accident.
Persian alphabetThe Persian alphabet (Alefbâye Fârsi), also known as the Perso-Arabic script, is the right-to-left alphabet used for the Persian language. It is a variation of the Arabic alphabet with four additional letters added: پ چ ژ گ. It was the basis of many Arabic-based scripts used in Central and South Asia. It is used for the Iranian and Dari standard varieties of Persian; and is one of two official writing systems for the Persian language, alongside the Cyrillic-based Tajik alphabet.
EBCDICExtended Binary Coded Decimal Interchange Code (EBCDIC; ˈɛbsᵻdɪk) is an eight-bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding six-bit binary-coded decimal code used with most of IBM's computer peripherals of the late 1950s and early 1960s. It is supported by various non-IBM platforms, such as Fujitsu-Siemens' BS2000/OSD, OS-IV, MSP, and MSP-EX, the SDS Sigma series, Unisys VS/9, Unisys MCP and ICL VME.