Catégorie

Traitement automatique du langage naturel

Concepts associés (115)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Relationship extraction

A relationship extraction task requires the detection and classification of semantic relationship mentions within a set of artifacts, typically from text or XML documents. The task is very similar to that of information extraction (IE), but IE additionally requires the removal of repeated relations (disambiguation) and generally refers to the extraction of many different relationships. The concept of relationship extraction was first introduced during the 7th Message Understanding Conference in 1998.

Genus–differentia definition

A genus–differentia definition is a type of intensional definition, and it is composed of two parts: a genus (or family): An existing definition that serves as a portion of the new definition; all definitions with the same genus are considered members of that genus. the differentia: The portion of the definition that is not provided by the genus. For example, consider these two definitions: a triangle: A plane figure that has 3 straight bounding sides. a quadrilateral: A plane figure that has 4 straight bounding sides.

Stochastic grammar

A stochastic grammar (statistical grammar) is a grammar framework with a probabilistic notion of grammaticality: Stochastic context-free grammar Statistical parsing Data-oriented parsing Hidden Markov model Estimation theory The grammar is realized as a language model. Allowed sentences are stored in a database together with the frequency how common a sentence is. Statistical natural language processing uses stochastic, probabilistic and statistical methods, especially to resolve difficulties that arise because longer sentences are highly ambiguous when processed with realistic grammars, yielding thousands or millions of possible analyses.

Coréférence

En linguistique, la coréférence est le phénomène qui consiste pour plusieurs syntagmes nominaux (SN) différents contenus dans une phrase ou dans un discours, à désigner la même entité. Par exemple une personne, un lieu, un événement, ou encore une date. Dans la terminologie linguistique, on dit qu'une coréférence est reliée à son antécédent. Pour que les syntagmes se coréférent, les deux expressions doivent porter les mêmes trait-φ. C'est-à-dire qu'ils doivent être en accord en genre, en nombre et en personne.

Natural Language Toolkit

The Natural Language Toolkit, or more commonly NLTK, is a suite of libraries and programs for symbolic and statistical natural language processing (NLP) for English written in the Python programming language. It supports classification, tokenization, stemming, tagging, parsing, and semantic reasoning functionalities. It was developed by Steven Bird and Edward Loper in the Department of Computer and Information Science at the University of Pennsylvania. NLTK includes graphical demonstrations and sample data.

Perfection

La perfection est l'atteinte d'un stade suprême ou idéal pour un être ou un objet. La perfection réunit toutes les qualités et n'a pas de défaut. Platon traite de la perfection avec une perspective métaphysique. Il soutient que toutes les perfections dans les êtres créés n'ont pu être mises en eux par le Créateur, c'est-à-dire le Dieu le plus parfait qui soit. Cette pensée concorde avec sa conception de la dégradation et de la dégénérescence : plus quelque chose évolue, plus elle s'éloigne du modèle, qui est seul à être parfait.

Lexical semantics

Lexical semantics (also known as lexicosemantics), as a subfield of linguistic semantics, is the study of word meanings. It includes the study of how words structure their meaning, how they act in grammar and compositionality, and the relationships between the distinct senses and uses of a word. The units of analysis in lexical semantics are lexical units which include not only words but also sub-words or sub-units such as affixes and even compound words and phrases. Lexical units include the catalogue of words in a language, the lexicon.

Traduction littérale

La traduction littérale, la traduction directe ou la traduction mot à mot est la traduction d'un texte d'une langue à une autre, un mot à la fois . Dans le cas des textes sacrés, il y a eu des tentatives de traductions littérales par souci d'authenticité et de fidélité aux textes originaux. C'est par exemple le cas de la traduction littérale de la Bible vers l'anglais à partir des textes originaux hébreux et grecs publiée en 1862 par le bibliste autodidacte écossais (1822–1888).

Oxford English Dictionary

L’Oxford English Dictionary (OED) est un dictionnaire de référence pour la langue anglaise. Il est publié par l'Oxford University Press et contient des mots venant du Royaume-Uni et des diverses régions du monde anglophone : Amérique du Nord, Afrique du Sud, Australie, Nouvelle-Zélande, Caraïbes. Il indique l'histoire, la prononciation et la définition des mots. Comme pour d'autres grands dictionnaires, le travail de rédaction s'est étalé sur plusieurs décennies. La première édition complète, comprenant vingt tomes, est publiée en 1928.

Rocchio algorithm

The Rocchio algorithm is based on a method of relevance feedback found in information retrieval systems which stemmed from the SMART Information Retrieval System developed between 1960 and 1964. Like many other retrieval systems, the Rocchio algorithm was developed using the vector space model. Its underlying assumption is that most users have a general conception of which documents should be denoted as relevant or irrelevant.