Concept

Evaluation of binary classifiers

The evaluation of binary classifiers compares two methods of assigning a binary attribute, one of which is usually a standard method and the other is being investigated. There are many metrics that can be used to measure the performance of a classifier or predictor; different fields have different preferences for specific metrics due to different goals. For example, in medicine sensitivity and specificity are often used, while in computer science precision and recall are preferred. An important distinction is between metrics that are independent on the prevalence (how often each category occurs in the population), and metrics that depend on the prevalence – both types are useful, but they have very different properties. Probabilistic classification models go beyond providing binary outputs and instead produce probability scores for each class. These models are designed to assess the likelihood or probability of an instance belonging to different classes. In the context of evaluating probabilistic classifiers, alternative evaluation metrics have been developed to properly assess the performance of these models. These metrics take into account the probabilistic nature of the classifier's output and provide a more comprehensive assessment of its effectiveness in assigning accurate probabilities to different classes. These evaluation metrics aim to capture the degree of calibration, discrimination, and overall accuracy of the probabilistic classifier's predictions. Confusion matrix Given a data set, a classification (the output of a classifier on that set) gives two numbers: the number of positives and the number of negatives, which add up to the total size of the set. To evaluate a classifier, one compares its output to another reference classification – ideally a perfect classification, but in practice the output of another gold standard test – and cross tabulates the data into a 2×2 contingency table, comparing the two classifications. One then evaluates the classifier relative to the gold standard by computing summary statistics of these 4 numbers.

Source officielle

https://en.wikipedia.org/wiki/Evaluation_of_binary_classifiers

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Cours associés (3)

MATH-412: Statistical machine learning

A course on statistical machine learning for supervised and unsupervised learning

CS-423: Distributed information systems

This course introduces the foundations of information retrieval, data mining and knowledge bases, which constitute the foundations of today's Web-based distributed information systems.

CS-233(a): Introduction to machine learning (BA3)

Machine learning and data analysis are becoming increasingly central in many sciences and applications. In this course, fundamental principles and methods of machine learning will be introduced, analy

Concepts associés (2)

F-score

In statistical analysis of binary classification, the F-score or F-measure is a measure of a test's accuracy. It is calculated from the precision and recall of the test, where the precision is the number of true positive results divided by the number of all positive results, including those not identified correctly, and the recall is the number of true positive results divided by the number of all samples that should have been identified as positive.

Précision et rappel

vignette|350px|Précision et rappel (« recall »). La précision compte la proportion d'items pertinents parmi les items sélectionnés alors que le rappel compte la proportion d'items pertinents sélectionnés parmi tous les items pertinents sélectionnables. Dans les domaines de la reconnaissance de formes, de la recherche d'information et de la classification automatique, la précision (ou valeur prédictive positive) est la proportion des items pertinents parmi l'ensemble des items proposés ; le rappel (ou sensibilité) est la proportion des items pertinents proposés parmi l'ensemble des items pertinents.

Evaluation of binary classifiers - Wikipedia

Source officielle

https://en.wikipedia.org/wiki/Evaluation_of_binary_classifiers

À propos de ce résultat

Cours associés (3)

MATH-412: Statistical machine learning

A course on statistical machine learning for supervised and unsupervised learning

CS-423: Distributed information systems

This course introduces the foundations of information retrieval, data mining and knowledge bases, which constitute the foundations of today's Web-based distributed information systems.

CS-233(a): Introduction to machine learning (BA3)

Séances de cours associées (14)

Évaluation des classificateurs binaires

Discute de l'évaluation des classificateurs binaires, y compris le rappel, la sensibilité, la spécificité, les courbes ROC et les mesures de performance.

Sans titre

Régression linéaire généralisée : classification

Explorer la régression linéaire généralisée, la classification, les matrices de confusion, les courbes ROC et le bruit dans les données.

Afficher plus

Publications associées (12)

Computation of sensitivity coefficients in fixed source simulations with SERPENT2

Andreas Pautz, Mathieu Hursin

Within the scope of the implementation of a nuclear data pipeline aiming at producing the best possible evaluated nuclear data files, a major point is the production of relevant sensitivity coefficients when including integral benchmark information. Thanks ...

Elsevier Science Sa2024

Model-Class Selection Using Clustering and Classification for Structural Identification and Prediction

Ian Smith, Sai Ganesh Sarvotham Pai, Masoud Sanayei

Structural identification using physics-based models and subsequent prediction have much potential to enhance civil infrastructure asset-management decision-making. Interpreting monitoring information in the presence of multiple uncertainty sources and sys ...

2021

Evaluating salient object detection in natural images with multiple objects having multi-level saliency

Sabine Süsstrunk, Gökhan Yildirim

Salient object detection is evaluated using binary ground truth (GT) with the labels being salient object class and background. In this study, the authors corroborate based on three subjective experiments on a novel image dataset that objects in natural im ...

2020

Afficher plus

Concepts associés (2)

F-score

Précision et rappel