Publication

Debunking Misinformation on the Web: Detection, Validation, and Visualisation

Thành Tâm Nguyên
2019
Thèse EPFL
Résumé

Our modern society is struggling with an unprecedented amount of online misinformation, which does harm to democracy, economics, and cybersecurity. Journalism and politics have been impacted by misinformation on a global scale, with weakened public trust in governments seen during the Brexit referendum and viral fake election stories outperforming genuine news on social media during the 2016 U.S. presidential election campaign. Online misinformation also single-handedly caused $136.5 billion in losses in the stock market value through a single tweet about explosions in the White House. Such attacks are even driven by the advances of modern artificial intelligence (AI) these days and pose a new and ever-evolving cyber threat operating at the information level, which is far more advanced than traditional cybersecurity attacks at the hardware and software levels.

Research in this area is still in its infancy but demonstrates that debunking misinformation on the Web is a formidable challenge. This is due to several reasons. First, the open nature of social platforms such as Facebook and Twitter allows users to freely produce and propagate any content without authentication, and this has been exploited to spread hundreds of thousands of fake news at a rate of more than three million social posts per minute. Second, those responsible for the spread of misinformation harvest the power of AI attacking models to mix and disguise falsehoods with common news. Methods of camouflage are used to cover digital footprints through synthesizing millions of fake accounts and appearing to participate in normal social interactions with other users. Third, innocent users, without proper alerts from algorithmic models, can accidentally spread misinformation in an exponential wave of shares, posts, and articles. The misinformation wave is often only detected when already beyond control and consequently can cause large-scale effects in a very short time.

The overarching goal of this thesis is to help media organizations, governments, the public, and academia build a misinformation debunking framework, where algorithmic models and human validators are seamlessly and cost-effectively integrated to prevent the damage of misinformation from occurring. This thesis investigates three important components of such a framework. 1) Detection: Early detection can potentially prevent the spread of misinformation from occurring by flagging suspicious news for human attention; however it remains, to date, an unsolved challenge. 2)Validation: Learning a good detection model already requires a lot of training data, and yet it can be outdated swiftly with new social trends. A promising approach is to use human experts to validate the detection results, helping algorithmic models to train themselves to become smarter and adaptive to new traits of misinformation. 3) Visualisation: Disseminating the debunking reports is an important step to raise public awareness against falsehood contents and educate Web users. However, human users can be easily overwhelmed by the high volume of Web data, as the level of redundancy increases and the value density decreases.

In summary, this thesis proposed key components of building a misinformation debunking framework. The proposed techniques improve upon the state-of-the-art in a variety of misinformation domains, including rumours, Web claims, and social streams.

À propos de ce résultat
Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.
Concepts associés (50)
Désinformation sur la pandémie de Covid-19
Des campagnes de désinformation sur la pandémie de Covid-19 font suite au déclenchement de l'épidémie de maladie à coronavirus 2019 (Covid-19) causée par le SARS-CoV-2. Un très grand nombre de théories du complot, infox et cas de désinformation ont été relevés, amenant l'Organisation mondiale de la santé à parler d'infodémie.
Infox
vignette|Manifestation aux États-Unis en 2017 contre la prolifération des infox. Les infox, fausses nouvelles, fausses informations, informations fallacieuses, canards, fake news (), sont des nouvelles mensongères diffusées dans le but de manipuler ou de tromper le public. Les articles contenant de fausses nouvelles emploient souvent des titres accrocheurs ou des informations entièrement fabriquées en vue d'augmenter le nombre de lecteurs et de partages en ligne.
Misinformation
Misinformation is incorrect or misleading information. It differs from disinformation, which is deliberately deceptive and propagated information. Rumors are information not attributed to any particular source, and so are unreliable and often unverified, but can turn out to be either true or false. However, definitions of the terms might vary between cultural contexts. Even if later retracted, misinformation can continue to influence actions and memory.
Afficher plus
Publications associées (50)

Content Moderation in Online Platforms

Manoel Horta Ribeiro

A critical role of online platforms like Facebook, Wikipedia, YouTube, Amazon, Doordash, and Tinder is to moderate content. Interventions like banning users or deleting comments are carried out thousands of times daily and can potentially improve our onlin ...
EPFL2024

Where Did the News Come From? Detection of News Agency Releases in Historical Newspapers

Lea Marxen

Since their beginnings in the 1830s and 1840s, news agencies have played an important role in the national and international news market, aiming to deliver news as fast and as reliable as possible. While we know that newspapers have been using agency conte ...
2023

Combating Online Scientific Misinformation

Panagiotis Smeros

The drastic shift towards digital communication in our mediasphere has caused a profound change in the production and consumption of information, which in turn has substantial implications on the social and political landscape. Misinformation, as a side ef ...
EPFL2022
Afficher plus
MOOCs associés (1)
Enjeux Mondiaux - Communication
The Communication A module of the course on Global Issues tackles challenges related to instantaneous communication and social media. The interdisciplinary approach implemented integrates SHS and engi

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.