Publication

Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization

Boi Faltings, Diego Matteo Antognini
2019
Conference paper

Abstract

Linking facts across documents is a challenging task, as the language used to express the same information in a sentence can vary significantly, which complicates the task of multi-document summarization. Consequently, existing approaches heavily rely on hand-crafted features, which are domain-dependent and hard to craft, or additional annotated data, which is costly to gather. To overcome these limitations, we present a novel method, which makes use of two types of sentence embeddings: universal embeddings, which are trained on a large unrelated corpus, and domain-specific embeddings, which are learned during training. To this end, we develop SemSentSum, a fully data-driven model able to leverage both types of sentence embeddings by building a sentence semantic relation graph. SemSentSum achieves competitive results on two types of summary, consisting of 665 bytes and 100 words. Unlike other state-of-the-art models, neither hand-crafted features nor additional annotated data are necessary, and the method is easily adaptable for other tasks. To our knowledge, we are the first to use multiple sentence embeddings for the task of multi-document summarization.

Official source

https://infoscience.epfl.ch/record/300304?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization

Graph Chatbot

Chat with Graph Search

Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework

Class Specific Feature Disentanglement and Text Embeddings for Multi-label Generalized Zero Shot CXR Classification

Learning computationally efficient static word and sentence representations

Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework

Class Specific Feature Disentanglement and Text Embeddings for Multi-label Generalized Zero Shot CXR Classification

Learning computationally efficient static word and sentence representations