Maxime Jean Julien Peyrard
In summarization, automatic evaluation metrics are usually compared based on their ability to correlate with human judgments. Unfortunately, the few existing human judgment datasets have been created as by-products of the manual evaluations performed durin ...
ASSOC COMPUTATIONAL LINGUISTICS-ACL2019