Automated Fact-checking for Wikipedia

Date

2021

Authors

Trokhymovych, Mykola

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

The incoming flow of information is continuously increasing along with the disinformation piece that can harm society. Filtering unreliable content helps keep Wikipedia as free as possible of disinformation, making it one of the most significant reliable information sources. Consequently, Wikipedia’s knowledge base is widely used for facts verification academic research. The main goal of our work is to transform recent academic achievements into a practical open-source Wikipedia-based fact-checking application that is both accurate and efficient. We review the primary NLI related datasets and study their relevant limitations. As a result, we propose the data filtering method that improves the model’s performance and generalization. We show that transfer learning for NLI models are not working well, and complete model training is needed to achieve the best result on a specific dataset. We come up with an unsupervised fine-tuning of the Masked Language model on field-specific texts for model domain adaptation. Finally, we present the new fact-checking system WikiCheck API that automatically performs a facts validation process based on the Wikipedia knowledge base. It is comparable to SOTA solutions in terms of accuracy and can be used on low memory CPU instances.

Description

Keywords

Natural Language Inference, Automated Fact-Checking, WikiCheck

Citation

Trokhymovych, Mykola. Automated Fact-checking for Wikipedia / Mykola Trokhymovych; Supervisor: Diego Saez-Trumper; Ukrainian Catholic University, Department of Computer Sciences. – Lviv : [s.n.], 2021. – 52 p.: ill.

Collections

Endorsement

Review

Supplemented By

Referenced By