Language-Agnostic detection of Current Events across Wikipedia

Date

2023

Authors

Antypova, Alisa

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Currently, English Wikipedia alone includes over 6,660,000 articles and it averages 550 new articles per day. To assist the readers in identifying pages that cover recent noteworthy occurrences the Current Events portal was implemented. However, this portal is maintained manually with notable quality differences across languages. The main goal of this work is to establish the task of supervised event detection in Wikipedia and propose a language-agnostic solution to address this problem. This is an important milestone towards improving the quality of the Current Events Portal for the languages with not many active editor communities. In this work, we reviewed existing research on this topic, and by combining and enriching those existing solutions, we proposed a current event detection dataset based on the Current Events Portal updates and Wikipedia pages’ features. Also, we developed a language-agnostic event detection model and reported its performance in English, German, and Polish languages, showing that is possible to automatize this task. The outcome of this work can be used to assist Wikipedia editors to keep the Current Event portal updated, saving time and the human effort used on this task.

Description

Keywords

Citation

Antypova Alisa. Language-Agnostic detection of Current Events across Wikipedia. Master Thesis. Ukrainian Catholic University, Faculty of Applied Sciences, Department of Computer Sciences. Lviv 2023, 46 p.

Collections

Endorsement

Review

Supplemented By

Referenced By