Skip to content
center-gradient-cover-bg
right-gradient-cover-bg
gradient-cover-bg
image post
White papers

Cross-lingual Extended Named Entity Classification of Wikipedia Articles

April 16, 2024

Share with:

Content

Authors: The Viet Bui, Phuong Le-Hong

Comments: Accepted to NTCIR-15

Subjects: Computation and Language (cs.CL)

Abstract: The this http URL team participated in the SHINRA2020-ML subtask of the NTCIR-15 SHINRA task. This paper describes our method to solving the problem and discusses the official results. Our method focuses on learning cross-lingual representations, both on the word level and document level for page classification. We propose a three-stage approach including multilingual model pre-training, monolingual model fine-tuning and cross-lingual voting. Our system is able to achieve the best scores for 25 out of 30 languages; and its accuracy gaps to the best performing systems of the other five languages are relatively small.

Published: 10/7/2020

PDF: https://arxiv.org/pdf/2010.03424.pdf

Download now
gradient-cover-bg

Do you need a workthrough of our platform? Let us know

    Related Posts

    Get ahead with AI-powered technology updates!

    Subscribe now to our newsletter for exclusive insights, expert analysis, and cutting-edge developments delivered straight to your inbox!