TY - JOUR
T1 - URJC-Team at PoliticEs 2022
T2 - 2022 Iberian Languages Evaluation Forum, IberLEF 2022
AU - Rodríguez-García, Miguel Ángel
AU - Herranz, Soto Montalvo
AU - Unanue, Raquel Martínez
N1 - Publisher Copyright:
© 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
PY - 2022
Y1 - 2022
N2 - Different works have demonstrated the relationship between personality traits and political ideology and how they influence our daily lives. The challenge proposed in the IberLEF 2022 Task, PoliticEs, consists of extracting political ideology traits from the text by utilising Natural Language Processing (NLP) techiques. This paper describes the participation of the URJC-Team in such task. In particular, the achievement of extracting political ideology features walks through identifying the gender, the profession, and the political spectrum from a binary and multi-class perspective. In this work, we proposed two Machine Learning models to address the binary and multiclass classification problems, a Linear Support Vector Machine and Logistic Regression. The utilized dataset comprises hundreds of tweets that are cleaned and processed to generate various representations that serve as an input for the system. Between the different proposed subtasks, the proposed classification method has obtained competitive results for the binary ideology classification task, reaching 0.81. The proposal has great room for improvement, and we have planned the following steps for it.
AB - Different works have demonstrated the relationship between personality traits and political ideology and how they influence our daily lives. The challenge proposed in the IberLEF 2022 Task, PoliticEs, consists of extracting political ideology traits from the text by utilising Natural Language Processing (NLP) techiques. This paper describes the participation of the URJC-Team in such task. In particular, the achievement of extracting political ideology features walks through identifying the gender, the profession, and the political spectrum from a binary and multi-class perspective. In this work, we proposed two Machine Learning models to address the binary and multiclass classification problems, a Linear Support Vector Machine and Logistic Regression. The utilized dataset comprises hundreds of tweets that are cleaned and processed to generate various representations that serve as an input for the system. Between the different proposed subtasks, the proposed classification method has obtained competitive results for the binary ideology classification task, reaching 0.81. The proposal has great room for improvement, and we have planned the following steps for it.
KW - Author Profiling
KW - Machine Learning
KW - Natural Language Processing
KW - Political Ideology Classification
KW - Social Media
UR - https://www.scopus.com/pages/publications/85137324226
U2 - 100000/000
DO - 100000/000
M3 - Conference article
AN - SCOPUS:85137324226
SN - 1613-0073
VL - 3202
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
Y2 - 20 September 2022
ER -