TY - JOUR
T1 - A survey on extremism analysis using natural language processing: definitions, literature review, trends and challenges
T2 - definitions, literature review, trends and challenges
AU - Torregrosa, Javier
AU - Bello-Orgaz, Gema
AU - Martínez-Cámara, Eugenio
AU - Ser, Javier Del
AU - Camacho, David
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12/12
Y1 - 2022/12/12
N2 - Extremism has grown as a global problem for society in recent years, especially after the apparition of movements such as jihadism. This and other extremist groups have taken advantage of different approaches, such as the use of Social Media, to spread their ideology, promote their acts and recruit followers. The extremist discourse, therefore, is reflected on the language used by these groups. Natural language processing (NLP) provides a way of detecting this type of content, and several authors make use of it to describe and discriminate the discourse held by these groups, with the final objective of detecting and preventing its spread. Following this approach, this survey aims to review the contributions of NLP to the field of extremism research, providing the reader with a comprehensive picture of the state of the art of this research area. The content includes a first conceptualization of the term extremism, the elements that compose an extremist discourse and the differences with other terms. After that, a review description and comparison of the frequently used NLP techniques is presented, including how they were applied, the insights they provided, the most frequently used NLP software tools, descriptive and classification applications, and the availability of datasets and data sources for research. Finally, research questions are approached and answered with highlights from the review, while future trends, challenges and directions derived from these highlights are suggested towards stimulating further research in this exciting research area.
AB - Extremism has grown as a global problem for society in recent years, especially after the apparition of movements such as jihadism. This and other extremist groups have taken advantage of different approaches, such as the use of Social Media, to spread their ideology, promote their acts and recruit followers. The extremist discourse, therefore, is reflected on the language used by these groups. Natural language processing (NLP) provides a way of detecting this type of content, and several authors make use of it to describe and discriminate the discourse held by these groups, with the final objective of detecting and preventing its spread. Following this approach, this survey aims to review the contributions of NLP to the field of extremism research, providing the reader with a comprehensive picture of the state of the art of this research area. The content includes a first conceptualization of the term extremism, the elements that compose an extremist discourse and the differences with other terms. After that, a review description and comparison of the frequently used NLP techniques is presented, including how they were applied, the insights they provided, the most frequently used NLP software tools, descriptive and classification applications, and the availability of datasets and data sources for research. Finally, research questions are approached and answered with highlights from the review, while future trends, challenges and directions derived from these highlights are suggested towards stimulating further research in this exciting research area.
KW - Natural language processing
KW - Radicalization
KW - Extremism
KW - Machine learning
KW - Deep learning
KW - Natural language processing
KW - Radicalization
KW - Extremism
KW - Machine learning
KW - Deep learning
UR - http://www.scopus.com/inward/record.url?scp=85122834709&partnerID=8YFLogxK
U2 - 10.1007/s12652-021-03658-z
DO - 10.1007/s12652-021-03658-z
M3 - Article
SN - 1868-5137
VL - unknown
SP - 9869
EP - 9905
JO - Journal of Ambient Intelligence and Humanized Computing
JF - Journal of Ambient Intelligence and Humanized Computing
IS - 8
ER -