Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Bilingual Hate Speech Detection on Social Media : Amharic and Afaan Oromo

View through CrossRef
Abstract Due to significant increases in internet penetration and the development of smartphone technology during the preceding couple of decades, many people have started using social media as a communication platform. Social media has grown to be one of the most significant components, with several benefits. However, technology also poses a number of threats, challenges, and barriers, such as hate speech, disinformation, and fake news. Hate speech detection is one of the many ways social media platforms can be accused of not doing enough to thwart hate speech on their platform. People in Bilingual and multinational societies commonly employ a code mix in both spoken and written communication. Among these, Amharic and Afaan Oromo language speakers frequently mix the two languages when conversing and posting on social media. The majority of previous study concentrated on identifying either technological favoured language or monolingual hate speech in Ethiopian languages; however, the availability of Bilingual communication in social media hampers the propagation of hate speech via social media. In this work, a Bilingual hate speech detection for Amharic and Afaan Oromo languages were conducted using four different deep learning classifiers (CNN, BiLSTM, CNN-BiLSTM, and BiGRU) and three feature extraction (Keras word embedding, word2vec, and FastText) techniques. According to the experiment, BiLSTM with FastText feature extraction is an outperforming the other algorithm by achieving a 78.05\% percent of accuracy for Bilingual Amharic Afaan Oromo hate speech detection. The FastText feature extraction overcomes the problem of out of vocabulary (OOV). Furthermore, we are working towards including others linguistic features of the languages to detect hate speech and make the resource available to facilitate further research in the area of Bilingual hate speech detection for other under-resourced Ethiopian languages.
Title: Bilingual Hate Speech Detection on Social Media : Amharic and Afaan Oromo
Description:
Abstract Due to significant increases in internet penetration and the development of smartphone technology during the preceding couple of decades, many people have started using social media as a communication platform.
Social media has grown to be one of the most significant components, with several benefits.
However, technology also poses a number of threats, challenges, and barriers, such as hate speech, disinformation, and fake news.
Hate speech detection is one of the many ways social media platforms can be accused of not doing enough to thwart hate speech on their platform.
People in Bilingual and multinational societies commonly employ a code mix in both spoken and written communication.
Among these, Amharic and Afaan Oromo language speakers frequently mix the two languages when conversing and posting on social media.
The majority of previous study concentrated on identifying either technological favoured language or monolingual hate speech in Ethiopian languages; however, the availability of Bilingual communication in social media hampers the propagation of hate speech via social media.
In this work, a Bilingual hate speech detection for Amharic and Afaan Oromo languages were conducted using four different deep learning classifiers (CNN, BiLSTM, CNN-BiLSTM, and BiGRU) and three feature extraction (Keras word embedding, word2vec, and FastText) techniques.
According to the experiment, BiLSTM with FastText feature extraction is an outperforming the other algorithm by achieving a 78.
05\% percent of accuracy for Bilingual Amharic Afaan Oromo hate speech detection.
The FastText feature extraction overcomes the problem of out of vocabulary (OOV).
Furthermore, we are working towards including others linguistic features of the languages to detect hate speech and make the resource available to facilitate further research in the area of Bilingual hate speech detection for other under-resourced Ethiopian languages.

Related Results

Afaan Oromo Multi-Label News Text Classification Using Deep Learning Approach
Afaan Oromo Multi-Label News Text Classification Using Deep Learning Approach
Abstract Classification is a technique for categorizing textual data into a form of predefined categories. Due to its major consequences in regard to critical tasks such as...
Vihapuheen kohteet ja teemat sekä lajit ja muodot ennen ja nyt
Vihapuheen kohteet ja teemat sekä lajit ja muodot ennen ja nyt
Tässä artikkelissa on analysoitu vihapuheen olemusta ja puhunnan muotoja 1930- ja 2000-luvuilla. Tavoitteena on ollut etsiä niitä yhtäläisyyksiä ja eroja, joita kahdella eri aikaka...
Generational Wisdom: Lesson from the Oromo People
Generational Wisdom: Lesson from the Oromo People
This review explores the foundational elements of Oromo generational wisdom, focusing on how their rich cultural heritage, particularly the Gadaa system, is passed down through gen...
From Hate Crime to Disability Hate Crime
From Hate Crime to Disability Hate Crime
This chapter traces the journey from hate crime to Disability Hate Crime through an analysis of the relevant literature including policy related documents which construct and refer...
Kajian Kriminologi Tindakan Hate Speech Akun Fufufafa dan Penerapan Hukum Pidana
Kajian Kriminologi Tindakan Hate Speech Akun Fufufafa dan Penerapan Hukum Pidana
Abstract. The advancement of information and communication technology has given rise to the cyber era, transforming the way society interacts, including how individuals express the...
Modeling and Analysis of Hate speech Propagation in a Community using Fractional Order Derivatives
Modeling and Analysis of Hate speech Propagation in a Community using Fractional Order Derivatives
Abstract The propagation of hate speech directed toward local public sector administrations in a community has become an issue of great concern. Hate speech not only underm...
Countering hate speech: modeling user-generated web content using natural language processing
Countering hate speech: modeling user-generated web content using natural language processing
Social media is considered a particularly conducive arena for hate speech. Counter speech, which is a "direct response that counters hate speech" is a remedy to address hate speech...
Amharic Adhoc Information Retrieval System Based on Morphological Features
Amharic Adhoc Information Retrieval System Based on Morphological Features
Information retrieval (IR) is one of the most important research and development areas due to the explosion of digital data and the need of accessing relevant information from huge...

Back to Top