Cyberbullying detection, prevention, and analysis on social media via trustable LSTM-autoencoder networks over synthetic data: The TLA-NET approach

Cuzzocrea, Alfredo; Akter, Mst Shapna; Shahriar, Hossain; García Bringas, Pablo

Cyberbullying detection, prevention, and analysis on social media via trustable LSTM-autoencoder networks over synthetic data: The TLA-NET approach

dc.contributor.author	Cuzzocrea, Alfredo
dc.contributor.author	Akter, Mst Shapna
dc.contributor.author	Shahriar, Hossain
dc.contributor.author	García Bringas, Pablo
dc.date.accessioned	2025-03-14T11:11:08Z
dc.date.available	2025-03-14T11:11:08Z
dc.date.issued	2025-02
dc.date.updated	2025-03-14T11:11:08Z
dc.description.abstract	The plague of cyberbullying on social media exerts a dangerous influence on human lives. Due to the fact that online social networks continue to daily expand, the proliferation of hate speech is also growing. Consequentially, distressing content is often implicated in the onset of depression and suicide-related behaviors. In this paper, we propose an innovative framework, named as the trustable LSTM-autoencoder network (TLA NET), which is designed for the detection of cyberbullying on social media by employing synthetic data. We introduce a state-of-the-art method for the automatic production of translated data, which are aimed at tackling data availability issues. Several languages, including Hindi and Bangla, continue to face research limitations due to the absence of adequate datasets. Experimental identification of aggressive comments is carried out via datasets in Hindi, Bangla, and English. By employing TLA NET and traditional models, such as long short-term memory (LSTM), bidirectional long short-term memory (BiLSTM), the LSTM-autoencoder, Word2vec, bidirectional encoder representations from transformers (BERT), and the Generative Pre-trained Transformer 2 (GPT-2), we perform the experimental identification of aggressive comments in datasets in Hindi, Bangla, and English. In addition to this, we employ evaluation metrics that include the F1-score, accuracy, precision, and recall, to assess the performance of the models. Our model demonstrates outstanding performance across all the datasets by achieving a remarkable 99% accuracy and positioning itself as a frontrunner when compared to previous works that make use of the dataset featured in this research	en
dc.description.sponsorship	This work was partially supported by project SERICS (PE00000014) under the MUR National Recovery and Resilience Plan funded by the European Union- NextGenerationEU	en
dc.identifier.citation	Cuzzocrea, A., Akter, M. S., Shahriar, H., & Garcia Bringas, P. (2025). Cyberbullying detection, prevention, and analysis on social media via trustable LSTM-autoencoder networks over synthetic data: The TLA-NET approach. Future Internet, 17(2). https://doi.org/10.3390/FI17020084
dc.identifier.doi	10.3390/FI17020084
dc.identifier.eissn	1999-5903
dc.identifier.uri	http://hdl.handle.net/20.500.14454/2534
dc.language.iso	eng
dc.publisher	Multidisciplinary Digital Publishing Institute (MDPI)
dc.rights	© 2025 by the authors
dc.subject.other	Cyber-bullying
dc.subject.other	Deep learning
dc.subject.other	Natural language processing
dc.subject.other	Neural networks
dc.title	Cyberbullying detection, prevention, and analysis on social media via trustable LSTM-autoencoder networks over synthetic data: The TLA-NET approach	en
dc.type	journal article
dcterms.accessRights	open access
oaire.citation.issue	2
oaire.citation.title	Future Internet
oaire.citation.volume	17
oaire.licenseCondition	https://creativecommons.org/licenses/by/4.0/
oaire.version	VoR

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: cuzzocrea_cyberbullying_2025.pdf
Tamaño:: 2.61 MB
Formato:: Adobe Portable Document Format

Descargar

Colecciones

Artículos