Cyberbullying detection, prevention, and analysis on social media via trustable LSTM-autoencoder networks over synthetic data: The TLA-NET approach
dc.contributor.author | Cuzzocrea, Alfredo | |
dc.contributor.author | Akter, Mst Shapna | |
dc.contributor.author | Shahriar, Hossain | |
dc.contributor.author | García Bringas, Pablo | |
dc.date.accessioned | 2025-03-14T11:11:08Z | |
dc.date.available | 2025-03-14T11:11:08Z | |
dc.date.issued | 2025-02 | |
dc.date.updated | 2025-03-14T11:11:08Z | |
dc.description.abstract | The plague of cyberbullying on social media exerts a dangerous influence on human lives. Due to the fact that online social networks continue to daily expand, the proliferation of hate speech is also growing. Consequentially, distressing content is often implicated in the onset of depression and suicide-related behaviors. In this paper, we propose an innovative framework, named as the trustable LSTM-autoencoder network (TLA NET), which is designed for the detection of cyberbullying on social media by employing synthetic data. We introduce a state-of-the-art method for the automatic production of translated data, which are aimed at tackling data availability issues. Several languages, including Hindi and Bangla, continue to face research limitations due to the absence of adequate datasets. Experimental identification of aggressive comments is carried out via datasets in Hindi, Bangla, and English. By employing TLA NET and traditional models, such as long short-term memory (LSTM), bidirectional long short-term memory (BiLSTM), the LSTM-autoencoder, Word2vec, bidirectional encoder representations from transformers (BERT), and the Generative Pre-trained Transformer 2 (GPT-2), we perform the experimental identification of aggressive comments in datasets in Hindi, Bangla, and English. In addition to this, we employ evaluation metrics that include the F1-score, accuracy, precision, and recall, to assess the performance of the models. Our model demonstrates outstanding performance across all the datasets by achieving a remarkable 99% accuracy and positioning itself as a frontrunner when compared to previous works that make use of the dataset featured in this research | en |
dc.description.sponsorship | This work was partially supported by project SERICS (PE00000014) under the MUR National Recovery and Resilience Plan funded by the European Union- NextGenerationEU | en |
dc.identifier.citation | Cuzzocrea, A., Akter, M. S., Shahriar, H., & Garcia Bringas, P. (2025). Cyberbullying detection, prevention, and analysis on social media via trustable LSTM-autoencoder networks over synthetic data: The TLA-NET approach. Future Internet, 17(2). https://doi.org/10.3390/FI17020084 | |
dc.identifier.doi | 10.3390/FI17020084 | |
dc.identifier.eissn | 1999-5903 | |
dc.identifier.uri | http://hdl.handle.net/20.500.14454/2534 | |
dc.language.iso | eng | |
dc.publisher | Multidisciplinary Digital Publishing Institute (MDPI) | |
dc.rights | © 2025 by the authors | |
dc.subject.other | Cyber-bullying | |
dc.subject.other | Deep learning | |
dc.subject.other | Natural language processing | |
dc.subject.other | Neural networks | |
dc.title | Cyberbullying detection, prevention, and analysis on social media via trustable LSTM-autoencoder networks over synthetic data: The TLA-NET approach | en |
dc.type | journal article | |
dcterms.accessRights | open access | |
oaire.citation.issue | 2 | |
oaire.citation.title | Future Internet | |
oaire.citation.volume | 17 | |
oaire.licenseCondition | https://creativecommons.org/licenses/by/4.0/ | |
oaire.version | VoR |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- cuzzocrea_cyberbullying_2025.pdf
- Tamaño:
- 2.61 MB
- Formato:
- Adobe Portable Document Format