USING FEATURE ENGINEERING IN MACHINE LEARNING MODELS FOR FAKE NEWS DETECTION
Abstract
In this study, the analytical system for processing Ukrainian and Russian texts and automatically detecting fake news was developed. The effectiveness of text message classification using the naive Bayes classifier, support vector machine, k-nearest neighbors, random forest and logistic regression methods was studied. It has been established that adding to the feature vector the number of positive and negative words, the text tone, and the aggression presence makes it possible to increase the accuracy of detecting fake news for developed machine learning models. The methods of support vector machine and logistic regression demonstrate the highest effectiveness of text message classification.
Keywords: computer text analysis, fake detection, machine learning, feature engineering.
Full Text:
PDFReferences
- Zhang X., Ghorbani A.A. An overview of online fake news: Characterization, detection, and discussion // Inf. Process. Manag. – 2020. – Vol. 57, no. 2. – P. 1–26.
- Aimeur E., Amri S., Brassard G. Fake news, disinformation and misinformation in social media: a review // Social Network Analysis and Mining. – 2023. – Vol. 13. – 30. https://doi.org/10.1007/s13278-023-01028-5.
- Rubin V. On deception and deception detection: Content analysis of computer-mediated stated beliefs // Proceedings of the American Society for Information Science and Technology. - 2010. https://doi.org/10.1002/meet.14504701124
- Zhou Z., Guan H., Bhat M.M., Hsu J. Fake News Detection via NLP is Vulnerable to Adversarial Attacks // 11th International Conference on Agents and Artificial Intelligence. – 2019. https://doi.org/10.5220/0007566307940800
- Villela H.F., Correa F., Ribeiro J.S. de A.N., Rabelo A., Carvalho D.B.F. Fake news detection: a systematic literature review of machine learning algorithms and datasets // Journal on Interactive Systems. - 2023. - Vol. 14. - P. 47 - 58. https://doi.org/10.5753/jis.2023.3020.
- Khanam Z., Alwasel B.N., Sirafi H., Rashid M. Fake News Detection Using Machine Learning Approaches // IOP Conf. Series: Materials Science and Engineering. – 2021. – Vol. 1099. – 012040. https://doi.org/10.1088/1757-899X/1099/1/012040.
- Umer M., Imtiaz Z., Ullah S., Mehmood A., Choi G.S., On B.W. Fake news stance detection using deep learning architecture (CNN-LSTM) // IEEE Access. – 2020. – Vol. 8. – P. 156695–156706. https://doi.org/10.1109/ACCESS.2020.3019735.
- Zhang G., Giachanou A., Rosso P. SceneFND: Multimodal fake news detection by modelling scene context information // Journal of Information Science. – 2022. – P. 1–13. https://doi.org/10.1177/01655515221087683
- Cao J., Qi P., Sheng Q., Yang T., Guo J., Li J. Exploring the role of visual content in fake news detection // In book: Disinformation, Misinformation, and Fake News in Social Media. – 2020. – P. 141–161. https://doi.org/10.1007/978-3-030-42699-6_8
- Song C., Ning N., Zhang Y., Wu B. A multimodal fake news detection model based on crossmodal attention residual and multichannel convolutional neural networks // Information Processing and Management. – 2021. – Vol. 58. – P. 1–14. https://doi.org/10.1016/j.ipm.2020.102437
- Prytula M., Olenych I. Detection of aggressive rhetoric in text using machine learning algorithms // Electronics and information technologies. – 2023. – Issue 22. – P. 34–45. https://doi.org/10.30970/eli.22.4.
- Thelwall M., Buckley K., Paltoglou G., Kappas A., Cai D. Sentiment strength detection in short informal text // Journal of the American Society for Information Science and Technology. – 2010. – No. 61. – P. 2544–2558.
- Robertson S. Understanding Inverse Document Frequency: On Theoretical Arguments for IDF // Journal of Documentation. – 2004. – Vol. 60, No. 5. – P. 503–520.
- Ukrainian tonal dictionary [Electronic resource]. - Mode of access: https://github.com/lang-uk/tone-dict-uk/blob/master/tone-dict-uk.tsv.
- Ukrainian tonal dictionary [Electronic resource]. - Mode of access: https://github.com/lang-uk/tone-dict-uk/blob/master/tone-dict-uk-manual.tsv.
- Russian tonal dictionary [Electronic resource]. - Mode of access: https://github.com/dkulagin/kartaslov.
- Vijayarani S., Nithya M.N. Efficient machine learning classifiers for automatic information classification // Int. J. Mod. Trends Eng. Res. – 2015. – Vol. 2. – P. 685–694.
DOI: http://dx.doi.org/10.30970/eli.24.5
Refbacks
- There are currently no refbacks.