USING FEATURE ENGINEERING IN MACHINE LEARNING MODELS FOR FAKE NEWS DETECTION

Igor Olenych; M. Prytula; Ya. Boyko; O. Sinkevych; O. Khamar

doi:10.30970/eli.24.5

USING FEATURE ENGINEERING IN MACHINE LEARNING MODELS FOR FAKE NEWS DETECTION

Igor Olenych, M. Prytula, Ya. Boyko, O. Sinkevych, O. Khamar

Abstract

In this study, the analytical system for processing Ukrainian and Russian texts and automatically detecting fake news was developed. The effectiveness of text message classification using the naive Bayes classifier, support vector machine, k-nearest neighbors, random forest and logistic regression methods was studied. It has been established that adding to the feature vector the number of positive and negative words, the text tone, and the aggression presence makes it possible to increase the accuracy of detecting fake news for developed machine learning models. The methods of support vector machine and logistic regression demonstrate the highest effectiveness of text message classification.

Keywords: computer text analysis, fake detection, machine learning, feature engineering.

Full Text:

PDF

References

Zhang X., Ghorbani A.A. An overview of online fake news: Characterization, detection, and discussion // Inf. Process. Manag. – 2020. – Vol. 57, no. 2. – P. 1–26.
Aimeur E., Amri S., Brassard G. Fake news, disinformation and misinformation in social media: a review // Social Network Analysis and Mining. – 2023. – Vol. 13. – 30. https://doi.org/10.1007/s13278-023-01028-5.
Rubin V. On deception and deception detection: Content analysis of computer-mediated stated beliefs // Proceedings of the American Society for Information Science and Technology. - 2010. https://doi.org/10.1002/meet.14504701124
Zhou Z., Guan H., Bhat M.M., Hsu J. Fake News Detection via NLP is Vulnerable to Adversarial Attacks // 11^th International Conference on Agents and Artificial Intelligence. – 2019. https://doi.org/10.5220/0007566307940800
Villela H.F., Correa F., Ribeiro J.S. de A.N., Rabelo A., Carvalho D.B.F. Fake news detection: a systematic literature review of machine learning algorithms and datasets // Journal on Interactive Systems. - 2023. - Vol. 14. - P. 47 - 58. https://doi.org/10.5753/jis.2023.3020.
Khanam Z., Alwasel B.N., Sirafi H., Rashid M. Fake News Detection Using Machine Learning Approaches // IOP Conf. Series: Materials Science and Engineering. – 2021. – Vol. 1099. – 012040. https://doi.org/10.1088/1757-899X/1099/1/012040.
Umer M., Imtiaz Z., Ullah S., Mehmood A., Choi G.S., On B.W. Fake news stance detection using deep learning architecture (CNN-LSTM) // IEEE Access. – 2020. – Vol. 8. – P. 156695–156706. https://doi.org/10.1109/ACCESS.2020.3019735.
Zhang G., Giachanou A., Rosso P. SceneFND: Multimodal fake news detection by modelling scene context information // Journal of Information Science. – 2022. – P. 1–13. https://doi.org/10.1177/01655515221087683
Cao J., Qi P., Sheng Q., Yang T., Guo J., Li J. Exploring the role of visual content in fake news detection // In book: Disinformation, Misinformation, and Fake News in Social Media. – 2020. – P. 141–161. https://doi.org/10.1007/978-3-030-42699-6_8
Song C., Ning N., Zhang Y., Wu B. A multimodal fake news detection model based on crossmodal attention residual and multichannel convolutional neural networks // Information Processing and Management. – 2021. – Vol. 58. – P. 1–14. https://doi.org/10.1016/j.ipm.2020.102437
Prytula M., Olenych I. Detection of aggressive rhetoric in text using machine learning algorithms // Electronics and information technologies. – 2023. – Issue 22. – P. 34–45. https://doi.org/10.30970/eli.22.4.
Thelwall M., Buckley K., Paltoglou G., Kappas A., Cai D. Sentiment strength detection in short informal text // Journal of the American Society for Information Science and Technology. – 2010. – No. 61. – P. 2544–2558.
Robertson S. Understanding Inverse Document Frequency: On Theoretical Arguments for IDF // Journal of Documentation. – 2004. – Vol. 60, No. 5. – P. 503–520.
Ukrainian tonal dictionary [Electronic resource]. - Mode of access: https://github.com/lang-uk/tone-dict-uk/blob/master/tone-dict-uk.tsv.
Ukrainian tonal dictionary [Electronic resource]. - Mode of access: https://github.com/lang-uk/tone-dict-uk/blob/master/tone-dict-uk-manual.tsv.
Russian tonal dictionary [Electronic resource]. - Mode of access: https://github.com/dkulagin/kartaslov.
Vijayarani S., Nithya M.N. Efficient machine learning classifiers for automatic information classification // Int. J. Mod. Trends Eng. Res. – 2015. – Vol. 2. – P. 685–694.

DOI: http://dx.doi.org/10.30970/eli.24.5

Refbacks

There are currently no refbacks.

Username
Password
Remember me

Electronics and information technologies / Електроніка та інформаційні технології

USING FEATURE ENGINEERING IN MACHINE LEARNING MODELS FOR FAKE NEWS DETECTION

Abstract

Full Text:

References

Refbacks