/
/
/
Features Extraction Based on Probability Weighting for Fake News Classification on Social Media

Features Extraction Based on Probability Weighting for Fake News Classification on Social Media

Original Research ArticleSep 16, 2022Vol. 23 No. 2 (2023) 10.55003/cast.2022.02.23.014

Abstract

Fake news is a massive problem globally, especially on social media. Most people spend a lot of time consuming social media every day, and it is very possible for people as social media users to receive fake news without realizing it. Primarily due to this situation, we developed a machine learning tool to detect fake news that operates with the aid of various algorithms such as Decision Tree, K-Nearest Neighbor, and Naïve Bayes. Our experiement is tested based on machine learning that selected only one technique used to classify the data by finding the model set. In addition, the performance of the set describes the classification of the model and the inconsistency solution for each iteration. This study proposed a model which used the probability weighting of the model in features extraction processing for data classification. The concept is the enhancement of probability weighting features that converge exactly the class labels of classification. Our work was also implemented based on traditional Count Vectorizer and TF-IDF Vectorizer sentiment analysis and combined probability weighting features for fake news articles. The experimental results of the work illustrate that the best accuracy achieved by a proposed model used probability weighting features to find out the impact of classifiers models. In addition, the results of experimental information is represented by enhancing the overall performance of Decision Tree, K-Nearest Neighbor, and Naïve Bayes with various datasets. In addition, the measures of precision, recall, F1-measure, AUC, and accuracy for each class and deep in each class were achieved and reached the highest performance of the proposed model.

Keywords: fake news; machine learning; sentiment analysis; probability weighting; data visualization

*Corresponding author: E-mail: wararat@kku.ac.th

References

1
Shu, K., Mahudeswaran, D., Wang, S., Lee, D. and Liu, H., 2018. FakeNewsNet: A data repository with news content, social context and Spatialtemporal information for studying fake news on social media. Big Data, 8(3), 171-188, DOI: 10.48550/arxiv.1809.01286.
2
Shu, K., Mahudeswaran, D. and Liu, H., 2018. Fakenewstracker: A tool for fake news collection, detection, and visualization. Computational and Mathematical Organization Theory, 25(1), 60-71, DOI: 10.1007/s10588-018-09280-3.
3
Yan, H., Wang, J. and Xia, C., 2017. Research and application of the test data visualization. Proceedings of 2017 IEEE 2nd International Conference on Data Science in Cyberspace (DSC), Shenzhen, China, 26-29 June 2017, pp. 661-665.
4
Alonso, M.A., Vilares, D., Gómez-Rodríguez, C. and Vilares, J., 2021. Sentiment analysis for fake news detection. Electronics, 10(11), 1-32, DOI: 10.3390/electronics10111348.
5
Dey, A., Rafi, R.Z., Parash, S.H, Arko, S.K. and Chakrabarty, A., 2018. Fake news pattern recognition using linguistic analysis. Proceedings of 2018 Joint 7th International Conference on Informatics, Electronics & Vision (ICIEV) and 2018 2nd International Conference on Imaging, Vision & Pattern Recognition (icIVPR), Japan, 25-29 June 2018, pp. 305-309.

Author Information

Sherly Valentina

Data Science and Artificial Intelligence Program, College of Computing, Khon Kaen University, Khon Kaen, Thaıland

Wararat Songpan*

Data Science and Artificial Intelligence Program, College of Computing, Khon Kaen University, Khon Kaen, Thaıland

About this Article

Current Journal

Vol. 23 No. 2 (2023)

Type of Manuscript

Original Research Article

Keywords

fake news;
machine learning;
sentiment analysis;
probability weighting;
data visualization

Published

16 September 2022

DOI

10.55003/cast.2022.02.23.014

Current Journal

Journal Cover
Vol. 23 No. 2 (2023)

Search

Latest Articles

Original Research Article
Mar 12, 2025

Comparison of Early and Late Season Phytochemical Content in Mon Thong Durian Cultivar (Durio zibethinus Murray)

Original Research Article
Mar 12, 2025

Diversity of Macrofungi in the Nature Trail of Namtok Phlio National Park, Chanthaburi Province, Thailand

Original Research Article
Mar 12, 2025

Selection of Stable Rice Genotypes through WAASB and MTSI Indices

Original Research Article
Mar 12, 2025

Sensitivity of Phytophthora palmivora Causing Durian Diseases to Metalaxyl-M and Dimethomorph in Southern and Eastern Thailand