Skip to content
2000
Volume 19, Issue 8
  • ISSN: 1872-2121
  • E-ISSN: 2212-4047

Abstract

Advanced technologies on the internet create an environment for information exchange among communities. However, some individuals exploit these environments to spread false news. False News, or Fake News (FN), refers to misleading information deliberately crafted to harm the reputation of individuals, products, or services. Identifying FN is a challenging issue for the research community. Many researchers have proposed approaches for FN detection using Machine Learning (ML) and Natural Language Processing (NLP) techniques. In this patent article, we propose a combined approach for FN detection, leveraging both ML and NLP techniques. We first extract all terms from the dataset after applying appropriate preprocessing techniques. A Feature Selection Algorithm (FSA) is then employed to identify the most important features based on their scores. These selected features are used to represent the dataset documents as vectors. The term weight measure determines the significance of each term in the vector representation. These document vectors are combined with vector representations obtained through an NLP technique. Specifically, we use the Bidirectional Encoder Representations from Transformers (BERT) model to represent the document vectors. The BERT small case model is employed to generate features, which are then used to create the document vectors. The combined vector, comprising ML-based document vector representations and NLP-based vector representations, is fed into various ML algorithms. These algorithms are used to build a model for classification. Our combined approach for FN detection achieved the highest accuracy of 96.72% using the Random Forest algorithm, with document vectors that included content-based features of size 4000 concatenated with outputs from the 9th to 12th BERT encoder layers.

Loading

Article metrics loading...

/content/journals/eng/10.2174/0118722121300281240823174052
2024-10-02
2025-12-13
Loading full text...

Full text loading...

References

  1. TolmieP. ProcterR. RandallD.W. RouncefieldM. BurgerC. Wong Sak HoiG. ZubiagaA. LiakataM. Supporting the use of user generated content in journalistic practice.Proceedings of the 2017 chi conference on human factors in computing systems, Denver, Colorado, USA, 02 May 2017, pp. 3632–3644.10.1145/3025453.3025892
    [Google Scholar]
  2. LazerD.M.J. BaumM.A. BenklerY. BerinskyA.J. GreenhillK.M. MenczerF. MetzgerM.J. NyhanB. PennycookG. RothschildD. SchudsonM. SlomanS.A. SunsteinC.R. ThorsonE.A. WattsD.J. ZittrainJ.L. The science of fake news.Science201835963801094109610.1126/science.aao299829590025
    [Google Scholar]
  3. ZannettouS. SirivianosM. BlackburnJ. KourtellisN. The web of false information: Rumors, fake news, hoaxes, clickbait, and various other shenanigans.ACM J. Data Inf. Qual.201911313710.1145/3309699
    [Google Scholar]
  4. WaweruJ. “Understanding Fake News”. Article in International Journal of Scientific and Research Publications.IJSRP2019918505
    [Google Scholar]
  5. KoohikamaliM. SidorovaA. Information re-sharing on social network sites in the age of fake news.Inf. Sci.20172021523510.28945/3871
    [Google Scholar]
  6. ZechS.T. GabbayM. Social network analysis in the study of terrorism and insurgency: From organization to politics.Int. Stud. Rev.201618221424310.1093/isr/viv011
    [Google Scholar]
  7. ChettyN. AlathurS. Hate speech review in the context of online social networks.Aggress. Violent. Behav.20184010811810.1016/j.avb.2018.05.003
    [Google Scholar]
  8. Tirupathi KumarB. Vishnu VardhanB. A review on fake news spreaders detection.Gis Science Journal202185388402
    [Google Scholar]
  9. AlamS. RavshanbekovA. Sieving Fake News From Genuine: A Synopsis.arXiv preprint2019
    [Google Scholar]
  10. ManzoorS.I. SinglaJ. 2019Fake News Detection Using Machine Learning approaches: A systematic Review.In 2019 3rd International Conference on Trends in Electronics and InformaticsTirunelveli, India, 23-25 April 2019, pp. 230-23410.1109/ICOEI.2019.8862770
    [Google Scholar]
  11. TraylorT. StraubJ. SnellN. 2019Classifying fake news articles using natural language processing to identify in-article attribution as a supervised learning estimator.In 2019 IEEE 13th International Conference on Semantic Computing, Newport Beach, CA, USA, 30 January 2019 - 01 February 2019, pp. 445-449.10.1109/ICOSC.2019.8665593
    [Google Scholar]
  12. GelfertA. Fake news: A definition.Informal Log.20183818411710.22329/il.v38i1.5068
    [Google Scholar]
  13. KaurS. KumarP. KumaraguruP. Automating fake news detection system using multi-level voting model.Soft Comput.2019121
    [Google Scholar]
  14. WaikhomL. GoswamiR.S. Fake News Detection Using Machine Learning.SSRN20193462938
    [Google Scholar]
  15. Altunbey OzbayF. AlatasB. A Novel Approach for Detection of Fake News on Social Media Using Metaheuristic Optimization Algorithms.Elektron. Elektrotech.2019254626710.5755/j01.eie.25.4.23972
    [Google Scholar]
  16. FaustiniP. CovõesT.F. Fake News Detection Using One-Class Classification.In 2019 8th Brazilian Conference on Intelligent Systems, Salvador, Brazil, 15-18 October 2019, pp. 592-597.201910.1109/BRACIS.2019.00109
    [Google Scholar]
  17. AhmedH. TraoreI. SaadS. 2017Detection of online fake news using N- gram analysis and machine learning techniques.International Conference on Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments127138SpringerCham10.1007/978‑3‑319‑69155‑8_9
    [Google Scholar]
  18. LiQ. HuQ. LuY. YangY. ChengJ. Multi-level word features based on CNN for fake news detection in cultural communication.Pers. Ubiquitous Comput.2019114
    [Google Scholar]
  19. GiachanouA. RossoP. CrestaniF. 2019Leveraging emotional signals for credibility detection.Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information RetrievalParis, France, 18 July 2019, 877–880.877880
    [Google Scholar]
  20. GranikM. MesyuraV. 2017Fake news detection using naive Bayes classifier.2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)900903IEEE.10.1109/UKRCON.2017.8100379
    [Google Scholar]
  21. BhattacharjeeS.D. TalukderA. BalantrapuB.V. 2017Active learning based news veracity detection with feature weighting and deep-shallow fusion.In 2017 IEEE International Conference on Big Data, Boston, MA, USA, 11-14 December 2017, pp. 556-565.10.1109/BigData.2017.8257971
    [Google Scholar]
  22. JanzeC. RisiusM. Automatic Detection of Fake News on Social Media Platforms.In PACIS2017261
    [Google Scholar]
  23. KimK.H. JeongC.S. Fake News Detection System using Article Abstraction.In 2019 16th International Joint Conference on Computer Science and Software Engineering, Chonburi, Thailand, 10-12 July 2019, pp. 209-212.10.1109/JCSSE.2019.8864154
    [Google Scholar]
  24. ZhouX. ZafaraniR. Network-based Fake News Detection.SIGKDD Explor.2019212486010.1145/3373464.3373473
    [Google Scholar]
  25. KumarS. SinghT.D. Fake news detection on Hindi news dataset.Glob. Transit. Proc.20223128929710.1016/j.gltp.2022.03.014
    [Google Scholar]
  26. KhanJunaed Younus KhondakerMd. Tawkat Islam AfrozSadia UddinGias IqbalAnindya A benchmark study of machine learning models for online fake news detectionMachine Learning with Applications24 March 2021
    [Google Scholar]
  27. RaiN. KumarD. KaushikN. RajC. AliA. Fake News Classification using transformer based enhanced LSTM and BERT.International Journal of Cognitive Computing in Engineering20223March9810510.1016/j.ijcce.2022.03.003
    [Google Scholar]
  28. NasirJamal Abdul KhanOsama Subhani VarlamisIraklis Fake news detection: A hybrid CNN-RNN based deep learning approachInternational Journal of Information Management Data Insights2021110.1016/j.jjimei.2020.100007
    [Google Scholar]
  29. ChauhanTavishee “Optimization and improvement of fake news detection using deep learning approaches for societal benefit”International Journal of Information Management Data Insights20211
    [Google Scholar]
  30. SinghM. Wasim BhattM. “Performance of bernoulli’s naive bayes classifier in the detection of fake news”.Mater. Today Proc.10.1016/j.matpr.2020.10.896
    [Google Scholar]
  31. Y-F.Huang P-H.Chen Fake News Detection Using an Ensemble Learning Model Based on Self-adaptive Harmony Search Algorithms, Expert Systems with Applications (2020)10.1016/j.eswa.2020.113584
    [Google Scholar]
  32. KaliyarR.K. GoswamiA. NarangP. SinhaS. FNDNet – A deep convolutional neural network for fake news detection.Cogn. Syst. Res.202061324410.1016/j.cogsys.2019.12.005
    [Google Scholar]
  33. ISOT LABAvailable at: https://www.uvic.ca/engineering/ece/isot/datasets/fake-news/index.php
  34. ZhaoZ. LiuH. Spectral feature selection for supervised and unsupervised learningProceedings of the 24th International Conference on Machine Learning20071151115710.1145/1273496.1273641
    [Google Scholar]
  35. LabaniM. MoradiP. AhmadizarF. JaliliM. A novel multivariate filter method for feature selection in text classification problems.Eng. Appl. Artif. Intell.201870253710.1016/j.engappai.2017.12.014
    [Google Scholar]
  36. ChenK. ZhangZ. LongJ. ZhangH. Turning from TF-IDF to TF-IGM for term weighting in text classification.Expert Syst. Appl.20166624526010.1016/j.eswa.2016.09.009
    [Google Scholar]
  37. B. Schölkopf and C. J. Burges, Advances in kernel methods: support vector learning. MIT press, 1999.
  38. HearstM.A. DumaisS.T. OsunaE. PlattJ. ScholkopfB. Support vector machines.IEEE Intell. Syst. Their Appl.1998134182810.1109/5254.708428
    [Google Scholar]
  39. ValyonJ. HorváthG. A weighted generalized ls-SVM.Period. Polytech. Electr. Eng.2003473-4229252
    [Google Scholar]
  40. QuinlanJ.R. Induction of decision trees.Mach. Learn.1986118110610.1007/BF00116251
    [Google Scholar]
  41. KassG.V. An exploratory technique for investigating large quantities of categorical data.Appl. Stat.198029211912710.2307/2986296
    [Google Scholar]
  42. BreimanL. FriedmanJ. StoneC.J. OlshenR.A. Classification and regression trees.CRC press1984
    [Google Scholar]
  43. QuinlanJ.R. C4. 5: programs for machine learning.Elsevier2014
    [Google Scholar]
  44. LamL. SuenS.Y. Application of majority voting to pattern recognition: an analysis of its behavior and performance.IEEE Trans. Syst. Man Cybern. A Syst. Hum.199727555356810.1109/3468.618255
    [Google Scholar]
  45. FarhangiV. MoradiM.J. DaneshvarK. HajilooH. Application of artificial intelligence in predicting the residual mechanical properties of fiber reinforced concrete (FRC) after high temperaturesConstruction and Building Materials411134609202410.1016/j.conbuildmat.2023.134609
    [Google Scholar]
  46. ChangJ.D.a. Open Sourcing BERT: State-of-the-Art Pretraining for Natural Language Processing.Google AI Blog2019
    [Google Scholar]
  47. DevlinJ. ChangM-W. LeeK. ToutanovaK. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.preprint arXiv2018
    [Google Scholar]
  48. KavuriK. KavithaM. A Word Embeddings based Approach for Author Profiling: Gender and Age Prediction.Int. J. Recent Innov. Trends Comput. Commun.2023117s23925010.17762/ijritcc.v11i7s.6996
    [Google Scholar]
  49. KahlootK.M. EklerP. Algorithmic splitting: A method for dataset preparation.IEEE Access2021912522912523710.1109/ACCESS.2021.3110745
    [Google Scholar]
/content/journals/eng/10.2174/0118722121300281240823174052
Loading
/content/journals/eng/10.2174/0118722121300281240823174052
Loading

Data & Media loading...


  • Article Type:
    Review Article
Keyword(s): BERT; encoder layers; feature selection technique; FN; FN detection; term weight measure
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test