Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
86 tokens/sec
GPT-4o
11 tokens/sec
Gemini 2.5 Pro Pro
53 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
3 tokens/sec
DeepSeek R1 via Azure Pro
33 tokens/sec
2000 character limit reached

A Multi-Embedding Convergence Network on Siamese Architecture for Fake Reviews (2401.05995v1)

Published 11 Jan 2024 in cs.MM

Abstract: In this new digital era, accessibility to real-world events is moving towards web-based modules. This is mostly visible on e-commerce websites where there is limited availability of physical verification. With this unforeseen development, we depend on the verification in the virtual world to influence our decisions. One of the decision making process is deeply based on review reading. Reviews play an important part in this transactional process. And seeking a real review can be very tenuous work for the user. On the other hand, fake review heavily impacts these transaction records of a product. The article presents an implementation of a Siamese network for detecting fake reviews. The fake reviews dataset, consisting of 40K reviews, preprocessed with different techniques. The cleaned data is passed through embeddings generated by MiniLM BERT for contextual relationship and Word2Vec for semantic relationship to form vectors. Further, the embeddings are trained in a Siamese network with LSTM layers connected to fuzzy logic for decision-making. The results show that fake reviews can be detected with high accuracy on a siamese network for prediction and verification.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Opinion mining and sentiment analysis. Foundations and Trends® in information retrieval, 2(1–2):1–135, 2008.
  2. Impact of online consumer reviews on sales: The moderating role of product and consumer characteristics. Journal of marketing, 74(2):133–148, 2010.
  3. Fake reviews detection: A survey. IEEE Access, 9:65771–65802, 2021. doi:10.1109/ACCESS.2021.3075573.
  4. Kurt Knutsson. amazon-industry-giants-team-up-battle-fake-reviews. https://cyberguy.com/scams/amazon-industry-giants-team-up-battle-fake-reviews/. Accessed: September 17, 2023.
  5. Data analytics for the identification of fake reviews using supervised learning. Computers, Materials & Continua, 70(2):3189–3204, 2022.
  6. Intelligent fake reviews detection based on aspect extraction and analysis using deep learning. Neural Computing and Applications, 34(22):20213–20229, 2022.
  7. Haffaz Aladeen. Can machine learning algorithms really stop fake news in its tracks? 2023.
  8. Creating and detecting fake reviews of online products. Journal of Retailing and Consumer Services, 64:102771, 2022.
  9. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013.
  10. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4171–4186, 2018.
  11. Long short-term memory. Neural Computation, 9(8):1735–1780, 1997. doi:10.1162/neco.1997.9.8.1735.
  12. Ludmila Kuncheva. Fuzzy classifier design, volume 49. Springer Science & Business Media, 2000.
  13. Natural language processing advancements by deep learning: A survey, 2021.
  14. A review of recurrent neural networks: Lstm cells and network architectures. Neural computation, 31(7):1235–1270, 2019.
  15. A survey on long short-term memory networks for time series prediction. Procedia CIRP, 99:650–655, 2021. ISSN 2212-8271. doi:https://doi.org/10.1016/j.procir.2021.03.088. URL https://www.sciencedirect.com/science/article/pii/S2212827121003796. 14th CIRP Conference on Intelligent Computation in Manufacturing Engineering, 15-17 July 2020.
  16. Empirical studies on the nlp techniques for source code data preprocessing. In Proceedings of the 2014 3rd international workshop on evidential assessment of software technologies, pages 32–39, 2014.
  17. Highly language-independent word lemmatization using a machine-learning classifier. Computación y Sistemas, 24(3):1353–1364, 2020.
  18. A rule based approach to word lemmatization. In Proceedings of IS, volume 3, pages 83–86, 2004.
  19. David A Hull. Stemming algorithms: A case study for detailed evaluation. Journal of the American Society for Information Science, 47(1):70–84, 1996.
  20. The importance of stop word removal on recall values in text categorization. In Proceedings of the International Joint Conference on Neural Networks, 2003., volume 3, pages 1661–1666. IEEE, 2003.
  21. Davide Chicco. Siamese neural networks: An overview. Artificial neural networks, pages 73–94, 2021.
  22. Systematic comparison of vectorization methods in classification context. Applied Sciences, 12(10):5119, 2022.
  23. The use of word2vec model in sentiment analysis: A survey. In Proceedings of the 2019 international conference on artificial intelligence, robotics and control, pages 39–43, 2019.
  24. Xin Rong. word2vec parameter learning explained. arXiv preprint arXiv:1411.2738, 2014.
  25. Contextual and non-contextual word embeddings: an in-depth linguistic investigation. In Proceedings of the 5th Workshop on Representation Learning for NLP, pages 110–119, 2020.
  26. Does bert make any sense? interpretable word sense disambiguation with contextualized embeddings. arXiv preprint arXiv:1909.10430, 2019.
  27. Small and practical bert models for sequence labeling. arXiv preprint arXiv:1909.00100, 2019.
  28. Minilm: Deep self-attention distillation for task-agnostic compression of pre-trained transformers. arXiv e-prints, pages arXiv–2002, 2020.
  29. Siamese multiplicative lstm for semantic text similarity. In Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence, pages 1–5, 2020.
  30. Are you convinced? choosing the more convincing evidence with a siamese network. arXiv preprint arXiv:1907.08971, 2019.
  31. Fuzzy time series prediction method based on fuzzy recurrent neural network. In Neural Information Processing: 13th International Conference, ICONIP 2006, Hong Kong, China, October 3-6, 2006. Proceedings, Part II 13, pages 860–869. Springer, 2006.
  32. Application of fuzzy logic and neural network in crop classification: a review. Aquatic Procedia, 4:1203–1210, 2015.
  33. Performance evaluation of fuzzy classifier systems for multidimensional pattern classification problems. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 29(5):601–618, 1999.
  34. Yelp dataset. https://www.yelp.com/dataset, 2022. Accessed: December 10, 2023.

Summary

We haven't generated a summary for this paper yet.