Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Key-phrase boosted unsupervised summary generation for FinTech organization (2310.10294v1)

Published 16 Oct 2023 in cs.CL and cs.AI

Abstract: With the recent advances in social media, the use of NLP techniques in social media data analysis has become an emerging research direction. Business organizations can particularly benefit from such an analysis of social media discourse, providing an external perspective on consumer behavior. Some of the NLP applications such as intent detection, sentiment classification, text summarization can help FinTech organizations to utilize the social media language data to find useful external insights and can be further utilized for downstream NLP tasks. Particularly, a summary which highlights the intents and sentiments of the users can be very useful for these organizations to get an external perspective. This external perspective can help organizations to better manage their products, offers, promotional campaigns, etc. However, certain challenges, such as a lack of labeled domain-specific datasets impede further exploration of these tasks in the FinTech domain. To overcome these challenges, we design an unsupervised phrase-based summary generation from social media data, using 'Action-Object' pairs (intent phrases). We evaluated the proposed method with other key-phrase based summary generation methods in the direction of contextual information of various Reddit discussion threads, available in the different summaries. We introduce certain "Context Metrics" such as the number of Unique words, Action-Object pairs, and Noun chunks to evaluate the contextual information retrieved from the source text in these phrase-based summaries. We demonstrate that our methods significantly outperform the baseline on these metrics, thus providing a qualitative and quantitative measure of their efficacy. Proposed framework has been leveraged as a web utility portal hosted within Amex.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. Samir Abdaljalil and Houda Bouamor. 2021. An exploration of automatic text summarization of financial reports. In Proceedings of the Third Workshop on Financial Technology and Natural Language Processing. 1–7.
  2. The Sentiment Analysis of Fintech Users Using Support Vector Machine and Particle Swarm Optimization Method. In 2019 7th International Conference on Cyber and IT Service Management (CITSM), Vol. 7. IEEE, 1–5.
  3. Tadeusz Caliński and Jerzy Harabasz. 1974. A dendrite method for cluster analysis. Communications in Statistics-theory and Methods 3, 1 (1974), 1–27.
  4. YAKE! Keyword extraction from single documents using multiple local features. Information Sciences 509 (2020), 257–289.
  5. Yake! collection-independent automatic keyword extractor. In European Conference on Information Retrieval. Springer, 806–810.
  6. Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces. https://doi.org/10.48550/ARXIV.1805.10190
  7. David L Davies and Donald W Bouldin. 1979. A cluster separation measure. IEEE transactions on pattern analysis and machine intelligence 2 (1979), 224–227.
  8. Min-Yuh Day and Chia-Chou Lee. 2016. Deep learning for financial sentiment analysis on finance news providers. In 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). IEEE, 1127–1134.
  9. ProtAugment: Intent detection meta-learning through unsupervised diverse paraphrasing. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2454–2466.
  10. Carsten Eickhoff. 2018. Cognitive biases in crowdsourcing. In Proceedings of the eleventh ACM international conference on web search and data mining. 162–170.
  11. Vatcharaporn Esichaikul and Chawisa Phumdontree. 2018. Sentiment analysis of thai financial news. In Proceedings of the 2018 2nd International Conference on Software and e-Business. 39–43.
  12. Enriched Pre-trained Transformers for Joint Slot Filling and Intent Detection. https://doi.org/10.48550/ARXIV.2004.14848
  13. The ATIS Spoken Language Systems Pilot Corpus. In Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, June 24-27,1990. https://aclanthology.org/H90-1021
  14. Matthew Honnibal and Mark Johnson. 2015. An improved non-monotonic transition system for dependency parsing. In Proceedings of the 2015 conference on empirical methods in natural language processing. 1373–1378.
  15. Using sentiment analysis to predict interday Bitcoin price movements. The Journal of Risk Finance (2018).
  16. George Karypis. 2002. CLUTO-a clustering toolkit. Technical Report. MINNESOTA UNIV MINNEAPOLIS DEPT OF COMPUTER SCIENCE.
  17. Moreno La Quatra and Luca Cagliero. 2020. End-to-end training for financial report summarization. In Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation. 118–123.
  18. Open Intent Discovery through Unsupervised Semantic Clustering and Dependency Parsing. arXiv preprint arXiv:2104.12114 (2021).
  19. Unsupervised dialogue intent detection via hierarchical topic model. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019). 932–938.
  20. Peter J Rousseeuw. 1987. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of computational and applied mathematics 20 (1987), 53–65.
  21. Stock Market Sentiment Classification from FinTech News. In 2019 17th International Conference on ICT and Knowledge Engineering (ICT&KE). IEEE, 1–4.
  22. Generalized zero-shot intent detection via commonsense knowledge. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1925–1929.
  23. A survey of joint intent detection and slot filling models in natural language understanding. Comput. Surveys 55, 8 (2022), 1–38.

Summary

We haven't generated a summary for this paper yet.