Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 30 tok/s
Gemini 2.5 Pro 46 tok/s Pro
GPT-5 Medium 18 tok/s Pro
GPT-5 High 12 tok/s Pro
GPT-4o 91 tok/s Pro
Kimi K2 184 tok/s Pro
GPT OSS 120B 462 tok/s Pro
Claude Sonnet 4 38 tok/s Pro
2000 character limit reached

ESG Accountability Made Easy: DocQA at Your Service (2311.18481v1)

Published 30 Nov 2023 in cs.CL, cs.AI, and cs.CV

Abstract: We present Deep Search DocQA. This application enables information extraction from documents via a question-answering conversational assistant. The system integrates several technologies from different AI disciplines consisting of document conversion to machine-readable format (via computer vision), finding relevant data (via natural language processing), and formulating an eloquent response (via LLMs). Users can explore over 10,000 Environmental, Social, and Governance (ESG) disclosure reports from over 2000 corporations. The Deep Search platform can be accessed at: https://ds4sd.github.io.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (26)
  1. Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness. In 2022 IEEE 15th International Conference on Cloud Computing (CLOUD). IEEE.
  2. Auer, C.; et al. 2023. ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents. In Lecture Notes in Computer Science, 471–482. Springer Nature Switzerland.
  3. Bommasani, R.; et al. 2022. On the Opportunities and Risks of Foundation Models. arXiv:2108.07258.
  4. Breuel, T. M. 2002. Two Geometric Algorithms for Layout Analysis. In Lopresti, D.; Hu, J.; and Kashi, R., eds., Document Analysis Systems V, 188–199. Berlin, Heidelberg: Springer Berlin Heidelberg. ISBN 978-3-540-45869-2.
  5. Brown, T. B.; et al. 2020. Language Models are Few-Shot Learners. arXiv:2005.14165.
  6. Geometric Layout Analysis Techniques for Document Image Understanding: a Review. Technical Report TR9703-09, ITC-irst, Via Sommarive 18, I-38050 Povo, Trento, Italy.
  7. Mining company sustainability reports to aid financial decision-making. In The AAAI-20 Workshop on Knowledge Discovery from Unstructured Data in Financial Services.
  8. Guo, T.; et al. 2020. ESG2Risk: A Deep Learning Framework from ESG News to Stock Volatility Prediction. arXiv:2005.02527.
  9. Huang, Y.; et al. 2022. LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking. Proceedings of the 30th ACM International Conference on Multimedia.
  10. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. In Advances in Neural Information Processing Systems, volume 33, 9459–9474. Curran Associates, Inc.
  11. Li, J.; et al. 2022. DiT: Self-supervised Pre-training for Document Image Transformer. Proceedings of the 30th ACM International Conference on Multimedia.
  12. DocBank: A Benchmark Dataset for Document Layout Analysis. In Scott, D.; Bel, N.; and Zong, C., eds., Proceedings of the 28th International Conference on Computational Linguistics, 949–960. Barcelona, Spain (Online): International Committee on Computational Linguistics.
  13. Robust PDF Document Conversion using Recurrent Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 35(17): 15137–15145. Number: 17.
  14. Analyzing Sustainability Reports Using Natural Language Processing. In NeurIPS 2020 Workshop on Tackling Climate Change with Machine Learning.
  15. Optimized Table Tokenization for Table Structure Recognition. In Document Analysis and Recognition - ICDAR 2023: 17th International Conference, San José, CA, USA, August 21–26, 2023, Proceedings, Part II, 37–50. Berlin, Heidelberg: Springer-Verlag. ISBN 978-3-031-41678-1.
  16. TableFormer: Table Structure Understanding with Transformers. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 4604–4613. Los Alamitos, CA, USA: IEEE Computer Society.
  17. Pfitzmann, B.; et al. 2022. DocLayNet: A Large Human-Annotated Dataset for Document-Layout Segmentation.
  18. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. Journal of Machine Learning Research, 21(140): 1–67.
  19. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Inui, K.; Jiang, J.; Ng, V.; and Wan, X., eds., Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 3982–3992. Hong Kong, China: Association for Computational Linguistics.
  20. Staar, P. W. J.; et al. 2018. Corpus Conversion Service. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM.
  21. UL2: Unifying Language Learning Paradigms. In The Eleventh International Conference on Learning Representations.
  22. Touvron, H.; et al. 2023. Llama 2: Open Foundation and Fine-Tuned Chat Models. arXiv:2307.09288.
  23. ClimateBERT: A Pretrained Language Model for Climate-Related Text. In Proceedings of AAAI 2022 Fall Symposium: The Role of AI in Responding to Climate Challenges.
  24. Emergent Abilities of Large Language Models. Transactions on Machine Learning Research. Survey Certification.
  25. Zhang, M.; et al. 2023. WeLayout: WeChat Layout Analysis System for the ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents. ArXiv, abs/2305.06553.
  26. Zhong, X.; et al. 2019. PubLayNet: Largest Dataset Ever for Document Layout Analysis. In 2019 International Conference on Document Analysis and Recognition (ICDAR), 1015–1022.
Citations (2)
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-Up Questions

We haven't generated follow-up questions for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com