Key Information Retrieval to Classify the Unstructured Data Content of Preferential Trade Agreements (2401.12520v1)
Abstract: With the rapid proliferation of textual data, predicting long texts has emerged as a significant challenge in the domain of natural language processing. Traditional text prediction methods encounter substantial difficulties when grappling with long texts, primarily due to the presence of redundant and irrelevant information, which impedes the model's capacity to capture pivotal insights from the text. To address this issue, we introduce a novel approach to long-text classification and prediction. Initially, we employ embedding techniques to condense the long texts, aiming to diminish the redundancy therein. Subsequently,the Bidirectional Encoder Representations from Transformers (BERT) embedding method is utilized for text classification training. Experimental outcomes indicate that our method realizes considerable performance enhancements in classifying long texts of Preferential Trade Agreements. Furthermore, the condensation of text through embedding methods not only augments prediction accuracy but also substantially reduces computational complexity. Overall, this paper presents a strategy for long-text prediction, offering a valuable reference for researchers and engineers in the natural language processing sphere.
- N. Limão. Preferential Trade Agreements. In Handbook of Commercial Policy, volume 1, pages 279–367. Elsevier.
- The design of international trade agreements: Introducing a new dataset. 9(3):353–375.
- The Evolution of Deep Trade Agreements. In Handbook of Deep Trade Agreements. The World Bank.
- NSF-Kellogg Institute. Database on Economic Integration Agreements.
- Connecting embeddings for knowledge graph entity typing. arXiv preprint arXiv:2007.10873, 2020.
- Knowledge graph entity typing via learning connecting embeddings. Knowledge-Based Systems, 196:105808, 2020.
- Deltadou: Expert-level doudizhu ai through self-play. In IJCAI, pages 1265–1271, 2019.
- Accel-gcn: High-performance gpu accelerator design for graph convolution networks. In 2023 IEEE/ACM International Conference On Computer Aided Design (ICCAD). IEEE, 2023.
- Juan Ramos et al. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, volume 242, pages 29–48. Citeseer, 2003.
- Bert: Pre-training of deep bidirectional transformers for language understanding. 2018.
- Accel-gcn: High-performance gpu accelerator design for graph convolution networks. arXiv preprint arXiv:2308.11825, 2023.
- Digital-assisted analog in-memory computing with rram devices. In 2023 International VLSI Symposium on Technology, Systems and Applications (VLSI-TSA/VLSI-DAT), pages 1–4. IEEE, 2023.
- Maxk-gnn: Towards theoretical speed limits for accelerating graph neural networks training. arXiv preprint arXiv:2312.08656, 2023.
- Muffin: A framework toward multi-dimension ai fairness by uniting off-the-shelf models. In 2023 60th ACM/IEEE Design Automation Conference (DAC), pages 1–6. IEEE, 2023.
- Is chatgpt good at search? investigating large language models as re-ranking agent. arXiv preprint arXiv:2304.09542, 2023.
- Advanced language model-driven verilog development: Enhancing power, performance, and area optimization in code synthesis. arXiv preprint arXiv:2312.01022, 2023.
- Switchtab: Switched autoencoders are effective tabular learners. arXiv preprint arXiv:2401.02013, 2024.
- Rrnet: Towards relu-reduced neural network for two-party computation based private inference. arXiv preprint arXiv:2302.02292, 2023.
- Aq2pnn: Enabling two-party privacy-preserving deep neural network inference with adaptive quantization. In Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, pages 628–640, 2023.
- Autorep: Automatic relu replacement for fast private network inference. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5178–5188, 2023.
- Federated contrastive learning for dermatological disease diagnosis via on-device learning. In 2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), pages 1–7. IEEE, 2021.
- Lingcn: Structural linearized graph convolutional network for homomorphically encrypted inference. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- Federated contrastive learning for volumetric medical image segmentation. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part III 24, pages 367–377. Springer, 2021.
- Pasnet: Polynomial architecture search framework for two-party computation-based secure neural network deployment. In 2023 60th ACM/IEEE Design Automation Conference (DAC), pages 1–6. IEEE, 2023.
- Jiahui Zhao (20 papers)
- Ziyi Meng (7 papers)
- Stepan Gordeev (2 papers)
- Zijie Pan (14 papers)
- Dongjin Song (42 papers)
- Sandro Steinbach (2 papers)
- Caiwen Ding (98 papers)