A Comprehensive Survey on AI-based Methods for Patents (2404.08668v2)
Abstract: Recent advancements in AI and machine learning have demonstrated transformative capabilities across diverse domains. This progress extends to the field of patent analysis and innovation, where AI-based tools present opportunities to streamline and enhance important tasks in the patent cycle such as classification, retrieval, and valuation prediction. This not only accelerates the efficiency of patent researchers and applicants but also opens new avenues for technological innovation and discovery. Our survey provides a comprehensive summary of recent AI tools in patent analysis from more than 40 papers from 26 venues between 2017 and 2023. Unlike existing surveys, we include methods that work for patent image and text data. Furthermore, we introduce a novel taxonomy for the categorization based on the tasks in the patent life cycle as well as the specifics of the AI methods. This interdisciplinary survey aims to serve as a resource for researchers and practitioners who are working at the intersection of AI and patent analysis as well as the patent offices that are aiming to build efficient patent systems.
- Linguistically informed masking for representation learning in the patent domain. arXiv preprint arXiv:2106.05768, 2021.
- L. Aristodemou. Identifying valuable patents: A deep learning approach. PhD thesis, 2021.
- Scibert: A pretrained language model for scientific text. In EMNLP, 2019.
- Classifying patent applications with ensemble methods. ALTA Workshop, 2018.
- A deep learning based method for extracting semantic information from patent documents. Scientometrics, 125:289–312, 2020.
- Establish a patent risk prediction model for emerging technologies using deep learning and data augmentation. Advanced Engineering Informatics, 52:101509, 2022.
- Learning phrase representations using rnn encoder-decoder for statistical machine translation. ACL, 2014.
- Deep learning for patent landscaping using transformer and graph embedding. Technological Forecasting and Social Change, 175, 2022.
- P. Chung and S. Y. Sohn. Early detection of valuable patents using a deep learning model: Case of semiconductor industry. Technological Forecasting and Social Change, 158:120146, 2020.
- Arcface: Additive angular margin loss for deep face recognition. In CVPR, 2019.
- Bert: Pre-training of deep bidirectional transformers for language understanding. NAACL-HLT, 2019.
- Predicting patent quality based on machine learning approach. IEEE Trans Eng Manag, 2022.
- Eurostat, O. Oslo Manual: Guidelines for Collecting and Interpreting Innovation Data. OECD, Paris, 2005. A joint publication of OECD and Eurostat.
- Classification of visualization types and perspectives in patents. In TPDL, 2023.
- Unveiling the inventive process from patents by extracting problems, solutions and advantages with natural language processing. Expert Systems with Applications, 229:120499, 2023.
- A. Graves and J. Schmidhuber. Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural networks, 18(5-6):602–610, 2005.
- Automated patent classification using word embedding. In ICMLA. IEEE, 2017.
- Opennre: An open and extensible toolkit for neural relation extraction. In EMNLP, 2019.
- Patent image retrieval using cross-entropy-based metric learning. In IW-FCV, 2023.
- K. Higuchi and K. Yanai. Patent image retrieval using transformer-based deep metric learning. WPI, 74:102217, 2023.
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
- Deep learning, text, and patent valuation. Text, and Patent Valuation, 2020.
- Bidirectional lstm-crf models for sequence tagging. arXiv preprint arXiv:1508.01991, 2015.
- An explainable ai (xai) model for text-based patent novelty analysis. Expert Systems with Applications, 2023.
- Deriving design feature vectors for patent images using convolutional neural networks. Journal of Mechanical Design, 143(6):061405, 2021.
- Bag of tricks for efficient text classification. In EACL, 2017.
- An ensemble framework for patent classification. WPI, 75:102233, 2023.
- Automated single-label patent classification using ensemble classifiers. In ICMLC, 2022.
- Patent prior art search using deep learning language model. In IDEAS, 2020.
- X. Krant. Text-based Patent-Quality Prediction Using Multi-Section Attention. PhD thesis, 2023.
- Patents images retrieval and convolutional neural network training dataset quality improvement. In ITSMSSM, 2017.
- A survey on deep learning for patent analysis. WPI, 65:102035, 2021.
- Deeppatent: Large scale patent drawing recognition and retrieval. In WACV, 2022.
- Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
- J.-S. Lee. Patent transformer: A framework for personalized patent claim generation. In CEUR Workshop Proceedings, volume 2598. CEUR-WS, 2020.
- J.-S. Lee. Evaluating generative patent language models. WPI, 72, 2023.
- J.-S. Lee and J. Hsiang. Patent claim generation by fine-tuning openai gpt-2. WPI, 62:101983, 2020.
- J.-S. Lee and J. Hsiang. Patent classification grawe-tuning bert language model. WPI, 61:101965, 2020.
- A deep learning-based early patent quality recognition model. In ICNC-FSKD, 2022.
- Patent quality valuation with deep learning models. In DASFAA, 2018.
- Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692, 2019.
- Efficient estimation of word representations in vector space. In ICLR, 2013.
- Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
- A multimodal approach for semantic patent image retrieval. In PatentSemTech, 2021.
- Language models are unsupervised multitask learners. OpenAI blog, 1:9, 2019.
- N. Reimers and I. Gurevych. Sentence-bert: Sentence embeddings using siamese bert-networks. EMNLP, 2019.
- Patentmatch: A dataset for matching patent claims & prior art. In PatentSemTech@SIGIR, 2021.
- J. Risch and R. Krestel. Learning patent speak: Investigating domain-specific word embeddings. In ICDIM, 2018.
- J. Risch and R. Krestel. Domain-specific word embeddings for patent classification. Data Technologies and Applications, 53(1), 2019.
- Patentnet: multi-label classification of patent documents using deep learning based language understanding. Scientometrics, pages 1–25, 2022.
- Artificial intelligence for patent prior art searching. WPI, 2021.
- An lstm approach to patent classification based on fixed hierarchy vectors. In SDM, 2018.
- Enhancing patent retrieval using text and knowledge graph embeddings: a technical note. Journal of Engineering Design, 33:670–683, 2022.
- K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
- M. Sofean. Deep learning based pipeline with multichannel inputs for patent classification. WPI, 66:102060, 2021.
- Machine learning methods for results merging in patent retrieval. Data Technologies and Applications, 2023.
- Patent value analysis using deep learning models—the case of iot technology mining for the manufacturing industry. IEEE-TEM, 68(5):1334–1346, 2019.
- Xlnet: Generalized autoregressive pretraining for language understanding. NeurIPS, 2019.
- Identifying patent classification codes associated with specific search keywords using machine learning. WPI, 71:102153, 2022.
- Event-based dynamic graph representation learning for patent application trend prediction. IEEE TKDE, 2023.
- Homaira Huda Shomee (1 paper)
- Zhu Wang (72 papers)
- Sathya N. Ravi (21 papers)
- Sourav Medya (36 papers)