Multi-Label Zero-Shot Product Attribute-Value Extraction (2402.08802v1)
Abstract: E-commerce platforms should provide detailed product descriptions (attribute values) for effective product search and recommendation. However, attribute value information is typically not available for new products. To predict unseen attribute values, large quantities of labeled training data are needed to train a traditional supervised learning model. Typically, it is difficult, time-consuming, and costly to manually label large quantities of new product profiles. In this paper, we propose a novel method to efficiently and effectively extract unseen attribute values from new products in the absence of labeled data (zero-shot setting). We propose HyperPAVE, a multi-label zero-shot attribute value extraction model that leverages inductive inference in heterogeneous hypergraphs. In particular, our proposed technique constructs heterogeneous hypergraphs to capture complex higher-order relations (i.e. user behavior information) to learn more accurate feature representations for graph nodes. Furthermore, our proposed HyperPAVE model uses an inductive link prediction mechanism to infer future connections between unseen nodes. This enables HyperPAVE to identify new attribute values without the need for labeled training data. We conduct extensive experiments with ablation studies on different categories of the MAVE dataset. The results demonstrate that our proposed HyperPAVE model significantly outperforms existing classification-based, generation-based LLMs for attribute value extraction in the zero-shot setting.
- Chih-Yao Chen and Cheng-Te Li. 2021. ZS-BERT: Towards Zero-Shot Relation Extraction with Attribute Representation Learning. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, Online, 3470–3479. https://doi.org/10.18653/v1/2021.naacl-main.272
- Knowledge-aware Zero-Shot Learning: Survey and Perspective. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, Zhi-Hua Zhou (Ed.). International Joint Conferences on Artificial Intelligence Organization, 4366–4373. https://doi.org/10.24963/ijcai.2021/597 Survey Track.
- Jiayi Chen and Aidong Zhang. 2020. Hgmf: heterogeneous graph-based fusion for multimodal data with incompleteness. In Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining. 1295–1305.
- Heterogeneous Graph Contrastive Learning for Recommendation. In Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining (Singapore, Singapore) (WSDM ’23). Association for Computing Machinery, New York, NY, USA, 544–552. https://doi.org/10.1145/3539597.3570484
- Extreme Multi-Label Classification with Label Masking for Product Attribute Value Extraction. In Proceedings of The Fifth Workshop on e-Commerce and NLP (ECNLP 5). 134–140.
- Extreme Multi-Label Classification with Label Masking for Product Attribute Value Extraction. In Proceedings of the Fifth Workshop on e-Commerce and NLP (ECNLP 5). Association for Computational Linguistics, Dublin, Ireland, 134–140. https://doi.org/10.18653/v1/2022.ecnlp-1.16
- Integrating Topology beyond Descriptions for Zero-shot Learning. Pattern Recognition (2023), 109738.
- IHGNN: Interactive Hypergraph Neural Network for Personalized Product Search. In Proceedings of the ACM Web Conference 2022. 256–265.
- RelationPrompt: Leveraging Prompts to Generate Synthetic Data for Zero-Shot Relation Triplet Extraction. In Findings of the Association for Computational Linguistics: ACL 2022. Association for Computational Linguistics, Dublin, Ireland, 45–57. https://doi.org/10.18653/v1/2022.findings-acl.5
- PV2TEA: Patching Visual Modality to Textual-Established Information Extraction. arXiv:2306.01016 [cs.CL]
- AE-smnsMLC: Multi-Label Classification with Semantic Matching and Negative Label Sampling for Product Attribute Value Extraction. In 2022 IEEE International Conference on Big Data (Big Data). IEEE, 1816–1821.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171–4186. https://doi.org/10.18653/v1/N19-1423
- Be More with Less: Hypergraph Attention Networks for Inductive Text Classification. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 4927–4936. https://doi.org/10.18653/v1/2020.emnlp-main.399
- Heterogeneous hypergraph variational autoencoder for link prediction. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 8 (2021), 4125–4138.
- Zero-shot Item-based Recommendation via Multi-task Product Knowledge Graph Pre-Training. arXiv preprint arXiv:2305.07633 (2023).
- HGNN+: General Hypergraph Neural Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 3 (2023), 3181–3199. https://doi.org/10.1109/TPAMI.2022.3182052
- Disentangled ontology embedding for zero-shot learning. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 443–453.
- Benchmarking knowledge-driven zero-shot learning. Journal of Web Semantics 75 (2023), 100757.
- Text Mining for Product Attribute Extraction. SIGKDD Explor. Newsl. 8, 1 (jun 2006), 41–48. https://doi.org/10.1145/1147234.1147241
- D-Extract: Extracting Dimensional Attributes From Product Images. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 3641–3649.
- Knowledge-Enhanced Multi-Label Few-Shot Product Attribute-Value Extraction. arXiv preprint arXiv:2308.08413 (2023).
- Jiaying Gong and Hoda Eldardiry. 2021. Zero-Shot Relation Classification from Side Information. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management (Virtual Event, Queensland, Australia) (CIKM ’21). Association for Computing Machinery, New York, NY, USA, 576–585. https://doi.org/10.1145/3459637.3482403
- Jiaying Gong and Hoda Eldardiry. 2023. Prompt-based Zero-shot Relation Extraction with Semantic Knowledge Augmentation. arXiv:2112.04539 [cs.CL]
- Explainable Zero-Shot Learning via Attentive Graph Convolutional Network and Knowledge Graphs. Semant. Web 12, 5 (jan 2021), 741–765. https://doi.org/10.3233/SW-210435
- Pietro Hiram Guzzi and Marinka Zitnik. 2022. Editorial deep learning and graph embeddings for network biology. IEEE/ACM Transactions on Computational Biology and Bioinformatics 19, 2 (2022), 653–654.
- Intra and Inter Domain HyperGraph Convolutional Network for Cross-Domain Recommendation. In Proceedings of the ACM Web Conference 2023. 449–459.
- Heterogeneous graph transformer. In Proceedings of the web conference 2020. 2704–2710.
- ConTextING: Granting Document-Wise Contextual Embeddings to Graph Neural Networks for Inductive Text Classification. In Proceedings of the 29th International Conference on Computational Linguistics. 1163–1168.
- Learning Cross-Task Attribute - Attribute Similarity for Multi-task Attribute-Value Extraction. In Proceedings of the 4th Workshop on e-Commerce and NLP. Association for Computational Linguistics, Online, 79–87. https://doi.org/10.18653/v1/2021.ecnlp-1.10
- Heterogeneous Hypergraph Neural Network for Social Recommendation using Attention Network. ACM Transactions on Recommender Systems (2023).
- Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).
- Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461 (2019).
- Hypergraph transformer neural networks. ACM Transactions on Knowledge Discovery from Data 17, 5 (2023), 1–22.
- HMGCL: Heterogeneous multigraph contrastive learning for LBSN friend recommendation. World Wide Web (2022), 1–24.
- Heterogeneous Hypergraph Neural Network for Friend Recommendation with Human Mobility. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 4209–4213.
- PAM: understanding product images in cross product category attribute extraction. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 3262–3270.
- Pre-training to Match for Unified Low-shot Relation Extraction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Dublin, Ireland, 5785–5795. https://doi.org/10.18653/v1/2022.acl-long.397
- JDsearch: A Personalized Product Search Dataset with Real Queries and Full Interactions. In Proceedings of the SIGIR 2023. ACM. https://doi.org/10.1145/3539618.3591900
- Hypergraph Variational Autoencoder for Multimodal Semi-supervised Representation Learning. In Artificial Neural Networks and Machine Learning–ICANN 2022: 31st International Conference on Artificial Neural Networks, Bristol, UK, September 6–9, 2022, Proceedings; Part IV. Springer, 395–406.
- Meta-HGT: Metapath-aware HyperGraph Transformer for heterogeneous information network embedding. Neural Networks 157 (2023), 65–76.
- Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization. arXiv:2207.07278 [cs.CV]
- Multimodal Pre-Training with Self-Distillation for Product Understanding in E-Commerce. In Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining. 1039–1047.
- Transductive Relation-Propagation Network for Few-shot Learning.. In IJCAI, Vol. 20. 804–810.
- Challenges and limitations of biological network analysis. BioTech 11, 3 (2022), 24.
- Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014).
- Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). 188–197.
- A Review of Generalized Zero-Shot Learning Methods. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 4 (2023), 4051–4070. https://doi.org/10.1109/TPAMI.2022.3191696
- Duangmanee (Pew) Putthividhya and Junling Hu. 2011. Bootstrapped Named Entity Recognition for Product Attribute Extraction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (Edinburgh, United Kingdom) (EMNLP ’11). Association for Computational Linguistics, USA, 1557–1567.
- Language models are unsupervised multitask learners. OpenAI blog 1, 8 (2019), 9.
- Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research 21, 1 (2020), 5485–5551.
- Hetegcn: heterogeneous graph convolutional networks for text classification. In Proceedings of the 14th ACM international conference on web search and data mining. 860–868.
- Accurate Product Attribute Extraction on the Field. In 2019 IEEE 35th International Conference on Data Engineering (ICDE). 1862–1873. https://doi.org/10.1109/ICDE.2019.00202
- Attribute value generation from product title using language models. In Proceedings of The 4th Workshop on e-Commerce and NLP. 13–17.
- Exploring Generative Models for Joint Attribute Value Extraction from Product Titles. arXiv preprint arXiv:2208.07130 (2022).
- Label Verbalization and Entailment for Effective Zero and Few-Shot Relation Extraction. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Online and Punta Cana, Dominican Republic, 1199–1212. https://doi.org/10.18653/v1/2021.emnlp-main.92
- The graph neural network model. IEEE transactions on neural networks 20, 1 (2008), 61–80.
- Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 227–234.
- A Unified Generative Approach to Product Attribute-Value Identification. arXiv preprint arXiv:2306.05605 (2023).
- Heterogeneous hypergraph embedding for graph classification. In Proceedings of the 14th ACM international conference on web search and data mining. 725–733.
- MPKGAC: Multimodal Product Attribute Completion in E-commerce. In Companion Proceedings of the ACM Web Conference 2023. 336–340.
- Learning to extract attribute value from product via question answering: A multi-task approach. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 47–55.
- SMARTAVE: Structured Multimodal Transformer for Product Attribute Value Extraction. In Findings of the Association for Computational Linguistics: EMNLP 2022. 263–276.
- Heterogeneous graph attention network. In The world wide web conference. 2022–2032.
- Dual subgraph-based graph neural network for friendship prediction in location-based social networks. ACM Transactions on Knowledge Discovery from Data 17, 3 (2023), 1–28.
- Hypergraphs: Concepts, Applications and Analysis. In 2022 IEEE 13th International Symposium on Parallel Architectures, Algorithms and Programming (PAAP). 1–6. https://doi.org/10.1109/PAAP56126.2022.10010428
- Scalable Attribute-Value Extraction from Semi-Structured Text. In ICDM Workshop on Large-scale Data Mining: Theory and Applications. http://www.computer.org/portal/web/csdl/doi/10.1109/ICDMW.2009.81
- Hypergraph collaborative network on vertices and hyperedges. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 3 (2022), 3245–3258.
- Inductive representation learning on temporal graphs. arXiv preprint arXiv:2002.07962 (2020).
- Scaling up open tagging from tens to thousands: Comprehension empowered attribute value extraction from product title. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 5214–5223.
- Towards Open-World Product Attribute Mining: A Lightly-Supervised Approach. arXiv preprint arXiv:2305.18350 (2023).
- Correlative Preference Transfer with Hierarchical Hypergraph Network for Multi-Domain Recommendation. In Proceedings of the ACM Web Conference 2023. 983–991.
- Hypergcn: A new method for training graph convolutional networks on hypergraphs. Advances in neural information processing systems 32 (2019).
- AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, Online, 4694–4705. https://doi.org/10.18653/v1/2021.acl-long.362
- MAVE: A product dataset for multi-source attribute value extraction. In Proceedings of the fifteenth ACM international conference on web search and data mining. 1256–1265.
- Learnable hypergraph laplacian for hypergraph learning. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4503–4507.
- Hypergraph and uncertain hypergraph representation learning theory and methods. Mathematics 10, 11 (2022), 1921.
- OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision. In Proceedings of the ACM Web Conference 2022 (Virtual Event, Lyon, France) (WWW ’22). Association for Computing Machinery, New York, NY, USA, 3153–3161. https://doi.org/10.1145/3485447.3512035
- Pay attention to implicit attribute values: a multi-modal generative framework for AVE task. In Findings of the Association for Computational Linguistics: ACL 2023. 13139–13151.
- OpenTag: Open Attribute Value Extraction from Product Profiles. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (London, United Kingdom) (KDD ’18). Association for Computing Machinery, New York, NY, USA, 1049–1058. https://doi.org/10.1145/3219819.3219839
- Multimodal Joint Attribute Prediction and Value Extraction for E-commerce Product. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 2129–2139. https://doi.org/10.18653/v1/2020.emnlp-main.166
- Jiaying Gong (8 papers)
- Hoda Eldardiry (31 papers)