Making Large Language Models Better Knowledge Miners for Online Marketing with Progressive Prompting Augmentation (2312.05276v2)
Abstract: Nowadays, the rapid development of mobile economy has promoted the flourishing of online marketing campaigns, whose success greatly hinges on the efficient matching between user preferences and desired marketing campaigns where a well-established Marketing-oriented Knowledge Graph (dubbed as MoKG) could serve as the critical "bridge" for preference propagation. In this paper, we seek to carefully prompt a LLM with domain-level knowledge as a better marketing-oriented knowledge miner for marketing-oriented knowledge graph construction, which is however non-trivial, suffering from several inevitable issues in real-world marketing scenarios, i.e., uncontrollable relation generation of LLMs,insufficient prompting ability of a single prompt, the unaffordable deployment cost of LLMs. To this end, we propose PAIR, a novel Progressive prompting Augmented mIning fRamework for harvesting marketing-oriented knowledge graph with LLMs. In particular, we reduce the pure relation generation to an LLM based adaptive relation filtering process through the knowledge-empowered prompting technique. Next, we steer LLMs for entity expansion with progressive prompting augmentation,followed by a reliable aggregation with comprehensive consideration of both self-consistency and semantic relatedness. In terms of online serving, we specialize in a small and white-box PAIR (i.e.,LightPAIR),which is fine-tuned with a high-quality corpus provided by a strong teacher-LLM. Extensive experiments and practical applications in audience targeting verify the effectiveness of the proposed (Light)PAIR.
- Badr AlKhamissi and Marjan Ghazvininejad. 2022. A Review on Language Models as Knowledge Bases. arXiv preprint arXiv:2204.06031 (2022).
- Ask Me Anything: A simple strategy for prompting language models. In ICLR.
- Qwen Technical Report. arXiv preprint arXiv:2309.16609 (2023).
- Baichuan. 2023. Baichuan 2: Open Large-scale Language Models. arXiv preprint arXiv:2309.10305 (2023).
- Freebase: a collaboratively created graph database for structuring human knowledge. In SIGMOD. 1247–1250.
- COMET: Commonsense Transformers for Automatic Knowledge Graph Construction. In ACL. 4762–4779.
- Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning. In WSDM. 186–194.
- Zero-shot Approach to Overcome Perturbation Sensitivity of Prompts. In ACL. 5698–5711.
- Adversarial Learning for Incentive Optimization in Mobile Payment Marketing. In CIKM. 2940–2944.
- Consistent Prototype Learning for Few-Shot Continual Relation Extraction. In ACL. 7409–7422.
- Crawling The Internal Knowledge-Base of Language Models. In EACL. 1811–1824.
- CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning. In ACL. 6338–6353.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL. 4171–4186.
- Stephanie deWet and Jiafan Ou. 2019. Finding Users Who Act Alike: Transfer Learning for Expanding Advertiser Audiences. In KDD. 2251–2259.
- Chain-of-Verification Reduces Hallucination in Large Language Models. arXiv preprint arXiv:2309.11495 (2023).
- GLM: General Language Model Pretraining with Autoregressive Blank Infilling. In ACL. 320–335.
- DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge. In WWW. 2648–2659.
- Christiane Fellbaum. 1998. WordNet: an electronic lexical database. MIT Press.
- Linguistic representations for fewer-shot relation extraction across domains. In ACL. 7502–7514.
- BertNet: Harvesting Knowledge Graphs with Arbitrary Relations from Pretrained Language Models. In ACL. 5000–5015.
- YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. In Artif Intell, Vol. 194. 28–61.
- Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes. In ACL. 8003–8017.
- LoRA: Low-Rank Adaptation of Large Language Models. In ICLR.
- Active Retrieval Augmented Generation. arXiv preprint arXiv:2305.06983 (2023).
- Aishwarya Kamath and Rajarshi Das. 2019. A Survey on Semantic Parsing. In AKBC.
- Neural Architectures for Named Entity Recognition. In NAACL. 260–270.
- Exploring the Secrets Behind the Learning Difficulty of Meaning Representations for Semantic Parsing. In EMNLP. 3616–3625.
- Explicit Feature Interaction-aware Uplift Network for Online Marketing. In KDD. 4507–4515.
- Two-Stage Audience Expansion for Financial Targeting in Marketing. In CIKM. 2629–2636.
- Sources of Transfer in Multilingual Named Entity Recognition. In ACL. 8093–8104.
- Crosslingual generalization through multitask finetuning. arXiv preprint arXiv:2211.01786 (2022).
- Refined Commonsense Knowledge from Large-Scale Web Contents. arXiv preprint arXiv:2112.04596 (2021).
- OpenAI. 2023a. Chatgpt: Optimizing language models for dialogue.
- OpenAI. 2023b. GPT-4 Technical Report. arXiv preprint arXiv:2303.08774 (2023).
- Language Models as Knowledge Bases?. In EMNLP. 2463–2473.
- Exploring 360-Degree View of Customers for Lookalike Modeling. In SIGIR. 3400–3404.
- Soft Gazetteers for Low-Resource Named Entity Recognition. In ACL. 8118–8123.
- Commonsense Properties from Query Logs and Question Answering Forums. In CIKM. 1411–1420.
- ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning. In AAAI. 3027–3035.
- REPLUG: Retrieval-Augmented Black-Box Language Models. arXiv preprint arXiv:2301.12652 (2023).
- AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts. In EMNLP. 4222–4235.
- ConceptNet 5.5: An Open Multilingual Graph of General Knowledge. In AAAI. 4444–4451.
- Head-to-Tail: How Knowledgeable are Large Language Models (LLM)? A.K.A. Will LLMs Replace Knowledge Graphs? arXiv preprint arXiv:2308.10168 (2023).
- LLaMA: Open and Efficient Foundation Language Models. arXiv preprint arXiv:2302.13971 (2023).
- LLaMA: Open and Efficient Foundation Language Models. arXiv preprint arXiv:2307.09288 (2023).
- Enhancing Knowledge Graph Construction Using Large Language Models. arXiv preprint arXiv:2305.04676 (2023).
- Language Models are Open Knowledge Graphs. arXiv preprint arXiv:2010.11967 (2020).
- A Survey of Diversification Techniques in Search and Recommendation. arXiv preprint arXiv:2212.14464 (2022).
- S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction. In ACL. 8186–8207.
- Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data. arXiv preprint arXiv:2304.01196 (2023).
- Vikas Yadav and Steven Bethard. 2018. A Survey on Recent Advances in Named Entity Recognition from Deep Learning models. In ACL. 2145–2158.
- Logistics Audience Expansion via Temporal Knowledge Graph. In CIKM. 4879–4886.
- Who Would be Interested in Services? An Entity Graph Learning System for User Targeting. In ICDE. 3248–3254.
- KG-BERT: BERT for Knowledge Graph Completion. arXiv preprint arXiv:1909.03193 (2019).
- Joint Incentive Optimization of Customer and Merchant in Mobile Payment Marketing. In AAAI. 15000–15007.
- Commonsense Knowledge Graph towards Super APP and Its Applications in Alipay. In KDD. 5509–5519.
- GLM-130B: An Open Bilingual Pre-trained Model. In ICLR.
- TransOMCS: From Linguistic Graphs to Commonsense Knowledge. In IJCAI. 4004–4010.
- How Language Model Hallucinations Can Snowball. arXiv preprint arXiv:2305.13534 (2023).
- Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning. In ICLR.
- Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models. arXiv preprint arXiv:2309.01219 (2023).
- XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations. In ACL. 15918–15947.
- Graph Neural Networks with Generated Parameters for Relation Extraction. In ACL. 1331–1339.
- Learning to Expand Audience via Meta Hybrid Experts and Critics for Recommendation and Advertising. In KDD. 4005–4013.
- LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities. arXiv preprint arXiv:2305.13168.
- Hubble: An industrial system for audience expansion in mobile marketing. In KDD. 2455–2463.