Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MGNC-CNN: A Simple Approach to Exploiting Multiple Word Embeddings for Sentence Classification (1603.00968v2)

Published 3 Mar 2016 in cs.CL

Abstract: We introduce a novel, simple convolution neural network (CNN) architecture - multi-group norm constraint CNN (MGNC-CNN) that capitalizes on multiple sets of word embeddings for sentence classification. MGNC-CNN extracts features from input embedding sets independently and then joins these at the penultimate layer in the network to form a final feature vector. We then adopt a group regularization strategy that differentially penalizes weights associated with the subcomponents generated from the respective embedding sets. This model is much simpler than comparable alternative architectures and requires substantially less training time. Furthermore, it is flexible in that it does not require input word embeddings to be of the same dimensionality. We show that MGNC-CNN consistently outperforms baseline models.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Ye Zhang (137 papers)
  2. Stephen Roller (27 papers)
  3. Byron Wallace (10 papers)
Citations (105)

Summary

We haven't generated a summary for this paper yet.