Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Joint Model for Multimodal Document Quality Assessment (1901.01010v2)

Published 4 Jan 2019 in cs.CL, cs.AI, and cs.DL

Abstract: The quality of a document is affected by various factors, including grammaticality, readability, stylistics, and expertise depth, making the task of document quality assessment a complex one. In this paper, we explore this task in the context of assessing the quality of Wikipedia articles and academic papers. Observing that the visual rendering of a document can capture implicit quality indicators that are not present in the document text --- such as images, font choices, and visual layout --- we propose a joint model that combines the text content with a visual rendering of the document for document quality assessment. Experimental results over two datasets reveal that textual and visual features are complementary, achieving state-of-the-art results.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Aili Shen (6 papers)
  2. Bahar Salehi (2 papers)
  3. Timothy Baldwin (125 papers)
  4. Jianzhong Qi (68 papers)
Citations (24)

Summary

We haven't generated a summary for this paper yet.