2000 character limit reached
Using Web Page Titles to Rediscover Lost Web Pages (1002.2439v1)
Published 11 Feb 2010 in cs.IR
Abstract: Titles are denoted by the TITLE element within a web page. We queried the title against the the Yahoo search engine to determine the page's status (found, not found). We conducted several tests based on elements of the title. These tests were used to discern whether we could predict a pages status based on the title. Our results increase our ability to determine bad titles but not our ability to determine good titles.
- Jeffery L. Shipman (1 paper)
- Martin Klein (34 papers)
- Michael L. Nelson (92 papers)