Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Podcast Metadata and Content: Episode Relevance andAttractiveness in Ad Hoc Search (2108.11460v1)

Published 25 Aug 2021 in cs.IR

Abstract: Rapidly growing online podcast archives contain diverse content on a wide range of topics. These archives form an important resource for entertainment and professional use, but their value can only be realized if users can rapidly and reliably locate content of interest. Search for relevant content can be based on metadata provided by content creators, but also on transcripts of the spoken content itself. Excavating relevant content from deep within these audio streams for diverse types of information needs requires varying the approach to systems prototyping. We describe a set of diverse podcast information needs and different approaches to assessing retrieved content for relevance. We use these information needs in an investigation of the utility and effectiveness of these information sources. Based on our analysis, we recommend approaches for indexing and retrieving podcast content for ad hoc search.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Ben Carterette (14 papers)
  2. Rosie Jones (13 papers)
  3. Gareth F. Jones (6 papers)
  4. Maria Eskevich (3 papers)
  5. Sravana Reddy (8 papers)
  6. Ann Clifton (13 papers)
  7. Yongze Yu (8 papers)
  8. Jussi Karlgren (22 papers)
  9. Ian Soboroff (13 papers)
Citations (6)