Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Predicting TED Talk Ratings from Language and Prosody (1906.03940v1)

Published 21 May 2019 in cs.MM and cs.CL

Abstract: We use the largest open repository of public speaking---TED Talks---to predict the ratings of the online viewers. Our dataset contains over 2200 TED Talk transcripts (includes over 200 thousand sentences), audio features and the associated meta information including about 5.5 Million ratings from spontaneous visitors of the website. We propose three neural network architectures and compare with statistical machine learning. Our experiments reveal that it is possible to predict all the 14 different ratings with an average AUC of 0.83 using the transcripts and prosody features only. The dataset and the complete source code is available for further analysis.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Daniel Gildea (28 papers)
  2. M. Ehsan Hoque (4 papers)
  3. Md Kamrul Hassan (2 papers)
  4. Md Iftekhar Tanveer (4 papers)
Citations (2)