Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Causality-Guided Prediction of the TED Talk Ratings from the Speech-Transcripts using Neural Networks (1905.08392v1)

Published 21 May 2019 in cs.LG, cs.CL, and stat.ML

Abstract: Automated prediction of public speaking performance enables novel systems for tutoring public speaking skills. We use the largest open repository---TED Talks---to predict the ratings provided by the online viewers. The dataset contains over 2200 talk transcripts and the associated meta information including over 5.5 million ratings from spontaneous visitors to the website. We carefully removed the bias present in the dataset (e.g., the speakers' reputations, popularity gained by publicity, etc.) by modeling the data generating process using a causal diagram. We use a word sequence based recurrent architecture and a dependency tree based recursive architecture as the neural networks for predicting the TED talk ratings. Our neural network models can predict the ratings with an average F-score of 0.77 which largely outperforms the competitive baseline method.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Md Kamrul Hasan (71 papers)
  2. Daniel Gildea (28 papers)
  3. M. Ehsan Hoque (4 papers)
  4. Md Iftekhar Tanveer (4 papers)
Citations (5)