Persistence Homology of TEDtalk: Do Sentence Embeddings Have a Topological Shape? (2103.14131v1)

Published 25 Mar 2021 in cs.LG

Abstract: \emph{Topological data analysis} (TDA) has recently emerged as a new technique to extract meaningful discriminitve features from high dimensional data. In this paper, we investigate the possibility of applying TDA to improve the classification accuracy of public speaking rating. We calculated \emph{persistence image vectors} for the sentence embeddings of TEDtalk data and feed this vectors as additional inputs to our machine learning models. We have found a negative result that this topological information does not improve the model accuracy significantly. In some cases, it makes the accuracy slightly worse than the original one. From our results, we could not conclude that the topological shapes of the sentence embeddings can help us train a better model for public speaking rating.

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Persistence Homology of TEDtalk: Do Sentence Embeddings Have a Topological Shape? (2103.14131v1)

Summary

Related Papers