Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
149 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A comparison of citation-based clustering and topic modeling for science mapping (2309.06160v2)

Published 12 Sep 2023 in cs.DL

Abstract: Understanding the different ways in which different science mapping approaches capture the structure of scientific fields is critical. This paper presents a comparative analysis of two commonly used approaches, topic modeling (TM) and citation-based clustering (CC), to assess their respective strengths, weaknesses, and the characteristics of their results. We compare the two approaches using cluster-to-topic and topic-to-cluster mappings based on science maps of cardiovascular research generated by TM and CC. Our findings reveal that relations between topics and clusters are generally weak, with limited overlap between topics and clusters. Only in a few exceptional cases do more than one-third of the documents in a topic belong to the same cluster, or vice versa. For TM the presence of highly similar topics is a considerable challenge. A strength of TM is its ability to represent societal needs related to cardiovascular disease, potentially offering valuable insights for policymakers. In contrast, CC excels in depicting the intellectual structure of cardiovascular diseases, with a strong capability to reflect scientific micro-communities. This study deepens the understanding of the use of TM and CC for science mapping, providing insights for users on how to apply these approaches based on their needs.

Citations (1)

Summary

We haven't generated a summary for this paper yet.