Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CephGPT-4: An Interactive Multimodal Cephalometric Measurement and Diagnostic System with Visual Large Language Model (2307.07518v1)

Published 1 Jul 2023 in cs.AI, cs.CL, cs.CV, and eess.IV

Abstract: Large-scale multimodal LLMs (LMMs) have achieved remarkable success in general domains. However, the exploration of diagnostic LLMs based on multimodal cephalometric medical data remains limited. In this paper, we propose a novel multimodal cephalometric analysis and diagnostic dialogue model. Firstly, a multimodal orthodontic medical dataset is constructed, comprising cephalometric images and doctor-patient dialogue data, with automatic analysis of cephalometric landmarks using U-net and generation of diagnostic reports. Then, the cephalometric dataset and generated diagnostic reports are separately fine-tuned on Minigpt-4 and VisualGLM. Results demonstrate that the CephGPT-4 model exhibits excellent performance and has the potential to revolutionize orthodontic measurement and diagnostic applications. These innovations hold revolutionary application potential in the field of orthodontics.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Lei Ma (195 papers)
  2. Jincong Han (1 paper)
  3. Zhaoxin Wang (4 papers)
  4. Dian Zhang (2 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.