Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Lip Localization and Viseme Classification for Visual Speech Recognition (1301.4558v1)

Published 19 Jan 2013 in cs.CV

Abstract: The need for an automatic lip-reading system is ever increasing. Infact, today, extraction and reliable analysis of facial movements make up an important part in many multimedia systems such as videoconference, low communication systems, lip-reading systems. In addition, visual information is imperative among people with special needs. We can imagine, for example, a dependent person ordering a machine with an easy lip movement or by a simple syllable pronunciation. Moreover, people with hearing problems compensate for their special needs by lip-reading as well as listening to the person with whome they are talking.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Salah Werda (1 paper)
  2. Walid Mahdi (6 papers)
  3. Abdelmajid ben Hamadou (4 papers)
Citations (52)

Summary

We haven't generated a summary for this paper yet.