Keyframe Extraction from Utterance Scene and Keyframe-based Word Lip Reading
-
- Saitoh Takeshi
- Graduate School of Computer Science and Systems Engineering, Kyushu Institute of Technology
-
- Morishita Kazutoshi
- Graduate School of Engineering, Tottori University
-
- Konishi Ryosuke
- Graduate School of Engineering, Tottori University
Bibliographic Information
- Other Title
-
- 発話シーンからのキーフレーム検出とキーフレームに基づく単語読唇
- ハツワ シーン カラ ノ キーフレーム ケンシュツ ト キーフレーム ニ モトズク タンゴドクシン
Search this article
Abstract
In this paper, we propose the new keyframe-based lip reading method which does not need the advanced registration of an utterance scene. To extract keyframe, we apply the frame subtraction method and extract frame which the difference value is the local minimum as the keyframe. We compute thirteen shape features from the five lip regions of the extracted keyframe. Then we apply a discriminant analysis to mouth shape recognition. We generate a code sequence based on a mouth shape recognition result. Moreover, in accordance with several rules, we generate candidate code sequences. Finally, we apply DP matching using two kinds of code sequence of based on keyframe and candidate, and select the similar code sequence as the result word. We set Japanese 19 words as the target. We took four speakers' utterance scene. We carried out three experiments of the keyframe extraction, the mouth shape recognition, and the word recognition. As a result, we obtained average recognition rate of 53.9%. Although there was individual difference, one speaker obtained 72.1% of the highest recognition rate.
Journal
-
- IEEJ Transactions on Electronics, Information and Systems
-
IEEJ Transactions on Electronics, Information and Systems 131 (2), 418-424, 2011
The Institute of Electrical Engineers of Japan
- Tweet
Details 詳細情報について
-
- CRID
- 1390001204608943232
-
- NII Article ID
- 10027804091
-
- NII Book ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
-
- NDL BIB ID
- 10952974
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
-
- Abstract License Flag
- Disallowed