Logo image
Lexicon-free fingerspelling recognition from video: Data, models, and signer adaptationI
Journal article   Open access   Peer reviewed

Lexicon-free fingerspelling recognition from video: Data, models, and signer adaptationI

Taehwan Kim, Jonathan Keane, Weiran Wang, Hao Tang, Jason Riggle, Gregory Shakhnarovich, Diane Brentari and Karen Livescu
Computer speech & language, Vol.46, pp.209-232
11/01/2017
DOI: 10.1016/j.csl.2017.05.009
url
https://doi.org/10.1016/j.csl.2017.05.009View
Published (Version of record) Open Access

Abstract

We study the problem of recognizing video sequences of fingerspelled letters in American Sign Language (ASL). Fingerspelling comprises a significant but relatively understudied part of ASL. Recognizing fingerspelling is challenging for a number of reasons: it involves quick, small motions that are often highly coarticulated; it exhibits significant variation between signers; and there has been a dearth of continuous fingerspelling data collected. In this work we collect and annotate a new data set of continuous fingerspelling videos, compare several types of recognizers, and explore the problem of signer variation. Our best-performing models are segmental (semi-Markov) conditional random fields using deep neural network-based features. In the signer dependent setting, our recognizers achieve up to about 92% letter accuracy. The multi-signer setting is much more challenging, but with neural network adaptation we achieve up to 83% letter accuracies in this setting. (C) 2017 Elsevier Ltd. All rights reserved.
Computer Science Computer Science, Artificial Intelligence Science & Technology Technology

Details

Metrics

Logo image