Similarity learning for cnn-based asl alphabet recognition
Perez-Daniel, Karina Ruby
MetadataShow full item record
Sign language is an important communication way to convey information among the deaf community, and it is primarily used by people who have hearing or speech impairments. Besides, sign language represents a direct Human-Computer-Interaction (HCI) similar to voice commands therefore, the purpose of this study is to investigate and develop a system for American Sign Language (ASL) alphabet recognition using convolutional neural networks. Our proposal is based on semantic similarity learning using Siamese Convolutional Neural Network to reduce the intra-class variation and inter-class similarity among sign images in a Euclidean space the results of the siamese architecture applied to the ASL alphabet dataset outperform previous works found in the literature. From these results, using t-SNE visualization, we demonstrate that our hypothesis is correct; the ASL recognition improves when increasing the similarity among encoding of the images belonging to the same class and reducing it otherwise. ©2021 The authors and IOS Press. All rights reserved.
Showing items related by title, author, creator and subject.
Álvarez, Víctor M.; Velázquez, Ramiro; Gutiérrez, Sebastián; Enríquez Zárate, Josué (Institute of Electrical and Electronics Engineers Inc., 2018)Current facial recognition techniques allow to automatically determine human emotions through a digital image of the face. The present study employs interest points as landmarks in facial images affected by some emotion ...
Ponce, Hiram; González Mora, José Guillermo; Martinez-Villaseñor, Lourdes; Miralles, Luis (Springer Verlag, 2018)Human activity recognition (HAR) aims to classify and identify activities based on data-driven from different devices, such as sensors or cameras. Particularly, mobile devices have been used for this recognition task. ...
Álvarez, Víctor M.; Sánchez-Gómez, Claudia; Gutiérrez, Sebastián; Domínguez Soberanes, Julieta; Velázquez, Ramiro (Institute of Electrical and Electronics Engineers Inc., 2018)As stated by Ekman in his Facial Action Coding System (FACS), facial expressions can be interpreted as the activation of different sets of facial muscles. This recognition skill, however, demands extensive training and is ...