Gesture dataset
Korean symbolic gestures to substitute and complement communications of the disabled
TAGS
Gestures
Speech disturbance
Communications
Assistive Technology
Analysis on guidelines for efficient data collection
We started this project in hopes of tearing down the communication barriers. This dataset of Korean symbolic gestures based on sign language aims to complement the communication process of those with severe disabilities. The technology will allow better communication between the disabled and the non-disabled people.
About
Soft Computing & Interaction Lab
Datumo provides high quality data for smarter AI. As part of Datumo's Data Sponsorship Program, Datumo cooperated with POSTECH in building the following dataset.
POSTECH is Korea’s first research university, established in a period when Korea was still chasing after advanced technologies of developed nations. It was founded on the belief that Korea needed a university that could spearhead the global science and technology community. POSTECH has been a university that constantly pioneered change and innovation in higher education.
The professors, including Minsu Cho, Suha Kwak, and Jaesik Park are continuing their research in visual correspondence, metric learning, GAN, 3D vision, and so on. Every year, they publish papers on the so called 'Top 3 most authoritative computer vision conferences', which include CVPR(Computer Vision and Pattern Recognition), ICCV(International Conference on Computer Vision), and ECCV(European Conference on Computer Vision).
Testimonial
“It has been a valuable experience as we were able to participate in a project building datasets in such scale that is not easy for research labs to approach. During the approximate five months after being selected for the program, there were some difficulties in carrying out our project. However, owing to Datumo, who was quick and kind to answer our questions and took care of small details throughout the project, we were able to finish the project strong. We were grateful to participate in a program with such good purpose and nice people involved. We hope the datasets we have built via 2021 AI Training Data Sponsorship Program will be used in developing various communication services to support the disabled.”
Dataset specification
- 4,204 video datasets on 205 categories
- Total of 21,020 video datasets of gestures built- each gesture repeated five times per video produced by a crowd-worker
Process of annotation
Data collected by providing video guidelines for 204 "Sondam" categories to crowd-workers
- Main apparatus: Datumo's crowd-sourcing platform "Cash Mission"
- Standards and means of data collection and guidance set, based on client's needs
- Project designed and guidelines (utilizing "Sondam" video tutorials) and tutorials set for crowd-workers
- Pilot test carried out before the actual project for assurance
- Collected "Sondam" video data via crowd-workers
- Metadata tagging carried out by crowd-workers
- Final data delivered after modification and inspection by in-house workers
Data Collection
Video data were collected and labeled using Datumo's crowd-sourcing platform "Cash Mission(mobile ver.)"
Sample Data
Tutorial video of "Sondam" gesture meaning "to be fine", from the guidelines in Cash Mission
Tutorial video of "Sondam" gesture meaning "older brother", from the guidelines in Cash Mission
Tutorial video of "Sondam" gesture meaning "mountain", from the guidelines in Cash Mission
Applications
Analysis and research on human gestures and movements
CC BY-SA
Reusers are allowed to distribute, remix, adapt, and build upon the material in any medium or format, even commercially, so long as attribution is given to the creator. If you remix, adapt, or build upon the material, you must license the modified material under identical terms.
https://creativecommons.org/licenses/by-sa/3.0/deed.en
Gesture dataset
Korean symbolic gestures to substitute and complement communications of the disabled