Slovo: Russian Sign Language Dataset
We introduce a large-scale video dataset Slovo for Russian Sign Language task. Slovo dataset size is about 16 GB, and it contains 20400 RGB videos for 1000 sign language gestures from 194 singers. Each class has 20 samples. The dataset is divided into training set and test set by subject user_id. The training set includes 15300 videos, and the test set includes 5100 videos. The total video recording time is ~9.2 hours. About 35% of the videos are recorded in HD format, and 65% of the videos are in FullHD resolution. The average video length with gesture is 50 frames.
Annotation file is easy to use and contains some useful columns, see annotations.csv file:
| | attachment_id | user_id | width | height | length | text | train | begin | end | |---:|:--------------|:--------|------:|-------:|-------:|-------:|:--------|:------|:----| | 0 | de81cc1c-... | 1b... | 1440 | 1920 | 14 | привет | True | 30 | 45 | | 1 | 3c0cec5a-... | 64... | 1440 | 1920 | 32 | утро | False | 43 | 66 | | 2 | d17ca986-... | cf... | 1920 | 1080 | 44 | улица | False | 12 | 31 |
where:
attachment_id- video file nameuser_id- unique anonymized user IDwidth- video widthheight- video heightlength- video lengthtext- gesture class in Russian Langaugetrain- train or test boolean flagbegin- start of the gesture (for original dataset)end- end of the gesture (for original dataset)