ISLTranslate: Dataset for Translating Indian Sign Language
Abstract: Sign languages are the primary means of communication for many hard-of-hearing people worldwide. Recently, to bridge the communication gap between the hard-of-hearing community and the rest of the population, several sign language translation datasets have been proposed to enable the development of statistical sign language translation systems. However, there is a dearth of sign language resources for the Indian sign language. This resource paper introduces ISLTranslate, a translation dataset for continuous Indian Sign Language (ISL) consisting of 31k ISL-English sentence/phrase pairs. To the best of our knowledge, it is the largest translation dataset for continuous Indian Sign Language. We provide a detailed analysis of the dataset. To validate the performance of existing end-to-end Sign language to spoken language translation systems, we benchmark the created dataset with a transformer-based model for ISL translation.
- BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues. In ECCV.
- BOBSL: BBC-Oxford British Sign Language Dataset.
- Neural sign language translation. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7784–7793.
- Sign language transformers: Joint end-to-end sign language recognition and translation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Content4all open research sign language translation datasets. In 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021), page 1–5. IEEE Press.
- Mathieu De Coster and Joni Dambre. 2022. Leveraging frozen pretrained written language models for neural sign language translation. Information, 13(5).
- Frozen pretrained transformers for neural sign language translation. In 1st International Workshop on Automated Translation for Signed and Spoken Languages.
- Speech recognition techniques for a sign language recognition system. In Proc. Interspeech 2007, pages 2513–2516.
- How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language. In Conference on Computer Vision and Pattern Recognition (CVPR).
- RÂ Elakkiya and BÂ Natarajan. 2021. Isl-csltr: Indian sign language dataset for continuous sign language translation and recognition. Mendeley Data.
- Skeleton aware multi-modal sign language recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
- CISLR: Corpus for Indian Sign Language recognition. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP).
- Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
- Neural sign language translation based on human keypoint estimation. ArXiv, abs/1811.11436.
- Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers. Computer Vision and Image Understanding, 141:108–125.
- Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison. In The IEEE Winter Conference on Applications of Computer Vision, pages 1459–1469.
- Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out, pages 74–81, Barcelona, Spain. Association for Computational Linguistics.
- Purdue rvl-slll asl database for automatic recognition of american sign language. Proceedings. Fourth IEEE International Conference on Multimodal Interfaces, pages 167–172.
- Real-time sign language detection using human pose estimation. In European Conference on Computer Vision, pages 237–248. Springer.
- Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 311–318, Philadelphia, Pennsylvania, USA. Association for Computational Linguistics.
- Robust speech recognition via large-scale weak supervision.
- Progressive Transformers for End-to-End Sign Language Production. In Proceedings of the European Conference on Computer Vision (ECCV).
- OpenHands: Making sign language recognition accessible with pose-based pretrained models across languages. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2114–2133, Dublin, Ireland. Association for Computational Linguistics.
- Open-domain sign language translation learned from online video.
- Ozge Mercanoglu Sincan and Hacer Yalim Keles. 2020. Autsl: A large scale multi-modal turkish sign language dataset and baseline methods. IEEE Access, 8:181340–181355.
- Natural language processing with transformers. " O’Reilly Media, Inc.".
- Stochastic transformer networks with linear competing units: Application to end-to-end sl translation. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 11926–11935.
- Automatic gloss dictionary for sign language learners. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 83–92, Dublin, Ireland. Association for Computational Linguistics.
- Including signed languages in natural language processing. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 7347–7360, Online. Association for Computational Linguistics.
- Kayo Yin and Jesse Read. 2020. Better sign language translation with STMC-transformer. In Proceedings of the 28th International Conference on Computational Linguistics, pages 5975–5989, Barcelona, Spain (Online). International Committee on Computational Linguistics.
- Improving sign language translation with monolingual data by sign back-translation. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 1316–1325.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.