Bornil: An open-source sign language data crowdsourcing platform for AI enabled dialect-agnostic communication

The absence of annotated sign language datasets has hindered the development of sign language recognition and translation technologies. In this paper, we introduce Bornil; a crowdsource-friendly, multilingual sign language data collection, annotation, and validation platform. Bornil allows users to...

Full description

Saved in:
Bibliographic Details
Main Authors: Dhruvo, Shahriar Elahi, Rahman, Mohammad Akhlaqur, Mandal, Manash Kumar, Shihab, Md. Istiak Hossain, Ansary, A. A. Noman, Shithi, Kaneez Fatema, Khanom, Sanjida, Akter, Rabeya, Arib, Safaeid Hossain, Ansary, M. N, Mehnaz, Sazia, Sultana, Rezwana, Rahman, Sejuti, Chowdhury, Sayma Sultana, Chowdhury, Sabbir Ahmed, Sadeque, Farig, Sushmit, Asif
Format: Journal Article
Language:English
Published: 29-08-2023
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The absence of annotated sign language datasets has hindered the development of sign language recognition and translation technologies. In this paper, we introduce Bornil; a crowdsource-friendly, multilingual sign language data collection, annotation, and validation platform. Bornil allows users to record sign language gestures and lets annotators perform sentence and gloss-level annotation. It also allows validators to make sure of the quality of both the recorded videos and the annotations through manual validation to develop high-quality datasets for deep learning-based Automatic Sign Language Recognition. To demonstrate the system's efficacy; we collected the largest sign language dataset for Bangladeshi Sign Language dialect, perform deep learning based Sign Language Recognition modeling, and report the benchmark performance. The Bornil platform, BornilDB v1.0 Dataset, and the codebases are available on https://bornil.bengali.ai
DOI:10.48550/arxiv.2308.15402