Passionate about Tamil? Contribute your voice!
Read and validate Tamil sentences to contribute to the development of open Tamil voice technologies.
Tamil Common Voice is undertaking the work to develop and release datasets that anyone can use. Currently, there is less than 25 hours of validated hours of voice data for Tamil. We need a minimum 300 validated hours of voice data. Let's make our small contribution towards to reach this goal.
Please go to the Tamil Common Voice platform to read and validate sentences.
How to Contribute
- Review written Instructions for how to contribute
- Video of how to contribute in Tamil #1
- Video of how to contribute in Tamil #2
- Fill out this form to receive acknowledgement of your participation in the Tamil Common Voice project.
What else are you doing?
At UTSC Library, we are participating as part of our larger Tamil Digital Studies Project, which seeks to promote Tamil Digital Scholarship, and develop an intellectually focused digital content hub that will encourage Tamil Studies scholarship. Follow our work in Open Data by visiting our catalog. However, a wide network of staff and volunteers have worked hard to find open texts, extract appropriate sentences, review, validate and add them to Mozilla’s Common Voice project. This is a group project that has also created open datasets and scripts that might be useful in other contexts and that are available on Github.