The focus of this Industry Connections program is the identification of use cases that could lead to proposals for standards to govern language resources for Indian languages. Language resources include speech, language data, and descriptions which are made available in a machine readable form and used for developing, evaluating, and improving algorithms in the area of natural language and speech processing. Such standards can also be used for language studies, localization of software, electronic publishing, and any purpose for researchers, subject area specialists, etc. Examples of language resources are spoken and written corpora, computational lexica, terminology databases, etc. The work will also include any tools that are used for the above stated purposes.
We welcome new participants from large and small corporations, academia, industry, and government agencies that are interested in Pre-Standardization Studies for Indian Language Resources.
Possible Industry Outcomes
Proposed deliverables and outcomes from this Industry Connections activity may include documents (e.g., white papers, reports), proposals for standards, conferences and workshops, databases, computer code, etc.
- List of language resources required for the officially recognized Indian languages
- Identification of standards for each of the identified language resources
- Database with sample collateral for each of the listed language resources across official Indian language
- Computer code for a sample list of common use cases using the language resources made available in database
- Workshops and conferences
How to Participate
To join the Pre-Standardization Studies for Indian Language Resources team, please express your interest by sending an inquiry to: