This contributed volume discusses in detail the process of construction of a WordNet of 18 Indian languages, called "Indradhanush" (rainbow) in Hindi. It delves into the major challenges involved in developing a WordNet in a multilingual country like India, where the information spread across the languages needs utmost care in processing, synchronization and representation. The project has emerged from the need of millions of people to have access to relevant content in their native languages, and it provides a common interface for information sharing and reuse across the Indian languages.
The chapters discuss important methods and strategies of language computation, language data processing, lexical selection and management, and language-specific synset collection and representation, which are of utmost value for the development of a WordNet in any language. The volume overall gives a clear picture of how WordNet is developed in Indian languages and how this can be utilized in similar projects for other languages. It includes illustrations, tables, flowcharts, and diagrams for easy comprehension.
This volume is of interest to researchers working in the areas of language processing, machine translation, word sense disambiguation, culture studies, language corpus generation, language teaching, dictionary compilation, lexicographic queries, cross-lingual knowledge sharing, e-governance, and many other areas of linguistics and language technology.
Publisher: Springer Verlag, Singapore
Number of pages: 264
Weight: 6017 g
Dimensions: 235 x 155 mm
Edition: 1st ed. 2017