English-Hindi Cross Language Information Retrieval System: Query Perspective
- 1 Amity University, India
- 2 Ambalika Institute of Management and Technology, India
Abstract
The abundance of multilingual content on internet other than English gives an urge to develop information retrieval system that can cross language boundaries. Such cross lingual information retrieval systems will bridge this language gap and allow user to ask a query in regional language and retrieve relevant documents in a different language. The problem of finding relevant document in language different from source language is the most challenging application of any cross lingual information retrieval. This paper discusses the development process of complete English to Hindi cross language information retrieval system along with the contribution of individual components to the system. The main focus of this paper is to discuss how optimization is done to our disambiguation approach, which we named as ‘Two level Disambiguation method’. The experimental results obtained affirm that the addition of a component ‘Analyzer’ to our CLIR architecture increases the efficiency of our proposed disambiguation algorithm.
DOI: https://doi.org/10.3844/jcssp.2018.705.713
Copyright: © 2018 Pratibha Bajpai, Parul Verma and Syed Q. Abbas. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 4,846 Views
- 2,981 Downloads
- 2 Citations
Download
Keywords
- Cross Lingual Information Retrieval System
- Translation Analyzer
- Disambiguation
- Translation Ambiguity