Authors : M. Sundara Rajan and S.P. Rajagopalan
Abstract: We propose a mixed language query disambiguation approach by using co-occurrence information from monolingual data only. A mixed language query consists of words in a primary language and a secondary language. Our method translates the query into monolingual queries in either language. Two novel features for disambiguation, namely contextual word voting and 1-best contextual word, are introduced and compared to a baseline feature, the nearest neighbor. Average query translation accuracy for the 2 features improved considerably compared to the baseline accuracy.
M. Sundara Rajan and S.P. Rajagopalan , 2007. Applications of Word Sense Disambiguation in Mixed Language Information Retrieval . Asian Journal of Information Technology, 6: 1187-1191.