Structured translation for cross-language information retrieval

TitleStructured translation for cross-language information retrieval
Publication TypeConference Papers
Year of Publication2000
AuthorsSperer R, Oard D
Conference NameProceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Date Published2000///
Conference LocationNew York, NY, USA
ISBN Number1-58113-226-3

The paper introduces a query translation model that reflects the structure of the cross-language information retrieval task. The model is based on a structured bilingual dictionary in which the translations of each term are clustered into groups with distinct meanings. Query translation is modeled as a two-stage process, with the system first determining the intended meaning of a query term and then selecting translations appropriate to that meaning that might appear in the document collection. An implementation of structured translation based on automatic dictionary clustering is described and evaluated by using Chinese queries to retrieve English documents. Structured translation achieved an average precision that was statistically indistinguishable from Pirkola's technique for very short queries, but Pirkola's technique outperformed structured translation on long queries. The paper concludes with some observations on future work to improve retrieval effectiveness and on other potential uses of structured translation in interactive cross-language retrieval applications.