Data Management

A Method Using Language Grid and Concept Base for Japanese- English Cross-Language Information Retrieval

Download Now Free registration required

Executive Summary

This paper describes query translation using language resources and a concept base method for Cross-Language Information Retrieval (CLIR). In the proposed method, queries are translated by multiple machine translation systems on the Language Grid. The queries are then expanded by using a bilingual dictionary to translate compound words or word phrases. In addition, documents related to the translated query are retrieved with a TFIDF term weighting model. The top 100 retrieved documents are re-ranked by a specificity-considered concept base with the noun phrases and compound words extracted from the query.

  • Format: PDF
  • Size: 704.48 KB