

Author: Wilbur W. Kim Won Xie Natalie
Publisher: Springer Publishing Company
ISSN: 1386-4564
Source: Information Retrieval, Vol.9, Iss.5, 2006-11, pp. : 543-564
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
It is known that users of internet search engines often enter queries with misspellings in one or more search terms. Several web search engines make suggestions for correcting misspelled words, but the methods used are proprietary and unpublished to our knowledge. Here we describe the methodology we have developed to perform spelling correction for the PubMed search engine. Our approach is based on the noisy channel model for spelling correction and makes use of statistics harvested from user logs to estimate the probabilities of different types of edits that lead to misspellings. The unique problems encountered in correcting search engine queries are discussed and our solutions are outlined.
Related content






PubSearch: a Web citation-based retrieval system
Library Hi Tech, Vol. 19, Iss. 3, 2001-09 ,pp. :

