Method and system for performing multilingual document searches
First Claim
1. A method to be executed at least in part in a computing device for searching multilingual documents, the method comprising:
- on the computing device, receiving a search result set ranked in order of relevance in response to submitting a query based on a request from a user for documents, the submitted query including a list of languages from which the received search result set is filtered, the documents being indexed in a search engine index, each document in the search engine index including an identifier property and a language property, the identifier property identifying the document and the language property specifying a language version for the document, each document in the received search result set including an identifier property and a language property, the identifier property for each document in the received search result set identifying the document and the language property for each document in the received search result set specifying the language version for the document;
determining which language version of each document in the received search result set is returned based on the list of languages included in the submitted query;
processing the received search result set based on the determination;
transforming the processed search result set into a relevance based ordered list;
generating a final list based on the relevance based ordered list using localized and non-localized content of the documents, the generated final list including at least one localized document having a higher position in the generated final list than a non-localized document ranked higher in the order of relevance than the at least one localized document in the received search result set, the at least one localized document having a language property for a local language from the list of languages included in the submitted query, and the non-localized document having a language property for a non-local language from the list of languages included in the submitted query; and
providing the generated final list to the user.
2 Assignments
0 Petitions
Accused Products
Abstract
Multilingual document searches are performed by ordering and filtering localized documents in different languages based on user preferences. Documents are indexed in a global search engine index using an identifier property and a language property permitting all relevant documents to be retrieved in a single search engine query. By storing documents with their identifier and language properties, documents from different cultures but with same content can be treated as equivalent. Results of the initial search include a multilingual set of documents ranked in order of relevance. Result sets are then post-processed using a data resource indicating for each multilingual user interface combination, which language version of each document should be returned. The initial search result are then transformed into a relevance based ordered list treating documents in different languages having same content as equivalent. Based on the ordered list, a final document list is prepared using localized and non-localized content.
31 Citations
19 Claims
-
1. A method to be executed at least in part in a computing device for searching multilingual documents, the method comprising:
-
on the computing device, receiving a search result set ranked in order of relevance in response to submitting a query based on a request from a user for documents, the submitted query including a list of languages from which the received search result set is filtered, the documents being indexed in a search engine index, each document in the search engine index including an identifier property and a language property, the identifier property identifying the document and the language property specifying a language version for the document, each document in the received search result set including an identifier property and a language property, the identifier property for each document in the received search result set identifying the document and the language property for each document in the received search result set specifying the language version for the document; determining which language version of each document in the received search result set is returned based on the list of languages included in the submitted query; processing the received search result set based on the determination; transforming the processed search result set into a relevance based ordered list; generating a final list based on the relevance based ordered list using localized and non-localized content of the documents, the generated final list including at least one localized document having a higher position in the generated final list than a non-localized document ranked higher in the order of relevance than the at least one localized document in the received search result set, the at least one localized document having a language property for a local language from the list of languages included in the submitted query, and the non-localized document having a language property for a non-local language from the list of languages included in the submitted query; and providing the generated final list to the user. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for searching multilingual documents, the system comprising:
-
a memory; a processor coupled to the memory, wherein the processor is configured to execute program modules including; a global search index module that maintains a list of available documents based on a document identifier property and a document language property; a search engine configured to; submit a query based on a request from a user for documents, the submitted query including a list of languages from which search results are filtered, the documents being indexed in a search engine index of the search engine, each document in the search engine index including an identifier property and a language property, the identifier property identifying the document and the language property specifying a language version for the document; receive search results ranked in order of relevance, the received search results including a plurality of documents, each document in the received search results including an identifier property and a language property, the identifier property for each document in the received search results identifying the document and the language property for each document in the received search results specifying a language version for the document; determine which language version of each document in the received search results is returned based on the list of languages included in the submitted query; de-prioritize documents in the received search results based on the determination; transform the received search results into a relevance based ordered list; generate a final list based on the relevance based ordered list using localized and non-localized content of the documents, the generated final list including at least one localized document having a higher position in the generated final list than a non-localized document ranked higher in the order of relevance than the at least one localized document in the received search result set, the at least one localized document having a language property for a local language from the list of languages included in the submitted query, and the non-localized document having a language property for a non-local language from the list of languages included in the submitted query; and provide the generated final list to the user. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage medium with instructions encoded thereon for searching multilingual documents, the instructions comprising:
-
receiving a search request from a user; submitting a query based on the received request for documents, wherein a list of available documents is maintained in a local search index based on a document identifier property and a document language property; receiving a search result set ranked in order of relevance, the received search result set including a plurality of documents, each document in the received search result set including an identifier property and a language property, the identifier property for each document in the received search result set identifying the document and the language property for each document in the received search result set specifying a language version for the document; determining which language version of each document in the received search result set is returned based on a user language preference; processing the received search result set based on the determination and the local search index; limiting the documents in the received search result set to those in languages of a desired multilingual user interface (MUI) by filtering on a list of localizations specified at query time; transforming the processed received search result set into a relevance based ordered list; generating a final list based on the relevance based ordered list using localized and non-localized content of the documents, the generated final list including at least one localized document having a higher position in the generated final list than a non-localized document ranked higher in the order of relevance than the at least one localized document in the received search result set, the at least one localized document having a language property for a local language, and the non-localized document having a language property for a non-local language; and providing the generated final list to the user. - View Dependent Claims (16, 17, 18, 19)
-
Specification