Method and system for semantic search and retrieval of electronic documents
First Claim
1. A system for semantic search for electronic documents stored on a computer readable media, and providing a search result in response to a query, comprising:
- a corpus including a plurality of electronic documents that are tagged at a document level to identify general domain of each electronic document, and are analyzed based at least partially on said tags to identify word usage patterns in said plurality of electronic documents;
an index of word usage patterns that indexes said plurality of documents in said corpus according to word usage patterns and said domain tags of said plurality of electronic documents;
a query pre-processing module that receives a query from a user, and analyzes said query to determine probable word usage patterns in said query; and
a processor that uses said index to identify at least one of said electronic documents having word usage patterns that matches said probable word usage patterns in said query as a candidate electronic document, and retrieves said candidate electronic document.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for semantic search for electronic documents stored on a computer readable media, and providing a search result in response to a query. The system includes a corpus including a plurality of electronic documents that are domain tagged at a document level and analyzed based on the tags to identify word usage patterns. An index of word usage patterns is provided that indexes the plurality of documents in the corpus according to their word usage patterns. The system also includes a query pre-processing module that receives a query from a user, and analyzes the query to determine probable word usage patterns in the query. The system further includes a processor that uses the index to identify documents having word usage patterns that matches the probable word usage patterns in the query as a candidate electronic document, and retrieves the candidate electronic document.
-
Citations
68 Claims
-
1. A system for semantic search for electronic documents stored on a computer readable media, and providing a search result in response to a query, comprising:
-
a corpus including a plurality of electronic documents that are tagged at a document level to identify general domain of each electronic document, and are analyzed based at least partially on said tags to identify word usage patterns in said plurality of electronic documents;
an index of word usage patterns that indexes said plurality of documents in said corpus according to word usage patterns and said domain tags of said plurality of electronic documents;
a query pre-processing module that receives a query from a user, and analyzes said query to determine probable word usage patterns in said query; and
a processor that uses said index to identify at least one of said electronic documents having word usage patterns that matches said probable word usage patterns in said query as a candidate electronic document, and retrieves said candidate electronic document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer implemented method for semantic search for electronic documents stored on a computer readable media, and providing a search result in response to a query, comprising:
-
providing a corpus including a plurality of electronic documents that are tagged at a document level to identify general domain of each electronic document, and are analyzed based at least partially on said tags to identify word usage patterns in said plurality of electronic documents;
providing an index of word usage patterns that indexes said plurality of electronic documents in said corpus according to word usage patterns and said domain tags of said plurality of electronic documents;
receiving a query from a user;
analyzing said query to derive probable word usage patterns in said query;
using said index to identify at least one of said electronic documents that has word usage patterns matching said probable word usage patterns in said query as a candidate electronic document; and
retrieving said candidate electronic document. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
-
37. A system for semantic search for electronic documents stored on a computer readable media, and providing a search result in response to a query, comprising:
-
a corpus of a plurality of electronic documents;
a tagging module that tags said plurality of electronic documents in said corpus at a document level to identify general domain of each electronic document;
a word usage module that determines word usage patterns in said plurality of electronic documents in said corpus based at least partially on said tags of said plurality of electronic documents; and
an indexing module that indexes said plurality of electronic documents in said corpus at least according to word usage patterns and domain tags. - View Dependent Claims (38, 39, 40, 41)
-
-
42. A computer implemented method for semantic search for electronic documents stored on a computer readable media, and providing a search result in response to a query, comprising:
-
providing a corpus of a plurality of electronic documents;
tagging said plurality of electronic documents in said corpus at a document level to identify general domain of each electronic document;
determining word usage patterns in said plurality of electronic documents in said corpus based at least partially on said tags of said plurality of electronic documents; and
generating an index of word usage patterns that indexes said plurality of documents in said corpus according to said word usage patterns and said domain tags of said plurality of electronic documents. - View Dependent Claims (43, 44, 45, 46)
-
-
47. A computer readable medium with executable instructions for semantic search for electronic documents stored on a computer readable media, and providing a search result in response to a query, comprising:
-
instructions for receiving a query from a user;
instructions for analyzing said query to derive probable word usage patterns in said query;
instructions for accessing an index of word usage patterns that indexes a plurality of electronic documents according to word usage patterns in said plurality of electronic documents, said plurality of electronic documents being tagged at a document level to identify general domain of each electronic document;
instructions for identifying at least one of said electronic documents that has word usage patterns matching said probable word usage patterns in said query as a candidate electronic document; and
instructions for retrieving said candidate electronic document. - View Dependent Claims (48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63)
-
-
64. A computer readable medium with executable instructions for semantic search for electronic documents stored on a computer readable media, and providing a search result in response to a query, comprising:
-
instructions for accessing a corpus of a plurality of electronic documents;
instructions for tagging said plurality of electronic documents in said corpus at a document level to identify general domain of each electronic document;
instructions for determining word usage patterns in said plurality of electronic documents in said corpus based at least partially on said tags of said plurality of electronic documents; and
instructions for generating an index of word usage patterns that indexes said plurality of documents in said corpus according to said word usage patterns and said domain tags of said plurality of electronic documents. - View Dependent Claims (65, 66, 67, 68)
-
Specification