Retrieval of structured documents
First Claim
Patent Images
1. A computer-implemented method, comprising:
- under control of one or more computing devices comprising one or more processors,ranking structured elements within a structured document, the structured document includes a document element or root element, at least one section element, and at least one paragraph elements, the ranking including;
for each paragraph element, in which Weight(ti, Pj) stands for the weight of the term ti in the paragraph Pj, “
tf(ti, Pj)”
is the term frequency of ti in this paragraph, N denotes the number of documents in the corpus, and ni represents the number of documents containing the term ti, calculating the terms'"'"' weight according to the calculation;
2 Assignments
0 Petitions
Accused Products
Abstract
This disclosure relates to performing a query for a search term of a database containing a plurality of structured documents. Those structured documents that do not include the search term are ferreted or filtered out during an initial search. Matched structured documents which are those structured documents that do contain the search term are evaluated by ranking the individual elements based on how well each individual element matches the search term, and indicating to the user the ranking of the individual elements wherein the individual elements can be accessed by the user.
-
Citations
8 Claims
-
1. A computer-implemented method, comprising:
-
under control of one or more computing devices comprising one or more processors, ranking structured elements within a structured document, the structured document includes a document element or root element, at least one section element, and at least one paragraph elements, the ranking including; for each paragraph element, in which Weight(ti, Pj) stands for the weight of the term ti in the paragraph Pj, “
tf(ti, Pj)”
is the term frequency of ti in this paragraph, N denotes the number of documents in the corpus, and ni represents the number of documents containing the term ti, calculating the terms'"'"' weight according to the calculation; - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification