System and methods for ranking documents based on content characteristics
First Claim
1. A system for assessing information in natural language contents, comprising:
- a computer processing system configured to receive, from a user interface, an object name as a query term from a user; and
a computer storage configured to store an object-specific data set related to the object name and to store a plurality of documents containing text in a natural language, wherein the object-specific data set includes a plurality of property names and association-strength values, each property name being associated with an association-strength value, wherein the association strength values of the plurality of property names are above a predetermined threshold value, wherein the plurality of property names includes a first property name and a second property name,wherein the computer processing system is configured to count a first frequency of the first property name in one of the plurality of documents, to count a second frequency of the second property name in the one of the plurality of documents, to calculate a relevance score as a function of the first frequency and the second frequency, to rank the plurality of documents using their respective relevance scores, and to return one or more documents to the user interface based on the ranking of the plurality of documents.
1 Assignment
0 Petitions
Accused Products
Abstract
A system is described for assessing information in natural language contents. A user interface receives an object name as a query term and a value for a customized ranking parameter from a user. A computer storage device stores an object-specific data set related to the object name, wherein the object-specific data set includes a plurality of property names and association-strength values. A computer processing system can count a first frequency of a first property name and count a second frequency of a second property name in a document containing text in a natural language, calculate a relevance score as a function of the first frequency and the second frequency, and rank the plurality of documents using their respective relevance scores, and return one or more documents to the user based on the ranking of the plurality of documents. The function is in part defined by the customized ranking parameter.
12 Citations
20 Claims
-
1. A system for assessing information in natural language contents, comprising:
-
a computer processing system configured to receive, from a user interface, an object name as a query term from a user; and a computer storage configured to store an object-specific data set related to the object name and to store a plurality of documents containing text in a natural language, wherein the object-specific data set includes a plurality of property names and association-strength values, each property name being associated with an association-strength value, wherein the association strength values of the plurality of property names are above a predetermined threshold value, wherein the plurality of property names includes a first property name and a second property name, wherein the computer processing system is configured to count a first frequency of the first property name in one of the plurality of documents, to count a second frequency of the second property name in the one of the plurality of documents, to calculate a relevance score as a function of the first frequency and the second frequency, to rank the plurality of documents using their respective relevance scores, and to return one or more documents to the user interface based on the ranking of the plurality of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for assessing information in natural language contents, comprising:
-
a computer processing system configured to receive, from a user interface, an object name as a query term from a user; and a computer storage configured to store an object-specific data set related to the object name and to store a plurality of documents containing text in a natural language, wherein the object-specific data set includes a plurality of property names and association-strength values, each property name being associated with an association-strength value, wherein the computer processing system is configured to separate the plurality of property names in the object-specific data set into a first group and a second group, wherein the first group of one or more property names have their respective association strength values at or above a predetermined value, wherein the second group of one or more property names have their respective association strength values below the predetermined value, wherein the computer processing system is configured to count the frequencies of the property names in the first group and count the frequencies of the property names in the second group in each of the plurality of documents, wherein the computer processing system is configured to calculate a relevance score as a function of the frequencies of the property names in the first group and the frequencies of property names in the second group, wherein the computer processing system is configured to rank the plurality of documents using their respective relevance scores and to return one or more documents to the user interface based on the ranking of the plurality of documents. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A method for assessing information in natural language contents, comprising:
-
receiving an object name as a query term from a user interface by a computer processing system; retrieving an object-specific data set related to the object name from a computer storage system, wherein the object-specific data set includes a plurality of property names and association-strength values, each property name being associated with an association-strength value, wherein the association strength values of the plurality of property names are above a predetermined threshold value, wherein the plurality of property names includes a first property name and a second property name; retrieving, by the computer processing system, a plurality of documents containing text in a natural language; counting a first frequency of the first property name in one of the plurality of documents by the computer processing system; counting a second frequency of the second property name in the in one of the plurality of documents by the computer processing system; calculating a relevance score as a function of the first frequency and the second frequency; ranking the plurality of documents using their respective relevance scores; and returning one or more documents to the user interface based on the ranking of the plurality of documents. - View Dependent Claims (17, 18, 19, 20)
-
Specification