INFORMATION RETRIEVAL USING SUBJECT-AWARE DOCUMENT RANKER
First Claim
1. Computer-storage media having computer-executable instructions embodied thereon that, when executed, perform a method of determining a document score, which suggests a relevance of a document to a search query, the method comprising:
- receiving the search query comprised of one or more terms that represent a subject;
identifying an equivalent subject that is semantically similar to the subject, wherein the subject and the equivalent subject comprise a subject group; and
determining the document score of the document,wherein the document score is comprised of a subject-group score, andwherein the subject-group score is calculated using both a subject frequency, which includes a number of times the subject is found in the document, and an equivalent-subject frequency, which includes a number of times the equivalent-subject is found in the document.
2 Assignments
0 Petitions
Accused Products
Abstract
Subject matter described herein is related to determining a document score, which suggests a relevance of a document (e.g., webpage) to a search query. For example, a search query is received that is comprised of one or more terms, which represent a subject. An equivalent subject is identified that is semantically similar to the subject. The document score is determined by accounting for both a subject frequency and an equivalent-subject frequency.
18 Citations
20 Claims
-
1. Computer-storage media having computer-executable instructions embodied thereon that, when executed, perform a method of determining a document score, which suggests a relevance of a document to a search query, the method comprising:
-
receiving the search query comprised of one or more terms that represent a subject; identifying an equivalent subject that is semantically similar to the subject, wherein the subject and the equivalent subject comprise a subject group; and determining the document score of the document, wherein the document score is comprised of a subject-group score, and wherein the subject-group score is calculated using both a subject frequency, which includes a number of times the subject is found in the document, and an equivalent-subject frequency, which includes a number of times the equivalent-subject is found in the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. The computer-storage media of claim 11, wherein the document score of the document is comprised of a sum of the plurality of subject-group scores.
-
12. Computer-storage media having computer-executable instructions embodied thereon that, when executed, perform a method of determining a document score, which suggests a relevance of a document to a search query, the method comprising:
-
receiving the search query comprised of one or more terms that represent a first subject and a second subject; identifying a first equivalent subject that is semantically similar to the first subject and a second equivalent subject that is semantically similar to the second subject, wherein each pair of a subject and an equivalent subject comprises a respective subject group; determining a first-subject-group frequency comprised of both a first-subject frequency, which includes a number of times the first subject is found in the document, and a first-equivalent-subject frequency, which includes a number of times the first equivalent subject is found in the document; determining a second-subject-group frequency comprised of both a second-subject frequency, which includes a number of times the second subject is found in the document, and a second-equivalent-subject frequency, which includes a number of times the second equivalent subject is found in the document; and calculating the document score of the document, wherein the document score is comprised of a first-subject-group score and a second-subject-group score, and wherein each subject-group score is calculated by applying a saturation function using a respective subject-group frequency. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
-
19. A computer system that determines a document score, which suggests a relevance of a document to a search query, the computer system including a processor coupled to computer-storage media, which includes computer software executable by the processor, the computer software comprising:
-
a search-query receiver that receives the search query, which is comprised of one or more terms that represent a subject; an equivalent-subject identifier that references an equivalent-subject datastore to identify an equivalent subject, which is semantically similar to the subject, wherein the subject and the equivalent subject comprise a subject group; and a document ranker that calculates the document score of the document, wherein the document ranker; (A) determines a subject-group frequency including both a subject frequency, which includes a number of times the subject is found in the document, and an equivalent-subject frequency, which includes a number of times the equivalent subject is found in the document, and (B) calculates the document score by applying a saturation function using the subject-group frequency. - View Dependent Claims (20)
-
Specification