ARTICLE AND METHOD OF AUTOMATICALLY FILTERING INFORMATION RETRIEVAL RESULTS USING TEXT GENRE
First Claim
1. A processor implemented method of searching a heterogeneous corpus of untagged machine-readable texts, each text of the corpus having a text genre and a topic, the corpus including at least a first text genre and a second text genre, the corpus including a multiplicity of topics, the processor implemented method comprising the steps of:
- a) searching the corpus for a first multiplicity of texts that have a first topic;
b) identifying a first set of texts of the first multiplicity that are instances of the first text genre;
c) identifying a second set of texts of the first multiplicity that are instances of the second text genre; and
d) identifying the first multiplicity of texts to a computer user in an order based upon the first type and the second type of text genre.
7 Assignments
0 Petitions
Accused Products
Abstract
A method of filtering according to text genre the results of a topic search of a heterogeneous corpus of untagged, machine-readable texts. Because each text of the corpus has a topic and a text genre, the corpus includes multiple text genres and covers multiple topics. According to the method, a processor first searches the corpus for a first multiplicity of texts that have a first topic. Next, the processor identifies a first set of texts of the first multiplicity that are instances of a first text genre and identifies a second set of texts of the first multiplicity that are instances of a second text genre. Finally, the processor identifies to a computer user the first multiplicity of texts in an order based upon the first text genre and second text genre.
-
Citations
22 Claims
-
1. A processor implemented method of searching a heterogeneous corpus of untagged machine-readable texts, each text of the corpus having a text genre and a topic, the corpus including at least a first text genre and a second text genre, the corpus including a multiplicity of topics, the processor implemented method comprising the steps of:
-
a) searching the corpus for a first multiplicity of texts that have a first topic;
b) identifying a first set of texts of the first multiplicity that are instances of the first text genre;
c) identifying a second set of texts of the first multiplicity that are instances of the second text genre; and
d) identifying the first multiplicity of texts to a computer user in an order based upon the first type and the second type of text genre. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An article of manufacture comprising:
-
a) a memory; and
b) instructions stored in the memory for a method of searching a heterogeneous corpus of untagged machine-readable texts, each text of the corpus having a text genre and a topic, the corpus including at least a first text genre and a second text genre, the corpus including a multiplicity of topics, the method being implemented by a processor coupled to the memory, the method comprising the steps of;
1) searching the corpus for a first multiplicity of texts that have a first topic;
2) identifying a first set of texts of the first multiplicity that are instances of the first text genre;
3) identifying the first set of texts to a computer user.
-
-
12. A processor implemented method of searching a heterogeneous corpus of machine-readable texts, each text of the corpus having a text genre and a topic, the corpus including a first multiplicity of text genres and a second multiplicity of topics, the processor implemented method comprising the steps of:
-
a) receiving from a computer user a search request for texts having a first topic and a first text genre, the search request also identifying a second text genre to be excluded;
b) identifying a third multiplicity of texts of the corpus having the first topic;
c) determining a text genre of each text of the third multiplicity of texts; and
d) identifying to the computer user those texts of the third multiplicity that are instances of the first text genre and not identifying any text of the third multiplicity that are instances of the second text genre.
-
-
13. An article of manufacture comprising:
-
a) a memory; and
b) instructions stored in the memory for a method of searching a heterogeneous corpus of machine-readable texts, each text of the corpus having a text genre and a topic, the corpus including a first multiplicity of text genres and a second multiplicity of topics, the method being implemented by a processor coupled to the memory, the method comprising the steps of;
1) receiving from a computer user a search request for texts having a first topic and a first text genre, the search request also identifying a second text genre to be excluded;
2) identifying a third multiplicity of texts of the corpus having the first topic;
3) determining a text genre of each text of the third multiplicity of texts; and
4) identifying to the computer user those texts of the third multiplicity that are instances of the first text genre and not identifying any text of the third multiplicity that are instances of the second text genre. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. An article of manufacture comprising:
-
a) a memory; and
b) instructions stored in the memory for a method of searching a heterogeneous corpus of machine-readable texts, each text of the corpus having a text genre and a topic, the corpus including a first multiplicity of text genres and a second multiplicity of topics, the method being implemented by a processor coupled to the memory, the method comprising the steps of;
1) receiving from a computer user a search request for texts having a first topic and a first text genre to be excluded;
2) identifying a third multiplicity of texts of the corpus having the first topic;
3) determining a text genre of each text of the third multiplicity of texts; and
4) identifying to the computer user those texts of the third multiplicity that have a text genre other than the first text genre.
-
-
20. An article of manufacture comprising:
-
a) a memory; and
b) instructions stored in the memory for a method of searching a heterogeneous corpus of machine-readable texts, each text of the corpus having a topic and a facet value for each facet of a first multiplicity of facets, the corpus including a second multiplicity of topics, the method being implemented by a processor coupled to the memory, the method comprising the steps of;
1) receiving from a computer user a search request for texts having a first topic and a first value of a first facet of the first multiplicity of facets;
2) identifying a third multiplicity of texts of the corpus having the first topic;
3) for each text of the third multiplicity determining for a value of the first facet; and
4) identifying to the computer user those texts of the third multiplicity that have the first value of the first facet. - View Dependent Claims (21, 22)
-
Specification