MAILBOX SEARCH ENGINE USING QUERY MULTI-MODAL EXPANSION AND COMMUNITY-BASED SMOOTHING
First Claim
1. A non-transitory computer-readable storage medium storing instructions executable by a computer to perform a retrieval method on a database of documents including text and names of participants associated with the documents by operations including:
- receiving a multi-faceted retrieval query having a text query facet comprising one or more keywords and a persons query facet comprising one or more participant names;
computing an enriched text query as an aggregation of the text query facet, a monomodal expansion of the text query facet based on the one or more keywords, a cross-modal expansion of the text query facet based on the one or more participant names, and a topic expansion of the text query facet based on a topic model associating words and topics;
computing an enriched persons query as an aggregation of the persons query facet, a mono-modal expansion of the persons query facet based on the one or more participant names, a cross-modal expansion of the persons query facet based on the one or more keywords, and a community expansion of the persons query facet based on a community model associating persons and communities; and
performing ranking including at least one of;
(1) generating a ranking of documents by sorting similarities between the enriched text query and documents of the database, and(2) generating a ranking of persons by sorting the enriched persons query.
1 Assignment
0 Petitions
Accused Products
Abstract
A retrieval method on a database of documents including text and names of participants associated with the documents includes: receiving a text query facet of keywords and a persons query facet of participant names; computing an enriched text query as an aggregation of the text query facet, a monomodal expansion of the text query facet based on the keywords, a cross-modal expansion of the text query facet based on the participant names, and a topic expansion of the text query facet based on a topic model associating words and topics; computing an enriched persons query as an aggregation of the persons query facet, a monomodal expansion of the persons query facet based on the participant names, a cross-modal expansion of the persons query facet based on the keywords, and a community expansion of the persons query facet based on a community model associating persons and communities.
-
Citations
20 Claims
-
1. A non-transitory computer-readable storage medium storing instructions executable by a computer to perform a retrieval method on a database of documents including text and names of participants associated with the documents by operations including:
-
receiving a multi-faceted retrieval query having a text query facet comprising one or more keywords and a persons query facet comprising one or more participant names; computing an enriched text query as an aggregation of the text query facet, a monomodal expansion of the text query facet based on the one or more keywords, a cross-modal expansion of the text query facet based on the one or more participant names, and a topic expansion of the text query facet based on a topic model associating words and topics; computing an enriched persons query as an aggregation of the persons query facet, a mono-modal expansion of the persons query facet based on the one or more participant names, a cross-modal expansion of the persons query facet based on the one or more keywords, and a community expansion of the persons query facet based on a community model associating persons and communities; and performing ranking including at least one of; (1) generating a ranking of documents by sorting similarities between the enriched text query and documents of the database, and (2) generating a ranking of persons by sorting the enriched persons query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A retrieval method operating on a database of documents including text and names of participants associated with the documents, the retrieval method comprising:
-
receiving a retrieval query having at least one of a text query facet comprising one or more keywords and a persons query facet comprising one or more participant names; performing query enrichment including at least one of; (1) computing an enriched text query as an aggregation of the text query facet, a monomodal expansion of the text query facet based on the one or more keywords, a cross-modal expansion of the text query facet based on the one or more participant names, and a topic expansion of the text query facet based on a topic model associating words and topics, and (2) computing an enriched persons query as an aggregation of the persons query facet, a monomodal expansion of the persons query facet based on the one or more participant names, a cross-modal expansion of the persons query facet based on the one or more keywords, and a community expansion of the persons query facet based on a community model associating persons and communities; and performing ranking including at least one of; (1) generating a ranking of documents by sorting similarities between the enriched text query and documents of the database, and (2) generating a ranking of persons by sorting the enriched persons query; wherein the retrieval method is performed by a computer. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
- 18. A non-transitory computer-readable storage medium storing instructions executable by a computer to generate a set of topics comprising word distributions and a set of communities comprising participant distributions by performing the non-negative matrix factorizations:
Specification