Bayesian visual interactive search
First Claim
1. A method for user identification of a desired document, comprising:
- providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space;
calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document;
a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; and
for each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1) and in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of candidate documents, identifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on Posterior probability scores for at least a portion of the documents in the candidate list, Ni being smaller than the number of documents in the candidate list, the Posterior probability score for each given document D being given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for identifying a desired document is provided to include calculating a Prior probability score for each document of a candidate list including a portion of documents of an embedding space, the Prior probability score indicating a preliminary probability, for each document of the candidate list, that the document is the desired document, and identifying an initial (i=0) collection of N0>1 candidate documents from the candidate list in dependence on the calculated Prior probability scores, the initial collection of candidate documents having fewer documents than the candidate list. The method further includes, for each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1) and in response to user selection of an i'"'"'th selected document from the (i−1)'"'"'th collection of candidate documents, identifying an i'"'"'th collection of Ni>1 candidate documents from the candidate list in dependence on Posterior probability scores.
-
Citations
45 Claims
-
1. A method for user identification of a desired document, comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1) and in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of candidate documents, identifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on Posterior probability scores for at least a portion of the documents in the candidate list, Ni being smaller than the number of documents in the candidate list, the Posterior probability score for each given document D being given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A method for user identification of a desired document, comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents of the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of documents, calculating a Posterior probability score for each document of the candidate list, andidentifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on the calculated Posterior probability scores for the documents in the candidate list, Ni being smaller than the number of documents of the candidate list,wherein the calculated Posterior probability score for each given document D is given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
-
32. A method for user identification of a desired document, comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; initializing a neighbor graph G to include a corresponding vertex for each document of the candidate list; developing a plurality of tours each touching all the vertices in the neighbor graph G corresponding to documents of the candidate list, including, for each k'"'"'th tour of the plurality of tours, beginning with a first (k=1) tour; determining a perpendicular projection of each document of the candidate list onto a k'"'"'th randomly oriented projection line that passes through an origin of the embedding space, inserting into the neighbor graph G an edge from each particular vertex of G, to all vertices of the particular vertex whose projections onto the k'"'"'th line neighbor the projection of the particular vertex onto the k'"'"'th line, and inserting into the neighbor graph G an edge between the vertices corresponding to two documents of the candidate list having the outermost projections onto the k'"'"'th line, a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of candidate documents, inserting documents into an i'"'"'th collection of Ni>
1 candidate documents in dependence on a walk through the neighbor graph G beginning with the vertex in the graph corresponding to the it’
selected document and guided by Posterior probability scores calculated for documents corresponding to vertices encountered during the walk, Ni being smaller than the number of documents in the candidate list, andidentifying toward the user the i'"'"'th collection of candidate documents from the candidate list, wherein the Posterior probability score for each given document D is given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
-
33. A method for user identification of a desired document, comprising:
-
providing, accessibly to a computer system, a database identifying (i) a catalog of documents in an embedding space and (ii) a distance between each pair of the documents in the embedding space and the distance corresponds to a predetermined measure of dissimilarity between the pair of documents; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of documents, assigning a weight w to each given document D in dependence on a Posterior probability score for each given document D, andidentifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on the assigned weights w, the Posterior probability scores and a distance from each given document D to a closest document in the candidate list, such that i'"'"'th collection is a collection of Ni>
1 documents from the candidate list having a lowest weighted average of distances to closest documents that are weighted based on the corresponding weights w and Posterior probability scores, Ni being smaller than the number of documents of the candidate list,wherein, the Posterior probability score for each given document D being given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
-
34. A non-transitory computer-readable storage medium impressed with computer program instructions to provide user identification of a desired document, the instructions, when executed on a processor, implement a method comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1) and in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of candidate documents, identifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on Posterior probability scores for at least a portion of the documents in the candidate list, Ni being smaller than the number of documents in the candidate list, the Posterior probability score for each given document D being given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D. - View Dependent Claims (35, 36)
-
-
37. A non-transitory computer-readable storage medium impressed with computer program instructions to provide user identification of a desired document, the instructions, when executed on a processor, implement a method comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents of the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of documents, calculating a Posterior probability score for each document of the candidate list, andidentifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on the calculated Posterior probability scores for the documents in the candidate list, Ni being smaller than the number of documents of the candidate list,wherein the calculated Posterior probability score for each given document D is given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
-
38. A non-transitory computer-readable storage medium impressed with computer program instructions to provide user identification of a desired document, the instructions, when executed on a processor, implement a method comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; initializing a neighbor graph G to include a corresponding vertex for each document of the candidate list; developing a plurality of tours each touching all the vertices in the neighbor graph G corresponding to documents of the candidate list, including, for each k'"'"'th tour of the plurality of tours, beginning with a first (k=1) tour; determining a perpendicular projection of each document of the candidate list onto a k'"'"'th randomly oriented projection line that passes through an origin of the embedding space, inserting into the neighbor graph G an edge from each particular vertex of G, to all vertices of the particular vertex whose projections onto the k'"'"'th line neighbor the projection of the particular vertex onto the k'"'"'th line, and inserting into the neighbor graph G an edge between the vertices corresponding to two documents of the candidate list having the outermost projections onto the k'"'"'th line, a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of candidate documents, inserting documents into an i'"'"'th collection of Ni>
1 candidate documents in dependence on a walk through the neighbor graph G beginning with the vertex in the graph corresponding to the it’
selected document and guided by Posterior probability scores calculated for documents corresponding to vertices encountered during the walk, Ni being smaller than the number of documents in the candidate list, andidentifying toward the user the i'"'"'th collection of candidate documents from the candidate list, wherein the Posterior probability score for each given document D is given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
-
39. A non-transitory computer-readable storage medium impressed with computer program instructions to provide user identification of a desired document, the instructions, when executed on a processor, implement a method comprising:
-
providing, accessibly to a computer system, a database identifying (i) a catalog of documents in an embedding space and (ii) a distance between each pair of the documents in the embedding space and the distance corresponds to a predetermined measure of dissimilarity between the pair of documents; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of documents, assigning a weight w to each given document D in dependence on a Posterior probability score for each given document D, andidentifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on the assigned weights w, the Posterior probability scores and a distance from each given document D to a closest document in the candidate list, such that i'"'"'th collection is a collection of Ni>
1 documents from the candidate list having a lowest weighted average of distances to closest documents that are weighted based on the corresponding weights w and Posterior probability scores, Ni being smaller than the number of documents of the candidate list,wherein, the Posterior probability score for each given document D being given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
-
40. A system including one or more processors coupled to memory, the memory loaded with computer instructions to provide for user identification of a desired document, the instructions, when executed on the processors, implement actions comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1) and in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of candidate documents, identifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on Posterior probability scores for at least a portion of the documents in the candidate list, Ni being smaller than the number of documents in the candidate list, the Posterior probability score for each given document D being given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D. - View Dependent Claims (41, 42)
-
-
43. A system including one or more processors coupled to memory, the memory loaded with computer instructions to provide for user identification of a desired document, the instructions, when executed on the processors, implement actions comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents of the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of documents, calculating a Posterior probability score for each document of the candidate list, andidentifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on the calculated Posterior probability scores for the documents in the candidate list, Ni being smaller than the number of documents of the candidate list,wherein the calculated Posterior probability score for each given document D is given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
-
44. A system including one or more processors coupled to memory, the memory loaded with computer instructions to provide for user identification of a desired document, the instructions, when executed on the processors, implement actions comprising:
-
providing, accessibly to a computer system, a database identifying a catalog of documents in an embedding space; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; initializing a neighbor graph G to include a corresponding vertex for each document of the candidate list; developing a plurality of tours each touching all the vertices in the neighbor graph G corresponding to documents of the candidate list, including, for each k'"'"'th tour of the plurality of tours, beginning with a first (k=1) tour; determining a perpendicular projection of each document of the candidate list onto a k'"'"'th randomly oriented projection line that passes through an origin of the embedding space, inserting into the neighbor graph G an edge from each particular vertex of G, to all vertices of the particular vertex whose projections onto the k'"'"'th line neighbor the projection of the particular vertex onto the k'"'"'th line, and inserting into the neighbor graph G an edge between the vertices corresponding to two documents of the candidate list having the outermost projections onto the k'"'"'th line, a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of candidate documents, inserting documents into an i'"'"'th collection of Ni>
1 candidate documents in dependence on a walk through the neighbor graph G beginning with the vertex in the graph corresponding to the it’
selected document and guided by Posterior probability scores calculated for documents corresponding to vertices encountered during the walk, Ni being smaller than the number of documents in the candidate list, andidentifying toward the user the i'"'"'th collection of candidate documents from the candidate list, wherein the Posterior probability score for each given document D is given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
-
45. A system including one or more processors coupled to memory, the memory loaded with computer instructions to provide for user identification of a desired document, the instructions, when executed on the processors, implement actions comprising:
-
providing, accessibly to a computer system, a database identifying (i) a catalog of documents in an embedding space and (ii) a distance between each pair of the documents in the embedding space and the distance corresponds to a predetermined measure of dissimilarity between the pair of documents; calculating a Prior probability score for each document of a candidate list including at least a portion of the documents of the embedding space, the Prior probability score indicating a preliminary probability, for each particular document of the candidate list, that the particular document is the desired document; a computer system identifying toward the user an initial (i=0) collection of N0>
1 candidate documents from the candidate list in dependence on the calculated Prior probability scores for the documents in the candidate list, the initial collection of candidate documents having fewer documents than the candidate list; andfor each i'"'"'th iteration in a plurality of iterations, beginning with a first iteration (i=1); in response to user selection of an i'"'"'th selected document from the (i−
1)'"'"'th collection of documents, assigning a weight w to each given document D in dependence on a Posterior probability score for each given document D, andidentifying toward the user an i'"'"'th collection of Ni>
1 candidate documents from the candidate list in dependence on the assigned weights w, the Posterior probability scores and a distance from each given document D to a closest document in the candidate list, such that i'"'"'th collection is a collection of Ni>
1 documents from the candidate list having a lowest weighted average of distances to closest documents that are weighted based on the corresponding weights w and Posterior probability scores, Ni being smaller than the number of documents of the candidate list,wherein, the Posterior probability score for each given document D being given by P(C|D)P(D), where C is the sequence of documents c1, . . . , ci selected by the user up through the i'"'"'th iteration, where P(C|D) is the system'"'"'s view of the probability of C if the desired document is D and where P(D) is the calculated Prior probability score for document D.
-
Specification