Noise mitigation in vector space representations of item collections
First Claim
1. A computer-implemented method to mitigate noise in vector space representations of item collections, the method comprising:
- receiving, by one or more computing devices, a collection of items, each item characterized by a plurality of features;
preparing, by the one or more computing devices, a first vector space representative of the collection, wherein each item in the first vector space is represented by a vector that is a function of the features characterizing the item;
first identifying, by the one or more computing devices and based on information characteristic of the first vector space, only a first one of;
items contributing to first vector space noise, and features contributing to first vector space noise;
first weighting, by the one or more computing devices, the first identified items and features to decrease the influence of the first identified items and features on vector spaces representative of the collection;
preparing, by the one or more computing devices, a second vector space representative of the collection based on the first weighted items and features;
second identifying, based on information characteristic of the second vector space, the first one of;
items contributing to second vector space noise andfeatures contributing to second vector space noise;
second weighting the second identified items and features to decrease the influence of the second identified items and features on vector spaces representative of the collection; and
preparing a third vector space representative of the collection based on the cumulatively weighted items and features, wherein each item is represented by a vector that is sum of the vectors of the features comprising the item.
1 Assignment
0 Petitions
Accused Products
Abstract
Mitigation of noise in vector space representations of item collections for applications such as facial recognition and spam detection. After receiving a collection of items, each item characterized by a plurality of features, a first vector space representative of the collection is prepared. Each item in the first vector space is represented by a vector that is a function of the features characterizing the item. Based on information characteristic of the first vector space, at least one of items contributing to first vector space noise, and features contributing to first vector space noise are identified. The first identified items and features are weighted to decrease the influence of the first identified items and features on vector spaces representative of the collection. A second vector space representative of the collection is prepared based on the first weighted items and features.
16 Citations
18 Claims
-
1. A computer-implemented method to mitigate noise in vector space representations of item collections, the method comprising:
-
receiving, by one or more computing devices, a collection of items, each item characterized by a plurality of features; preparing, by the one or more computing devices, a first vector space representative of the collection, wherein each item in the first vector space is represented by a vector that is a function of the features characterizing the item; first identifying, by the one or more computing devices and based on information characteristic of the first vector space, only a first one of;
items contributing to first vector space noise, and features contributing to first vector space noise;first weighting, by the one or more computing devices, the first identified items and features to decrease the influence of the first identified items and features on vector spaces representative of the collection; preparing, by the one or more computing devices, a second vector space representative of the collection based on the first weighted items and features; second identifying, based on information characteristic of the second vector space, the first one of; items contributing to second vector space noise and features contributing to second vector space noise; second weighting the second identified items and features to decrease the influence of the second identified items and features on vector spaces representative of the collection; and preparing a third vector space representative of the collection based on the cumulatively weighted items and features, wherein each item is represented by a vector that is sum of the vectors of the features comprising the item. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product, comprising:
a non-transitory computer-readable storage device having computer-executable program instructions embodied thereon that when executed by a computer cause the computer to mitigate noise in vector space representations of item collections, the computer-executable program instructions comprising; computer-executable program instructions to receive a collection of items, each item characterized by a plurality of features; computer-executable program instructions to prepare a first vector space representative of the collection, wherein each item in the first vector space is represented by a vector that is a function of the features characterizing the item; computer-executable program instructions to first identify, based on information characteristic of the first vector space, only a first one of; items contributing to first vector space noise, and features contributing to first vector space noise; computer-executable program instructions to first weight the first identified items and features to decrease the influence of the first identified items and features on vector spaces representative of the collection; computer-executable program instructions to prepare a second vector space representative of the collection based on the first weighted items and features; and computer-executable program instructions to second identify, based on information characteristic of the second vector space, the first one of; items contributing to second vector space noise and features contributing to second vector space noise; computer-executable program instructions to second weight the second identified items and features to decrease the influence of the second identified items and features on vector spaces representative of the collection; and computer-executable program instructions to prepare a third vector space representative of the collection based on the cumulatively weighted items and features, wherein each item is represented by a vector that is sum of the vectors of the features comprising the item. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
15. A system to mitigate noise in vector space representations of item collections, the system comprising:
-
a storage device; and a processor communicatively coupled to the storage device, wherein the processor executes application code instructions that are stored in the storage device to cause the system to; receive a collection of items, each item characterized by a plurality of features; prepare a first vector space representative of the collection, wherein each item in the first vector space is represented by a vector that is a function of the features characterizing the item; first identify, based on information characteristic of the first vector space, only one of; items contributing to first vector space noise, and features contributing to first vector space noise; first weight the first identified items and features to decrease the influence of the first identified items and features on vector spaces representative of the collection; prepare a second vector space representative of the collection based on the first weighted items and features; second identify, based on information characteristic of the second vector space, the first one of; items contributing to second vector space noise and features contributing to second vector space noise; second weight the second identified items and features to decrease the influence of the second identified items and features on vector spaces representative of the collection; and prepare a third vector space representative of the collection based on the cumulatively weighted items and features, wherein each item is represented by a vector that is sum of the vectors of the features comprising the item. - View Dependent Claims (16, 17, 18)
-
Specification