System and method for appearance search
DCFirst Claim
1. An appearance search system comprising:
- one or more cameras configured to capture video of a scene, the video having images of objects, wherein at least one of the one or more cameras is further configured to identify, using a first learning machine at the camera, one or more of the objects within the images of the objects;
one or more processors and memory comprising computer program code stored on the memory; and
a network configured to send images comprising the one or more identified objects from the camera to the one or more processors,wherein the computer program code is configured when executed by the one or more processors to cause the one or more processors to perform a method comprising;
generating, as output from a second learning machine, one or more signatures of the respective one or more identified objects and a signature of an object of interest;
comparing the one or more signatures of the respective one or more identified objects with the signature of the object of interest to generate one or more similarity scores for the respective one or more identified objects; and
transmitting an instruction for presenting on a display one or more of the images of the one or more identified objects based on the one or more similarity scores.
3 Assignments
Litigations
0 Petitions
Accused Products
Abstract
There is provided an appearance search system comprising one or more cameras configured to capture video of a scene, the video having images of objects. The system comprises one or more processors and memory comprising computer program code stored on the memory and configured when executed by the one or more processors to cause the one or more processors to perform a method. The method comprises identifying one or more of the objects within the images of the objects. The method further comprises implementing a learning machine configured to generate signatures of the identified objects and generate a signature of an object of interest. The system further comprises a network configured to send the images of the objects from the camera to the one or more processors. The method further comprises comparing the signatures of the identified objects with the signature of the object of interest to generate similarity scores for the identified objects, and transmitting an instruction for presenting on a display one or more of the images of the objects based on the similarity scores.
21 Citations
16 Claims
-
1. An appearance search system comprising:
-
one or more cameras configured to capture video of a scene, the video having images of objects, wherein at least one of the one or more cameras is further configured to identify, using a first learning machine at the camera, one or more of the objects within the images of the objects; one or more processors and memory comprising computer program code stored on the memory; and a network configured to send images comprising the one or more identified objects from the camera to the one or more processors, wherein the computer program code is configured when executed by the one or more processors to cause the one or more processors to perform a method comprising; generating, as output from a second learning machine, one or more signatures of the respective one or more identified objects and a signature of an object of interest; comparing the one or more signatures of the respective one or more identified objects with the signature of the object of interest to generate one or more similarity scores for the respective one or more identified objects; and transmitting an instruction for presenting on a display one or more of the images of the one or more identified objects based on the one or more similarity scores. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer-readable medium having stored thereon computer program code executable by one or more processors and configured when executed by the one or more processors to cause the one or more processors to perform a method comprising:
- receiving images of one or more identified objects, the one or more identified objects having been identified by a first learning machine at a video camera that captured video of a scene, the video having the one or more images;
generating, as output from a second learning machine, one or more signatures of the respective one or more identified objects, and a signature of an object of interest;
generating one or more similarity scores for the respective one or more identified objects by comparing the one or more signatures of the respective one or more identified objects with the signature of the object of interest; and
presenting on a display one or more of the images of the one or more identified objects based on the one or more similarity scores. - View Dependent Claims (7, 8, 9)
- receiving images of one or more identified objects, the one or more identified objects having been identified by a first learning machine at a video camera that captured video of a scene, the video having the one or more images;
-
10. A system comprising:
-
one or more cameras configured to capture video of a scene; and one or more processors and memory comprising computer program code stored on the memory and configured when executed by the one or more processors to cause the one or more processors to perform a method comprising; extracting chips from the video, wherein the chips comprise images of objects; for each of at least one of the chips; determining a confidence level for the chip; and if the confidence level does not meet a confidence requirement, then; identifying, using a first learning machine, multiple objects within the chip; dividing, using the first learning machine, the chip into multiple divided chips, each of the divided chips comprising at least a portion of one of the identified objects; and generating, using a second learning machine, respective feature vectors from the multiple divided chips;
orif the confidence level meets the confidence requirement, then using the second learning machine to generate a feature vector from the chip. - View Dependent Claims (11, 12)
-
-
13. A method comprising:
-
capturing video of a scene, the video having images of objects; identifying, using a first learning machine at a video camera, one or more of the objects within the images of the objects; generating, as output from a second learning machine, one or more signatures of the respective one or more identified objects and a signature of an object of interest; generating one or more similarity scores for the respective one or more identified objects by comparing the one or more signatures of the respective one or more identified objects with the signature of the object of interest; and presenting on a display one or more of the images of the one or more identified objects based on the one or more similarity scores. - View Dependent Claims (14, 15, 16)
-
Specification