Method for voice assistant, location tagging, multi-media capture, transmission, speech to text conversion, photo/video image/object recognition, creation of searchable metatags/contextual tags, storage and search retrieval
DCFirst Claim
1. A method for capturing image and audio information for storage in a database at a location on a network, comprising the steps of:
- interfacing a microphone with an external audio information source that generates external audio information and converting with a first data converter the external audio information from the microphone,interfacing a camera with an external image source to capture an image therefrom;
the first data converter processing the captured external audio information and storing it in a first digital audio format as stored digital audio within the capture device, the camera for processing the captured image and storing it as a stored digital image;
capturing with a data capture device, as captured data, location information and time information associated with at least the capture of the image and storing the captured data as stored captured data;
combining with a data combiner for the stored digital audio, stored digital image and stored captured data as a composite data set;
encrypting the composite data set as an encrypted composite data set;
transmitting with a transmitter the encrypted composite data set to the location on the network; and
wherein a system disposed at the location on the network operates to;
receive the transmitted encrypted composite data set from the transmitter,decrypt the received encrypted composite data set as a decrypted composite data set to provide the decrypted composite data set as a received set of decrypted captured information,converting with a system data converter the received digital audio in the decrypted composite data set to a text based searchable file as a text context tag and creating an image recognition searchable context tag with image recognition of at least a portion of the digital image in the decrypted composite data set and associating the text and image recognition context tags with the digital image in the received decrypted composite data set, andstoring in the database the digital image in the decrypted composite data set in association with the text and image recognition context tags as a stored context based digital image and in association with the received captured data in the decrypted composite data set.
4 Assignments
Litigations
1 Petition
Accused Products
Abstract
This invention relates to a network interface device. A first capture device interfaces with a first external information source to capture first external information. A processor processes the captured first external information and stores it in a first media. The processor initiates the storage of the first captured information at an initial time and completes storage of the first captured information at a completion time, thus providing a stored defined set of first captured information. A transmitter transmits the defined set of stored captured information to a remote location on a network. A remote processing system is disposed at the remote node on the network and includes a database and a receiver for receiving the transmitted defined set of first captured information. A data converter is operable to convert the received defined set of first captured information to a second format. The database stores the set of converted captured information.
313 Citations
17 Claims
-
1. A method for capturing image and audio information for storage in a database at a location on a network, comprising the steps of:
-
interfacing a microphone with an external audio information source that generates external audio information and converting with a first data converter the external audio information from the microphone, interfacing a camera with an external image source to capture an image therefrom; the first data converter processing the captured external audio information and storing it in a first digital audio format as stored digital audio within the capture device, the camera for processing the captured image and storing it as a stored digital image; capturing with a data capture device, as captured data, location information and time information associated with at least the capture of the image and storing the captured data as stored captured data; combining with a data combiner for the stored digital audio, stored digital image and stored captured data as a composite data set; encrypting the composite data set as an encrypted composite data set; transmitting with a transmitter the encrypted composite data set to the location on the network; and wherein a system disposed at the location on the network operates to; receive the transmitted encrypted composite data set from the transmitter, decrypt the received encrypted composite data set as a decrypted composite data set to provide the decrypted composite data set as a received set of decrypted captured information, converting with a system data converter the received digital audio in the decrypted composite data set to a text based searchable file as a text context tag and creating an image recognition searchable context tag with image recognition of at least a portion of the digital image in the decrypted composite data set and associating the text and image recognition context tags with the digital image in the received decrypted composite data set, and storing in the database the digital image in the decrypted composite data set in association with the text and image recognition context tags as a stored context based digital image and in association with the received captured data in the decrypted composite data set. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for capturing image and audio information for storage, comprising:
executing a capture operation with a capture device by the steps of; providing internal storage; interfacing a microphone with an external audio information source that generates external audio information and converting with a first data converter the first external audio information from the microphone, interfacing a camera with an external image source to capture an image therefrom; the first data converter processing the captured external audio information and storing it in a first digital audio format as stored digital audio in internal storage within the capture device, the camera for processing the captured image and storing it as a stored digital image in internal storage; capturing with a data capture device, as captured data, location information, and time information associated with at least the capture of the image and storing the captured data as stored captured data; converting with a media data converter the received digital audio to a text based searchable file as a text context tag and creating an image recognition searchable context tag with image recognition of at least a portion of the digital image and associating the text and image recognition context tags with the digital image and the captured data, and storing in the internal storage the digital image in association with the text and image recognition context tags in addition to the stored captured data. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
13. A method for capturing image and audio information for storage, comprising the steps of:
-
providing internal storage; interfacing a microphone with an external audio information source that generates external audio information and converting with a first data converter the external audio information from the microphone; interfacing a camera with an image source to capture an image therefrom; capturing with a capture device, as captured data, location information and time information associated with at least the capture of the image and storing the captured data as stored captured data; the first data converter processing the captured external audio information and storing it in a first digital audio format as stored digital audio within the capture device, the camera for processing the captured image and storing it as a stored digital image; converting with a second data converter the received digital audio to a text based searchable file as a text context tag and creating an image recognition searchable context tag with image recognition of at least a portion of the digital image and associating the text and image recognition context tags with the digital image and with the stored captured data; and storing in the internal storage the digital image in association with the text and image recognition context tags in addition to the stored captured data. - View Dependent Claims (14, 15, 16, 17)
-
Specification