Recording and providing for display images of events associated with power equipment
First Claim
1. A method, comprising:
- recording, via a wearable image recorder, images associated with a maintenance procedure;
recording, via a wearable audio recorder, audio associated with the maintenance procedure;
filtering images associated with events in the maintenance procedure from images not associated with events in the maintenance procedure based upon whether each image is associated with events in the maintenance procedure or is not associated with events in the maintenance procedure, the filtering comprising;
analyzing the audio and the images to identify a plurality of events, comprising a first event, a second event, a third event and a fourth event, of the maintenance procedure, the analyzing comprising;
identifying the first event based upon a non-verbal cue, in the audio, indicative of a vibration determined to indicate a failure of a first part in power equipment based upon a match between the non-verbal cue and a first audio indication paired with the first event in an audio library;
identifying the second event based upon a vocal change determined to indicate an increase in stress associated with a technician based upon a match between the vocal change and a second audio indication paired with the second event in the audio library;
identifying the third event based upon a keyword recognized via voice recognition in the audio, the identifying the third event based upon a match between the keyword and a third audio indication paired with the third event in the audio library; and
identifying the fourth event based upon a trigger image, recognized via image analysis in the images, indicative of a start of the fourth event;
determining that a first portion of the images and a first portion of the audio are associated with the first event based upon an analysis of timestamps in metadata associated with the first portion of the images and timestamps in metadata associated with the first portion of the audio;
determining that a second portion of the images and a second portion of the audio are associated with the second event based upon an analysis of timestamps in metadata associated with the second portion of the images and timestamps in metadata associated with the second portion of the audio;
determining that a third portion of the images and a third portion of the audio are associated with the third event based upon an analysis of timestamps in metadata associated with the third portion of the images and timestamps in metadata associated with the third portion of the audio;
determining that a fourth portion of the images and a fourth portion of the audio are associated with the fourth event based upon an analysis of timestamps in metadata associated with the fourth portion of the images and timestamps in metadata associated with the fourth portion of the audio; and
determining that a fifth portion of the images and a fifth portion of the audio are associated with routine actions in the maintenance procedure based upon an analysis of timestamps in metadata associated with the fifth portion of the images and timestamps in metadata associated with the fifth portion of the audio;
generating a library of events associated with the power equipment, the generating comprising;
responsive to determining that the first portion of the images and the first portion of the audio are associated with the first event, storing the first portion of the images and the first portion of the audio in a database of the library in association with the first event;
responsive to determining that the second portion of the images and the second portion of the audio are associated with the second event, storing the second portion of the images and the second portion of the audio in the database of the library in association with the second event;
responsive to determining that the third portion of the images and the third portion of the audio are associated with the third event, storing the third portion of the images and the third portion of the audio in the database of the library in association with the third event;
responsive to determining that the fourth portion of the images and the fourth portion of the audio are associated with the fourth event, storing the fourth portion of the images and the fourth portion of the audio in the database of the library in association with the fourth event; and
responsive to determining that the fifth portion of the images and the fifth portion of the audio are associated with the routine actions in the maintenance procedure, discarding the fifth portion of the images and the fifth portion of the audio to mitigate storage space to be consumed by a presentation associated with the maintenance procedure and to mitigate a temporal length of the presentation;
responsive to receiving a request from a user for images corresponding to performance of an event, identifying, in the library, a portion of the images and a portion of the audio indicative of the event;
analyzing the event to identify one or more keywords associated with the event specified by the request from the user;
responsive to identifying the one or more keywords, searching one or more databases, based upon the one or more keywords, to identify one or more documents associated with the event;
responsive to identifying the one or more documents, programmatically annotating textual information associated with the one or more documents to an event image in the portion of the images indicative of the event;
generating an audio-visual depiction of performance of the event based upon a combination of the portion of the images indicative of the event, the portion of the audio indicative of the event and the annotated textual information; and
controlling a graphical user interface to cause a screen to display an image stream of the audio-visual depiction of performance of the event while controlling a speaker to play an audio stream of the audio-visual depiction of performance of the event.
5 Assignments
0 Petitions
Accused Products
Abstract
Among other things, one or more techniques and/or systems are provided for identifying event images captured during an event associated with power equipment, such as a maintenance action. An entity, such as a maintenance technician, is equipped with an image recorder and an audio recorder. In some embodiments, the image recorder and/or audio recorder are wearable, such as mounted to eyewear and/or headwear worn by the entity. Audio that is recorded by the audio recorder is processed to identify events (e.g., key actions in a maintenance procedure). Based upon the identification of an event from the audio, event images (e.g., images captured temporally proximate to when the event was identified in the audio) associated with the event are identified and stored in a library. In some embodiments, one or more of such event images are later displayed to entities via a heads-up display, for example.
15 Citations
20 Claims
-
1. A method, comprising:
-
recording, via a wearable image recorder, images associated with a maintenance procedure; recording, via a wearable audio recorder, audio associated with the maintenance procedure; filtering images associated with events in the maintenance procedure from images not associated with events in the maintenance procedure based upon whether each image is associated with events in the maintenance procedure or is not associated with events in the maintenance procedure, the filtering comprising; analyzing the audio and the images to identify a plurality of events, comprising a first event, a second event, a third event and a fourth event, of the maintenance procedure, the analyzing comprising; identifying the first event based upon a non-verbal cue, in the audio, indicative of a vibration determined to indicate a failure of a first part in power equipment based upon a match between the non-verbal cue and a first audio indication paired with the first event in an audio library; identifying the second event based upon a vocal change determined to indicate an increase in stress associated with a technician based upon a match between the vocal change and a second audio indication paired with the second event in the audio library; identifying the third event based upon a keyword recognized via voice recognition in the audio, the identifying the third event based upon a match between the keyword and a third audio indication paired with the third event in the audio library; and identifying the fourth event based upon a trigger image, recognized via image analysis in the images, indicative of a start of the fourth event; determining that a first portion of the images and a first portion of the audio are associated with the first event based upon an analysis of timestamps in metadata associated with the first portion of the images and timestamps in metadata associated with the first portion of the audio; determining that a second portion of the images and a second portion of the audio are associated with the second event based upon an analysis of timestamps in metadata associated with the second portion of the images and timestamps in metadata associated with the second portion of the audio; determining that a third portion of the images and a third portion of the audio are associated with the third event based upon an analysis of timestamps in metadata associated with the third portion of the images and timestamps in metadata associated with the third portion of the audio; determining that a fourth portion of the images and a fourth portion of the audio are associated with the fourth event based upon an analysis of timestamps in metadata associated with the fourth portion of the images and timestamps in metadata associated with the fourth portion of the audio; and determining that a fifth portion of the images and a fifth portion of the audio are associated with routine actions in the maintenance procedure based upon an analysis of timestamps in metadata associated with the fifth portion of the images and timestamps in metadata associated with the fifth portion of the audio; generating a library of events associated with the power equipment, the generating comprising; responsive to determining that the first portion of the images and the first portion of the audio are associated with the first event, storing the first portion of the images and the first portion of the audio in a database of the library in association with the first event; responsive to determining that the second portion of the images and the second portion of the audio are associated with the second event, storing the second portion of the images and the second portion of the audio in the database of the library in association with the second event; responsive to determining that the third portion of the images and the third portion of the audio are associated with the third event, storing the third portion of the images and the third portion of the audio in the database of the library in association with the third event; responsive to determining that the fourth portion of the images and the fourth portion of the audio are associated with the fourth event, storing the fourth portion of the images and the fourth portion of the audio in the database of the library in association with the fourth event; and responsive to determining that the fifth portion of the images and the fifth portion of the audio are associated with the routine actions in the maintenance procedure, discarding the fifth portion of the images and the fifth portion of the audio to mitigate storage space to be consumed by a presentation associated with the maintenance procedure and to mitigate a temporal length of the presentation; responsive to receiving a request from a user for images corresponding to performance of an event, identifying, in the library, a portion of the images and a portion of the audio indicative of the event; analyzing the event to identify one or more keywords associated with the event specified by the request from the user; responsive to identifying the one or more keywords, searching one or more databases, based upon the one or more keywords, to identify one or more documents associated with the event; responsive to identifying the one or more documents, programmatically annotating textual information associated with the one or more documents to an event image in the portion of the images indicative of the event; generating an audio-visual depiction of performance of the event based upon a combination of the portion of the images indicative of the event, the portion of the audio indicative of the event and the annotated textual information; and controlling a graphical user interface to cause a screen to display an image stream of the audio-visual depiction of performance of the event while controlling a speaker to play an audio stream of the audio-visual depiction of performance of the event. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system, comprising:
-
an image recorder configured to capture images of power equipment associated with a maintenance procedure; an audio recorder configured to capture audio associated with the maintenance procedure; a filter configured to filter images associated with events in the maintenance procedure from images not associated with events in the maintenance procedure based upon whether each image is associated with events in the maintenance procedure or is not associated with events in the maintenance procedure, the filtering comprising; analyzing the audio and the images to identify a plurality of events, comprising a first event, a second event, a third event, and a fourth event, of the maintenance procedure, the analyzing comprising; identifying the first event based upon a non-verbal cue, in the audio, indicative of a vibration determined to indicate a failure of a first part in the power equipment based upon a match between the non-verbal cue and a first audio indication paired with the first event in an audio library; identifying the second event based upon a vocal change determined to indicate an increase in stress associated with a technician based upon a match between the vocal change and a second audio indication paired with the second event in the audio library; identifying the third event based upon a keyword recognized via voice recognition in the audio, the identifying the third event based upon a match between the keyword and a third audio indication paired with the third event in the audio library; and identifying the fourth event based upon a trigger image, recognized via image analysis in the images, indicative of a start of the fourth event; determining that a first portion of the images and a first portion of the audio are associated with the first event based upon an analysis of timestamps in metadata associated with the first portion of the images and timestamps in metadata associated with the first portion of the audio; determining that a second portion of the images and a second portion of the audio are associated with the second event based upon an analysis of timestamps in metadata associated with the second portion of the images and timestamps in metadata associated with the second portion of the audio; determining that a third portion of the images and a third portion of the audio are associated with the third event based upon an analysis of timestamps in metadata associated with the third portion of the images and timestamps in metadata associated with the third portion of the audio; determining that a fourth portion of the images and a fourth portion of the audio are associated with the fourth event based upon an analysis of timestamps in metadata associated with the fourth portion of the images and timestamps in metadata associated with the fourth portion of the audio; and determining that a fifth portion of the images and a fifth portion of the audio are associated with routine actions in the maintenance procedure based upon an analysis of timestamps in metadata associated with the fifth portion of the images and timestamps in metadata associated with the fifth portion of the audio; a generator configured to generate a library of events associated with the power equipment, the generating comprising; responsive to determining that the first portion of the images and the first portion of the audio are associated with the first event, storing the first portion of the images and the first portion of the audio in a database of the library in association with the first event; responsive to determining that the second portion of the images and the second portion of the audio are associated with the second event, storing the second portion of the images and the second portion of the audio in the database of the library in association with the second event; responsive to determining that the third portion of the images and the third portion of the audio are associated with the third event, storing the third portion of the images and the third portion of the audio in the database of the library in association with the third event; responsive to determining that the fourth portion of the images and the fourth portion of the audio are associated with the fourth event, storing the fourth portion of the images and the fourth portion of the audio in the database of the library in association with the fourth event; and responsive to determining that the fifth portion of the images and the fifth portion of the audio are associated with the routine actions in the maintenance procedure, discarding the fifth portion of the images and the fifth portion of the audio to mitigate storage space to be consumed by a presentation associated with the maintenance procedure and to mitigate a temporal length of the presentation; and an image retrieval component configured to; responsive to receiving a request from a user for images corresponding to performance of an event, identify, in the library, a portion of the images and a portion of the audio indicative of the event; analyze the event to identify one or more keywords associated with the event specified by the request from the user; responsive to identifying the one or more keywords, search one or more databases, based upon the one or more keywords, to identify one or more documents associated with the event; responsive to identifying the one or more documents, programmatically annotate textual information associated with the one or more documents to an event image in the portion of the images indicative of the event; generate an audio-visual depiction of performance of the event based upon a combination of the portion of the images indicative of the event, the portion of the audio indicative of the event and the annotated textual information; and control a graphical user interface to cause a screen to display an image stream of the audio-visual depiction of performance of the event while controlling a speaker to play an audio stream of the audio-visual depiction of performance of the event. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer-readable medium comprising instructions that when executed perform a method, comprising:
-
recording, via an image recorder, images associated with a maintenance procedure; recording, via an audio recorder, audio associated with the maintenance procedure; filtering images associated with events in the maintenance procedure from images not associated with events in the maintenance procedure based upon whether each image is associated with events in the maintenance procedure or is not associated with events in the maintenance procedure, the filtering comprising; analyzing the audio and the images to identify a plurality of events, comprising a first event, a second event, a third event, and a fourth event, of the maintenance procedure, the analyzing comprising; identifying the first event based upon a non-verbal cue, in the audio, indicative of a vibration determined to indicate a failure of a first part in power equipment based upon a match between the non-verbal cue and a first audio indication paired with the first event in an audio library; identifying the second event based upon a vocal change determined to indicate an increase in stress associated with a technician based upon a match between the vocal change and a second audio indication paired with the second event in the audio library; identifying the third event based upon a keyword recognized via voice recognition in the audio, the identifying the third event based upon a match between the keyword and a third audio indication paired with the third event in the audio library; and identifying the fourth event based upon a trigger image, recognized via image analysis in the images, indicative of a start of the fourth event; determining that a first portion of the images and a first portion of the audio are associated with the first event based upon an analysis of timestamps in metadata associated with the first portion of the images and timestamps in metadata associated with the first portion of the audio; determining that a second portion of the images and a second portion of the audio are associated with the second event based upon an analysis of timestamps in metadata associated with the second portion of the images and timestamps in metadata associated with the second portion of the audio; determining that a third portion of the images and a third portion of the audio are associated with the third event based upon an analysis of timestamps in metadata associated with the third portion of the images and timestamps in metadata associated with the third portion of the audio; determining that a fourth portion of the images and a fourth portion of the audio are associated with the fourth event based upon an analysis of timestamps in metadata associated with the fourth portion of the images and timestamps in metadata associated with the fourth portion of the audio; and determining that a fifth portion of the images and a fifth portion of the audio are associated with routine actions in the maintenance procedure based upon an analysis of timestamps in metadata associated with the fifth portion of the images and timestamps in metadata associated with the fifth portion of the audio; generating a library of events associated with the power equipment, the generating comprising; responsive to determining that the first portion of the images and the first portion of the audio are associated with the first event, storing the first portion of the images and the first portion of the audio in a database of the library in association with the first event; responsive to determining that the second portion of the images and the second portion of the audio are associated with the second event, storing the second portion of the images and the second portion of the audio in the database of the library in association with the second event; responsive to determining that the third portion of the images and the third portion of the audio are associated with the third event, storing the third portion of the images and the third portion of the audio in the database of the library in association with the third event; responsive to determining that the fourth portion of the images and the fourth portion of the audio are associated with the fourth event, storing the fourth portion of the images and the fourth portion of the audio in the database of the library in association with the fourth event; and responsive to determining that the fifth portion of the images and the fifth portion of the audio are associated with the routine actions in the maintenance procedure, discarding the fifth portion of the images and the fifth portion of the audio to mitigate storage space to be consumed by a presentation associated with the maintenance procedure and to mitigate a temporal length of the presentation; responsive to receiving a request from a user for images corresponding to performance of an event, identifying, in the library, a portion of the images and a portion of the audio indicative of the event; analyzing the event to identify one or more keywords associated with the event specified by the request from the user; responsive to identifying the one or more keywords, searching one or more databases, based upon the one or more keywords, to identify one or more documents associated with the event; responsive to identifying the one or more documents, programmatically annotating textual information associated with the one or more documents to an event image in the first portion of the images indicative of the event; generating an audio-visual depiction of performance of the event based upon a combination of the portion of the images indicative of the event, the portion of the audio indicative of the event, and the annotated textual information; and controlling a graphical user interface to cause a screen to display an image stream of the audio-visual depiction of performance of the event while controlling a speaker to play an audio stream of the audio-visual depiction of performance of the event.
-
Specification