Apparatus, systems and methods for presenting text identified in a video image
First Claim
1. A method of presenting text identified in a presented video image of a media content event, the method comprising:
- receiving a complete video frame that is associated with a presented video image of a captured scene of a video content event, wherein the presented video image includes text disposed on an object that has been captured in the scene;
finding the text on the object that is part of the captured scene in the complete video frame;
using an optical character recognition (OCR) algorithm to translate the found text on the object into translated text; and
presenting the translated text associated with the text on the object that is part of the captured scene.
6 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are operable to present text identified in a presented video image of a media content event. An exemplary embodiment receives a complete video frame that is associated with a presented video image of a video content event, wherein the presented video image includes a region of text; finds the text in the complete video frame; uses an optical character recognition (OCR) algorithm to translate the found text; and presents the translated text. The translated text may be presented on a display concurrently with the video image that is presented on the display. Alternatively, or additionally, the translated text may be presented as audible speech emitted from at least one speaker.
57 Citations
20 Claims
-
1. A method of presenting text identified in a presented video image of a media content event, the method comprising:
-
receiving a complete video frame that is associated with a presented video image of a captured scene of a video content event, wherein the presented video image includes text disposed on an object that has been captured in the scene; finding the text on the object that is part of the captured scene in the complete video frame; using an optical character recognition (OCR) algorithm to translate the found text on the object into translated text; and presenting the translated text associated with the text on the object that is part of the captured scene. - View Dependent Claims (8, 9, 10, 12, 13, 14, 15, 16)
-
-
2. A method of presenting text identified in a presented video image of a media content event, the method comprising:
-
receiving a complete video frame that is associated with a presented video image of a captured scene of a video content event, wherein the presented video image includes text that has been captured in the scene; finding the text in the complete video frame based on a text search region of the presented video image, wherein a location of the text search region is user specified based on a received signal from a remote control that initiates presentation of a pointer icon, wherein a location of the pointer icon defines a location of the text search region on the presented video image; using an optical character recognition (OCR) algorithm to translate the found text and presenting the translated text. - View Dependent Claims (3, 4, 5, 6, 7)
-
-
11. A method of presenting text identified in a presented video image of a media content event, the method comprising:
-
receiving a complete video frame that is associated with the presented video image of a captured scene of a video content event, wherein the presented video image includes text that has been captured in the scene; finding the text in the complete video frame; using an optical character recognition (OCR) algorithm to translate the found text and presenting the translated text on a display concurrently with the video image in a text balloon on the display at a location that overlays the presented video image, wherein a pointer portion of the text balloon indicates a location of the found text in the presented video image.
-
-
17. A media device, comprising:
-
a media content stream interface that receives a media content event comprising a stream of video frames that are serially presented, wherein each video frame includes a video image of an object that is part of a captured scene of the media content event, wherein the object that is part of the captured scene includes text thereon; a presentation device interface that communicates the stream of video frames to a display of a media presentation device; and a processor system communicatively coupled to the media content stream interface and the presentation device interface, wherein the processor system is configured to; select a complete video frame from the received stream of video frames; find the text on the object in the video image of the captured scene of the selected complete video frame; translate the found text on the object using an optical character recognition (OCR) algorithm into translated text; and communicate the translated text to the display via the presentation device interface, wherein the translated text associated with the text on the object that is part of the captured scene is presented on the display.
-
-
18. A media device, comprising:
-
a media content stream interface that receives a media content event comprising stream of video frames that are serially presented, wherein each video frame includes a video image of a captured scene; a presentation device interface that communicates the stream of video frames to a display of a media presentation device; a remote interface that receives a signal from at least one of a remote control and a remote device; and a processor system communicatively coupled to the media content stream interface, the remote interface and the presentation device interface, wherein the processor system is configured to; select a complete video frame from the received stream of video frames; initiate presentation of a pointer icon in response to receiving a first signal from the remote control or the remote device, wherein a location of the pointer icon is associated with a location of a text search region on the presented video image; find text in the text search region of the video image of the selected complete video frame; translate the found text using an optical character recognition (OCR) algorithm; communicate the translated text to the display via the presentation device interface; and adjust the location of the pointer icon in response to receiving a second signal from the remote control or the remote device, wherein the adjusted location of the pointer icon adjusts location of the text search region.
-
-
19. A method of operating a media device, the method comprising:
-
presenting a video image of a captured scene of a media content event, wherein the captured scene includes an object that is part of the captured scene, wherein the object has visible text thereon; receiving an input that activates a text translation mode of operation of the media device; selecting a complete video frame corresponding to the video image in response to activation of the text translation mode of operation; finding the text on the object in the complete video frame based on a text search region of the presented video image, wherein the text search region encompasses at least part of the text; using an optical character recognition (OCR) algorithm to translate the found text on the object into translated text; and presenting the translated text associated with the text on the object that is part of the captured scene. - View Dependent Claims (20)
-
Specification