Method and system for looking up words on a display screen by OCR comprising a set of base forms of recognized inflected words
First Claim
1. A computer implemented method comprising:
- displaying text image content on a display screen;
detecting user input associated with a portion of the display screen;
establishing coordinates associated with the user input;
identifying a rectangular region of the text image content indicated by the coordinates, wherein the rectangular region contains text;
performing character recognition on the identified rectangular region to extract text from the text image content resulting in a recognition result;
comparing the recognition result with similar word forms in a first morphology dictionary to correct errors in the recognition result;
identifying the extracted text associated with the rectangular region of the text image content;
determining a set of base forms of any inflected form of the word in the extracted text using a second morphological dictionary;
performing a dictionary lookup based on the identified extracted text comprising;
determining a set of base forms of any inflected form of a word in the extracted text using a second morphological dictionary;
identifying a set of translations of the set of base forms from a first dictionary;
determining a result translation of the word from the retrieved set of translations, wherein the result translation is a most likely part of speech;
and displaying the result translation of the dictionary lookup on the display screen.
3 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the present invention disclose a dictionary lookup method and an electronic device that implements the dictionary lookup method. The dictionary lookup method allows a user to quickly obtain meanings and translations of words from electronic dictionaries while reading a text on a display screen of the electronic device, wherein reading text is utilized by performing an optical character recognition comprising of determining a set of base forms of each inflected recognized word. Advantageously, in one embodiment the meanings (e.g., the base forms) and translations may be displayed in a balloon, in a pop-up window, as subscript, as superscript, or in any other suitable manner when the user touches a word on the display screen, in one embodiment.
-
Citations
27 Claims
-
1. A computer implemented method comprising:
-
displaying text image content on a display screen; detecting user input associated with a portion of the display screen; establishing coordinates associated with the user input; identifying a rectangular region of the text image content indicated by the coordinates, wherein the rectangular region contains text; performing character recognition on the identified rectangular region to extract text from the text image content resulting in a recognition result; comparing the recognition result with similar word forms in a first morphology dictionary to correct errors in the recognition result; identifying the extracted text associated with the rectangular region of the text image content; determining a set of base forms of any inflected form of the word in the extracted text using a second morphological dictionary; performing a dictionary lookup based on the identified extracted text comprising; determining a set of base forms of any inflected form of a word in the extracted text using a second morphological dictionary; identifying a set of translations of the set of base forms from a first dictionary; determining a result translation of the word from the retrieved set of translations, wherein the result translation is a most likely part of speech; and displaying the result translation of the dictionary lookup on the display screen. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An electronic device comprising:
-
a processor; and a memory coupled to the processor, the memory storing instructions which when executed by the processor cause the processor to perform operations comprising; displaying text image content on a display screen; detecting user input associated with a portion of the display screen; establishing coordinates associated with the user input; identifying a rectangular region of the text image content indicated by the coordinates, wherein the rectangular region contains text; performing character recognition on the identified rectangular region to extract text from the text image content resulting in a recognition result; comparing the recognition result with similar word forms in a first morphology dictionary to correct errors in the recognition result; identifying the extracted text associated with the rectangular region of the text image content; determining a set of base forms of any inflected form of the word in the extracted text using a second morphological dictionary; performing a dictionary lookup based on the identified extracted text comprising; determining a set of base forms of any inflected form of a word in the extracted text using a second morphological dictionary; identifying a set of translations of the set of base forms from a first dictionary; determining a result translation of the word from the retrieved set of translations, wherein the result translation is a most likely part of speech; and displaying the result translation of the dictionary lookup on the display screen. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A non-transitory computer-readable medium having stored thereon a sequence of instruction which when executed by a system cause the system to perform a method, comprising:
-
displaying text image content on a display screen; detecting user input associated with a portion of the display screen; establishing coordinates associated with the user input; identifying a rectangular region of the text image content indicated by the coordinates, wherein the rectangular region contains text; performing character recognition on the identified rectangular region to extract text from the text image content resulting in a recognition result; comparing the recognition result with similar word forms in a first morphology dictionary to correct errors in the recognition result; identifying the extracted text associated with the rectangular region of the text image content; determining a set of base forms of any inflected form of the word in the extracted text using a second morphological dictionary; performing a dictionary lookup based on the identified extracted text comprising; determining a set of base forms of any inflected form of a word in the extracted text using a second morphological dictionary; identifying a set of translations of the set of base forms from a first dictionary; determining a result translation of the word from the retrieved set of translations, wherein the result translation is a most likely part of speech; and displaying the result translation of the dictionary lookup on the display screen. - View Dependent Claims (27)
-
Specification