Generating alternative descriptions for images
First Claim
1. A method, in a data processing system comprising at least one processor and at least one memory, for generating alternative text descriptions for images in electronic documents, comprising:
- analyzing, by the data processing system, an original image embedded in an electronic document to generate a data pattern for the image;
performing, by the data processing system, a matching operation to identify one or more similar images in other electronic documents from one or more sources of electronic documents based on the generated data pattern;
extracting, by the data processing system, textual description information associated with the one or more similar images from data associated with the one or more similar images;
generating, by the data processing system, an alternative text description for the original image based on the extracted textual description information associated with the one or more similar images;
storing, by the data processing system, the alternative text description for the original image in association with the original image;
generating a confidence level value for the alternative text description, wherein the confidence level value identifies a level of confidence that the alternative text description accurately describes content of the original image; and
storing the confidence level value in association with the alternative text description, wherein the confidence level value is generated based on scoring each keyword in the alternative text description according to a frequency of occurrence of the keyword in textual description information associated with the one or more similar images and wherein the confidence level value is generated using the following relationship;
Confidence level value=(v/(v+m))*R+(m/(v+m))*C where R is an average score for the keywords in the alternative text description, v is a number of keywords in the alternative text description, m is a minimum number of keywords in the alternative text description required, and C is a mean confidence level value.
1 Assignment
0 Petitions
Accused Products
Abstract
Mechanisms are provided for generating alternative text descriptions for images in electronic documents. An original image embedded in an electronic document is analyzed to generate a data pattern for the image. A matching operation is performed to identify similar images in other electronic documents from sources of electronic documents based on the generated data pattern. Textual description information associated with the similar images is extracted from data associated with the similar image. An alternative text description for the original image is generated based on the extracted textual description information associated with the similar images. The alternative text description for the original image is stored in association with the original image.
-
Citations
20 Claims
-
1. A method, in a data processing system comprising at least one processor and at least one memory, for generating alternative text descriptions for images in electronic documents, comprising:
-
analyzing, by the data processing system, an original image embedded in an electronic document to generate a data pattern for the image; performing, by the data processing system, a matching operation to identify one or more similar images in other electronic documents from one or more sources of electronic documents based on the generated data pattern; extracting, by the data processing system, textual description information associated with the one or more similar images from data associated with the one or more similar images; generating, by the data processing system, an alternative text description for the original image based on the extracted textual description information associated with the one or more similar images; storing, by the data processing system, the alternative text description for the original image in association with the original image; generating a confidence level value for the alternative text description, wherein the confidence level value identifies a level of confidence that the alternative text description accurately describes content of the original image; and storing the confidence level value in association with the alternative text description, wherein the confidence level value is generated based on scoring each keyword in the alternative text description according to a frequency of occurrence of the keyword in textual description information associated with the one or more similar images and wherein the confidence level value is generated using the following relationship;
Confidence level value=(v/(v+m))*R+(m/(v+m))*Cwhere R is an average score for the keywords in the alternative text description, v is a number of keywords in the alternative text description, m is a minimum number of keywords in the alternative text description required, and C is a mean confidence level value. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product, for generating alternative text descriptions for images in electronic documents, comprising a non-transitory computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
-
analyze an original image embedded in an electronic document to generate a data pattern for the image; perform a matching operation to identify one or more similar images in other electronic documents from one or more sources of electronic documents based on the generated data pattern; extract textual description information associated with the one or more similar images from data associated with the one or more similar images; generate an alternative text description for the original image based on the extracted textual description information associated with the one or more similar images; store the alternative text description for the original image in association with the original image; generate a confidence level value for the alternative text description, wherein the confidence level value identifies a level of confidence that the alternative text description accurately describes content of the original image; and store the confidence level value in association with the alternative text description, wherein the confidence level value is generated based on scoring each keyword in the alternative text description according to a frequency of occurrence of the keyword in textual description information associated with the one or more similar images and wherein the confidence level value is generated using the following relationship;
Confidence level value=(v/(v+m))*R+(m/(v+m))*Cwhere R is an average score for the keywords in the alternative text description, v is as number of keywords in the alternative text description, m is a minimum number of keywords in the alternative text description required, and C is a mean confidence level value. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. An apparatus, comprising:
-
a processor; and a memory coupled to the processor, wherein the memory comprises instructions which, when executed by the processor, cause the processor to; analyze an original image embedded in an electronic document to generate a data pattern for the image; perform a matching operation to identify one or more similar images in other electronic documents from one or more sources of electronic documents based on the generated data pattern; extract textual description information associated with the one or more similar images from data associated with the one or more similar images; generate an alternative text description for the original image based on the extracted textual description information associated with the one or more similar images; store the alternative text description for the original image in association with the original image; generate a confidence level value for the alternative text description, wherein the confidence level value identifies a level of confidence that the alternative text description accurately describes content of the original image; and store the confidence level value in association with the alternative text description, wherein the confidence level value is generated based on scoring each keyword in the alternative text description according to a frequency of occurrence of the keyword in textual description information associated with the one or more similar images and wherein the confidence level value is generated using the following relationship;
Confidence level value=(v/v+m))*R+(m/(v+m))*Cwhere R is an average score for the keywords in the alternative text description, v is a number of keywords hi the alternative text description, m is a minimum number of keywords in the alternative text description required, and C is a mean confidence level value. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification