Content availability for natural language processing tasks
First Claim
Patent Images
1. A method, in an information handling system comprising a processor and a memory, of making content available to natural language processing (NLP) tasks, the method comprising:
- determining that a document section comprises non-textual image data;
in response to the determining, converting the document section into a natural language format, wherein the natural language format comprises textual data compatible with the NLP tasks, and wherein the converting comprises;
selecting a screen reader application, wherein the selected screen reader application does not provide textual output;
inputting a screen view of the document section into the selected screen reader application;
in response to the inputting, receiving, from the selected screen reader application, an audible speech output; and
inputting the audible speech output as an audible speech input to a speech recognition application, wherein the speech recognition application converts the audible speech input into a natural language textual output that describes the non-textual image data; and
performing a NLP operation on the natural language textual output, wherein the NLP operation is carried out by a question and answer system.
1 Assignment
0 Petitions
Accused Products
Abstract
An approach is provided to make content available to natural language processing (NLP) tasks. In the approach, a screen view of a document section is provided as input to a screen reader application. The screen reader application converts information displayed on the screen into a natural language format. A NLP operation is then performed on the natural language format.
69 Citations
14 Claims
-
1. A method, in an information handling system comprising a processor and a memory, of making content available to natural language processing (NLP) tasks, the method comprising:
-
determining that a document section comprises non-textual image data; in response to the determining, converting the document section into a natural language format, wherein the natural language format comprises textual data compatible with the NLP tasks, and wherein the converting comprises; selecting a screen reader application, wherein the selected screen reader application does not provide textual output; inputting a screen view of the document section into the selected screen reader application; in response to the inputting, receiving, from the selected screen reader application, an audible speech output; and inputting the audible speech output as an audible speech input to a speech recognition application, wherein the speech recognition application converts the audible speech input into a natural language textual output that describes the non-textual image data; and performing a NLP operation on the natural language textual output, wherein the NLP operation is carried out by a question and answer system. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An information handling system comprising:
-
one or more processors; a memory coupled to at least one of the processors; a display; and a set of instructions stored in the memory and executed by at least one of the processors to make content available to natural language processing (NLP) tasks, wherein the set of instructions perform actions of; determining that a document section comprises non-textual image data; in response to the determining, converting the document section into a natural language format, wherein the natural language format comprises textual data compatible with the NLP tasks, and wherein the converting comprises; selecting a screen reader application, wherein the selected screen reader application does not provide textual output; inputting a screen view of the document section into the selected screen reader application; in response to the inputting, receiving, from the selected screen reader application, an audible speech output; and inputting the audible speech output as an audible speech input to a speech recognition application, wherein the speech recognition program converts the audible speech input into a natural language textual output that describes the non-textual image data; and performing a NLP operation on the natural language textual output, wherein the NLP operation is carried out by a question and answer system. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A non-transitory computer readable storage medium, comprising computer instructions stored thereon, that, when executed by an information handling system, causes the information handling system to make content available to natural language processing (NLP) tasks by performing actions comprising:
-
determining that a document section comprises non-textual image data; in response to the determining, converting the document section into a natural language format, wherein the natural language format comprises textual data compatible with the NLP tasks, and wherein the converting comprises; selecting a screen reader application, wherein the selected screen reader application does not provide textual output; inputting a screen view of the document section into the selected screen reader application; in response to the inputting, receiving, from the selected screen reader application, an audible speech output; and inputting the audible speech output as an audible speech input to a speech recognition application, wherein the speech recognition application converts the audible speech input into a natural language textual output that describes the non-textual image data; and performing a NLP operation on the natural language textual output, wherein the NLP operation is carried out by a question and answer system. - View Dependent Claims (12, 13, 14)
-
Specification