Intelligent retrieval and classification of information from a product manual
First Claim
1. A method of extracting intelligent information from text sources in electronic format, comprising:
- receiving unstructured data from text sources in an electronic format;
transforming the unstructured data into unstructured neutral format data;
converting the transformed unstructured neutral format data into structured data based on desired intelligent information;
extracting desired intelligent information from the converted structured data by using a multilayer self-organizing maps (MSOM) algorithm; and
storing the desired intelligent information, wherein converting the transformed unstructured neutral format data into structured data based on desired intelligent information, comprises;
extracting multiple packets from the transformed unstructured neutral format data based on desired intelligent information, wherein each extracted packet includes multiple sentence structured text data that is translated to single sentence structured text data including information that facilitates in categorizing and classifying the extracted multiple packets; and
forming multiple preprocessed packets by transforming each single sentence structured data into context-based data based on contextual information and occurrence frequency.
0 Assignments
0 Petitions
Accused Products
Abstract
An intelligent information-mining system extracts desired intelligent information from unstructured text data in documents to categorize the unstructured text data based on the extracted desired intelligent information. This is accomplished by obtaining unstructured text data from documents in an electronic format. The obtained unstructured text data is then transformed into unstructured neutral format data. The transformed unstructured neutral format data is then converted into structured data based on desired intelligent information. Desired intelligent information is then extracted from the structured data by using a multilayer self-organizing maps (MSOM) algorithm.
45 Citations
17 Claims
-
1. A method of extracting intelligent information from text sources in electronic format, comprising:
-
receiving unstructured data from text sources in an electronic format; transforming the unstructured data into unstructured neutral format data; converting the transformed unstructured neutral format data into structured data based on desired intelligent information; extracting desired intelligent information from the converted structured data by using a multilayer self-organizing maps (MSOM) algorithm; and storing the desired intelligent information, wherein converting the transformed unstructured neutral format data into structured data based on desired intelligent information, comprises; extracting multiple packets from the transformed unstructured neutral format data based on desired intelligent information, wherein each extracted packet includes multiple sentence structured text data that is translated to single sentence structured text data including information that facilitates in categorizing and classifying the extracted multiple packets; and forming multiple preprocessed packets by transforming each single sentence structured data into context-based data based on contextual information and occurrence frequency. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of extracting intelligent information from unstructured text data in an electronic data format, comprising:
-
transforming the unstructured text data into unstructured neutral format text data; converting the transformed unstructured neutral format text data into structured text data based on desired intelligent information by extracting multiple packets of finer resolution information from the transformed unstructured neutral format text data, wherein each extracted packet includes a single sentence structure data including fault information that facilitates in categorizing and classifying the extracted multiple packets; inputting the extracted multiple packets into a multi-layer self-organizing maps (MSOM) algorithm; extracting desired fault information from each inputted packet by using the MSOM algorithm; and categorizing and classifying each extracted packet based on the extracted desired fault information; and storing such categorization and classification of each extracted packet. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A physical computer-readable medium having computer-executable instructions for extracting desired intelligent information from unstructured data in aircraft maintenance manuals, comprising:
-
receiving unstructured text data in the aircraft maintenance manuals in an electronic format; extracting multiple packets of finer resolution from the unstructured text data, wherein each extracted packet includes multiple sentence structured text data that is translated to single sentence structured text data, wherein each single sentence structured text data includes aircraft fault information that can facilitate in categorizing and classifying the extracted multiple packets based on type of aircraft fault; obtaining a predetermined number of packets from the multiple extracted packets; generating a template based on a two-dimensional structured document reap using the obtained predetermined number of packets from the multiple extracted packets; and extracting the desired category and classification information from each extracted packet by projecting the extracted packet onto the generated template; and storing such categorization and classification of each extracted packet, wherein generating the template based on a two-dimensional structured document map using the obtained predetermined number of packets, comprises; extracting key phrases by sequentially inputting the obtained predetermined number of packets; generating a first layer contextual relation man by mapping each of the extracted key phrases to a two-dimensional map using a self-organizing map and a function approximation neighborhood technique; forming phrase clusters for the generated first layer of contextual relation map; and constructing a first key phrase frequency histogram consisting of the frequency of occurrences of key phrases from the generated first layer contextual relation map; and generating the template based on the two-dimensional structured document map of the predetermined number of packets from the constructed first key phrase frequency histogram and the generated first layer contextual relation map by using the function approximation neighborhood technique in the self-organizing map. - View Dependent Claims (13)
-
-
14. A computer system to perform data transfer operations between a remotely located communication module and one or more security devices in a COM-based computer network system having multiple users, comprising:
-
a processor, an output device; and a storage device to store instructions that are executable by the processor for extracting desired intelligent information from unstructured data in aircraft maintenance manuals, comprising; receiving unstructured text data in the aircraft maintenance manuals in an electronic format; extracting multiple packets of finer resolution from the unstructured text data, wherein each extracted packet includes multiple sentence structured text data that is translated to single sentence structured text data, wherein each single sentence structured text data includes aircraft fault information that can facilitate in categorizing and classifying the extracted multiple packets based on type of aircraft fault; obtaining a predetermined number of packets from the multiple extracted packets; generating a template based on a two-dimensional structured document map using the obtained predetermined number of packets from the multiple extracted packets; and extracting the desired category and classification information from each extracted packet by projecting the extracted packet onto the generated template; and storing such categorization and classification of each extracted packet. - View Dependent Claims (15, 16, 17)
-
Specification