SYSTEM AND METHOD FOR AUTOMATICALLY DETECTING AND INTERACTIVELY DISPLAYING INFORMATION ABOUT ENTITIES, ACTIVITIES, AND EVENTS FROM MULTIPLE-MODALITY NATURAL LANGUAGE SOURCES
First Claim
1. A non-transitory computer program storage device embodying instructions executable by a processor to interactively display information about entities, activities and events from multiple-modality natural language sources, the non-transitory computer program storage device comprising storage memory configured to store:
- an information extraction module having instruction code for downloading document content from text and audio/video, for parsing the document content, for detecting mentions, for co-referencing, for cross-document co-referencing and for extracting relations;
an information gathering module having instruction code for extracting acquaintances, biography and involvement in events from the information extraction module; and
an information display module having instruction code for displaying information from the information gathering module.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for automatically extracting and organizing information by a processing device from a plurality of data sources is provided. A natural language processing information extraction pipeline that includes an automatic detection of entities is applied to the data sources. Information about detected entities is identified by analyzing products of the natural language processing pipeline. Identified information is grouped into equivalence classes containing equivalent information. At least one displayable representation of the equivalence classes is created. An order in which the at least one displayable representation is displayed is computed. A combined representation of the equivalence classes that respects the order in which the displayable representation is displayed is produced.
76 Citations
10 Claims
-
1. A non-transitory computer program storage device embodying instructions executable by a processor to interactively display information about entities, activities and events from multiple-modality natural language sources, the non-transitory computer program storage device comprising storage memory configured to store:
-
an information extraction module having instruction code for downloading document content from text and audio/video, for parsing the document content, for detecting mentions, for co-referencing, for cross-document co-referencing and for extracting relations; an information gathering module having instruction code for extracting acquaintances, biography and involvement in events from the information extraction module; and an information display module having instruction code for displaying information from the information gathering module. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer program storage device embodying instructions executable by a processor to automatically extract and organize information from a plurality of data sources, the non-transitory computer program storage device comprising storage memory configured to store:
-
instruction code for applying to the data sources a natural language processing information extraction pipeline that includes an automatic detection of entities; instruction code for identifying information about detected entities by analyzing products of the natural language processing pipeline; instruction code for grouping identified information into equivalence classes containing equivalent information; instruction code for creating at least one displayable representation of the equivalence classes; instruction code for computing an order in which the at least one displayable representation is displayed; and instruction code for producing a combined representation of the equivalence classes that respects an order in which said displayable representation is displayed. - View Dependent Claims (7, 8, 9, 10)
-
Specification