Methods and systems for detecting and extracting information
First Claim
Patent Images
1. A method for extracting information from an electronic document using a computer server system having one or more processors, the method comprising:
- receiving, at the server system, a request for information that includes a definition of a concept list comprising an origin concept, a relationship between the origin concept and an evaluated concept, and a distance representing a strength of the relationship between the origin concept and the evaluated concept, and a target scope that characterizes a size of document regions to which the concept list is to be applied, wherein the request for information and the target scope are received from a user interacting with the server system through a client device connected to the server system via a network;
receiving, at the server system a definition of an extraction rule, wherein the extraction rule definition comprises an extraction scope that characterizes document regions to be extracted;
the server system determining a target score for document regions of the article, wherein the score represents how well document regions of the size to which the concept list is to be applied relate to the concept list;
the server system applying the extraction rule to the article to extract document regions characterized by the extraction scope from the article, wherein the application of the extraction rule is based on the determined target score; and
outputting the extract from the server system for the client device in response to the request for information.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods that detect information and extract information are described. In one aspect, target rules are defined for detection of target hits in an article, including defining a target article region, extraction rules are defined based on the target rules for the extraction of extracts from the article, including an extraction article region, target rules are applied to each target article region of the article to determine target hits, and extraction rules are applied to detect at least one extract from the article based on the determined target hit.
-
Citations
43 Claims
-
1. A method for extracting information from an electronic document using a computer server system having one or more processors, the method comprising:
-
receiving, at the server system, a request for information that includes a definition of a concept list comprising an origin concept, a relationship between the origin concept and an evaluated concept, and a distance representing a strength of the relationship between the origin concept and the evaluated concept, and a target scope that characterizes a size of document regions to which the concept list is to be applied, wherein the request for information and the target scope are received from a user interacting with the server system through a client device connected to the server system via a network; receiving, at the server system a definition of an extraction rule, wherein the extraction rule definition comprises an extraction scope that characterizes document regions to be extracted; the server system determining a target score for document regions of the article, wherein the score represents how well document regions of the size to which the concept list is to be applied relate to the concept list; the server system applying the extraction rule to the article to extract document regions characterized by the extraction scope from the article, wherein the application of the extraction rule is based on the determined target score; and outputting the extract from the server system for the client device in response to the request for information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. An article for extracting information from an electronic document using a computer server system having one or more processors, the article comprising one or more computer-readable data storage media containing program code operable to cause one or more machines of a server system to perform operations, the operations comprising:
-
receiving, from a user, a request for information that includes a definition of a concept list comprising an origin concept, a relationship between the origin concept and an evaluated concept, and a distance representing a strength of the relationship between the origin concept and the evaluated concept, and a target scope that characterizes a size of document regions to which the target rule is to be applied; receiving, from the user, a definition of an extraction rule, wherein the extraction rule definition comprises an extraction scope that characterizes document regions to be extracted; determining a target score for document regions of the article, wherein the score represents how well document regions of the size to which the concept list is to be applied relate to the concept list; applying the extraction rule to the article to extract document regions characterized by the extraction scope from the article, wherein the application of the extraction rule is based on the determined target score; and outputting the extract in response to the request for information for use by a client device. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
-
42. A computer-implemented method for extracting a subset of a document using a computer server system having one or more processors, the method comprising:
-
receiving, at the server system, a request for information that describes a combination of two or more concept lists, wherein each concept list is defined by an origin concept, a relationship between the origin concept and an evaluated concept, and a distance representing a strength of the relationship between the origin concept and the evaluated concept, wherein the two or more concept lists are combined using an operation to define a target definition that is to be detected, wherein the request for information is received from a user interacting with the server system through a client device connected to the server system via a network; receiving, at the server system, a description of a document region targeted for extraction; accessing a document stored in a document database using the server system; based on the target definition and the document regions targeted for extraction, using the server system to extract one or more regions of the accessed document; and the server system outputting the extracted regions for the client device in response to the request for information. - View Dependent Claims (43)
-
Specification