Systems and methods for automatic identification of potential material facts in documents
First Claim
1. A system to identify potential material fact sentences in electronic legal documents obtained from electronic repositories, the system comprising:
- a processing device; and
a non-transitory, processor-readable storage medium in communication with the processing device, the non-transitory, processor-readable storage medium comprising one or more programming instructions that, when executed, cause the processing device to;
obtain an electronic legal document from a repository,parse text within the electronic legal document to determine whether each one of one or more paragraphs in the legal document is a fact paragraph, a discussion paragraph, or an outcome paragraph based on at least one of a heading associated with the paragraph and one or more features of the paragraph, andfor each one of the one or more paragraphs that is a fact paragraph;
extract each one of one or more sentences in the fact paragraph,direct a trained sentence classifier to determine whether each one of the one or more sentences is a potential material fact sentence or a non-material fact sentence based on one or more features of the sentence, wherein;
determining the potential material fact sentence comprises determining that a sentence potentially contains a material fact therein,determining the non-material fact sentence comprises determining that a sentence does not contain a material fact, andthe material fact is a fact that is germane to a particular topic of the electronic legal document,identify one or more potential material fact sentences from the one or more sentences based on the determination; and
provide the one or more potential material fact sentences to an external device.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods to identify potential material fact sentences in electronic legal documents obtained from electronic repositories are disclosed. A system includes a processing device and a storage medium in communication with the processing device. The storage medium includes programming instructions that cause the processing device to obtain a document and parse text within the document to determine whether each paragraph in the document is a fact paragraph, a discussion paragraph, or an outcome paragraph based on at least one of a heading associated with the paragraph and features of the paragraph. The storage medium further includes programming instructions that cause the processing device to extract each sentence in the fact paragraph, direct a trained sentence classifier to determine whether each sentence is a potential material fact sentence or a non-material fact sentence based on features of the sentence, and identify potential material fact sentences.
-
Citations
20 Claims
-
1. A system to identify potential material fact sentences in electronic legal documents obtained from electronic repositories, the system comprising:
-
a processing device; and a non-transitory, processor-readable storage medium in communication with the processing device, the non-transitory, processor-readable storage medium comprising one or more programming instructions that, when executed, cause the processing device to; obtain an electronic legal document from a repository, parse text within the electronic legal document to determine whether each one of one or more paragraphs in the legal document is a fact paragraph, a discussion paragraph, or an outcome paragraph based on at least one of a heading associated with the paragraph and one or more features of the paragraph, and for each one of the one or more paragraphs that is a fact paragraph; extract each one of one or more sentences in the fact paragraph, direct a trained sentence classifier to determine whether each one of the one or more sentences is a potential material fact sentence or a non-material fact sentence based on one or more features of the sentence, wherein; determining the potential material fact sentence comprises determining that a sentence potentially contains a material fact therein, determining the non-material fact sentence comprises determining that a sentence does not contain a material fact, and the material fact is a fact that is germane to a particular topic of the electronic legal document, identify one or more potential material fact sentences from the one or more sentences based on the determination; and provide the one or more potential material fact sentences to an external device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method to identify potential material fact sentences in electronic legal documents obtained from electronic repositories, the method comprising:
-
obtaining, by a processing device, an electronic legal document from a repository; parsing, by the processing device, text within the electronic legal document to determine whether each one of one or more paragraphs in the legal document is a fact paragraph, a discussion paragraph, or an outcome paragraph based on at least one of a heading associated with the paragraph and one or more features of the paragraph; and for each one of the one or more paragraphs that is a fact paragraph; extracting, by the processing device, each one of one or more sentences in the fact paragraph, directing, by the processing device, a trained sentence classifier to determine whether each one of the one or more sentences is a potential material fact sentence or a non-material fact sentence based on one or more features of the sentence, wherein; determining the potential material fact sentence comprises determining that a sentence potentially contains a material fact therein, determining the non-material fact sentence comprises determining that a sentence does not contain a material fact, and the material fact is a fact that is germane to a particular topic of the electronic legal document, identifying, by the processing device, one or more potential material fact sentences from the one or more sentences based on the determination, and providing the one or more potential material fact sentences to an external device. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method to identify potential material fact sentences in electronic legal documents obtained from electronic repositories, the method comprising:
-
obtaining, by a processing device, an electronic legal document from a repository; parsing, by the processing device, text within the electronic legal document to determine whether each one of one or more paragraphs in the legal document is a fact paragraph, a discussion paragraph, or an outcome paragraph based on at least one of a heading associated with the paragraph and one or more features of the paragraph; and for each one of the one or more paragraphs that is a fact paragraph; extracting, by the processing device, each one of one or more sentences in the fact paragraph, directing, by the processing device, a natural language parser to parse each one of the one or more sentences in the fact paragraph to determine a number of noun phrases and a number of verb phrases, extracting, by the processing device, one or more features selected from a number of dates, a number of time stamps, a number of monetary values, a number of lower court actions, a number of present court actions, a number of plaintiff actions, a number of defendant actions, a number of legal phrases, a number of legal concepts, a number of non-material fact words, and a number of non-material fact phrases from each one of the one or more sentences, scoring, by the processing device, each one of the one or more sentences based on the number of noun phrases, the number of verb phrases, and the one or more features, determining, by the processing device, whether each one of the one or more sentences is a potential material fact sentence or a non-material fact sentence based on the scoring, wherein; determining the potential material fact sentence comprises determining that a sentence potentially contains a material fact therein, determining the non-material fact sentence comprises determining that a sentence does not contain a material fact, and the material fact is a fact that is germane to a particular topic of the electronic legal document, and providing the one or more potential material fact sentences to an external device. - View Dependent Claims (20)
-
Specification