Segmenting and interpreting a document, and relocating document fragments to corresponding sections
First Claim
Patent Images
1. A method, comprising:
- receiving a document including an item via a computer input device;
comparing the item to a plurality of lexicons using a processor;
determining, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons;
storing, using the processor, the fragment in a section of the document determined based on the at least one of the plurality of lexicons;
receiving at least one other item via the computer input device after the fragment is stored in the section;
segmenting the document into multiple sections, wherein each of the multiple sections corresponds to a respective section type of multiple section types;
segmenting items in a first section of the multiple sections of the document into multiple fragments, wherein the first section corresponds to a first section type;
determining a section type of each of the multiple fragments in the first section;
determining whether the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions; and
based on the multiple fragments in the first section including fragments that correspond to different section types and that are interspersed among each other in even proportions;
determining that the fragments that correspond to different section types and that are interspersed among each other in even proportions do not belong in the first section;
generating a new section corresponding to a section type that is different than the multiple section types; and
re-locating the fragments that correspond to different section types and that are interspersed among each other in even proportions to the new section.
1 Assignment
0 Petitions
Accused Products
Abstract
A method comprising receiving an item via a computer input device, comparing the item to a plurality of lexicons using a processor, and determining, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons. The method also comprises storing, using the processor, the fragment in a section of a document determined based on the at least one of the plurality of lexicons, and receiving at least one other item via the computer input device after the fragment is stored in the section.
-
Citations
16 Claims
-
1. A method, comprising:
-
receiving a document including an item via a computer input device; comparing the item to a plurality of lexicons using a processor; determining, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons; storing, using the processor, the fragment in a section of the document determined based on the at least one of the plurality of lexicons; receiving at least one other item via the computer input device after the fragment is stored in the section; segmenting the document into multiple sections, wherein each of the multiple sections corresponds to a respective section type of multiple section types; segmenting items in a first section of the multiple sections of the document into multiple fragments, wherein the first section corresponds to a first section type; determining a section type of each of the multiple fragments in the first section; determining whether the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions; and based on the multiple fragments in the first section including fragments that correspond to different section types and that are interspersed among each other in even proportions; determining that the fragments that correspond to different section types and that are interspersed among each other in even proportions do not belong in the first section; generating a new section corresponding to a section type that is different than the multiple section types; and re-locating the fragments that correspond to different section types and that are interspersed among each other in even proportions to the new section. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system, comprising:
-
an input device configured to receive a document including an item; and a processor communicably coupled to the input device and configured to; compare the item to a plurality of lexicons; determine, based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons; store the fragment in a section of the document determined based on the at least one of the plurality of lexicons; receive at least one other item via the computer input device after the fragment is stored in the section; segment the document into multiple sections, wherein each of the multiple sections corresponds to a respective section type of multiple section types; segment items in a first section of the multiple sections of the document into multiple fragments, wherein the first section corresponds to a first section type; determine a section type of each of the multiple fragments in the first section; determine whether the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions; and based on the multiple fragments in the first section including fragments that correspond to different section types and that are interspersed among each other in even proportions; determine that the fragments that correspond to different section types and that are interspersed among each other in even proportions do not belong in the first section; generate a new section corresponding to a section type that is different than the multiple section types; and re-locate the fragments that correspond to different section types and that are interspersed among each other in even proportions to the new section. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A computer program product comprising a non-transitory computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to:
-
receive a document including an item via a computer input device; compare the item to a plurality of lexicons using a processor; determine, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons; store, using the processor, the fragment in a section of the document determined based on the at least one of the plurality of lexicons; receive at least one other item via the computer input device after the fragment is stored in the section; segment the document into multiple sections, wherein each of the multiple sections corresponds to a respective section type of multiple section types; segment items in a first section of the multiple sections of the document into multiple fragments, wherein the first section corresponds to a first section type; determine a section type of each of the multiple fragments in the first section; determine whether the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions; and based on the multiple fragments in the first section including fragments that correspond to different section types and that are interspersed among each other in even proportions; determine that the fragments that correspond to different section types and that are interspersed among each other in even proportions do not belong in the first section; generate a new section corresponding to a section type that corresponds to a section type that is different than the multiple section types; and re-locate the fragments that correspond to different section types and that are interspersed among each other in even proportions to the new section. - View Dependent Claims (13, 14, 15, 16)
-
Specification