SYSTEM AND METHOD FOR BLOCK SEGMENTING, IDENTIFYING AND INDEXING VISUAL ELEMENTS, AND SEARCHING DOCUMENTS
First Claim
Patent Images
1. A method for processing documents, the method comprising:
- generating preliminary metadata for a document, the document having source content and presentation semantics;
dividing the document into one or more identified blocks using the preliminary metadata;
preparing a block list of the one or more identified blocks;
processing the one or more identified blocks in the block list using block operations; and
identifying at least one title block item for at least one of the identified blocks in the block list.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for segmenting, identifying and indexing visual elements, and searching documents comprises for each document generating metadata, segmenting the document into blocks using the metadata, performing block operations on the identified blocks, identifying and indexing inline visual elements using data and metadata rules, identifying and indexing block visual elements using profiles, and searching for documents containing visual elements.
-
Citations
97 Claims
-
1. A method for processing documents, the method comprising:
-
generating preliminary metadata for a document, the document having source content and presentation semantics; dividing the document into one or more identified blocks using the preliminary metadata; preparing a block list of the one or more identified blocks; processing the one or more identified blocks in the block list using block operations; and identifying at least one title block item for at least one of the identified blocks in the block list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A method for processing and identifying documents in accordance with their relevance to a search query, the method comprising:
-
generating preliminary metadata for a document, the document having source content and presentation semantics; dividing the document into blocks using the preliminary metadata; preparing a list of the identified blocks; processing the blocks in the block list using block operations; identifying a title block for each block in the block list; identifying at least one inline visual element within a block item in the block list using predefined data and metadata rules, if an inline visual element exists; identifying at least one block visual element within at least one of the blocks in the block list using predefined profiles, if a block visual element exists; generating an index of the found visual elements; receiving search query data via a Graphical User Interface (GUI) wherein the search query data comprises at least one selected visual element type; and generating a response to the search query with identification of each document that is determined to be relevant to the search query. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95)
-
-
96. A segmenting system, comprising:
-
a network device communicatively coupled to one or more web servers; and logic configured to receive at least one document from the one or more web servers via the network device, the logic further configured to generate preliminary metadata for the document, the document having document content, segment the document content, dividing the document into one or more identified blocks using the preliminary metadata, and preparing a list of the one or more identified blocks, each block comprising at least one block item, the logic further configured to process the identified blocks using block operations and identify any title block item within each of the identified blocks.
-
-
97. A search system, comprising:
-
a network device communicatively coupled to one or more web servers; and logic configured to retrieve documents from via the network device from the one or more web servers, the logic further configured to generate preliminary metadata for the document, the document having document content, segment the document content, dividing the document into blocks using the preliminary metadata, and preparing a list of the identified blocks, each block comprising at least one block item, the logic further configured to process the found blocks using block operations and identify inline visual elements within each block item using data and the metadata, the logic further configured to identify any block visual elements within each of the identified blocks and block items using profiles and generate an index of the found visual elements, the logic further configured to receive a request for one or more visual elements from a user as part of search query, the search query further comprising a visual element type and generate a response to the user for his search query.
-
Specification