SEARCHING AND ACCESSING DOCUMENTS ON PRIVATE NETWORKS FOR USE WITH CAPTURES FROM RENDERED DOCUMENTS
3 Assignments
0 Petitions
Accused Products
Abstract
A facility for exposing an index of private documents is described. In a private network, the facility (1) identifies electronic versions of documents that are available inside the private network, including a distinguished document; (2) constructs an index covering the identified electronic versions of documents; and (3) exports the constructed index from the private network to an index publication server. At the index publication server, the facility (1) receives the exported index; (2) receives a query via a public network; and (3) uses an index, based upon the received index, to generate a query result for the received query that contains the distinguished document.
202 Citations
59 Claims
-
1-23. -23. (canceled)
-
24. A method in a computing system for searching private documents, comprising:
-
in a distinguished private network comprising a plurality of content storage computer systems and a private network indexing computer system; identifying, by the private network indexing computer system, electronic versions of documents available inside the distinguished private network, wherein each identified electronic version of the documents corresponds to a rendered document; constructing a full-text index, by the private network indexing computer system, covering the complete textual content of the identified electronic versions of documents; exporting the constructed full-text index, by the private network indexing computer system, from the distinguished private network to an index publication server not in the distinguished private network; at the index publication server; receiving the exported full-text index; receiving a full-text query via a public network based on a text capture from a rendered document; and using an index based upon the received full-text index to generate a query result for the received full-text query based on the text capture that identifies the rendered document, the query result indicating that an electronic version of the identified rendered document is available in the distinguished private network. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49)
-
-
50. A computer-readable storage medium having stored thereon instructions that, when executed, cause a computing system to perform a method for exposing an index of private documents, the method comprising:
-
in a private network comprising a plurality of content storage computer systems and a private network indexing computer system; identifying electronic versions of documents each having textual content and being available inside a distinguished private network, including a distinguished document; constructing a full-text index covering the complete textual content of the identified electronic versions of documents; exporting the constructed full-text index from the distinguished private network to an index publication server not in the distinguished private network; at the index publication server; receiving the exported full-text index; receiving a full-text query via a public network, wherein the full-text query includes an explicit indication that at least a portion of the full-text query was obtained by optically scanning a rendered document; and using an index based upon the received full-text index to generate a query result for the received full-text query that contains the distinguished document, the query result indicating that an electronic version of the distinguished document is available in the private network.
-
-
51. A computing system comprising at least one storage hardware component and at least one processing hardware component for exposing an index of private documents, comprising:
-
in a distinguished private network comprising a plurality of content storage computer systems, a private network indexing computer system that; identifies electronic versions of documents each having textual content and being available inside the distinguished private network, including a distinguished document; constructs a full-text index usable to determine whether contiguous phrases are present among the textual content of the identified electronic versions of documents; and exports the constructed full-text index from the distinguished private network to an index publication server not in the distinguished private network; and an index publication server that; receives the exported full-text index; receives a continuous phrase query via a public network, wherein the continuous phrase query includes an explicit indication that at least a portion of the full-text query was obtained by optically scanning a rendered document; and uses an index based upon the received full-text index to generate a query result for the received full-text query that contains the distinguished document, the query result indicating that an electronic version of the distinguished document is available in the distinguished private network.
-
-
52. A device comprising at least one storage hardware component and at least one processing hardware component that can be connected to a private network comprising a plurality of content storage computer systems to expose an index of private documents, comprising:
-
a document identification subsystem that identifies electronic versions of documents, each having textual content and available inside the distinguished private network, including a distinguished document; an indexing subsystem that constructs a full-text index usable to determine whether continuous phrases are present among textual content of the identified electronic versions of documents; and an index exportation subsystem that exports the constructed full-text index from the distinguished private network to an index publication server not in the distinguished private network capable of generating a query result based on a text-based query derived from a text capture from a rendered document performed using a hand-held text capture device, the query result indicating that an electronic version of the rendered document is available in the distinguished private network.
-
-
53. An index aggregation system comprising at least one storage hardware component and at least one processing hardware component, comprising:
an index subsystem, the index subsystem comprising; a first full-text index obtained from a first private network comprising a first plurality of content storage computer systems and a first private network indexing computer system; a second full-text index obtained from a second private network comprising a second plurality of content storage computer systems and a second private network indexing computer system; a query subsystem that receives queries from users generated by capturing text from rendered documents via hand-held text capture devices associated with the users and searches both the first full-text index and the second full-text index based on each of the received queries; and a result subsystem that returns results of the searches, the results indicating that electronic versions of the rendered documents are available in the first or second private network. - View Dependent Claims (54, 55, 56, 57, 58)
-
59. An index aggregation system comprising at least one storage hardware component and at least one processing hardware component, comprising:
-
an index subsystem, the index subsystem comprising; a first full-text index obtained from a private network comprising a plurality of content storage computer systems and a private network indexing computer system; a second full-text index obtained from a public network; and a query subsystem that generates query results based on queries from users against both the first and second full-text indices, wherein at least a portion of the serviced queries include text segments captured from rendered documents using a hand-held text capture device, and wherein at least a portion of the query results indicate that electronic versions of the rendered documents are available in the private network.
-
Specification