METHOD FOR ENTITY ENRICHMENT OF DIGITAL CONTENT TO ENABLE ADVANCED SEARCH FUNCTIONALITY IN CONTENT MANAGEMENT SYSTEMS
First Claim
1. A computer-implemented method comprising:
- receiving, by a computer, a plurality data streams associated with a plurality of data sources respectively;
responsive to the computer detecting a triggering condition associated with data of a data stream;
generating, by the computer, geographic data associated with the data of the data stream; and
updating, by the computer, metadata associated with the data of the data stream, the metadata containing the geographic data associated with the data;
responsive to the computer not detecting the triggering condition for the data;
mapping, by the computer, the metadata for the data source to a set of managed properties associated with a search index; and
responsive to the computer determining that the data is image data;
determining, by the computer, a storage location of a machine-readable document file containing the image data, based upon the metadata associated with the image data;
executing, by the computer, an optical character recognition routine on the document file containing the image data received from the data source, thereby generating text data for the data of the data source; and
updating, by the computer, the metadata of the data from the data source, in response to identifying geographic data associated with the text data.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is a system and method for extending search capabilities of contentment management systems, such as SharePoint 2013®, to enable geographic and name entity based searches. Geographic and named entity searches are enabled by a content enrichment web service. The content enrichment web service calls a geotagging or a named entity tagger web service application to tag crawled managed properties as input and return geographically or entity modified managed properties as output. The system associates one or more geographically and named entity modified managed properties with content and stores this information as metadata in a SharePoint 2013® search index. Thus, the search system allows users to identify a particular geographic entity the user is interested in finding, and to receive search results directly related to that geographic entity on SharePoint 2013®.
8 Citations
10 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a computer, a plurality data streams associated with a plurality of data sources respectively; responsive to the computer detecting a triggering condition associated with data of a data stream; generating, by the computer, geographic data associated with the data of the data stream; and updating, by the computer, metadata associated with the data of the data stream, the metadata containing the geographic data associated with the data; responsive to the computer not detecting the triggering condition for the data; mapping, by the computer, the metadata for the data source to a set of managed properties associated with a search index; and responsive to the computer determining that the data is image data; determining, by the computer, a storage location of a machine-readable document file containing the image data, based upon the metadata associated with the image data; executing, by the computer, an optical character recognition routine on the document file containing the image data received from the data source, thereby generating text data for the data of the data source; and updating, by the computer, the metadata of the data from the data source, in response to identifying geographic data associated with the text data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification