Collection of intranet activity data
First Claim
1. A system for collection of activity data related to a plurality of authenticated computer network users, comprising:
- a data collection server, deployed within said computer network, configured to collect raw activity data related to said plurality of users from sources within said computer network, wherein said sources include at least some of a web content management server, a document management server, a web server, a proxy server, a directory service information server, an email server, or a client-side logging application, the data collection server being configured to normalize the raw activity data to provide normalized activity data associated with the document, wherein normalization of the raw activity data resolves differences of actions on the document by unifying saving the document, directly opening the document, and opening the document via textually different URLs such that the activity data reflects activity data associated with the document, wherein the textually different URLs are resolved to be logically equivalent by disassembling the textually different URLs and reconstructing a URL having a unified format; and
a control server coupled to said data collection server, said control server having a processor and a memory storing at least one configuration table containing at least one rule based on which said control server is configured to regulate the collection, transformation, aggregation, and anonymization of said raw activity data related to said plurality of users to generate user activity data on said computer network in compliance with at least one privacy law and/or at least one organizational privacy policy, wherein personally identifiable information is removed from the user activity data, wherein said at least one rule includes a schedule to collect activity data from said sources and an exclusion rule which defines a subset of said plurality of users and/or sources from which collection of activity data is not allowed.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods and computer program products for facilitating the collection of data within a computer network (especially an intranet) while complying with applicable privacy laws and regulations, as well as individual organizations'"'"' rules addressing intranet users'"'"' privacy are disclosed. Such systems, methods and computer program products allow for the collecting of activity information related to computer-based activities performed by users while logged into an organization'"'"'s intranet. Such activity includes navigating to URLs, opening and editing documents, writing, opening and reading email and instant messages, and the like. The collecting, consolidating, storing and exposing of such activity information—while ensuring privacy requirements—serves as a basis for high-value services (e.g., augmenting documents with extra information, improving search results, automatic news feeds, social networking announcements, etc.) to be offered and provisioned to such users.
-
Citations
15 Claims
-
1. A system for collection of activity data related to a plurality of authenticated computer network users, comprising:
-
a data collection server, deployed within said computer network, configured to collect raw activity data related to said plurality of users from sources within said computer network, wherein said sources include at least some of a web content management server, a document management server, a web server, a proxy server, a directory service information server, an email server, or a client-side logging application, the data collection server being configured to normalize the raw activity data to provide normalized activity data associated with the document, wherein normalization of the raw activity data resolves differences of actions on the document by unifying saving the document, directly opening the document, and opening the document via textually different URLs such that the activity data reflects activity data associated with the document, wherein the textually different URLs are resolved to be logically equivalent by disassembling the textually different URLs and reconstructing a URL having a unified format; and a control server coupled to said data collection server, said control server having a processor and a memory storing at least one configuration table containing at least one rule based on which said control server is configured to regulate the collection, transformation, aggregation, and anonymization of said raw activity data related to said plurality of users to generate user activity data on said computer network in compliance with at least one privacy law and/or at least one organizational privacy policy, wherein personally identifiable information is removed from the user activity data, wherein said at least one rule includes a schedule to collect activity data from said sources and an exclusion rule which defines a subset of said plurality of users and/or sources from which collection of activity data is not allowed. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-implemented method for collecting activity data related to a plurality of authenticated computer network users, said method comprising:
-
collecting, by a processor, raw activity data related to said plurality of users from sources within said computer network, wherein said sources include at least some of a web content management server, a document management server, a web server, a proxy server, a directory service information server, an email server, or a client-side logging application; regulating, by a processor, collection, transformation, aggregation, and anonymization of said raw activity data related to said plurality of users from said sources based on at least one rule in a configuration table to generate user activity data on said computer network in compliance with at least one privacy law and/or at least one organizational privacy policy, wherein personally identifiable information is removed from said user activity data, wherein said at least one rule includes an exclusion rule that defines a subset of said plurality of users from whom collection of activity data is not allowed, said subset of said plurality of users being from a particular geographical location or a group within an organization; normalizing, by a processor, the raw activity data to provide normalized activity data associated with the document, wherein normalizing the raw activity data resolves differences of actions on the document by unifying saving the document, directly opening the document, and opening the document via textually different URLs such that the activity data reflects activity data associated with the document, wherein the textually different URLs are resolved to be logically equivalent by disassembling the textually different URLs and reconstructing a URL having a unified format; and storing, by a processor, said user activity data in a database. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A computer-implemented method for collecting activity data related to a plurality of authenticated computer network users, said method comprising:
-
collecting, by a processor, raw activity data of said plurality of users from sources within said computer network, wherein said sources include at least some of a web content management server, a document management server, a web server, a proxy server, a directory service information server, an email server, or a client-side logging application, and wherein said activity data include data corresponding to at least one of navigating to a URL, opening a document, editing a document, writing an email, opening an email, reading an email, sending an instant message, or receiving an instant message; generating, by a processor, user activity data in compliance with at least one privacy law and/or at least one organizational privacy policy via transformation, aggregation, and anonymization of said raw activity data based on at least one rule in a configuration table such that personally identifiable information is removed from the user activity data, wherein said at least one rule includes an access rule, an aggregation rule, a transformation rule, an exclusion rule, or a consent rule, wherein the exclusion rule defines a subset of said plurality of users from whom collection of activity data is not allowed, said subset of said plurality of users being from a particular geographical location or a group within an organization; normalizing, by a processor, the raw activity data to provide normalized activity data associated with the document, wherein normalizing the raw activity data resolves differences of actions on the document by unifying saving the document, directly opening the document, and opening the document via textually different URLs such that the activity data reflects activity data associated with the document, wherein the textually different URLs are resolved to be logically equivalent by disassembling the textually different URLs and reconstructing a URL having a unified format; and storing, by a processor, said user activity data in a database. - View Dependent Claims (13, 14, 15)
-
Specification