System and method for semantic analysis of intelligent device discovery
First Claim
1. A method of analyzing electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the method comprising:
- crawling HTML content and text content in a set of the sites;
deep-scanning non-HTML content and non-text content in the set of sites;
reverse-scanning the set of sites;
performing a semantic analysis of the crawled content and the deep-scanned content;
correlating the results of the semantic analysis with the results of the reverse-scanning; and
comparing user navigation patterns and content from the members of the set of sites.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides a method, system, and service of analyzing electronic documents in an intranet, where the intranet includes a plurality of web sites. In an exemplary embodiment, the method, system, and service include (1) crawling HTML content and text content in a set of the sites, (2) deep-scanning non-HTML content and non-text content in the set of sites, (3) reverse-scanning the set of sites, (4) performing a semantic analysis of the crawled content and the deep-scanned content, (5) correlating the results of the semantic analysis with the results of the reverse-scanning, and (6) comparing user navigation patterns and content from the members of the set of sites. In a further embodiment, the method, system, and service further include combining the results of the performing, the results of the correlating, and the results of the comparing.
-
Citations
53 Claims
-
1. A method of analyzing electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the method comprising:
-
crawling HTML content and text content in a set of the sites; deep-scanning non-HTML content and non-text content in the set of sites; reverse-scanning the set of sites; performing a semantic analysis of the crawled content and the deep-scanned content; correlating the results of the semantic analysis with the results of the reverse-scanning; and comparing user navigation patterns and content from the members of the set of sites. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system of analyzing electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the system comprising:
-
a crawling module configured to crawl HTML content and text content in a set of the sites; a deep-scanning module configured to deep-scan non-HTML content and non-text content in the set of sites; a reverse-scanning module configured to reverse-scan the set of sites; a performing module configured to perform a semantic analysis of the crawled content and the deep-scanned content; a correlating module configured to correlate the results of the semantic analysis with the results of the reverse-scanning; and a comparing module configured to compare user navigation patterns and content from the members of the set of sites. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method of providing a service to analyze electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the method comprising:
-
crawling HTML content and text content in a set of the sites; deep-scanning non-HTML content and non-text content in the set of sites; reverse-scanning the set of sites; performing a semantic analysis of the crawled content and the deep-scanned content, correlating the results of the semantic analysis with the results of the reverse-scanning; and comparing user navigation patterns and content from the members of the set of sites. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method of analyzing electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the method comprising:
-
crawling HTML content and text content in a set of the sites; deep-scanning non-HTML content and non-text content in the set of sites; reverse-scanning the set of sites; performing a semantic analysis of the crawled content and the deep-scanned content; correlating the results of the semantic analysis with the results of the reverse-scanning; comparing user navigation patterns and content from the members of the set of sites; and combining the results of the performing, the results of the correlating, and the results of the comparing. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35)
-
-
36. A system of analyzing electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the system comprising:
-
a crawling module configured to crawl HTML content and text content in a set of the sites; a deep-scanning module configured to deep-scan non-HTML content and non-text content in the set of sites; a reverse-scanning module configured to reverse-scan the set of sites; a performing module configured to perform a semantic analysis of the crawled content and the deep-scanned content; a correlating module configured to correlate the results of the semantic analysis with the results of the reverse-scanning; a comparing module configured to compare user navigation patterns and content from the members of the set of sites; and a combining module configured to combine the results of the performing module, the results of the correlating module, and the results of the comparing module. - View Dependent Claims (37, 38, 39, 40, 41, 42, 43)
-
-
44. A method of providing a service to analyze electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the method comprising:
-
crawling HTML content and text content in a set of the sites; deep-scanning non-HTML content and non-text content in the set of sites; reverse-scanning the set of sites; performing a semantic analysis of the crawled content and the deep-scanned content; correlating the results of the semantic analysis with the results of the reverse-scanning; comparing user navigation patterns and content from the members of the set of sites; and combining the results of the performing, the results of the correlating, and the results of the comparing. - View Dependent Claims (45, 46, 47, 48, 49, 50, 51)
-
-
52. A computer program product usable with a programmable computer having readable program code embodied therein of analyzing electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the computer program product comprising:
-
computer readable code for crawling HTML content and text content in a set of the sites; computer readable code for deep-scanning non-HTML content and non-text content in the set of sites; computer readable code for reverse-scanning the set of sites, computer readable code for performing a semantic analysis of the crawled content and the deep-scanned content; computer readable code for correlating the results of the semantic analysis with the results of the reverse-scanning; and computer readable code for comparing user navigation patterns and content from the members of the set of sites,
-
-
53. A computer program product usable with a programmable computer having readable program code embodied therein of analyzing electronic documents in an intranet, wherein the intranet comprises a plurality of web sites, the computer program product comprising:
-
computer readable code for crawling HTML content and text content in a set of the sites; computer readable code for deep-scanning non-HTML content and non-text content in the set of sites; computer readable code for reverse-scanning the set of sites; computer readable code for performing a semantic analysis of the crawled content and the deep-scanned content; computer readable code for correlating the results of the semantic analysis with the results of the reverse-scanning; computer readable code for comparing user navigation patterns and content from the members of the set of sites; and computer readable code for combining the results of the performing, the results of the correlating, and the results of the comparing.
-
Specification