SYSTEM AND METHOD FOR DETECTING PERSONAL EXPERIENCE EVENT REPORTS FROM USER GERNERATED INTERNET CONTENT
First Claim
1. A method, implementable on a computing device, for detecting personal experience event reports from user generated content on the Internet, the method comprising:
- filtering a collection of Internet posts to include only said Internet posts containing personal experience terms; and
further filtering said filtered Internet posts by removing said Internet posts with non-personal experience terms.
4 Assignments
0 Petitions
Accused Products
Abstract
A method for retrieving Internet posts, implementable on a computing device, includes analyzing Internet posts to define segments in the Internet posts, where the segments at least contain terms consistent with user generation of a personal experience event report associated with a pre-defined search subject, and scoring each of the segments, where the score indicates a likelihood that the Internet post associated with the segment represents a user generated the personal experience report associated with the pre-defined search subject. A method for detecting personal experience event reports from user generated content on the Internet includes filtering a collection of Internet posts to include only Internet posts containing personal experience terms, and further filtering the filtered Internet posts by removing the Internet posts with non-personal experience terms.
-
Citations
42 Claims
-
1. A method, implementable on a computing device, for detecting personal experience event reports from user generated content on the Internet, the method comprising:
-
filtering a collection of Internet posts to include only said Internet posts containing personal experience terms; and further filtering said filtered Internet posts by removing said Internet posts with non-personal experience terms. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. An Internet post retrieval system, implementable on a computing device, comprising:
-
a segment analyzer module to define segments in said Internet posts, wherein said segments at least contain terms consistent with user generation of a personal experience event report associated with a pre-defined search subject; and a scoring engine to calculate a score for each said segment, wherein said score indicates a likelihood that said Internet post associated with said segment represents a user generated said personal experience report associated with said pre-defined search subject. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28)
-
-
29. A method for compiling a list of Internet post collection websites, implementable on a computing device and comprising:
-
detecting “
good”
textual patterns indicative of an authentic user generated personal experience event report from a training set of authenticated said user generated personal experience event reports; anddetecting “
bad”
textual patterns indicative of a non-authentic said user generated personal experience event report from a training set of non-valid said user generated personal experience event reports. - View Dependent Claims (30, 31, 32, 33, 34, 35)
-
-
36. A method for isolating segments from Internet posts, implementable on a computing device, comprising:
-
filtering said Internet posts to remove said Internet posts that do not at least contain terms from each of a minimum number of term categories that are associated with a user generated personal experience report associated with a pre-defined subject; detecting a pair of anchors from two anchor categories, wherein said anchor categories are among said term categories and represent two essential components of said user generated personal experience event reports; defining a basic said segment as a shortest section of text between said pair of anchors; and when said shortest section of text does not include at least one said term from each of said minimum number of term categories, expanding said basic segment to extend beyond said shortest section of text to include at least one said term from each of said minimum number of term categories. - View Dependent Claims (37, 38)
-
-
39. A method for scoring segments of Internet posts, implementable on a computing device, comprises:
-
defining a set of indicating factors, wherein each said indicating factor is associated with a possible feature in said segments, wherein said possible features affect a likelihood that said Internet post associated with said segment represents a user generated said personal experience event report associated with a pre-defined search subject; and weighting said indicating factors in accordance with said likelihood, wherein each of said indicating factors is at least one of negative and positive. - View Dependent Claims (40, 41, 42)
-
Specification