IDENTIFYING PRODUCT REFERENCES IN USER-GENERATED CONTENT
First Claim
1. A method for product extraction, the method comprising:
- receiving, by a computer system, a document;
identifying, by the computer system, a product type for the document according to content of the document;
extracting, by the computer system, product attributes and attribute values from the document;
comparing, by the computer system, the extracted attributes to a sufficient attribute set specific to the identified product type; and
selecting, by the computer system, an inferred product based at least on part on the product having the extracted attribute values for the extracted attributes belonging to the sufficient attribute set.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed herein for extracting products referenced in a document. A document is analyzed to identify a product type that is referenced in the document. Attributes are extracted from the document. A set of candidate products are identified corresponding to the extracted attributes. A score is calculated for the candidate products and the products are further selected or filtered based on the score, whitelist rules, and blacklist rules in order to identify one or more inferred products referenced by the document. The whitelist and blacklist rules may take as inputs a domain, a user identifier, and keywords included in the document. A set of sufficient attributes may be identified for each product type. Selection of a candidate product may be based at least in part on the document including all of the attributes in the set of sufficient attributes.
25 Citations
20 Claims
-
1. A method for product extraction, the method comprising:
-
receiving, by a computer system, a document; identifying, by the computer system, a product type for the document according to content of the document; extracting, by the computer system, product attributes and attribute values from the document; comparing, by the computer system, the extracted attributes to a sufficient attribute set specific to the identified product type; and selecting, by the computer system, an inferred product based at least on part on the product having the extracted attribute values for the extracted attributes belonging to the sufficient attribute set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for product extraction, the system comprising one or more processors and one or more memory devices operably coupled to the one or more processors, the one or more memory devices storing executable and operational data effective to cause the one or more processors to:
-
receive a document; identify a product type for the document according to content of the document; extract product attributes and attribute values from the document; compare the extracted attributes to a sufficient attribute set specific to the identified product type; and select an inferred product based at least on part on the product having the extracted attribute values for the extracted attributes belonging to the sufficient attribute set. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification