METHODOLOGIES AND ANALYTICS TOOLS FOR IDENTIFYING WHITE SPACE OPPORTUNITIES IN A GIVEN INDUSTRY
First Claim
1. A method for use with at least one keyword retrieved from a first set of documents related to a predefined subject matter, the method comprising:
- constructing snippets from textual material in said first set of documents, each of said snippets including at least one word appearing within a specified text distance of said at least one keyword;
defining a plurality of categories wherein each of said snippets is assigned to one of said categories, each of said categories designated for receiving similar snippets;
creating a respective mathematical model for each of said categories;
analyzing a second set of documents to determine an assignment for each document in said second set of documents to one of said categories, said assignment based on matching each of said documents in said second set of documents to said mathematical model for said assigned category; and
identifying at least one white space in said second set of documents, said at least one white space including fewer than a specified number of documents.
6 Assignments
0 Petitions
Accused Products
Abstract
A method for analyzing predefined subject matter in a patent database being for use with a set of target patents, each target patent related to the predefined subject matter, the method comprising: creating a feature space based on frequently occurring terms found in the set of target patents; creating a partition taxonomy based on a clustered configuration of the feature space; editing the partition taxonomy using domain expertise to produce an edited partition taxonomy; creating a classification taxonomy based on structured features present in the edited partition taxonomy; creating a contingency table by comparing the edited partition taxonomy and the classification taxonomy to provide entries in the contingency table; and identifying all significant relationships in the contingency table to help determine the presence of any white space.
45 Citations
21 Claims
-
1. A method for use with at least one keyword retrieved from a first set of documents related to a predefined subject matter, the method comprising:
-
constructing snippets from textual material in said first set of documents, each of said snippets including at least one word appearing within a specified text distance of said at least one keyword; defining a plurality of categories wherein each of said snippets is assigned to one of said categories, each of said categories designated for receiving similar snippets; creating a respective mathematical model for each of said categories; analyzing a second set of documents to determine an assignment for each document in said second set of documents to one of said categories, said assignment based on matching each of said documents in said second set of documents to said mathematical model for said assigned category; and identifying at least one white space in said second set of documents, said at least one white space including fewer than a specified number of documents. - View Dependent Claims (2, 3, 4)
-
-
5. A method of analyzing predefined subject matter in a patent database, the method being for use with a set of target patents, each of said target patents related to the predefined subject matter, the method comprising:
-
creating a feature space based on frequently occurring terms found in said set of target patents; creating a partition taxonomy based on a clustered configuration of said feature space; editing said partition taxonomy using domain expertise to produce an edited partition taxonomy; creating a classification taxonomy based on structured features present in said edited partition taxonomy; creating a contingency table by comparing said edited partition taxonomy and said classification taxonomy to provide entries in said contingency table; and identifying significant relationships in said contingency table which help determine the presence of a white space. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A method for conducting an analysis to provide patent information to a customer, said method being for use with a set of customer patents, each of said customer patents related to business needs of the customer, the method comprising:
-
creating a first taxonomy for said set of customer patents, said first taxonomy related to technology underlying said customer patents; creating a second taxonomy for said set of customer patents, said second taxonomy related to an application of said technology; and creating a contingency table by comparing said first taxonomy to said second taxonomy, said contingency table providing an indication of one or more relationships of interest for the customer. - View Dependent Claims (17, 18)
-
-
19. A computer program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform a method comprising the steps of:
-
assembling a set of target documents using one or more keywords, each of said target documents related to a predefined subject matter; analyzing each of said target documents to derive a count of occurrences of said keywords in each of said target documents; creating a first taxonomy for said set of target documents, said first taxonomy related to technology underlying said target documents; partitioning said set of target documents into a plurality of categories based on words or phrases appearing within a specified distance of one of said keywords; and accepting input for applying domain expertise to selectively delete, merge, and create categories. - View Dependent Claims (20)
-
-
21. A computer program product for use with at least one keyword retrieval from a set of initial documents related to a predefined subject matter, the program comprising a computer usable medium including a computer readable program, wherein when executed on a computer the computer readable program causes the computer to:
- construct snippets from textual material in said first set of documents, each of said snippets including at least one word appearing within a specified text distance of said at least one keyword;
define a plurality of categories wherein each of said snippets is assigned to one of said categories, each of said categories designated for receiving similar snippets; create a respective mathematical model for each of said categories; analyze a second set of documents to determine an assignment for each document in said second set of documents to one of said categories, said assignment based on matching each of said documents in said second set of documents to said mathematical model for said assigned category; and identify at least one white space in said second set of documents, said at least one white space including fewer than a specified number of documents.
- construct snippets from textual material in said first set of documents, each of said snippets including at least one word appearing within a specified text distance of said at least one keyword;
Specification