System and method for hybrid text mining for finding abbreviations and their definitions
First Claim
Patent Images
1. A system for matching one or more abbreviations and one or more definitions, comprising:
- a recognition process that examines character strings and determines which character strings to be abbreviated;
an abbreviation pattern generation process that creates from said determined character strings one or more abbreviation patterns representing candidate abbreviations, each of the one or more abbreviation patterns being a template that indicates a number and a location of characters and numeric strings within a candidate abbreviation; and
a definition pattern generation process that creates from said determined character strings one or more definition patterns representing candidate definitions, each of the one or more definition patterns being a template that indicates a number and a location of numeric strings, stopwords, prefix/headword combinations and base words within a candidate definition.
1 Assignment
0 Petitions
Accused Products
Abstract
This present invention matches one or more abbreviations to one or more definitions. The invention has an abbreviation pattern generation process that generates one or more abbreviation patterns corresponding to the candidate abbreviations, and a definition pattern generation process that generates one or more definition patterns corresponding to the candidate definitions.
101 Citations
22 Claims
-
1. A system for matching one or more abbreviations and one or more definitions, comprising:
-
a recognition process that examines character strings and determines which character strings to be abbreviated; an abbreviation pattern generation process that creates from said determined character strings one or more abbreviation patterns representing candidate abbreviations, each of the one or more abbreviation patterns being a template that indicates a number and a location of characters and numeric strings within a candidate abbreviation; and a definition pattern generation process that creates from said determined character strings one or more definition patterns representing candidate definitions, each of the one or more definition patterns being a template that indicates a number and a location of numeric strings, stopwords, prefix/headword combinations and base words within a candidate definition. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A system for matching one or more abbreviations and one or more definitions, comprising:
-
means for examining character strings and determining which character strings to be abbreviated; means for generating from said determined character strings one or more abbreviation patterns representing candidate abbreviations, each of the one or more abbreviation patterns being a template that indicates a number and a location of characters and numeric strings within a candidate abbreviation; and means for generating from said determined character strings one or more definition patterns representing candidate definitions, each of the one or more definition patterns being a template that indicates a number and a location of numeric strings, stopwords, prefix/headword combinations and base words within a candidate definition.
-
-
22. A method for matching one or more abbreviations and one or more definitions, comprising:
-
examining character strings and determining which character strings to be abbreviated; generating from said determined character strings one or more abbreviation patterns representing candidate abbreviations, each of the one or more abbreviation patterns being a template that indicates a number and a location of characters and numeric strings within a candidate abbreviation; and generating from said determined character strings one or more definition patterns representing candidate definitions, each of the one or more definition patterns being a template that indicates a number and a location of numeric strings, stopwords, prefix/headword combinations and base words within a candidate definition.
-
Specification