Textual database system using skeletonization and phonetic replacement to retrieve words matching or similar to query words
First Claim
1. A digital data processing apparatus for information retrieval, said apparatus comprisingA. input means for accepting a signal representative of a search linguistic expression in conventional textual reoresentation.B database storage means for storing signals representative of plural textual expressions and information pertaining thereto,C. database matching means in circuit with said input means and with said database storage means for locating within said database a linguistic expression matching or similar to said search linguistic expression,said database matching means including skeletonization means for converting at least one said database linguistic expression to a linguistically salient word skeleton, and for converting said search linguistic expression, or a modified form thereof, to a linguistically salient word skeleton, said converting means comprisingi. means for eliminating from the word skeleton produced thereby a selected alpha set, if any, of the expression being converted which lacks isomorphy with a phonetic representation of that selected set, andii means for replacing with a different linguistic symbol another selected alpha set, if any, of the expression being converted which lacks isomorphy with a phonetic representation of that other selected set,D. output means in circuit with said matching means for generating a signal indicative of the success of locating at least one database linguistic expression matching or similar to said search linguistic expression and for generating signals representative of information pertaining to the matching or similar database linguistic expressions, if any.
8 Assignments
0 Petitions
Accused Products
Abstract
An electronic database search system can identify database records having textual expressions that match, or are similar to, an operator-designated search expression. The system features a mechanism for transforming linguistic expressions, e.g., words, into linguistically salient word skeletons. Skeletal modification and suffix stripping features are employed to enhance expression-matching qualities of the word skeletons and to reduce data storage requirements.
155 Citations
13 Claims
-
1. A digital data processing apparatus for information retrieval, said apparatus comprising
A. input means for accepting a signal representative of a search linguistic expression in conventional textual reoresentation. B database storage means for storing signals representative of plural textual expressions and information pertaining thereto, C. database matching means in circuit with said input means and with said database storage means for locating within said database a linguistic expression matching or similar to said search linguistic expression, said database matching means including skeletonization means for converting at least one said database linguistic expression to a linguistically salient word skeleton, and for converting said search linguistic expression, or a modified form thereof, to a linguistically salient word skeleton, said converting means comprising i. means for eliminating from the word skeleton produced thereby a selected alpha set, if any, of the expression being converted which lacks isomorphy with a phonetic representation of that selected set, and ii means for replacing with a different linguistic symbol another selected alpha set, if any, of the expression being converted which lacks isomorphy with a phonetic representation of that other selected set, D. output means in circuit with said matching means for generating a signal indicative of the success of locating at least one database linguistic expression matching or similar to said search linguistic expression and for generating signals representative of information pertaining to the matching or similar database linguistic expressions, if any.
-
8. A method for information retrieval for use with a digital data processing apparatus having database storage means for storing signals representative of plural textual expressions and information pertaining thereto, said method comprising the steps of
A. accepting an input signal representative of a search linguistic expression in conventional textual reoresentation. B. locating within said database a linguistic exoression matching or similar to said search linguistic expression, said locating step including the steps of converting at least one said database linguistic expression to a linguistically salient word skeleton, and converting said search linguistic expression, or a modified form thereof, to a linguistically salient word skeleton, each said converting step comprising the steps of i. eliminating from the word skeleton produced thereby a selected alpha set, if any, of the expression being converted which lacks isomorphy with a phonetic representation of that selected set, and ii replacing with a different linguistic symbol another selected alpha set, if any, of the expression being converted which lacks isomorphy with a phonetic representation of that other selected set, C. generating for outout a signal indicative of the success of locating at least one database linguistic expression matching or similar to said search linguistic expression and generating for outout signals representative of information pertaining to the matching or similar database linguistic expressions, if any.
Specification