Method and apparatus for maintaining and navigating a non-hierarchical personal spatial file system
First Claim
1. A method for storing a document in a file system, comprising the steps of:
- determining a term weight for terms appearing in said document, wherein said term weight is based on a frequency of occurrence of said terms in a reference corpus; and
storing said document with an indication of said term weights.
0 Assignments
0 Petitions
Accused Products
Abstract
A self-organizing personal file system is disclosed that evaluates the “importance” of terms and phrases in a document in a personal corpus relative to usage in a reference corpus. A personalized term weighting scheme assigns a weight to terms or phrases based on the frequency of occurrence of the corresponding term or phrase in a reference corpus. The personalized term weighting for a given term or phrase can be used to store and access documents containing the corresponding term or phrase in the spatial file system and provides coordinates in a spatial file system, for one or more documents containing the corresponding term or phrase. The location of a given document in a file space may be specified by the relative frequency distribution of the stems of its significant terms or phrases compared to the occurrence of such terms or phrases in a reference corpus. The employed term weighting scheme can be based on the fractional frequency of a term in a user'"'"'s document relative to the fractional frequency of the term in the reference corpus, or alternative term weighting schemes, such as entropy weightings and traditional tf×idf weightings can be employed. A spatial rendering system utilizes the personalized term weights as coordinates for each document in a spatial file system. The spatial rendering system positions landmarks in the visual file space indicating terms that are correlated to terms associated with axes of the file system.
30 Citations
22 Claims
-
1. A method for storing a document in a file system, comprising the steps of:
-
determining a term weight for terms appearing in said document, wherein said term weight is based on a frequency of occurrence of said terms in a reference corpus; and
storing said document with an indication of said term weights. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method for identifying one or more documents in a file system that are related to one or more specified words, comprising the steps of:
-
storing each of said documents with an indication of a term weight for terms appearing in said corresponding document, wherein said term weight is based on a frequency of occurrence of said terms in a reference corpus; and
evaluating a distance between each of said documents and said one or more specified words, wherein said distance is based on said term weights. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A system for storing a document in a file system, comprising:
-
a memory that stores computer-readable code; and
a processor operatively coupled to said memory, said processor configured to implement said computer-readable code, said computer-readable code configured to;
determine a term weight for terms appearing in said document, wherein said term weight is based on a frequency of occurrence of said terms in a reference corpus; and
store said document with an indication of said term weights.
-
-
20. A system for identifying one or more documents in a file system that are related to one or more specified words, comprising:
-
a memory that stores computer-readable code; and
a processor operatively coupled to said memory, said processor configured to implement said computer-readable code, said computer-readable code configured to;
store each of said documents with an indication of a term weight for terms appearing in said corresponding document, wherein said term weight is based on a frequency of occurrence of said terms in a reference corpus; and
evaluate a distance between each of said documents and said one or more specified words, wherein said distance is based on said term weights.
-
-
21. An article of manufacture for storing a document in a file system, comprising:
-
a computer readable medium having computer readable code means embodied thereon, said computer readable program code means comprising;
a step to determine a term weight for terms appearing in said document, wherein said term weight is based on a frequency of occurrence of said terms in a reference corpus; and
a step to store said document with an indication of said term weights.
-
-
22. An article of manufacture for identifying one or more documents in a file system that are related to one or more specified words, comprising:
-
a computer readable medium having computer readable code means embodied thereon, said computer readable program code means comprising;
a step to store each of said documents with an indication of a term weight for terms appearing in said corresponding document, wherein said term weight is based on a frequency of occurrence of said terms in a reference corpus; and
a step to evaluate a distance between each of said documents and said one or more specified words, wherein said distance is based on said term weights.
-
Specification