Processing un-typed triple store data
First Claim
Patent Images
1. A method, executed by one or more processors, the method comprising:
- in response to receiving a triple comprising a subject, a predicate, and an un-typed object, wherein the un-typed object is a string literal, scanning a database storing a collection of triples for triples that match the predicate of the received triple to identify similar triples;
responsive to identifying an object of each similar triple, for each non-literal object identified, scanning the database storing the collection of triples for typing triples that (i) have a subject that matches a respective non-literal object and (ii) include a type predicate, thereby classifying objects of the typing triples as candidate data types for the un-typed object;
identifying the candidate data types for the untyped-object;
determining frequency information for each candidate data type based on a ratio of a count of the candidate data types to a count of the similar triples;
associating the un-typed object with one or more candidate data types based on the frequency information; and
processing the un-typed object based on the frequency information and the one or more candidate data types.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, executed by a computer, includes receiving a triple comprising a predicate, a first entity and an un-typed second entity, scanning a collection of triples for triples that match the predicate of the triple to produce similar triples, determining frequency information for second entity types referenced by the similar triples, and processing the un-typed second entity using the frequency information. A computer system and computer program product corresponding to the method are also disclosed herein.
11 Citations
20 Claims
-
1. A method, executed by one or more processors, the method comprising:
-
in response to receiving a triple comprising a subject, a predicate, and an un-typed object, wherein the un-typed object is a string literal, scanning a database storing a collection of triples for triples that match the predicate of the received triple to identify similar triples; responsive to identifying an object of each similar triple, for each non-literal object identified, scanning the database storing the collection of triples for typing triples that (i) have a subject that matches a respective non-literal object and (ii) include a type predicate, thereby classifying objects of the typing triples as candidate data types for the un-typed object; identifying the candidate data types for the untyped-object; determining frequency information for each candidate data type based on a ratio of a count of the candidate data types to a count of the similar triples; associating the un-typed object with one or more candidate data types based on the frequency information; and processing the un-typed object based on the frequency information and the one or more candidate data types. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer system comprising:
-
one or more computer processors; one or more computer readable storage devices and program instructions stored on the one or more computer readable storage devices, the program instructions comprising instructions to perform; in response to receiving a triple comprising a subject, a predicate, and an un-typed object, wherein the un-typed object is a string literal, scanning a database storing a collection of triples for triples that match the predicate of the received triple to identify similar triples; responsive to identifying an object of each similar triple, for each non-literal object identified, scanning the database storing the collection of triples for typing triples that (i) have a subject that matches a respective non-literal object and (ii) include a type predicate, thereby classifying objects of the typing triples as candidate data types for the un-typed object; identifying the candidate data types for the untyped-object; determining frequency information for each candidate data type based on a ratio of a count of the candidate data types to a count of the similar triples; associating the un-typed object with one or more candidate data types based on the frequency information; and processing the un-typed object based on the frequency information and the one or more candidate data types. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product comprising:
one or more computer readable storage media and program instructions stored on the one or more computer readable storage media, the program instructions comprising instructions to perform; in response to receiving a triple comprising a subject, a predicate, and an un-typed object, wherein the un-typed object is a string literal, scanning a database storing a collection of triples for triples that match the predicate of the received triple to identify similar triples; responsive to identifying an object of each similar triple, for each non-literal object identified, scanning the database storing the collection of triples for typing triples that (i) have a subject that matches a respective non-literal object and (ii) include a type predicate, thereby classifying objects of the typing triples as candidate data types for the un-typed object; identifying the candidate data types for the untyped-object; determining frequency information for each candidate data type based on a ratio of a count of the candidate data types to a count of the similar triples; associating the un-typed object with one or more candidate data types based on the frequency information; and processing the un-typed object based on the frequency information and the one or more candidate data types. - View Dependent Claims (16, 17, 18, 19, 20)
Specification