NATURAL LANGUAGE PROCESSING AND STATISTICAL TECHNIQUES BASED METHODS FOR COMBINING AND COMPARING SYSTEM DATA
First Claim
1. A method comprising:
- obtaining first data comprising data elements pertaining to a first plurality of vehicles;
obtaining second data comprising data elements pertaining to a second plurality of vehicles, wherein one or both of the first data and the second data include one or more abbreviated terms;
disambiguating the abbreviated terms at least in part by;
identifying, from a domain ontology stored in a memory, respective basewords that are associated with each of the abbreviated terms;
filtering the basewords;
performing a set intersection of the basewords; and
calculating posterior probabilities for the basewords based at least in part on the filtering and the set intersection; and
combining the first data and the second data, via a processor, based on semantic and syntactic similarity between respective data elements of the first data and the second data and the disambiguating of the abbreviated terms.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems are provided for automatically comparing, combining and fusing vehicle data. First data is obtained pertaining to a first plurality of vehicles. Second data is obtained pertaining to a second plurality of vehicles. One or both of the first data and the second data include abbreviated terms. The abbreviated terms are disambiguating at least in part by identifying, from a domain ontology stored in a memory, respective basewords that are associated with each of the abbreviated terms, filtering the basewords, performing a set intersection of the basewords, and calculating posterior probabilities for the basewords based at least in part on the filtering and the set intersection. The first data and the second data are combined, via a processor, based on semantic and syntactic similarity between respective data elements of the first data and the second data and the disambiguating of the abbreviated terms.
-
Citations
20 Claims
-
1. A method comprising:
-
obtaining first data comprising data elements pertaining to a first plurality of vehicles; obtaining second data comprising data elements pertaining to a second plurality of vehicles, wherein one or both of the first data and the second data include one or more abbreviated terms; disambiguating the abbreviated terms at least in part by; identifying, from a domain ontology stored in a memory, respective basewords that are associated with each of the abbreviated terms; filtering the basewords; performing a set intersection of the basewords; and calculating posterior probabilities for the basewords based at least in part on the filtering and the set intersection; and combining the first data and the second data, via a processor, based on semantic and syntactic similarity between respective data elements of the first data and the second data and the disambiguating of the abbreviated terms. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising:
-
obtaining first data comprising data elements pertaining to a first plurality of vehicles, the first data comprising design failure mode and effects analysis (DFMEA) data that is generated using vehicle warranty claims; obtaining second data comprising data elements pertaining to a second plurality of vehicles, the second data comprising vehicle field data; combining the DFMEA data and the vehicle field data, based on syntactic similarity between respective data elements of the DMEA data and the vehicle field data; determining whether any particular failure modes have resulted in multiple warranty claims for the vehicle, based on the DFMEA data and the vehicle field data; and updating the DFMEA data based on the multiple warranty claims for the vehicle caused by the particular failure modes. - View Dependent Claims (12, 13, 14)
-
-
15. A system comprising:
-
a memory storing; first data comprising data elements pertaining to a first plurality of vehicles; second data comprising data elements pertaining to a second plurality of vehicles wherein one or both of the first data and the second data include one or more abbreviated terms; and a processor coupled to the memory and configured to at least facilitate; disambiguating the abbreviated terms at least in part by; identifying, from a domain ontology stored in a memory, respective basewords that are associated with each of the abbreviated terms; filtering the basewords; performing a set intersection of the basewords; and calculating posterior probabilities for the basewords based at least in part on the filtering and the set intersection; and combining the first data and the second data, via a processor, based on syntactic similarity between respective data elements of the first data and the second data and the disambiguating of the abbreviated terms. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification