Pharmacovigilance database
First Claim
1. A method for developing a pharmacovigilance database from source data having verbatim terms and from reference data, wherein the source data represents a plurality of cases regarding effects experienced by persons using one or more drugs, the method comprising:
- parsing the source data into a relational database;
performing cleanup on the source data stored in the relational database and storing cleaned up source data in the relational database; and
mapping verbatim terms from the cleaned up source data to at least one token, the token being at least one of (1) a term from the reference data and (2) a term selected from the verbatim terms, wherein when the token is selected from the verbatim terms, the step of mapping comprises analyzing the verbatim terms contained in the plurality of cases, nominating one of the verbatim terms as a likely correct token in view of other data associated with the verbatim terms, and selecting the nominated token as the at least one token for mapping, whereby a pharmacovigilance database comprising the cleaned up source data and mapped tokens is developed.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for developing a pharmacovigilance database from source data and reference data. The unedited source data contains verbatim terms. The method includes parsing source data into a relational safety database; performing cleanup on the relational safety database; and mapping verbatim terms from the cleaned safety database to at least one token from at least one reference source. Cleanup includes removing redundant entries, correcting misspellings, removing irrelevant non-alpha characters and noise words, and relocating dislocated terms. Mapping verbatim terms to tokens includes nominating tokens from the source data, choosing tokens from the reference sources, and linking chosen tokens to corresponding verbatim terms. In one embodiment, the history of clean-up and mapping is saved as the pedigree of the verbatim-to-token mapping.
-
Citations
20 Claims
-
1. A method for developing a pharmacovigilance database from source data having verbatim terms and from reference data, wherein the source data represents a plurality of cases regarding effects experienced by persons using one or more drugs, the method comprising:
-
parsing the source data into a relational database;
performing cleanup on the source data stored in the relational database and storing cleaned up source data in the relational database; and
mapping verbatim terms from the cleaned up source data to at least one token, the token being at least one of (1) a term from the reference data and (2) a term selected from the verbatim terms, wherein when the token is selected from the verbatim terms, the step of mapping comprises analyzing the verbatim terms contained in the plurality of cases, nominating one of the verbatim terms as a likely correct token in view of other data associated with the verbatim terms, and selecting the nominated token as the at least one token for mapping, whereby a pharmacovigilance database comprising the cleaned up source data and mapped tokens is developed. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
interactively identifying likely misspelled terms to an operator;
accepting direction from an operator; and
editing the likely misspelled term for which direction was accepted in accordance with the direction.
-
-
12. The method of claim 11, wherein suppressing misspellings further comprises nominating at least one correction for at least one likely misspelled term to an operator.
-
13. The method of claim 1, wherein performing cleanup further comprises moving at least one valid, but dislocated, entry to a proper field in the relational database for that entry.
-
14. The method of claim 1, wherein the other data associated with the verbatim terms comprises at least one of age, sex, dates, reaction, dose, outcomes, report sources, and concomitant drugs.
-
15. The method of claim 1, further comprising providing an interactive display for selecting the token.
-
16. The method of claim 1, further comprising displaying pedigree information generated in the course of performing the step of mapping verbatim terms to at least one token.
-
17. The method of claim 1, further comprising record linking to match successor case reports with predecessor case reports.
-
18. The method of claim 17, further comprising comparing predetermined fields in each of the reports.
-
19. The method of claim 1, further comprising capturing and using domain-specific lexical knowledge.
-
20. The method of claim 5, wherein the step of suppressing at least one redundancy comprises analyzing at least two separate cases of the plurality of cases to determine whether the at least two cases represent a single case with multiple events.
-
2. A method for developing a pharamcovigilance database from source data having verbatim terms and from reference data, wherein the source data represents a plurality of cases regarding effects experienced by persons using one or more drugs, the method comprising:
-
performing cleanup on the source data to obtain cleaned up source data;
parsing the cleaned up source data into a relational database; and
mapping verbatim terms from the cleaned up source data to at least one token, the token being at least one of (i) a term from the reference data and (2) a term selected from the verbatim terms, wherein when the token is selected from the verbatim terms, the step of mapping comprises analyzing the verbatim terms contained in the plurality of cases, nominating one of the verbatim terms as a likely correct token in view of other data associated with the verbatim terms, and selecting the nominated token as the at least one token for mapping, whereby a pharmacovigilance database comprising the cleaned up source data and mapped tokens is developed.
-
Specification