Automated pathway recognition system
First Claim
1. A computer-implemented method of identifying candidate genes from a plurality of DNA sequences, the method comprising:
- obtaining gene expression profile data for a plurality of DNA sequences, wherein the gene expression profile data describe behavioral patterns of gene expression;
identifying a group of DNA sequences for further analysis;
using information extraction algorithms to retrieve and extract pathway information from a database related to the group of DNA sequences;
cross-referencing said pathway information to said DNA sequences;
ranking the pathway information based on a ranking of a publication in a citation index;
viewing said cross-referenced information and said ranking; and
, wherein viewing the cross-referenced information and said ranking facilitates the identification of candidate genes.
3 Assignments
0 Petitions
Accused Products
Abstract
There is a pressing need for computer-implemented tools that can summarize and present the enormous amounts of public literature to facilitate analysis of gene expression data. The present invention provides techniques and systems for efficiently integrating public literature regarding gene function with data from gene expression profiling experiments. Information from literature databases relating to a particular set of DNA sequences of known expression pattern is retrieved, processed, cross-referenced and viewed to provide further information about a particular DNA sequence to facilitate its identification as a candidate gene.
43 Citations
17 Claims
-
1. A computer-implemented method of identifying candidate genes from a plurality of DNA sequences, the method comprising:
-
obtaining gene expression profile data for a plurality of DNA sequences, wherein the gene expression profile data describe behavioral patterns of gene expression;
identifying a group of DNA sequences for further analysis;
using information extraction algorithms to retrieve and extract pathway information from a database related to the group of DNA sequences;
cross-referencing said pathway information to said DNA sequences;
ranking the pathway information based on a ranking of a publication in a citation index;
viewing said cross-referenced information and said ranking; and
,wherein viewing the cross-referenced information and said ranking facilitates the identification of candidate genes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A data processing system for identifying candidate genes from a plurality of DNA sequences of known expression pattern, comprising:
-
a processor; and
,a memory coupled to the processor, wherein the memory has instructions for execution by the processor, the instructions comprising;
instructions for accessing and extracting pathway information from a literature database comprising a biomedical publication;
instructions for cross-referencing said pathway information to said candidate genes;
instructions for ranking the biomedical publication and instructions to assign a ranking score to the pathway information extracted from a biomedical publication based on the ranking of the biomedical publication; and
,instructions for viewing said cross-referenced information and said ranking score. - View Dependent Claims (16)
-
-
17. A data processing system for identifying candidate genes from a plurality of DNA sequences, comprising:
-
a processor; and
,a memory coupled to the processor, wherein the memory has instructions for execution by the processor, the instructions comprising;
instructions for clustering the plurality of DNA sequences based on the behavioral patterns of the DNA sequences as described by gene expression profile data;
instructions for accessing and extracting pathway information from a literature database comprising a biomedical publication;
instructions for cross-referencing said pathway information to said candidate genes;
instructions for ranking the biomedical publication and instructions to assign a ranking score to the pathway information extracted from a biomedical publication based on the ranking of the biomedical publication; and
,instructions for viewing said cross-referenced information and said ranking score.
-
Specification