Automated pathway recognition system
First Claim
1. A computer-implemented method of identifying a candidate gene from a plurality of nucleotide sequences, the method comprising:
- obtaining gene expression profile data for a plurality of nucleotide sequences, wherein said gene expression profile data describe behavioral patterns of gene expression;
identifying a group of said sequences for further analysis;
using information extraction algorithms to retrieve and extract pathway information from a database comprising biological data;
cross-referencing said pathway information; and
viewing said cross-referenced information, wherein viewing said cross-referenced information facilitates the identification of a candidate gene.
3 Assignments
0 Petitions
Accused Products
Abstract
There is a pressing need for computer-implemented tools that can summarize and present the enormous amounts of public literature to facilitate analysis of gene expression data. The present invention provides techniques and systems for efficiently integrating public literature regarding gene function with data from gene expression profiling experiments. Information from literature databases relating to a particular set of DNA sequences of known expression pattern is retrieved, processed, cross-referenced and viewed to provide further information about a particular DNA sequence to facilitate its identification as a candidate gene.
-
Citations
20 Claims
-
1. A computer-implemented method of identifying a candidate gene from a plurality of nucleotide sequences, the method comprising:
-
obtaining gene expression profile data for a plurality of nucleotide sequences, wherein said gene expression profile data describe behavioral patterns of gene expression;
identifying a group of said sequences for further analysis;
using information extraction algorithms to retrieve and extract pathway information from a database comprising biological data;
cross-referencing said pathway information; and
viewing said cross-referenced information, wherein viewing said cross-referenced information facilitates the identification of a candidate gene. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19)
-
-
16. A data processing system for identifying candidate genes from a list of genes of known expression pattern, comprising:
-
a processor a memory coupled to the processor, the memory configured to store instructions for execution by the processor, the instructions comprising;
instructions for accessing a list of genes of known expression pattern;
instructions for accessing and extracting pathway information from a literature database relevant to individual genes on the list of genes;
instructions for cross-referencing said pathway information; and
instructions for viewing said cross-referenced information.
-
-
20. A data processing system for identifying a candidate gene from a plurality of sequences, comprising:
-
a processor a memory coupled to the processor, the memory configured to store instructions for execution by the processor, the instructions comprising;
instructions for clustering the plurality of sequences based on patterns of expression of the sequences, as described by gene expression profile data;
instructions for accessing and extracting information from a literature database;
instructions for cross-referencing said information; and
instructions for viewing said cross-referenced information.
-
Specification