×

Using lexical analysis and parsing in genome research

  • US 9,104,657 B2
  • Filed: 05/02/2013
  • Issued: 08/11/2015
  • Est. Priority Date: 07/03/2012
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for locating a genome pattern, comprising:

  • creating, using a processor of a computer, one or more lexical annotators that each identify a sequence of nucleotides of nucleotide bases selected from A, C, G, and T;

    providing (1) the one or more lexical annotators, (2) one or more dictionary entries, (3) one or more previously-defined parsing rule annotators, and (4) one or more characters that each represent a nucleotide;

    creating a parsing rule annotator that identifies an order of and a combination of at least two elements selected from (1) the one or more lexical annotators, (2) the one or more dictionary entries, (3) the one or more previously-defined parsing rule annotators, and (4) the one or more characters that each represent a nucleotide; and

    creating an Unstructured Information Management Architecture (UIMA) pipeline to locate the genome pattern using the parsing rule annotator by;

    in a first stage of the UIMA pipeline, parsing a genetic sequence that is found in a Common Analysis Structure (CAS) to determine a language used and to generate tokens that are added to the CAS with a start position and an end position for each of the tokens;

    in a second stage of the UIMA pipeline, executing the one or more lexical annotators against the genetic sequence to identify one or more lexical annotations that are added to the CAS with a start position and an end position for each of the one or more lexical annotations; and

    in a third stage of the UIMA pipeline, using the start position and the end position for each of the tokens and the start position and the end position for each of the one or more lexical annotations to identify a match to the parsing rule annotation and to form a new annotation that is added to the CAS.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×