×

Systems and methods for encoding genetic variation for a population

  • US 10,460,829 B2
  • Filed: 01/26/2016
  • Issued: 10/29/2019
  • Est. Priority Date: 01/26/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method of encoding variation data for a population, comprising using at least one computer hardware processor connected to at least one non-transitory computer-readable storage medium to perform:

  • receiving information describing genetic variation of a population of individuals, the information comprising a plurality of variable sites within a reference genome of the population and a plurality of genotypes of a plurality of individuals in the population with respect to those variable sites;

    determining a prevalence for each variable site within the population, wherein the prevalence comprises the frequency at which alternative alleles of a given variable site occur in the population;

    selecting an encoding strategy for each of the plurality of variable sites based on the determined prevalence of each variable site across the population, wherein if the prevalence for a variable site is less than 10%, less than 5%, less than 1%, or less than 0.1% of the population, the encoding strategy is a compression encoding strategy, and otherwise the encoding strategy is a bit field encoding strategy;

    encoding the information according to the encoding strategy selected for each of the plurality of variable sites; and

    storing the encoded information in the at least one non-transitory computer-readable storage medium.

View all claims
  • 12 Assignments
Timeline View
Assignment View
    ×
    ×