Mendel - Iterative Hard Thresholding

A modern approach to analyze data from a Genome Wide Association Studies (GWAS)

Package Feature

  • Analyze large GWAS datasets intuitively.
  • Built-in support for PLINK binary files via SnpArrays.jl and VCF files via VCFTools.jl.
  • Out-of-the-box parallel computing routines for q-fold cross-validation.
  • Fits a variety of generalized linear models with any choice of link function.
  • Computation directly on raw genotype files.
  • Efficient handlings for non-genetic covariates.
  • Optional acceleration (debias) step to dramatically improve speed.
  • Ability to explicitly incorporate weights for predictors.
  • Ability to enforce within and between group sparsity.
  • Naive genotype imputation.
  • Estimates nuisance parameter for negative binomial regression using Newton or MM algorithm.
  • Excellent flexibility to handle different data structures and complements well with other Julia packages.

Read our paper for more detail.

MendelIHT borrows distribution and link functions implementationed in GLM.jl and Distributions.jl.

DistributionCanonical LinkStatus
NormalIdentityLink$\checkmark$
BernoulliLogitLink$\checkmark$
PoissonLogLink$\checkmark$
NegativeBinomialLogLink$\checkmark$
GammaInverseLinkexperimental
InverseGaussianInverseSquareLinkexperimental

Examples of these distributions in their default value is visualized in this post.

CauchitLink
CloglogLink
IdentityLink
InverseLink
InverseSquareLink
LogitLink
LogLink
ProbitLink
SqrtLink

Manual Outline