Introgression analysis with RAD-seq data

Definitions

Many terms in the literature are used interchangeably in the discussion of gene flow. Several popular terms that are often confused, but sometimes have distinct meanings, are admixture, introgression, and hybridization.

Admixture: A detectable genetic signature of mixed ancestry from one or more distinct lineages. It is a pattern that can be detected. It is possible for two populations to appear admixed even if they have not exchanged genetic material.

Introgression: The exchange of alleles from one or more distinct lineages into another. It is a process that occurs in time. We rarely observe introgression, and therefore we aim to reconstruct introgression events in the past.

hybridization: Successful reproduction between individuals from two distinct lingeages. We often observe hybridization among extant individuals, but further investigation is necessary to determine whether hybridization (e.g., F1s) result in introgression of alleles between lineages.

Drift-based statistics for modelling admixture

Methods for detecting admixture within and between populations typically rely on the assumption that existing variation in populations is a result of the sorting of ancestral standing variation in the ancestor of those populations. Therefore it does not account for de novo mutations. Examples include STRUCTURE and TREEMIX.


Name: 32082_przewalskii Group: 0 Prop: 0.0Name: 33588_przewalskii Group: 0 Prop: 0.0Name: 41478_cyathophylloides Group: 0 Prop: 0.0Name: 41954_cyathophylloides Group: 0 Prop: 0.0001Name: 29154_superba Group: 0 Prop: 0.7468Name: 30686_cyathophylla Group: 0 Prop: 0.5126Name: 33413_thamno Group: 0 Prop: 0.07Name: 30556_thamno Group: 0 Prop: 0.159Name: 35236_rex Group: 0 Prop: 0.1137Name: 40578_rex Group: 0 Prop: 0.0729Name: 35855_rex Group: 0 Prop: 0.0895Name: 39618_rex Group: 0 Prop: 0.0Name: 38362_rex Group: 0 Prop: 0.0Name: 32082_przewalskii Group: 1 Prop: 0.0Name: 33588_przewalskii Group: 1 Prop: 0.0Name: 41478_cyathophylloides Group: 1 Prop: 0.9749Name: 41954_cyathophylloides Group: 1 Prop: 0.9746Name: 29154_superba Group: 1 Prop: 0.2527Name: 30686_cyathophylla Group: 1 Prop: 0.278Name: 33413_thamno Group: 1 Prop: 0.0Name: 30556_thamno Group: 1 Prop: 0.0Name: 35236_rex Group: 1 Prop: 0.0059Name: 40578_rex Group: 1 Prop: 0.0005Name: 35855_rex Group: 1 Prop: 0.003Name: 39618_rex Group: 1 Prop: 0.0Name: 38362_rex Group: 1 Prop: 0.0Name: 32082_przewalskii Group: 2 Prop: 0.0Name: 33588_przewalskii Group: 2 Prop: 0.0Name: 41478_cyathophylloides Group: 2 Prop: 0.0Name: 41954_cyathophylloides Group: 2 Prop: 0.0Name: 29154_superba Group: 2 Prop: 0.0Name: 30686_cyathophylla Group: 2 Prop: 0.2051Name: 33413_thamno Group: 2 Prop: 0.3041Name: 30556_thamno Group: 2 Prop: 0.1544Name: 35236_rex Group: 2 Prop: 0.2057Name: 40578_rex Group: 2 Prop: 0.1365Name: 35855_rex Group: 2 Prop: 0.1052Name: 39618_rex Group: 2 Prop: 0.0Name: 38362_rex Group: 2 Prop: 0.0Name: 32082_przewalskii Group: 3 Prop: 0.0Name: 33588_przewalskii Group: 3 Prop: 0.0Name: 41478_cyathophylloides Group: 3 Prop: 0.0Name: 41954_cyathophylloides Group: 3 Prop: 0.0Name: 29154_superba Group: 3 Prop: 0.0005Name: 30686_cyathophylla Group: 3 Prop: 0.0039Name: 33413_thamno Group: 3 Prop: 0.6237Name: 30556_thamno Group: 3 Prop: 0.6821Name: 35236_rex Group: 3 Prop: 0.6698Name: 40578_rex Group: 3 Prop: 0.7902Name: 35855_rex Group: 3 Prop: 0.8012Name: 39618_rex Group: 3 Prop: 1.0Name: 38362_rex Group: 3 Prop: 1.0Name: 32082_przewalskii Group: 4 Prop: 1.0Name: 33588_przewalskii Group: 4 Prop: 1.0Name: 41478_cyathophylloides Group: 4 Prop: 0.0251Name: 41954_cyathophylloides Group: 4 Prop: 0.0254Name: 29154_superba Group: 4 Prop: 0.0Name: 30686_cyathophylla Group: 4 Prop: 0.0005Name: 33413_thamno Group: 4 Prop: 0.0023Name: 30556_thamno Group: 4 Prop: 0.0044Name: 35236_rex Group: 4 Prop: 0.005Name: 40578_rex Group: 4 Prop: 0.0Name: 35855_rex Group: 4 Prop: 0.0011Name: 39618_rex Group: 4 Prop: 0.0Name: 38362_rex Group: 4 Prop: 0.032082_przewalskii33588_przewalskii41478_cyathophylloides41954_cyathophylloides29154_superba30686_cyathophylla33413_thamno30556_thamno35236_rex40578_rex35855_rex39618_rex38362_rex

Phylogenetic models for modelling admixture

The D-statistics (ABBA-BABA) tests.

References:

Full-genome implementations

The software admix-tools is often used to calculate ABBA-BABA scores from genome wide sequence alignments where the value if often measured using a sliding-widow approach along the genome.

https://github.com/DReichLab/AdmixTools

RAD-seq implementation

Because RAD-seq provides only a subset of the genome the data must be analyzed a little differently than for full genome data.

Phylogenetic invariants and tetrad

A more general implementation of the general markov model.