RAD-seq bioinformatics workshop

Malaga, Spain 2019

Part 2: Reproducible RAD-seq analyses


Deren Eaton, Columbia Univesity

Isaac Overcast, City College of New York

API for analysis

Even if data was assembled w/ CLI, API can be useful afterwards.


  # load the assembly in the API
  data = ip.load_json("/home/jovyan/work/simdata.json")

  # show stats of the assembly
  data.stats

  # show output file paths of the assembly
  data.outfiles
  

Phylogenomic analyses


Infer gene trees and species trees, even over relatively deep evolutionary time scales (~100 Ma).

API analysis tools for population level analysis

Many other tools can be used with the output files as well.


  # load the ipyrad analysis tools
  import ipyrad.analysis as ipa

  # run raxml with ipa
  rax = ipa.raxml(
      data="./simdata_outfiles/simdata.phy",
      name="raxml-tree",
      N=50,
      T=4,
  )

  # run the analysis
  rax.run()
  

Phylogenomic analyses

Infer gene trees and species trees from RAD-seq data.


Example github repositories with reproducible notebooks

https://github.com/dereneaton/Canarium-GBS

https://nbviewer.jupyter.org/github/dereneaton/ipyrad/blob/master/tests/cookbook-empirical-API-1-pedicularis.ipynb