Tool to identify within-species contamination
Two tools identified:
-
Haplocheck.
- It estimates contamination by detecting polymorphic sites in the mtDNA data and classifies them into mitochondrial haplogroups.
- It can be used as a proxy tool to estimate the nDNA contamination levels.
-
VerifyBamID2
- A method that accurately estimates DNA contamination and is agnostic to genetic ancestry of the intended or contaminating sample.
- Integrates the estimation of genetic ancestry and DNA contamination in a unified likelihood framework by leveraging individual-specific allele frequencies projected from reference genotypes onto principal component coordinates.
For more on this topic, also see the issue here.
Which tool to use?