Skip to main content
Figure 1 | BioData Mining

Figure 1

From: Mycoplasma contamination in the 1000 Genomes Project

Figure 1

Schematic showing major data flows in Mycoplasma analysis of The Thousand Genome Project (top color). A random sample (≈8%) of next generation scan are copied across the Internet to the computer at UCL (black). Bowtie [8] is used to extract individual and paired-end DNA measurements which match one or more of the thirty published Mycoplasma genomes (Additional file 1: Table S1). Bowtie is used a second time to exclude DNA measurements which match the reference human genome, leaving 75 879 Mycoplasma DNA measurements from 2055 scans of the 4058 downloaded.

Back to article page