![]() |
After sequence import, Quality control reports generated within MGX should be inspected (2.14) before proceding with data analysis. MGX currently offers three types of QC reports: Distribution of GC content, sequence length and nucleotide distribution within the DNA sequences. Those can be used to evaluate overall sequence data quality and check for possible signs of contamination. For demonstration purposes, data shown relates to the artificial simHC metagenome dataset created by the FAMeS [Mavromatis et al., 2007] project. The actual sequence data is publicly available and can be obtained from the FAMeS web site (http://fames.jgi-psf.org/Retrieve_data.html).
![]() |
Depending on the kind of sequence data, different patterns might emerge (2.18), which might or might not warrant any further action. While small amounts of e.g. adapter residue are sometimes encountered and might be considered acceptable, it is up to the individual researcher to check back with their sequencing provider and ask for adapter sequences to execute additional trimming.
Sebastian Jaenicke, 2020-04-28