之前一直用fastqc + trimmomatic 數(shù)據(jù)質(zhì)控早芭,基于fastp快速方便等特點(diǎn)扬蕊,研究轉(zhuǎn)為fastp做質(zhì)控治宣。
trimmomatic:我們使用的參數(shù)(以MALBAC_1_step_lib為例)
trimmomatic的具體參數(shù)用法:
http://www.reibang.com/p/a8935adebaae
高通量測序常見的接頭序列
https://github.com/csf-ngs/fastqc/blob/master/Contaminants/contaminant_list.txt
ILLUMINACLIP="ILLUMINACLIP:" + ADAPTERS + ":2:20:6"
SLIDINGWINDOW= SLIDINGWINDOW:4:15
LEADING= "LEADING:3",
TRAILING= "TRAILING:3",
MINLEN= "MINLEN:25",
CROP= 60
HEAD_CROP=12
fastp 具體的參數(shù)用法:
http://www.reibang.com/p/6f492058da5b
https://github.com/OpenGene/fastp#base-correction-for-pe-data
http://www.biotrainee.com/thread-2540-1-1.html
fastp 對應(yīng)trimmomatic 的參數(shù)如下:
1.先做接頭切除:用默認(rèn)值
2.做滑窗處理:窗口大小:4提澎;每個窗口平均堿基質(zhì)量值:15
--cut_window_size 4
--cut_mean_quality 15
3.根據(jù)堿基質(zhì)量切
---cut_front/-5 3
--cut_front/-3 3
4.長度過濾:丟掉長度不夠的reads
--length_required 25
5.做全局剪切
reads開頭切掉的堿基數(shù)
read1: -f 12
read2: -F 12
從reads尾部開始切姚垃,使其達(dá)到指定長度
read1: -b/--max_len1 60
read2: -B, --max_len2 60