Macaca mulatta.jpg
1 軟件安裝
http://www.reibang.com/p/eb89ab4af035
linux平臺下需要安裝的軟件:fastqc咬摇,fastp贸桶,hisat2睬捶,samtools,htseq
2下載基因組序列和基因組注釋文件
獼猴基因組和注釋文件:
Macaca mulatta (ID 215) - Genome - NCBI (nih.gov)
Macaca_mulatta - Ensembl genome browser 104
3構建索引文件
hisat2-build -p 2 GCF_003339765.1_Mmul_10_genomic.fna Mmul
hisat2-build -p 2 Macaca_mulatta.Mmul_10.dna.toplevel.fa Mmul
4過濾raw reads
mkdir -p fastp
ls *1.fastq.gz|while read id;
do
fastp -5 20 -i ${id%_*}_1.fastq.gz -I ${id%_*}_2.fastq.gz \
-o ${id%_*}_1.clean.fq.gz -O ${id%_*}_2.clean.fq.gz \
-j ./fastp/${id%_*}.json -h ./fastp/${id%_*}.html;
done
5比對
ls *1.clean.fq.gz|while read id;
do
hisat2 -t -p 3 -x /media/lzx/0000678400004823/Indexs/Hisat2/Macaca_mulatta/Mmul \
-1 $id -2 ${id%_*}_2.clean.fq.gz \
2>${id%%_*}.hisat.log \
|samtools sort -@ 3 -o ${id%_*}_ht2p.bam
done