當(dāng)然不是機(jī)器人總動(dòng)員中的伊娃~但是開(kāi)發(fā)作者卻巧妙的借用了其名湿酸,還給了伊娃的圖標(biāo)婿屹,作者是想開(kāi)發(fā)一個(gè)像伊娃一樣戰(zhàn)斗力爆表的工具嗎?
EVA推溃,EV細(xì)胞外囊泡簡(jiǎn)寫(xiě)昂利,A,Analysis的縮寫(xiě),名字就是這么來(lái)的吧蜂奸?(不是犁苏,是我瞎猜的...)
軟件地址:http://bioinfo.life.hust.edu.cn/EVAtool/
軟件目的:An optimized reads assignment tool for extracellular vesicle small ncRNA quantification
軟件的工作流
支持sra、fastq扩所、gz三種格式的數(shù)據(jù)輸入围详,使用fastqc對(duì)數(shù)據(jù)進(jìn)行質(zhì)量評(píng)估,Bowtie2比對(duì)碌奉,比對(duì)過(guò)程中有一個(gè)ORAA算法短曾,主要是為了分配一個(gè)read比對(duì)到多個(gè)sRNA上時(shí),分配給哪一種sRNA比較合理赐劣。接著進(jìn)行了定量和標(biāo)準(zhǔn)化嫉拐,最后是生成結(jié)果的可視化報(bào)告。
安裝軟件
# 創(chuàng)建conda環(huán)境
conda create -y -n EVAtool python=3魁兼。7
# 安裝依賴(lài)環(huán)境
conda install -y samtools
conda install -y bowtie2
conda install -y fastq-dump
conda install -y trimmomatic-0.39.jar
conda install -y bedtools
# pip安裝
pip install evatool
## 查看是否安裝成功
evatool -h
usage: evatool [-h] -i INPUT -o OUTPUT [-c CONFIG] [-n [NCRNA ...]]
EVAtool could be used to estimate the quantification and abundence of small ncRNA from EV or other sources.
options:
-h, --help show this help message and exit
-i INPUT, --input INPUT
The path of the input file, the file type could be '.sra, .fastq.gz or .fastq'.
-o OUTPUT, --output OUTPUT
The path of output file.
-c CONFIG, --config CONFIG
The path of the Config file. User can download the config file from url, or define yourself.
-n [NCRNA ...], --ncrna [NCRNA ...]
The list of small ncRNA types. User can use default ncRNA list (miRNA, rRNA, tRNA, piRNA, snoRNA, snRNA, YRNA), or define yourself.
下載準(zhǔn)備數(shù)據(jù)
參考基因組
# 下載參考基因組 人的 和config文件
wget "http://bioinfo.life.hust.edu.cn/EVAtool/ref/refs.zip"
# 解壓
unzip refs.zip
# refs內(nèi)容如下:
├── four_elements.sort.bed
├── Homo_sapiens.GRCh38.86.chr.gtf
├── Homo_sapiens_v86
├── miRNA
├── piRNA
├── reference_config.json
├── rRNA
├── snoRNA
├── snRNA
├── sRNA.fa
├── tRNA
├── two_elements.sort.bed
└── YRNA
目前官網(wǎng)只放了人類(lèi)的婉徘,其他的物種可以參考這個(gè)表格去下載:
下載示例fastq數(shù)據(jù)
# Download example data
wget http://bioinfo.life.hust.edu.cn/EVAtool/example/example.fastq.gz
# fq內(nèi)容
@SRR8185773.1 1 length=36
NCTACAGTGCACGTGTCTCCAGAGATCGGAAGAGCA
+SRR8185773.1 1 length=36
#AAAAEEEEEEEEEEEEEEAEEAEEAEEEE6EEEEE
@SRR8185773.2 2 length=36
GGCTGGTCCGATGGTAGTGGGTTATCAGAACAGATC
+SRR8185773.2 2 length=36
AAAAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
@SRR8185773.3 3 length=36
CCTGTCTGAGGGTCGGAGATCGGAAGAGCACACGTC
+SRR8185773.3 3 length=36
AAAAAEEEEAEE/E/EEEEEEEEEEE/EE/E/EEE<
運(yùn)行EVAtool
# Run EVAtool
evatool -i example.fastq.gz -o ./
更多詳細(xì)輸入?yún)?shù)可以如下選擇:
輸出結(jié)果
輸出結(jié)果為一個(gè)html報(bào)告
fastq的read長(zhǎng)度分布
比對(duì)到不同sRNA上的read的比例
加上沒(méi)有比對(duì)上的reads
定量結(jié)果,是一個(gè)tab鍵的表格
每種sRNA表達(dá)top5的數(shù)據(jù):
不同sRNA的表達(dá)分布圖:
鑒定到的sRNA數(shù)目:
結(jié)果還挺豐富的咐汞,可以去用用看~
ref:
Xie G Y, Liu C J, Guo A Y. EVAtool: an optimized reads assignment tool for small ncRNA quantification and its application in extracellular vesicle datasets[J]. Briefings in Bioinformatics, 2022. Link: https://doi.org/10.1093/bib/bbac310