Assemblytics弯蚜, 發(fā)表在Bioinformaticshttp://www.ncbi.nlm.nih.gov/pubmed/27318204拗慨,鑒定基因組間SV速蕊。
Githup,https://github.com/marianattestad/assemblytics
同時(shí)也可以在線使用,http://assemblytics.com
Assemblytics 首先基于nucmer比對(duì) (),然后進(jìn)行過濾,獲取單一比對(duì)結(jié)果坷檩,進(jìn)而進(jìn)行SV的檢測,可檢測如下SV改抡。
其中
- 插入和缺失, <50 bp overlap or gap
- Tandem, overmap >50 bp
依賴的工具
R相關(guān)
- ggplot2
- plyr
- RColorBrewer
-scales
Python
- argparse
- numpy
從githup下載后矢炼,給權(quán)限即可運(yùn)行
chmod a+x scripts/Assemblytics*
第一步 序列比對(duì)
nucmer -maxmatch -l 100 -c 500 REFERENCE.fa ASSEMBLY.fa -prefix OUT
第二步 過濾并鑒定SV
scripts/Assemblytics <delta_file> <output_prefix> <unique_anchor_length> <min_variant_size> <max_variant_size>
或者直接使用在線工具也是可以的,將比對(duì)好的結(jié)果拽入紅色框中即可
主要結(jié)果
-
變異類型即數(shù)量分布圖
變異結(jié)果bed文件
reference ref_start ref_stop ID size strand type ref_gap_size query_gap_size query_coordinates method
NC_000913.3 1972855 1978502 Assemblytics_b_1 5647 + Deletion 5647 0 NZ_CP009685.1:1721649-1721649:+ between_alignments
NC_000913.3 1873031 1873039 Assemblytics_b_2 777 + Insertion -8 769 NZ_CP009685.1:1821473-1822242:+ between_alignments
NC_000913.3 1096961 1097583 Assemblytics_b_3 181 + Tandem_expansion -622 -441 NZ_CP009685.1:2597877-2598318:- between_alignments
NC_000913.3 4295948 4296271 Assemblytics_b_5 113 + Tandem_contraction -323 -436 NZ_CP009685.1:4040722-4041158:- between_alignments