參考文獻(xiàn):CEL-Seq2: sensitive highly-multiplexed single-cell RNA-Seq
github:GitHub - yanailab/CEL-Seq-pipeline
下載:
wget https://github.com/yanailab/CEL-Seq-pipeline/archive/refs/tags/v1.0.tar.gz
解壓后只有這些文件
CEL-Seq-pipeline-1.0/
├── bc_demultiplex.py
├── bowtie_wrapper.py
├── clean_up.py
├── htseq_count_umified.py
├── htseq_wrapper.py
├── LICENSE
├── pijpleiding.py
└── README.md
使用方法:
pijpleiding config_file.txt
準(zhǔn)備工作:
- 你的fastq文件,
- barcode index文件(barcode_umis.tab)
- sample_sheet txt文件 (sample_sheet_example.txt)
安裝軟件:python2秽梅, hiseq(失敗)
python2 -m pip install HTSeq
HTSeq已經(jīng)不支持python2了:
如果你有python2 的HTSeq包拖刃,可以繼續(xù):
首先創(chuàng)建一個config_file.txt吟策,修改里面的參數(shù)和路徑素征。
## pijpleiding configuration file. Run `pijplieding --help` for more help.
##
## the pipe_run parameter decides whether to run the pipe segment or not.
## the pipe_input_files (which can be multilined) is treated as multiple shell
## patterns refering to existing files, so it is expanded and split accordingly,
## and passed as 'input_files' to the pipe segment. The rest of the parameters
## are passed as they are to the pipe segments, so check their description
[scythe_wrapper]
pipe_run = True
[bc_demultiplex]
pipe_run = True
bc_index_file= /path_to/barcodes_umis.tab
sample_sheet= /path_to/Sample_sheet.txt
pipe_input_files= /path_to/*/*R1*.fastq
output_dir= /path_to/barcode_splitted
stats_file= stats.tab
min_bc_quality= 10
bc_length = 6
umi_length = 5
cut_length = 35
[bowtie_wrapper]
pipe_run = True
pipe_input_files= /path_to/barcode_splitted/CE_*.fastq
index_file= /path_to/refs/genomes/CE/WS230/c_elegans.WS230_spikein.genomic
output_dir= /path_to/sam_files
bowtie_report_name = bt_report_full.tab
number_of_threads = 3
extra_params =
procs = 10
[htseq_wrapper]
pipe_run = True
pipe_input_files = /path_to/sam_files/*sam
gff_file = /path_to/refs/annotations/CE/WS230/c_elegans.WS230_spikein.annotations_trimmed.spikes_and_lincs.gff3
output_dir= /path_to/expression_umi
umi= true
extra_params = -q
count_filename = CE_exp.tab
[clean_up]
pipe_run = False
最后使用:
python2 CEL-Seq-pipeline-1.0/pijpleiding.py config.txt
總結(jié): 這么過時的代碼滞时,就別用了吧贼陶!