pybedtools文檔-平行在外顯子和內(nèi)含子中計(jì)算reads數(shù)

本例子展示的是如何在內(nèi)含子和外顯子中統(tǒng)計(jì)reads數(shù)量董虱。不但包括了這個(gè)內(nèi)容，還包括了其他的命名展示申鱼。例如

1愤诱、BAM file support (for more, see?Working with BAM files)

2、indexing into Interval objects (for more, see?Intervals)

3润讥、filtering (for more, see?Filtering)

4转锈、streaming (for more, see?Using BedTool objects as iterators/generators)

5、ability to use parallel processing

下面是含有注釋信息的命令行展示

import sys

import multiprocessing

import pybedtools

# get?example GFF and BAM filenames 獲得例子中的GFF和BAM的內(nèi)容

gff = pybedtools.example_filename('gdc.gff')

bam = pybedtools.example_filename('gdc.bam')

# Some?GFF files have invalid entries -- like chromosomes with negative coords GFF文件中含有不實(shí)的內(nèi)容楚殿，例如染色為負(fù)數(shù)坐標(biāo)的信息

# orfeatures of length = 0.? This lineremoves them and saves the result in a?tempfile 或者特征值的長(zhǎng)度為0撮慨，可以使用命令行去除這些無(wú)意義的內(nèi)容，并且存儲(chǔ)為另外一個(gè)文件脆粥。

g = pybedtools.BedTool(gff).remove_invalid().saveas()

# Next,?we create a function to pass only features for a particular?featuretype.? This is similar to a"grep" operation when applied to every#feature in a BedTool

~~隨后砌溺，我們構(gòu)建了一個(gè)函數(shù)用來(lái)只顯示符合要求的特征，類似于grep命令~~

def featuretype_filter(feature, featuretype):

??? if feature[2] == featuretype:

??????? return True

??? return False

# This?function will eventually be run in parallel, applying the filter above?to?several different BedTools simultaneously?多種不同的BedTools同時(shí)選擇

def subset_featuretypes(featuretype):

??? result = g.filter(featuretype_filter, featuretype).saveas()

??? return pybedtools.BedTool(result.fn)

~~# Thisfunction performs the intersection of a BAM file with a GFF file and?returns the total number of hits.? Itwill eventually be run in parallel.~~

def count_reads_in_features(features_fn):

??? """

??? Callback function to count reads infeatures

??? """

??# BAM files are auto-detected; no need foran `abam` argument.? Here we?# construct a new BedTool out of the BAM?file and intersect it with the?features filename.?We use stream=True so that no?intermediate tempfile is?created, and bed=True so that the.count() method can iterate through the?resulting streamed BedTool.

??? return pybedtools.BedTool(bam).intersect(b=features_fn,?stream=True).count()

# Set?up a pool of workers for parallel processing pool = multiprocessing.Pool()

#?Create separate files for introns and exons, using the function we defined?above

featuretypes = ('intron', 'exon')

introns, exons = pool.map(subset_featuretypes, featuretypes)

#?Perform some genome algebra to get unique and shared intron/exon regions.?Here?we keep only the filename of the results, which is safer than an entire BedTool for passing around in parallel computations.

exon_only = exons.subtract(introns).merge().remove_invalid().saveas().fn

intron_only = introns.subtract(exons).merge().remove_invalid().saveas().fn

intron_and_exon = exons.intersect(introns).merge().remove_invalid().saveas().fn

# Do?intersections with BAM file in parallel, using the other function we defined above

features = (exon_only, intron_only, intron_and_exon)

results = pool.map(count_reads_in_features, features)

# Print?the results

labels = (' exon only:',?' intron only:',?'intron and exon:')

for label, reads in zip(labels, results):

??? sys.stdout.write('%s %s\n' % (label, reads))

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者

人面猴
序言：七十年代末变隔，一起剝皮案震驚了整個(gè)濱河市规伐，隨后出現(xiàn)的幾起案子，更是在濱河造成了極大的恐慌匣缘，老刑警劉巖猖闪，帶你破解...
沈念sama閱讀 218,122評(píng)論 6贊 505
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件，死亡現(xiàn)場(chǎng)離奇詭異肌厨，居然都是意外死亡培慌，警方通過(guò)查閱死者的電腦和手機(jī)，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 93,070評(píng)論 3贊 395
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門柑爸，熙熙樓的掌柜王于貴愁眉苦臉地迎上來(lái)吵护，“玉大人，你說(shuō)我怎么就攤上這事表鳍∠诙” “怎么了？”我有些...
開(kāi)封第一講書(shū)人閱讀 164,491評(píng)論 0贊 354
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵譬圣，是天一觀的道長(zhǎng)瓮恭。經(jīng)常有香客問(wèn)我，道長(zhǎng)厘熟，這世上最難降的妖魔是什么偎血？我笑而不...
開(kāi)封第一講書(shū)人閱讀 58,636評(píng)論 1贊 293
?港島之戀（遺憾婚禮）
正文為了忘掉前任诸衔，我火速辦了婚禮盯漂，結(jié)果婚禮上颇玷，老公的妹妹穿的比我還像新娘。我一直安慰自己就缆，他們只是感情好帖渠，可當(dāng)我...
茶點(diǎn)故事閱讀 67,676評(píng)論 6贊 392
惡毒庶女頂嫁案：這布局不是一般人想出來(lái)的
文/花漫我一把揭開(kāi)白布。她就那樣靜靜地躺著竭宰，像睡著了一般空郊。火紅的嫁衣襯著肌膚如雪。梳的紋絲不亂的頭發(fā)上切揭，一...
開(kāi)封第一講書(shū)人閱讀 51,541評(píng)論 1贊 305
城市分裂傳說(shuō)
那天狞甚，我揣著相機(jī)與錄音，去河邊找鬼廓旬。笑死哼审，一個(gè)胖子當(dāng)著我的面吹牛，可吹牛的內(nèi)容都是我干的孕豹。我是一名探鬼主播涩盾，決...
沈念sama閱讀 40,292評(píng)論 3贊 418
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開(kāi)眼，長(zhǎng)吁一口氣：“原來(lái)是場(chǎng)噩夢(mèng)啊……” “哼励背！你這毒婦竟也來(lái)了春霍？” 一聲冷哼從身側(cè)響起，我...
開(kāi)封第一講書(shū)人閱讀 39,211評(píng)論 0贊 276
萬(wàn)榮殺人案實(shí)錄
序言：老撾萬(wàn)榮一對(duì)情侶失蹤叶眉，失蹤者是張志新（化名）和其女友劉穎址儒，沒(méi)想到半個(gè)月后，有當(dāng)?shù)厝嗽跇?shù)林里發(fā)現(xiàn)了一具尸體衅疙，經(jīng)...
沈念sama閱讀 45,655評(píng)論 1贊 314
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡莲趣，尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 37,846評(píng)論 3贊 336
?白月光啟示錄
正文我和宋清朗相戀三年，在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了炼蛤。大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片妖爷。...
茶點(diǎn)故事閱讀 39,965評(píng)論 1贊 348
活死人
序言：一個(gè)原本活蹦亂跳的男人離奇死亡，死狀恐怖理朋，靈堂內(nèi)的尸體忽然破棺而出絮识，到底是詐尸還是另有隱情，我是刑警寧澤嗽上，帶...
沈念sama閱讀 35,684評(píng)論 5贊 347
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布次舌，位于F島的核電站，受9級(jí)特大地震影響兽愤，放射性物質(zhì)發(fā)生泄漏彼念。R本人自食惡果不足惜挪圾，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 41,295評(píng)論 3贊 329
男人毒藥：我在死后第九天來(lái)索命
文/蒙蒙一、第九天我趴在偏房一處隱蔽的房頂上張望逐沙。院中可真熱鬧哲思，春花似錦、人聲如沸吩案。這莊子的主人今日做“春日...
開(kāi)封第一講書(shū)人閱讀 31,894評(píng)論 0贊 22
一樁弒父案，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽(yáng)徘郭。三九已至靠益，卻和暖如春，著一層夾襖步出監(jiān)牢的瞬間残揉，已是汗流浹背胧后。一陣腳步聲響...
開(kāi)封第一講書(shū)人閱讀 33,012評(píng)論 1贊 269
情欲美人皮
我被黑心中介騙來(lái)泰國(guó)打工，沒(méi)想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留抱环，地道東北人壳快。一個(gè)月前我還...
沈念sama閱讀 48,126評(píng)論 3贊 370
代替公主和親
正文我出身青樓，卻偏偏與公主長(zhǎng)得像江醇，于是被迫代替她去往敵國(guó)和親濒憋。傳聞我的和親對(duì)象是個(gè)殘疾皇子，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 44,914評(píng)論 2贊 355

pybedtools文檔-平行在外顯子和內(nèi)含子中計(jì)算reads數(shù)

1愤诱、BAM file support (for more, see?Working with BAM files)

2、indexing into Interval objects (for more, see?Intervals)

3润讥、filtering (for more, see?Filtering)

4转锈、streaming (for more, see?Using BedTool objects as iterators/generators)

5、ability to use parallel processing

推薦閱讀更多精彩內(nèi)容