一、數(shù)據(jù)的處理與統(tǒng)計(jì)
Miseq測(cè)序獲得的是雙端序列數(shù)據(jù)孕锄,根據(jù)兩端序列的互補(bǔ)區(qū)域吮廉,進(jìn)行拼接獲得單條完整序列,同時(shí)對(duì)序列的質(zhì)量和拼接的效果進(jìn)行質(zhì)控過濾畸肆,根據(jù)序列首尾兩端index序列和引物序列區(qū)分序列來自于哪一個(gè)樣品宦芦,并獲得高質(zhì)量序列。
通過統(tǒng)計(jì)匯總表格轴脐,可以體現(xiàn)本批次樣品數(shù)據(jù)的相關(guān)統(tǒng)計(jì)调卑,包括片段的長(zhǎng)度抡砂、原始數(shù)據(jù)序列數(shù)量、 總堿基數(shù)恬涧、每例樣品的測(cè)序序列數(shù)等注益。
二、數(shù)據(jù)處理部分在論文中的描述
a)關(guān)于數(shù)據(jù)總量的描述
? ? ? 例如:All rawdata was merged based on overlap, and quality filtered. XX,XXXX reads have been get from XXX samples. The average length of? the reads was XXX bp. ……
b) 關(guān)于每例樣品的平均數(shù)據(jù)量的描述
? ? ? 例如:XXX samples were sequencing by Illumina Miseq platform, X,XXX±XXX high quality reads per sample was get for bioinformation analysis. ……
注意事項(xiàng)及說明:
關(guān)于表示每例樣品的測(cè)序量可以采用Mean ± SD 方式進(jìn)行表示溯捆,但是有時(shí)候需要借助Excel進(jìn)行統(tǒng)計(jì)每例樣品測(cè)序數(shù)據(jù)丑搔,計(jì)算平均值和SD值。