要做什么桥状?
文章里面的:
我自己的表達(dá)矩陣的的話:有60564個(gè)基因
文章里面的“GSE130437_genes.fpkm_table”,有60603個(gè)基因
(少了四十幾個(gè)硝清?)
先用自己的數(shù)據(jù)辅斟,去差異性分析,看看符不符合芦拿?
exprset_my <- read.table('all.id.txt',header = T,sep = '\t',fill = T)
exprset_my=exprset_my[!duplicated(exprset_my$Geneid),]
row.names(exprset_my) <- exprset_my$Geneid
exprset_my <- exprset_my[,-1]
exprset_my1 <- exprset_my[,6:11]
colnames(exprset_my1) <- c("MCF7pR1","MCF7pR2","MCF7pR3","MCF7pS1","MCF7pS2","MCF7pS3")
colData <- read.csv("pdata_溶瘤病毒耐藥1.csv", header = T)
row.names(colData) <- colData$X
coldata2 <- colData[2]
#DESeq2差異性分析
library(DESeq2)
dds <- DESeqDataSetFromMatrix(countData = exprset_my1,colData = colData,design = ~ condition)
dds <- DESeq(dds)
res <- results(dds, contrast=c("condition","control","treatment"))
DEG <- as.data.frame(res)
DEG <- na.omit(DEG)
diff_gene <-subset(DEG, padj <= 0.05 & abs(log2FoldChange) > 1)
diff_gene_up <- subset(diff_gene, log2FoldChange > 1)
diff_gene_down <- subset(diff_gene, log2FoldChange < -1)
文章的分析結(jié)果是:
Using a q-value cutoff ≤ 0.05 with |log2FC| ≥1, we identified 2183 up-regulated genes and 1548 down-regulated transcripts in MCF7/pR cells
自己的diff-gene有5227個(gè)
up的:3538
down的有:1689
也就是up基因那里士飒,多了一千多個(gè)?
找表1顯示了與MCF7 / pS細(xì)胞相比蔗崎,MCF7 / pR細(xì)胞中排名前20位的上調(diào)和下調(diào)基因酵幕。
我自己找的,和文章的缓苛,完全對(duì)不上號(hào)芳撒?
用文章里面的數(shù)據(jù)試試吧?
文章給的處理好的GEO數(shù)據(jù)未桥。