Entering edit mode
I need to try to compare some TCGA data with my rnaseq data. I take this 1098 sample and I try to use on deseq2 package.
SampleTable SampleName fileName condition 1 1 0000772b-773d-4cf8-8baf-0e1e6dbf55e8.htseq.counts A 2 2 003d640d-3bfc-4791-b241-4a3206d49468.htseq.counts A 3 3 00511204-3512-4a5e-b664-60271e968903.htseq.counts B 4 4 00587432-3004-4d4b-9dc2-8c90c81f7640.htseq.counts B 5 5 0094a627-a55f-48dc-bbc3-ea66035e514a.htseq.counts A 6 6 009ff7e3-8bd3-4683-adf5-7b68a0859f35.htseq.counts A 7
This is the code I use. Unfortunately there is no way to obtain an Image. R kill the run when I try to use rlog()
Here you can se the code. What can I do for complete this task. My PC have 35 GB ram and Is Centos 7
metadata<-read.table("ID_provami.csv",sep=",",header=T) SampleTable <-data.frame(SampleName=metadata$ID,fileName=metadata$NOMIFILE,condition=factor(metadata$CONDITION)) ddHTSeq <- DESeqDataSetFromHTSeqCount(sampleTable=SampleTable,directory="TuttoCount", design= ~ 1) # dds <- estimateSizeFactors(ddHTSeq) #filtro 10 conti there are less than 0,3 samples with counts greater than or equal to 10. Given that the inference i idx <- rowSums( counts(dds) >= 10 ) >= 0.3 dds <- dds[idx,] dds <- DESeq(dds) rld = rlog(dds) data<-DESeq2::plotPCA(rld,intgroup=c("condition"),returnData=TRUE) p1<-qplot(PC1,PC2,color=condition,data=data,size=I(9))+geom_text(aes(label=ID),hjust=-2,vjust=1) ggsave("PCA_condition.png",p1)