Entering edit mode
Hello everyone, I am trying to do a co-expression network analysis of my RNA seq data in R, but I am getting an error. Please help me to understand why my data got terminated.
# >library(WGCNA)
> options(stringsAsFactors = FALSE)
> HBV <- read.csv("HBV.csv",row.names=1)
> dim(HBV)
[1] 26692 23
> names(HBV)
[1] "A" "B" "C" "D" "E" "F" "G" "H" "I" "J" "K" "L" "M" "N" "O" "P" "Q" "R" "S"
[20] "T" "U" "V" "W"
> gsg = goodSamplesGenes(HBV, verbose = 3)
Flagging genes and samples with too many missing values...
..step 1
> gsg$allOK
[1] TRUE
> sampleTree = hclust(dist(HBV), method = "average")
# Terminated
I am following the standard tutorial and from there I have used these codes.
Can you please elaborate? - which part of the tutorial? You neither answered my first question, but that is okay. It just means that I / we have less ability to assist.
Please follow this link : https://horvath.genetics.ucla.edu/html/CoexpressionNetwork/Rpackages/WGCNA/Tutorials/FemaleLiver-01-dataInput.pdf I am first trying to preprocess and clean my data.
This is my system's memory information.
============================================================
Thanks for sharing the information. It seems that your computer's memory is relatively low, compared to other computers. This may not be enough to generate the required distance matrix, and this is what is likely causing the crash / error.
You could try to generate the distance tree via pvclust, as this uses bootstrapping; however, with this amount of memory at your disposal, you are going to frequently run into problems in the future.
You could also try to skip this part of the WGCNA workflow. I think that the other WGCNA functions make better use of memory.
Thanks for your patience and the information I will soon upgrade my system.