Dear Bioconductor users,
I am using WGCNA software for RNA-seq data (320 patient samples) and I want to decipher potential mechanisms underlying hepatocellular carcinoma progression. First, I removed all features that have a count of less than 10 in more than 90% of the samples and I transformed my data using varianceStabilizingTransformation function implemented in DEseq2 package. Second, I removed all potential outliers and then I used the pickSoftThreshold function to select the beta power for signed network construction. As you can see in the attached plot, the SFT fit index of power of 12 (default option) fails to reach the 0.8. The power of 16 fits the Scale free topology. Although the power of 16 means network modules with low mean connectivity. Which is your opinion about the selection of soft threshold power in my case? Is it absolutely necessary the selection of the soft threshold power based in SFT criterion?
https://www.dropbox.com/s/yyb3a4houef2utl/SFT_MEAN_CONN.pdf?dl=0
Thank you for your time in advance!!
Sincerely,
Panagiotis Mokos
If you are using data from the TCGA dataset, plot it with all the variables. When I analyzed some of subset of them I could find some batch effect. Did you look up for them? I recommend the library sva for that purpose.
Also it is hard to make sure, if there is a grouping factor or not, I recommend ploting a PCA or a MDS too (and colour by each variable)
Dear Lluis,
Thank you very much for your useful information!
Which variable did you use as variable for BATCH EFFECT detection and correction? Tissue source site(TSS), or another variable?
Thank you in advance!!
Panagiotis
I used the barcode to check if the sample vial had any effect, see the study here
Dear Lluis,
Thank you very very much for your quick and informative reply!!!
Panagiotis