Hi,
I am working with the transcriptomics expression data (a 96 targeted gene panel) and many samples (consisting of 4 different healthy/diseases conditions including Healthy
, Disease A
, Disease B
, and Disease C
. These different disease conditions are again sub-categorized into different stages (Moderate
, Mild
, Severe
, Remission
). Further, these were stimulated with 10 different stimulations/treatments (Stim 1
, Stim 2
, Stim 3
, Stim 4
, ..............., Stim 10
). Basically, this is a multi-level experiment with multiple samples from each subject, and analysis was perfomed using multi-level experiments in the limma R package User's Guide
, I performed statistical analysis identified differentially expressed genes between groups/comparisons as stated below.
My question is, what is the best way to represent this data in the form of visualization as I have many comparisons? One way is to extract the differentially expressed genes with p.val = 0.05 and logFC = +/-1.0 per each comparison and plot the heatmap
, etc., But this will be tedious and time consuming task. The other way, I was thinking it would be nice if there are common or unique genes/biomarker across all these conditions and then plot heatmap
. With these, certain biological contextualization can be performed like pathways/ontologies
Groups compared in limma
:
Cont.matrix <- makeContrasts(
DaRS2.1 = Disease_A_Remission_Stim_2 - Disease_A_Remission_Stim_1
DaRS3.1 = Disease_A_Remission_Stim_3 - Disease_A_Remission_Stim_1
DaRS4.1 = Disease_A_Remission_Stim_4 - Disease_A_Remission_Stim_1
etc.,
DbRS2.1 = Disease_B_Remission_Stim_2 - Disease_B_Remission_Stim_1
DbRS3.1 = Disease_B_Remission_Stim_3 - Disease_B_Remission_Stim_1
DbRS4.1 = Disease_B_Remission_Stim_4 - Disease_B_Remission_Stim_1
etc.,
DbMS2.1 = Disease_B_Mild_Stim_2 - Disease_B_Mild_Stim_1
DbMS3.1 = Disease_B_Mild_Stim_3 - Disease_B_Mild_Stim_1
DbMS4.1 = Disease_B_Mild_Stim_4 - Disease_B_Mild_Stim_1
etc.,
DbaMS2.1 = Disease_B_Mild_Stim_2 - Disease_A_Mild_Stim_1
DbaMS3.1 = Disease_B_Mild_Stim_3 - Disease_A_Mild_Stim_1
DbaMS4.1 = Disease_B_Mild_Stim_4 - Disease_A_Mild_Stim_1
etc.,
levels=design)
Thank you, Toufiq
Ali Barry Thank you for the information and inputs. I will work on the design further. Yes,
facet_wrap
is a good suggestion. I will try this. One question I have at the moment is how do I calculate the variance or highest variance. I have adata.frame
with fold changes and I am interested in extracting highly variance genes and plotting the data. Is there any specific package to perform this or base function in R.Would
rowVars
work for your data structure?Ali Barry
Sure, let me try
rowVars
.