I ran a time course with a resistant and a susceptible genotype over 22 days (0 dpi, 1 dpi, 7 dpi, 14 dpi, 22 dpi). I have a dds object from DESeq2 made from RSEM for all thirty time points. I would like to filter out lowly expressed genes by rowMeans, but want to do so in two batches (filter out the low row means for all of the susceptible samples, then the resistant ones). I should note that the two groups of 15 are separated by a condition (Genotype) included in the DESeqDataSetFromTximport function. Would I be able to just run a subset function on the two genotypes?
Thanks, Michael. After filtering, I ended up with this error:
Would you have any insight on how I could resolve said error? Thank you.
This reflects probably some aspects that make DESeq2 model not appropriate. Why are there no genes that are expressed for all samples?
The RSEM values were assembled through Trinity. I created my DDS object using transcript values. Might that explain it?
Eyeball your gene counts. Usually that error means you have a few samples that have zero reads for nearly every gene.
Thanks. Not entirely sure how I should progress. Do you think I should filter those transcripts with gene counts lower than 1 in any column?
Swbarnes2 said "samples".
Take a look at the column sum of the counts.
colSums(counts(dds))