Hi,
I have 4 .csv files containing gene level (transcripts collapsed) counts coming from infected (2 biological replicates) and non-infected (2 biological replicates) generated via RSEM. I dont have access to raw data files. Their contents are as follows;
gene_id transcript_id(s) length expected_count FPKM
AT1G01010 AT1G01010.1 1688 88 2.68
However, number of rows in each file is different (My guess is 0 counts were removed). Can someone please help me by advising (or share a R script)me on how to get these data combined to match the input of tximport? so I can do the differential expression analysis using DESeq2. Thank you!
Thank Michael! I agree. These data coming from an outside provider. They have provided RSEM output (.csv ). Their pipeline includes RSEM > DEseq2. I am trying to recreate their DESeq2 output (also provided as .csv). In a way this is me trying to see the reproducibility of their output. I appreciate if you can give me some guidance to achieve that.
I have no idea what they did. Presumably used estimated counts without an offset. Why not reach out to them for their scripts.
Thank you. I will do that.