Hello everyone. I have some RNAseq data from different conditions and samples. Using tximport function, I was able to map transcript ids to gene ids. Now I have a data matrix where rows are gene ids and columns are TPM values for 21 samples. Since I have several transcript variants for each gene, their transcript ids are different, but they map to a same gene id. To make some biological meaning out of this, I want to ignore transcript variants and merge all TPM values for the same gene id. What is the best way to do this? Sum them up? Or a weighed sum (based on length)?
Since I am new to RNAseq, I will appreciate your help :)