Hi all,
My data is a data frame of estimated TPM counts, where rows are the genes and columns are the samples. I'm using "library(biomaRt)" to get ensembl symbols and hgnc symbols. When trying to change the rownames from enemble to hgnc symbols I get an error which stems from duplicates in hgnc symbols the way I understand it.
The error I get:
"Error in
.rowNamesDF<-
(x, value = value) : duplicate 'row.names' are not allowed In addition: Warning message: non-unique values when setting 'row.names': ‘’, ‘ABCF2’, ‘LINC01238’, ‘POLR2J3’, ‘POLR2J4’, ‘TBCE"
How can I solve this issue?
EDITED:
using .rowNamesDF(TPM_countdata, make.names=TRUE)
I've managed to force row names to be hgnc coded but I don't understand the reason it creates duplicates initially and not unique names of hgnc.
Thanks and all the best
data screenshot https://ibb.co/8M853bm