I have a serious problem with gene ID annotation :-)
I am converting the ensemble_ID to gene names
this is an example of IDs
ENSG00000122718
ENSG00000130201
ENSG00000150076
ENSG00000150526
ENSG00000155640
ENSG00000166748
ENSG00000168260
ENSG00000168787
ENSG00000170590
ENSG00000170803
ENSG00000171484
ENSG00000172381
ENSG00000172774
here is what I do
data = read.table("list_ensembl_gid.tsv")
colnames(data)[1] <- "ensembl_gene_id"
colnames(data)[2] <- "counts"
library('biomaRt')
hsapiens = useMart("ensembl",
dataset="hsapiens_gene_ensembl")
hsapiens_infos <- getBM(attributes=c('ensembl_gene_id',
'external_gene_name'),
mart = hsapiens)
merge_infos <- merge(x = data,
y = hsapiens_infos,
by = "ensembl_gene_id",
all.x = TRUE)
What exactly is the problem? You're using a number of packages and functions here, but haven't said what is going wrong, or what output you're currently generating.
@Mike Smith I dont have any problem with programing. The problem is that I cannot annotate them to gene name. Aparently these are old biomart database. Is there any possibility to annotate them ? can you please just run and see it by yourself?