I have trouble to read gtf files with branchpointer::gtfToExons While the supplied example file (gencode.v26.annotation.small.gtf) can be read, my own gtf files or any change in the example file lead to "Error: subscript contains invalid names". E.g. keeping only the gene_id and transcript_id from the example file renders it unreadable. I suspect that gtfToExons relies on specific attributes in the group/attribute field but I cannot pinpoint which. I work with non-model organisms and can only provide transcript-exon information with non-public identifiers. Also, gff3 files cannot be read.

An example for a minimal gtf file which cannot be read is:

chr1    gmap    transcript      1       1000    .       +       .       transcript_id "tx1";
chr1    gmap    exon    100     900     .       +       0       transcript_id "tx1";

Any hint on how to construct my gtf files?

exons <- gtfToExons("minimal.gtf")

Error: subscript contains invalid names

sessionInfo( ):
Hi Frank,

Your example gtf is missing a gene_id. In the old code we also required a transcript_type/transcript_biotype, and a gene_type/gene_biotype. The code on github (betsig/branchpointer) has been updated so these are no longer required.


