Entering edit mode
zen
•
0
@zen-22107
Last seen 4.2 years ago
Hello,
I have res
output using DESeq2
which looks like:
baseMean log2FoldChange lfcSE stat pvalue padj
B_08g068390 1266.29961 -3.271902 0.4937087 -6.627191 3.421356e-11 6.201550e-07
I have imported gff
file for annotation using:
GFF <- makeTxDbFromGFF(file = "gene_models.gff")
gff has following columns:
##gff-version 3
B3ch01 maker CDS 218 220 . - 0 ID=B_06g024400.1.1:cds;Parent=B_06g024400;
B3ch01 maker exon 218 220 . - . ID=B_06g024400.1.1:exon:2;Parent=B_06g024400;
B3ch01 maker gene 218 220 . - . ID=B_06g024400.1;Name=B_06g024400;Note=Similar to 24400: LOW QUALITY:50S ribosomal protein;ref_id=24400;
B3ch01 maker mRNA 218 220 126 - . ID=B_06g024400.1.1;Parent=B_06g024400;Name=B_06g024400;Note=Similar to 24400.1.1: LOW QUALITY:50S ribosomal protein;ref_id=24400;
The column names for gff
file are:
seqnames ranges strand | source type score phase ID Parent Name Note ref_id Dbxref Ontology_term
I want to add gene annotations (such as LOW QUALITY:50S ribosomal protein
) for differentially expressed genes in res
from GFF
file.
I will appreciate any help. Thank you!
Thank you, I used it and got this error:
I also tried using:
But it got error:
Cool. What is the output of:
?
Thank you, here is the output:
Thank you. I am confident that you can resolve the problem from this point. please take time to review the example in the vignette to which I linked in my original answer.
Thank you, I looked at the example you shared. The information from
gff
file that I want to join toDESeq2
output is underNote
. I searched that there is aattribute
column ingff
file andNote
comes in that. But in mygff
file withcolumns(GFF)
, there is no column namedattribute
orNote
. I tried looking into all the columns from the output and did not get the information I want.Thank you, I looked at the example you shared. The information from
gff
file that I want to join toDESeq2
output is underNote
. I searched that there is aattribute
column ingff
file andNote
comes in that. But in mygff
file withcolumns(GFF)
, there is no column namedattribute
orNote
. I tried looking into all the columns from the output and did not get the information I want.Is it even reading the GFF file correctly? From what I can see, the file should be GTF or GFF3. If you do this, what output appears?
The output from above command is:
Here is an example line from gff file:
I am not sure why
Note
section is not showing up in thehead
command. Thank you!Hi, it seems that this may be a limitation of this package. I would recommend instead to read the GFF into a data frame using, for example,
ape::read.gff()
, and then annotate that way.Thank you! It does show
attributes
column fromgff
now. I tried using sameselect
method to useNote
information fromattributes
column withDESeq2
res
output file but it didn't work. Can you suggest if there is any other way to do this.Thank you! It does show
attributes
column fromgff
now. I tried using sameselect
method to useNote
information fromattributes
column withDESeq2
res
output file but it didn't work. Can you suggest if there is any other way to do this.Thank you! It does show
attributes
column fromgff
now. I tried using sameselect
method to useNote
information fromattributes
column withDESeq2
res
output file but it didn't work. Can you suggest if there is any other way to do this.If you use
ape::read.gff()
, then you cannot useselect()
- these methods are incompatible. By usingape::read.gff()
, you can then match the GFF annotation to the genes in yourres
object via other functions, such aswhich()
andmatch()
Thank you! Based on the
attributes
column fromgff
, how can I mentionID
to match and addNote
tores
dataset. I tried:Ideally I want to do,
Hi, for this secondary issue of matching up the annotation data to the DESeq2 results object, i would suggest that you take some time to go through tutorials about this online. There should be many out there. Thanks!