I am finding differentially expressed genes in sample and control data using ballgown. Below is my directory structure:
- extdata
- ALL control1
- tdata.ctab
- i2t.ctab
- idata.ctab
- e2t.ctab
- e_data.ctab
- ALL control2
- ALL control3
- ALL sample1
- ALL sample2
- ALL sample3
- ALL sample4
- ALL sample5
- ALL control1
I am finding differentially expressed genes in sample(n=5) and control(n=3) data using stattest() function in R to get most variance. But I am unable to load both sample and control data together as the intron i_id are different for samples and controls. I get below error. Could you please guide on how I can proceed? Should I process sample and control in separate ballgown objects? But if so how can I calculate differentially expressed genes between the groups? If I need to process together I might have to delete the extra introns data in samples which will cause loss in data. Please help.
Code and Output:
pheno_data=read.csv(file="pheno_data.csv")
`pheno_data`
id state
1 ALL control1 control
2 ALL control2 control
3 ALL control3 control
4 ALL sample1 sample
5 ALL sample2 sample
6 ALL sample3 sample
7 ALL sample4 sample
8 ALL sample5 sample
bg=ballgown(dataDir = "C:/Users/lak/Documents/extdata", samplePattern = "ALL", pData= pheno_data)
Wed Sep 09 17:21:18 2020
Wed Sep 09 17:21:19 2020: Reading linking tables
Wed Sep 09 17:21:29 2020: Reading intron data files
Wed Sep 09 17:24:12 2020: Merging intron data
Error in ballgown(dataDir = "C:/Users/lak/Documents/extdata", samplePattern = "ALL", :
**intron ids were either not the same or not in the same order across samples. double check i_data.ctab for each sample.**
In addition: Warning messages:
1: In x$i_id != intronAll[[1]]$i_id :
longer object length is not a multiple of shorter object length
2: In x$i_id != intronAll[[1]]$i_id :
longer object length is not a multiple of shorter object length
3: In x$i_id != intronAll[[1]]$i_id :
longer object length is not a multiple of shorter object length
4: In x$i_id != intronAll[[1]]$i_id :
longer object length is not a multiple of shorter object length
5: In x$i_id != intronAll[[1]]$i_id :
longer object length is not a multiple of shorter object length