Hi,
I am new to R and DESeq2 package for RNAseq analysis. I am trying to analyze the read counts of 2 samples (M1, M2) as 3 biological triplicates (M11, M12, M13 and M21, M22, M23). I read the files into R as .csv but when I try to create a dds I get the following error. Could you please help me? Is something wrong with my data format?
Read_Counts <- read.csv("Read Counts.csv", header = TRUE)
head(Read_Counts)
ï..Gene_ID SP_18 SP_23 SP_28 SP_20 SP_25 SP_30
1 ENSG00000000003 88 45 30 70 100 151
2 ENSG00000000419 604 920 828 905 596 1047
3 ENSG00000000457 258 242 153 252 119 135
4 ENSG00000000460 77 70 51 152 76 75
5 ENSG00000000938 3074 3672 2948 5560 5434 7641
6 ENSG00000000971 4521 115 55 42 1 0
Meta_Data <- read.csv("Meta Data.csv", header = TRUE)
head(Meta_Data)
ï..Sample_ID Condition CellType
1 SP_18 M1_1 M1
2 SP_23 M1_2 M1
3 SP_28 M1_3 M1
4 SP_20 M2_1 M2
5 SP_25 M2_2 M2
6 SP_30 M2_3 M2
dds <- DESeqDataSetFromMatrix(countData = Read_Counts, colData = Meta_Data, design = ~ CellType)
Error in DESeqDataSetFromMatrix(countData = Read_Counts, colData = Meta_Data, :
ncol(countData) == nrow(colData) is not TRUE.
Thanks a lot, Kevin. I just learn as I go, bits from here and there. I just removed the column header for the gene symbols and now its working. I appreciate your advice and will keep it in my script.