I download dataset,
geoq <- getGEO("GSE9514")
At the end, the warning message shown,
Warning message:
In readLines(fname) :
incomplete final line found on '/var/folders/k2/kdrnsbws5gz8vrt83yjmlbdm0000gn/T//Rtmp6T1Fwv/GSE9514_series_matrix.txt.gz'
I the dataset downloaded is 9.2 MB, but the teaching video (in edX PH525x Data Analysis for Genomics) shown that it is 9.9 MB
Besides, when look into the dataset using dim(e) (after e <- geo[[1]]), the features of my dataset shows that it is 4370 only, but the video shows 9335. Besides, I use pData(e)$data_row_count the features in each column is 9335. Apparently, I the dataset I downloaded is truncated.
How can I solve this problem?
> sessionInfo()
R version 3.0.2 (2013-09-25)
Platform: x86_64-apple-darwin10.8.0 (64-bit)
locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
attached base packages:
[1] parallel stats graphics grDevices utils datasets methods
[8] base
other attached packages:
[1] GEOquery_2.28.0 GenomicRanges_1.14.4 XVector_0.2.0
[4] IRanges_1.20.7 Biobase_2.22.0 BiocGenerics_0.8.0
loaded via a namespace (and not attached):
[1] RCurl_1.95-4.3 stats4_3.0.2 tools_3.0.2 XML_3.95-0.2