Problem acessing indexed information in hdf5 database
1
0
Entering edit mode
@maria-pedroto-6482
Last seen 9.6 years ago
Hello, I'm trying to use the rhdf5 package to read and iterate in a hdf5 database. The structure of the database is as follows: » h5ls(filename, datasetinfo=TRUE) group name otype dclass dim 0 / _i_events H5I_GROUP 1 /_i_events msisdn H5I_GROUP 2 /_i_events/msisdn abounds H5I_DATASET INTEGER 105408 3 /_i_events/msisdn bounds H5I_DATASET INTEGER 287 x 366 4 /_i_events/msisdn indices H5I_DATASET INTEGER 4718592 x 366 5 /_i_events/msisdn indicesLR H5I_DATASET INTEGER 4718592 6 /_i_events/msisdn mbounds H5I_DATASET INTEGER 105408 7 /_i_events/msisdn mranges H5I_DATASET INTEGER 366 8 /_i_events/msisdn ranges H5I_DATASET INTEGER 2 x 366 9 /_i_events/msisdn sorted H5I_DATASET INTEGER 4718592 x 366 10 /_i_events/msisdn sortedLR H5I_DATASET INTEGER 4718881 11 /_i_events/msisdn zbounds H5I_DATASET INTEGER 105408 12 /_i_events timestamp H5I_GROUP 13 /_i_events/timestamp abounds H5I_DATASET STRING 105408 14 /_i_events/timestamp bounds H5I_DATASET STRING 287 x 366 15 /_i_events/timestamp indices H5I_DATASET INTEGER 4718592 x 366 16 /_i_events/timestamp indicesLR H5I_DATASET INTEGER 4718592 17 /_i_events/timestamp mbounds H5I_DATASET STRING 105408 18 /_i_events/timestamp mranges H5I_DATASET STRING 366 19 /_i_events/timestamp ranges H5I_DATASET STRING 2 x 366 20 /_i_events/timestamp sorted H5I_DATASET STRING 4718592 x 366 21 /_i_events/timestamp sortedLR H5I_DATASET STRING 4718881 22 /_i_events/timestamp zbounds H5I_DATASET STRING 105408 23 / events H5I_DATASET COMPOUND 1729572595. I'm not being able to find out how to use the indexes to find the information I need. That is, I think i need to access the timestamp index and return a value to be inserted in the h5read function at the index field. Dunnow if i'm using the best function because I haven't found on the web an example so complicated. Best regards, Maria Pedroto [[alternative HTML version deleted]]
rhdf5 rhdf5 • 1.6k views
ADD COMMENT
0
Entering edit mode
Bernd Fischer ▴ 550
@bernd-fischer-5348
Last seen 7.3 years ago
Germany / Heidelberg / DKFZ
Dear Maria! You can access the different datasets in you HDF5 file by > A = h5read(file=filename, name="/_i_events/timestamp/abounds") > A Or you can access a subset (e.g. elements 101 to 110) of a dataset by > A = h5read(file=filename, name="/_i_events/timestamp/abounds", index=list(101:110)) > A Or a subset of columns from a two dimensional dataset by > B = h5read(file=filename, name="/_i_events/timestamp/bounds", index=list(,101:110)) > B or a subset of rows and columns by > B = h5read(file=filename, name="/_i_events/timestamp/bounds", index=list(20:25,101:110)) > B If you what to know the content of a dataset (e.g. " /_i_events/timestamp/indices") means, you may ask the data provider of the HDF5 file to describe you the file format in detail. rhdf5 provides you with methods to read a dataset, but not to interpret its content. Bernd On 02.04.2014, at 16:58, Maria Pedroto <maria.pedroto at="" gmail.com=""> wrote: > Hello, > I'm trying to use the rhdf5 package to read and iterate in a hdf5 database. > The structure of the database is as follows: > > ? h5ls(filename, datasetinfo=TRUE) > group name otype dclass dim > 0 / _i_events H5I_GROUP > 1 /_i_events msisdn H5I_GROUP > 2 /_i_events/msisdn abounds H5I_DATASET INTEGER 105408 > 3 /_i_events/msisdn bounds H5I_DATASET INTEGER 287 x 366 > 4 /_i_events/msisdn indices H5I_DATASET INTEGER 4718592 x 366 > 5 /_i_events/msisdn indicesLR H5I_DATASET INTEGER 4718592 > 6 /_i_events/msisdn mbounds H5I_DATASET INTEGER 105408 > 7 /_i_events/msisdn mranges H5I_DATASET INTEGER 366 > 8 /_i_events/msisdn ranges H5I_DATASET INTEGER 2 x 366 > 9 /_i_events/msisdn sorted H5I_DATASET INTEGER 4718592 x 366 > 10 /_i_events/msisdn sortedLR H5I_DATASET INTEGER 4718881 > 11 /_i_events/msisdn zbounds H5I_DATASET INTEGER 105408 > 12 /_i_events timestamp H5I_GROUP > 13 /_i_events/timestamp abounds H5I_DATASET STRING 105408 > 14 /_i_events/timestamp bounds H5I_DATASET STRING 287 x 366 > 15 /_i_events/timestamp indices H5I_DATASET INTEGER 4718592 x 366 > 16 /_i_events/timestamp indicesLR H5I_DATASET INTEGER 4718592 > 17 /_i_events/timestamp mbounds H5I_DATASET STRING 105408 > 18 /_i_events/timestamp mranges H5I_DATASET STRING 366 > 19 /_i_events/timestamp ranges H5I_DATASET STRING 2 x 366 > 20 /_i_events/timestamp sorted H5I_DATASET STRING 4718592 x 366 > 21 /_i_events/timestamp sortedLR H5I_DATASET STRING 4718881 > 22 /_i_events/timestamp zbounds H5I_DATASET STRING 105408 > 23 / events H5I_DATASET COMPOUND 1729572595. > > I'm not being able to find out how to use the indexes to find the > information I need. That is, I think i need to access the timestamp index > and return a value to be inserted in the h5read function at the index > field. > > Dunnow if i'm using the best function because I haven't found on the web an > example so complicated. > > Best regards, > Maria Pedroto > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD COMMENT
0
Entering edit mode
Hello Mr. Bernd, first of all I appreciate for the quick answer. Secondly I believe I wasn't very clear. I apologise. My main problem is that the timestamp and msisdn data are *index datasets (please check http://stackoverflow.com/questions/1686869/searching-a-hdf5-dataset <http: stackoverflow.com="" questions="" 1686869="" searching-a-hdf5-dataset="">) .* But I'm a bit puzzled about what field has the index so I can query my database (last line of the source code I sent). I mean if I select the timestamp index I don't understand in which h5i_dataset from the list below should I search for a reference to the records where I have this timestamp. /_i_events timestamp H5I_GROUP 13 /_i_events/timestamp abounds H5I_DATASET STRING 105408 14 /_i_events/timestamp bounds H5I_DATASET STRING 287 x 366 15 /_i_events/timestamp indices H5I_DATASET INTEGER 4718592 x 366 16 /_i_events/timestamp indicesLR H5I_DATASET INTEGER 4718592 17 /_i_events/timestamp mbounds H5I_DATASET STRING 105408 18 /_i_events/timestamp mranges H5I_DATASET STRING 366 19 /_i_events/timestamp ranges H5I_DATASET STRING 2 x 366 20 /_i_events/timestamp sorted H5I_DATASET STRING 4718592 x 366 21 /_i_events/timestamp sortedLR H5I_DATASET STRING 4718881 22 /_i_events/timestamp zbounds H5I_DATASET STRING 105408 After having the selected data I believe I would be able to go to the events database and return the record with the data I need. Best regards, Maria Pedroto On Wed, Apr 2, 2014 at 7:10 PM, Bernd Fischer <bernd.fischer@embl.de> wrote: > Dear Maria! > > You can access the different datasets in you HDF5 file by > > > A = h5read(file=filename, name="/_i_events/timestamp/abounds") > > A > > Or you can access a subset (e.g. elements 101 to 110) of a dataset by > > > A = h5read(file=filename, name="/_i_events/timestamp/abounds", > index=list(101:110)) > > A > > Or a subset of columns from a two dimensional dataset by > > > B = h5read(file=filename, name="/_i_events/timestamp/bounds", > index=list(,101:110)) > > B > > or a subset of rows and columns by > > > B = h5read(file=filename, name="/_i_events/timestamp/bounds", > index=list(20:25,101:110)) > > B > > If you what to know the content of a dataset (e.g. " > /_i_events/timestamp/indices") means, > you may ask the data provider of the HDF5 file to describe you the file > format in detail. > > rhdf5 provides you with methods to read a dataset, but not to interpret > its content. > > Bernd > > > On 02.04.2014, at 16:58, Maria Pedroto <maria.pedroto@gmail.com> wrote: > > > Hello, > > I'm trying to use the rhdf5 package to read and iterate in a hdf5 > database. > > The structure of the database is as follows: > > > > » h5ls(filename, datasetinfo=TRUE) > > group name otype dclass dim > > 0 / _i_events H5I_GROUP > > 1 /_i_events msisdn H5I_GROUP > > 2 /_i_events/msisdn abounds H5I_DATASET INTEGER 105408 > > 3 /_i_events/msisdn bounds H5I_DATASET INTEGER 287 x 366 > > 4 /_i_events/msisdn indices H5I_DATASET INTEGER 4718592 x 366 > > 5 /_i_events/msisdn indicesLR H5I_DATASET INTEGER 4718592 > > 6 /_i_events/msisdn mbounds H5I_DATASET INTEGER 105408 > > 7 /_i_events/msisdn mranges H5I_DATASET INTEGER 366 > > 8 /_i_events/msisdn ranges H5I_DATASET INTEGER 2 x 366 > > 9 /_i_events/msisdn sorted H5I_DATASET INTEGER 4718592 x 366 > > 10 /_i_events/msisdn sortedLR H5I_DATASET INTEGER 4718881 > > 11 /_i_events/msisdn zbounds H5I_DATASET INTEGER 105408 > > 12 /_i_events timestamp H5I_GROUP > > 13 /_i_events/timestamp abounds H5I_DATASET STRING 105408 > > 14 /_i_events/timestamp bounds H5I_DATASET STRING 287 x 366 > > 15 /_i_events/timestamp indices H5I_DATASET INTEGER 4718592 x 366 > > 16 /_i_events/timestamp indicesLR H5I_DATASET INTEGER 4718592 > > 17 /_i_events/timestamp mbounds H5I_DATASET STRING 105408 > > 18 /_i_events/timestamp mranges H5I_DATASET STRING 366 > > 19 /_i_events/timestamp ranges H5I_DATASET STRING 2 x 366 > > 20 /_i_events/timestamp sorted H5I_DATASET STRING 4718592 x 366 > > 21 /_i_events/timestamp sortedLR H5I_DATASET STRING 4718881 > > 22 /_i_events/timestamp zbounds H5I_DATASET STRING 105408 > > 23 / events H5I_DATASET COMPOUND 1729572595. > > > > I'm not being able to find out how to use the indexes to find the > > information I need. That is, I think i need to access the timestamp index > > and return a value to be inserted in the h5read function at the index > > field. > > > > Dunnow if i'm using the best function because I haven't found on the web > an > > example so complicated. > > > > Best regards, > > Maria Pedroto > > > > [[alternative HTML version deleted]] > > > > _______________________________________________ > > Bioconductor mailing list > > Bioconductor@r-project.org > > https://stat.ethz.ch/mailman/listinfo/bioconductor > > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor > > [[alternative HTML version deleted]]
ADD REPLY

Login before adding your answer.

Traffic: 481 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6