Question

Killed 9 on macOS Mojave for Rsubread buildindex

0

Entering edit mode

raabec • 0

@raabec-21046

Last seen 4.9 years ago

Hi,

I am trying to index the human genome with buildindex from the Rsubread package. However, I receive a fatal error with macOSX Mojave.

The genome has been downloaded from the UCSC server and all chromosomes are combined to a single fastA file. Repeats are softmasked.

# path to genome file
ref = normalizePath(path ="~/Desktop/work/fusion/work/alignments/storrage/hsa.fa")
# building the index
buildindex(basename = "hsa_index", reference = ref) 

//================================= Running ==================================\\
||                                                                            ||
|| Check the integrity of provided reference sequences ...                    ||
|| No format issues were found                                                ||
|| Scan uninformative subreads in reference sequences ...                     ||
|| 676060 uninformative subreads were found.                                  ||
|| These subreads were excluded from index building.                          ||
|| Build the index...                                                         ||
||    8%,  24 mins elapsed, rate=178.8k bps/s                                 ||
||   16%,  38 mins elapsed, rate=223.6k bps/s                                 ||
||   24%,  55 mins elapsed, rate=232.0k bps/s                                 ||
||   33%,  75 mins elapsed, rate=227.1k bps/s                                 ||
||   41%, 103 mins elapsed, rate=208.0k bps/s                                 ||
Killed: 9

About the Mac

MacBook Pro (Retina, 13-inch, Late 2013) Processor 2,8 GHz Intel Core i7 Memory 16 GB 1600 MHz DDR3

As of now I am not clear how to handle this problem, advice is very welcome.

Carsten

Rsubread buildindex killed 9 Mojave • 1.4k views

ADD COMMENT • link updated 4.8 years ago by Wei Shi ★ 3.6k • written 4.9 years ago by raabec • 0

score 1 · Answer 1 · 2019-06-19

1

Entering edit mode

Wei Shi ★ 3.6k

@wei-shi-2183

Last seen 17 days ago

Australia/Melbourne/Olivia Newton-John …

Your index building process was likely to be terminated by the system due to large amount of memory used by buildindex for building a full index for human genome.

You may use the following command to build a gapped index to save memory. This will result in an increase in read mapping time but it is still fast.

buildindex(basename = "hsa_index", reference = ref, gappedIndex=TRUE)

ADD COMMENT • link 4.9 years ago Wei Shi ★ 3.6k

0

Entering edit mode

Great. Worked very well. What would be the suggested memory requirements to effectively work with the Rsubread package and the mouse and human genomes.

Thanks in advance

Carsten

ADD REPLY • link 4.9 years ago raabec • 0

score 0 · Answer 2 · 2019-06-23

To build a full index for human/mouse genome, buildindex function will need 15GB of memory. Mapping functions in Rsubread including align and subjunc will need 17.8GB of memory to perform read alignment using this full index. This enables fastest mapping speed, but it is recommended for use on a unix server due to relatively large memory use.

To use Rsubread in a personal computer that typically has 16GB of memory or less, a gapped index is recommended. buildindex only needs 5.7GB of memory to build a gapped index for human/mouse genome. align will need 8.2GB of memory for mapping with this index and subjunc will need 8.8GB of memory.

Memory use for index building and read mapping can be further reduced by building a split index using the indexSplit and memory options in buildindex.