Question

Why NanoStringDiff package so slow?

2

Entering edit mode

lim6432 ▴ 50

@lim6432-21478

Last seen 5.7 years ago

Hello I am studying NanoString data analysis.

I'm trying to do a DEG analysis using the NanoStringDiff package, but the glm.LRT function is too slow. Why is that?

Please let me know. Is there another DEG tool for NanoString data that I would recommend for you?

NanoStringDiff NanoString nCounter DE analysis • 2.4k views

ADD COMMENT • link updated 17 months ago by georgersmith • 0 • written 5.7 years ago by lim6432 ▴ 50

0

Entering edit mode

Were you able to get a workaround of this? I have tried a few times running the glm.LRT in R studio server and it went on running over 30 mins a few times and I had to abort. Is there a workaround?

ADD REPLY • link 5.1 years ago vd4mmind • 0

0

Entering edit mode

I'm having the same issue, it literally can take 1 hour or more to do a single call of glm.LRT() and I have only 50 samples. I do not understand how they could release a library that takes this long to do DEG on ~800 genes.

If someone has developed a workaround, including even modifying the NanoStringDiff source code glm.LRT() function which I have no problem doing, I would really appreciate it!!

ADD REPLY • link 4.9 years ago hermidalc ▴ 20

0

Entering edit mode

I've given up on NanoStringDiff, it's basically unusable and can take HOURS just to do a simple DEG. Even though the authors mentioned in their paper that when there are counts >1000 in the dataset that the algorithm can be slow this is worse than slow it's a library that isn't usable at all. They should fix their algorithm, period.

ADD REPLY • link 4.9 years ago hermidalc ▴ 20

score 1 · Answer 1 · 2021-05-26

1

Entering edit mode

anastazie.d ▴ 30

@anastazied-10696

Last seen 3.8 years ago

Czechia

I have the same problem and I figured out that problem was because positive control was not ordered in raw data that I got from the client (POS_C(8), POS_A(128), POS_F(0.125), POS_D(2), POS_B(32), POS_E(0.5)). NanoStringDiff assumes that positive control is ordered. I would recommend to order data by Code.Class and Name. After this change analysis for ~800 genes and ~20 samples took 30 minutes max.

ADD COMMENT • link 3.8 years ago anastazie.d ▴ 30

0

Entering edit mode

Am giving this a go, have ordered by name, will let you know how it does

ADD REPLY • link 17 months ago georgersmith • 0

0

Entering edit mode

this is the best piece of advice out there! Worked in under 20 minutes! Simply order by name and you're good to go!

What a day, thank you!

ADD REPLY • link 17 months ago georgersmith • 0