Any pointers on simulating cell outliers?
1
0
Entering edit mode
luke.zappia ▴ 50
@lukezappia-11973
Last seen 22 months ago
Germany

Received via GitHub https://github.com/Oshlack/splatter/issues/47

I was wondering if it is possible to use Splatter to simulate cell outliers.

In the documentation of Splatter, there are expression outlier parameters, but I did not find any specific information on cell outliers. Excuse me if I missed anything. I only searched the keyword "outlier" in the documentation.

I thought about generating very small DE groups with large DE scale in Splatter as cell outliers, but I am not sure if it is appropriate or not.

I am trying to benchmark methods for detecting cell outliers. Although it is not very meaningful to identify a very small number of outliers in scRNA-seq datasets that usually contain thousands of cells, it may be a straightforward approach to reduce the noise in the datasets.

Any pointers would be appreciated!

By the way, I have been using Splatter to benchmark clustering methods, and it worked very smoothly. I appreciate your efforts on making the package reliable and easy to use.

splatter simulation • 955 views
ADD COMMENT
1
Entering edit mode
luke.zappia ▴ 50
@lukezappia-11973
Last seen 22 months ago
Germany

Cell outliers aren't part of the current model for the Splat simulation. I think your idea of having some groups with very small probabilities and relatively large DE factors is probably a good approach to try. Maybe something like:

```

sim <- splatSimulateGroups(group.prob = c(0.5, 0.4, 0.09, 0.001, 0.001, 0.001, 0.001, 0.001, 0.001, 0.001, 0.001, 0.001, 0.001))

```

That would give you three "real" clusters and ten kinds of "outlier" cells. You will probably need to play around with the exact probabilities to get something that looks like what you want and you might want to do something similar with the DE parameters. 

ADD COMMENT

Login before adding your answer.

Traffic: 751 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6