Hi,
I'm running DropletUtils' (1.2.1) barcodeRanks() function, and noticing some inconsistencies in the output.
I run the command like:
br.out <- barcodeRanks(counts(single_cell_experiment))
single_cell_experiment is an object parsed from the pbmc3k dataset, 2700 barcodes.
From the docs I expect br.out$rank, br.out$total and br.out$fitted to be of length 2700 corresponding to the columns of single_cell_experiment. The vectors are the right length, but I noticed the labels of the output vectors don't match the input SCE (or each other):
> head(colData(single_cell_experiment))
DataFrame with 6 rows and 4 columns
Sample Barcode barcodeRank barcodeTotal
<character> <character> <numeric> <numeric>
AAACATACAACCAC-1 /test_data AAACATACAACCAC-1 2599 NA
AAACATTGAGCTAC-1 /test_data AAACATTGAGCTAC-1 2322.5 2149
AAACATTGATCAGC-1 test_data AAACATTGATCAGC-1 109 1566
AAACCGTGCTTCCG-1 test_data AAACCGTGCTTCCG-1 836 NA
AAACCGTGTATGCG-1 /test_data AAACCGTGTATGCG-1 966.5 NA
AAACGCACTGGTAC-1 /test_data AAACGCACTGGTAC-1 177.5 2211
> head(br.out$rank)
TTACTCGAACGTTG-1 AGAGGTCTACAGCT-1 GGCACGTGTGAGAA-1 GCGAAGGAGAGCTT-1 ACGAACTGGCTATG-1 GGGCCAACCTTGGA-1
1042.0 77.0 429.0 784.5 2555.5 1415.0
> head(br.out$total)
CCAGTCTGCGGAGA-1 TTACTCGAACGTTG-1 AGAGGTCTACAGCT-1 GGCACGTGTGAGAA-1 GCGAAGGAGAGCTT-1 ACGAACTGGCTATG-1
2421 4903 3149 2639 981 2164
If I try and match on the vector names I discover there are multiple entries for some single_cell_experiment barcodes, none for others.
All this to say that I suspect this means the vector labels in br.out here are meaningless relics- but if so the labels should probably be unset to prevent confusion ;-).
Thanks,
Jon

Great- thanks.