Sporadic missingness with admixmmap
1
0
Entering edit mode
@genomic_region-13050
Last seen 3.1 years ago

I'm running into an issue with sporadic missingness when using admixmmap function from Genesis.

Column `rsids` joining factor and character vector, coercing into character vector
Reading in Phenotype and Covariate Data...
Fitting Model with 7485 Samples
Computing Variance Component Estimates using AIREML Procedure...
Sigma^2_A     Sigma^2_E     logLik     RSS
Running analysis with 7485 Samples and 43221 SNPs
Beginning Calculations...
Block 1 of 9 Completed - 1.645 mins
Block 2 of 9 Completed - 1.128 mins
Block 3 of 9 Completed - 38.08 secs
Block 4 of 9 Completed - 47.14 secs
Block 5 of 9 Completed - 42.99 secs
Block 6 of 9 Completed - 1.203 mins
Error in prepareGenotype(genoData = admixDataList[[i]], snp.read.idx = snp.include$index[bidx],  :
  genoData has sporadic missingness in block size > 1
Calls: admixMapMM -> prepareGenotype
Execution halted

I'm left with one chromosome to wrap up the analysis.
Is it possible to identify region the function is giving issues for missingness? Is it due to large number of SNPs, or low number of SNPs, or high linkage equilibrium?

SNPRelate_1.12.2    argparse_2.0.1      GENESIS_2.8.1     dplyr_0.8.0.1       gdsfmt_1.14.1       GWASTools_1.24.1     Biobase_2.38.0      BiocGenerics_0.24.0  
R version 3.4.2 (2017-09-28)

Thanking you in advance.

genesis admixmap • 1.0k views
ADD COMMENT
1
Entering edit mode
@stephanie-m-gogarten-5121
Last seen 15 hours ago
University of Washington

admixMapMM cannot handle missing genotype values in the input data. The fastest way to identify which SNPs have missing values is to use the SNPRelate function snpgdsSNPRateFreq, which returns missing call rate as well as allele frequency. You could then exclude those SNPs from your analysis.

Also, your versions of R and Bioconductor are two years old, so I would recommend upgrading to the current versions. In GENESIS 2.14.3, admixMapMM has been replaced by a new function admixMap which will give you an warning (instead of an error) and return NA for any blocks which contain missing values.

ADD COMMENT
0
Entering edit mode

I've small concern. I perform analysis using genetic related matrix (GRM) to adjust for random effect. Data size is 10K individuals. Here, the most time consuming step is fitting null-model for each CHR. I submit analysis separately for each chromosome.

Now, my model needs to adjust for another matrix, that is, I've two matrices as random effect. Is it possible to store output of fitted null model in an R object (.rd or such) that would allow to read it and use when performing analysis. This would cull the most time intensive process every time, thus, expediting the analysis. I hope my concern isn't confusing. I'm using libraries tied with R v3.5 and admixMap function.

ADD REPLY
1
Entering edit mode

Yes, this is exactly what we do in our own analyses. script 1:

nullmod <- fitNullModel(...)
save(nullmod, file="my_null_model.RData")

script 2:

load("my_null_model.RData")
adm <- admixMap(admixDataList, nullmod)
ADD REPLY
0
Entering edit mode

Oh wow, awesome. Thanks much :)

ADD REPLY

Login before adding your answer.

Traffic: 646 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6