How to run a CDHIT command of "BioSeqClass" package on DNAStringSet object of DNA sequences to get non-redundent set of sequences?
0
0
Entering edit mode
vinod.acear ▴ 50
@vinodacear-8884
Last seen 3.7 years ago
India

I am trying to run CDHIT , on DNAStringSet object of DNA sequences to get non-redundent set of sequences

library(Biostrings)

library("BioSeqClass")

## flank is DNAStrinSet object of equal length sequences

seq = as.character(flank)  
## Homolog reduction of whole-length sequence by cd-hit need cd-hit program;

I have downloaded latest CD-HIT programme and named its fiolder as "cd-hit" from the given link http://www.bioinformatics.org/download/cd-hit/  and given path of the given folder to cdhit.path in above command.

> reducSeq50 = hr(seq, method="cdhit", identity=0.5, cdhit.path="cd-hit")
Error in cdhitHR(seq, identity, cdhit.path) :
  cdhit.path cd-hit is not corrected

Please let me know where I am wrong.

 

 

 

 

 

CD-HIT BioStrings cdhit biostrings BioSeqClass • 1.0k views
ADD COMMENT

Login before adding your answer.

Traffic: 955 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6