Question

'x' has "out of limits" views ,Issue with the file size!! Compareequence function is not working

0

Entering edit mode

susobhanbaidya • 0

@susobhanbaidya-13780

Last seen 6.9 years ago

Sir, It is not working according to your guidance.

My location of exdata folder is "C:\Users\SUSHOVAN\Documents\R\win-library\3.4\CRISPRseek\extdata" , where I have put two similar fasta sequences X1.fa and X2.fa I have tried to run the following code and got that error. The file fize of X1.fa and X2.fa is 1 Kb and 1Kb. variation among the sequences is about 10%-20%.

I cut and paste from X1 and X2.fa to the predefined , preexisting files rs362331C.fa and rs362331T.fa

but same problem.

What is the meaning of 'x' has "out of limits" views

search for gRNAs for input file1...
Validating input ...
Searching for gRNAs ...
<simpleError in fromXStringViewsToStringSet(x, out.of.limits = out.of.limits, use.names = use.names): 'x' has "out of limits" views>
search for gRNAs for input file2...
Validating input ...
Searching for gRNAs ...
<simpleError in fromXStringViewsToStringSet(x, out.of.limits = out.of.limits, use.names = use.names): 'x' has "out of limits" views>
[1] "Scoring ..."
Error in searchHits(gRNAs = gRNAs1, PAM = PAM, PAM.pattern = PAM.pattern, :
object 'gRNAs1' not found
In addition: Warning messages:
1: In dir.create(outputDir) :
'C:\Users\SUSHOVAN\Documents\rs362331C.fa-Aug-19-2017' already exists
2: In dir.create(outputDir) :
'C:\Users\SUSHOVAN\Documents\rs362331T.fa-Aug-19-2017' already exists
> inputFile1Path <- system.file("extdata", "rs362331C.fa", package = "CRISPRseek")
> inputFile1Path <- system.file("extdata", "rs362331C.fa", package = "CRISPRseek")
> inputFile2Path <- system.file("extdata", "rs362331T.fa", package = "CRISPRseek")
> seqs <- compare2Sequences(inputFile1Path, inputFile2Path,outputDir = outputDir , REpatternFile = REpatternFile,overwrite = TRUE)
search for gRNAs for input file1...
Validating input ...
Searching for gRNAs ...
<simpleError in fromXStringViewsToStringSet(x, out.of.limits = out.of.limits, use.names = use.names): 'x' has "out of limits" views>
search for gRNAs for input file2...
Validating input ...
Searching for gRNAs ...
<simpleError in fromXStringViewsToStringSet(x, out.of.limits = out.of.limits, use.names = use.names): 'x' has "out of limits" views>
[1] "Scoring ..."
Error in searchHits(gRNAs = gRNAs1, PAM = PAM, PAM.pattern = PAM.pattern, :
object 'gRNAs1' not found
In addition: Warning messages:
1: In dir.create(outputDir) :
'C:\Users\SUSHOVAN\Documents\rs362331C.fa-Aug-19-2017' already exists
2: In dir.create(outputDir) :

**************** Should I give very small sequence for comparing two sequence with only 1SNP.

***************** My input files were about 1Kb with 10-20 % variation (of nucleotide between those files).

Sir/Madam please help me

compilation error filesize bioconductor • 3.3k views

ADD COMMENT • link 6.9 years ago susobhanbaidya • 0

score 0 · Answer 1 · 2017-08-19

0

Entering edit mode

Julie Zhu ★ 4.3k

@julie-zhu-3596

Last seen 8 months ago

United States

Can you try these?

readDNAStringSet(inputFile1Path)

readDNAStringSet(inputFile2Path)

You should see your sequences. If not, then you need to make sure your files are in plain text format.

Alternatively, you can avoid storing input sequences in the file, and try to run the following code instead. Please replace the DNA sequences with your own two sequences.

inputFile1Path <- DNAStringSet("GACCCACGCCTGCTCCCTCATCTACTGTGTGCACTTCATCCTGGA")

names(inputFile1Path) <- "seq1"

inputFile2Path <- DNAStringSet("GACCCACGCCTGCTCCCTCATCCACTGTGTGCACTTCATCCTGGA")

names(inputFile2Path) <- "seq2"

seqs <- compare2Sequences(inputFile1Path, inputFile2Path,

    inputNames=c("Seq1", "Seq2"),

scoring.method = "CFDscore",

        outputDir = getwd(), 
        overwrite = TRUE)

Best，

Julie

ADD COMMENT • link 6.9 years ago Julie Zhu ★ 4.3k

0

Entering edit mode

Thank u Julie, but

My question was about the length of the sequences and the difference between them.

Please see the the question

**************** Should I give very small sequence for comparing two sequence with only 1SNP.

***************** My input files were about 1Kb with 10-20 % variation (of nucleotide between those files).

ADD REPLY • link 6.9 years ago susobhanbaidya • 0

0

Entering edit mode

Please try with your entire sequences. Best, Julie On Aug 19, 2017, at 11:11 AM, susobhanbaidya [bioc] <noreply@bioconductor.org<mailto:noreply@bioconductor.org>> wrote: Activity on a post you are following on support.bioconductor.org<https: support.bioconductor.org=""> User susobhanbaidya<https: support.bioconductor.org="" u="" 13780=""/> wrote Comment: 'x' has "out of limits" views ,Issue with the file size!! Compareequence function is not working <https: support.bioconductor.org="" p="" 99429="" #99439=""> : Thank u Julie, but My question was about the length of the sequences and the difference between them. Please see the the question **************** Should I give very small sequence for comparing two sequence with only 1SNP. ***************** My input files were about 1Kb with 10-20 % variation (of nucleotide between those files). ________________________________ Post tags: compilation error, filesize, bioconductor You may reply via email or visit C: 'x' has "out of limits" views ,Issue with the file size!! Compareequence functio

ADD REPLY • link 6.9 years ago Julie Zhu ★ 4.3k

score 0 · Answer 2 · 2017-08-19

Madam I tried to follow your suggestion , but same problem

*************** Should I give small sequence for comparing two sequence with only 1SNP.

***************** My input files were about 1Kb with 10-20 % variation (of nucleotide between those files).

> inputFile1Path <- DNAStringSet("TAATATTTTAAAATCGGTGACGTGGGCCCAAAACGAGTGCAGTTCCAAAGGCACCCACCTGTGGCAGCGTCTCTCCTTTCACATCAAAAAGCGCGAAAGCAACCAGACGGCTGTGATCAAACCCTTCTCCAAGACATCTGAGGGACGCTACTGCGGAGGCAGCGACATAAACACGGACAACAGCAACAACAAGATGCTCTATGAAGTCACGGAGGCGGAGGAACGCTACCCAATCACCTACCGGCCCCAAACGCCCTCTCCAATCAGCACCGTCACACAAAGAGCTAGCATCGCTTTAGCACAAAGCTCTGACGCGCAGGTCATGAGTGTTTCTACGTACATGTGCCAGCCTCCACAGCATCGCGTGCCATGTCCTGGCGGAAGCGCGTCACAGGGCTCTCTAATGGAGCAGATTAGCTGTGTGGTAAACCGCTTCACTGCTAACATCAGCGAGCTTAATAGCATGATGCTCACCAGTTCTCCTCCAGGGGGAGCTGTCGGTTTATCCACTAGCACTTCAGCACACGTCCACTCACCTGAACCATGCTGTCCACCATCTTACCGCCTGTCCCGGGAGGTTTCCCTACCATCCACTATGACAACCTATGCTGAAATACAGCCACTCCCACCGGTGGAGTCAAACGTGGGTCGAGCTGCCACCTCCTGCCATCTTTCTGGACCATCACGACTGAAAACATCTCCAACAACCAGTGGGAATGAGCTAAAAGTGGGAGTGGCATCAGCTCTGGAGTCGCCGTCCAATCAACCGGAGCTTGAGGAACTGCTGGCGCTGACTCCGCCCTCCCCCTTTAGGGACTCGGTTGGGTCAGGCAGCGCTTCACCCAGTTCTCCAACATCCGATGCTGAAACTTCAGTACTTCCGACCATCCCTAAATACGAGTGCCTGGTGCTGCGACACTACAGCCAAAGCTCGTCTTCATTA")
> names(inputFile1Path) <- "seq1"
> inputFile2Path <- DNAStringSet("TAATATTTTAAAACCGGTGACGTGGGCCCAAAACGAGTGCAGTTCCAAAGGCACCCACCTGTGGCAGCGTATCTCCTTTCACATTAAAAAGCGCGAAAGCAACCTGACAGCTGTGAACGAACCCTACTCAAAGACATCTGTTGGACGATACTGCGGCGGCAGTGACATAAACACGGACAAGTGCAACACCAAGATGCTCTATGAAGTCACGGAGGCGGAGGAACGCTACCCAATTACCTACCTGCCCCAAACGCCCTCTCCAATCAGCACCGTCACACAAAGAGCGAGCATCGCTATAGCACAAAGCTCTGACGCGCAGGTCATGAGTGTTTCTACGTACATGTCCCAGCCTCCACAGCATCGCGTTCCATGTCCCGGCGGAAGAGCTTCACAGGGCTCTCTAATGGAGCAGATTAGCTTTGTGGTAAACCGCTACACTGCTAACATCATCGAGCTTCATAGAATGAGGCTCACCACTTCTGCTCCAGGGGGAGCTGTCAGATTATCCACTATCACTTCAGCACACGTCCAATCACCTGTACGATGCTCTCCGCCATCTTACCGCCCGTCCCGGGAGGTTTCCCTACCTTCCACTATGACAACCTATGTTGAAATTCAACCACTCCGACCGGTGGAGTCAAACGTAGGTGGAGCAGCCACCTCCTGCCATCTTTGTGGACCATCACGACTGAAAACATCTCCAACAACCAGTGGCAATGAGCTATATGTGGGACTGGCAGCAGCTCTGGACTTGTCGTCCAATTAACCGGAGCTTGAGGAACTGCTGGCGCTGCCTCCGCCCTCCCCCTTTAGGGACTCGGTTGGGTCAGGCAGCGCTTCACCCAGTTCTCCAACATCCGATGCTGAAACTTCAGTATTTCCGAGCATCCCTAAATACGAGTGCCTGGTGCTGCGACACTACAGCCAAAGCTCGTCTTCATTA"
+ )
> names(inputFile2Path) <- "seq2"
> seqs <- compare2Sequences(inputFile1Path, inputFile2Path,inputNames=c("Seq1", "Seq2"),scoring.method = "CFDscore",outputDir = getwd(), overwrite = TRUE)
search for gRNAs for input file1...
Validating input ...
Searching for gRNAs ...
<simpleError in fromXStringViewsToStringSet(x, out.of.limits = out.of.limits, use.names = use.names): 'x' has "out of limits" views>
search for gRNAs for input file2...
Validating input ...
Searching for gRNAs ...
<simpleError in fromXStringViewsToStringSet(x, out.of.limits = out.of.limits, use.names = use.names): 'x' has "out of limits" views>
[1] "Scoring ..."
Error in searchHits(gRNAs = gRNAs1, PAM = PAM, PAM.pattern = PAM.pattern, :
object 'gRNAs1' not found

score 0 · Answer 3 · 2017-08-21

Thanks Julie for your enormous effort.

I have 2/ 3 precise question

Qs1 > Can I compare more than 2 that is 3 sequences by any other command.

Qs2. What is the meaning of Target in sequence 1 one and Target in sequence 2.

if in same xls file , we are getting common GRNAs , then why two separate file.

QS3.

What is significance of field A in xls (file2_gr7)

Qs4. Can we get common GRNAs from the given 2 input sequences.

Last and very imnportant for me

Q5: what will be the commands for only potential off target analysis of a given input sequence only.

Not species specific

I do not want chromosome and spices specific data.

gRNAFilePath <- system.file('extdata', 'testHsap_GATA1_ex2_gRNA1.fa',

+ package = 'CRISPRseek')

> results <- offTargetAnalysis(inputFilePath = gRNAFilePath,

+ findgRNAsWithREcutOnly = FALSE, REpatternFile = REpatternFile,

+ findPairedgRNAOnly = FALSE, findgRNAs = FALSE,

+ BSgenomeName = Hsapiens, chromToSearch = 'chrX',

+ txdb = TxDb.Hsapiens.UCSC.hg19.knownGene,

+ orgAnn = org.Hs.egSYMBOL,

+ max.mismatch = 1, outputDir = outputDir, overwrite = TRUE)

score 0 · Answer 4 · 2017-08-21

Susobhan,

Qs1 > Can I compare more than 2 that is 3 sequences by any other command.

Yes, you need to perform 3 separate analysis, i.e., seq1 vs seq2 and seq3, seq2 vs seq1 and seq3, seq3 vs seq1 and seq2.

Qs2. What is the meaning of Target in sequence 1 one and Target in sequence 2.

if in same xls file , we are getting common GRNAs , then why two separate file.

Please type ?compare2Sequences for more detailed description. For your convenience, here is the section relevant to you.

Value

Return a data frame with all potential gRNAs from both sequences. In addition, a tab delimited file scoresFor2InputSequences.xls is also saved in the outputDir, sorted by scoreDiff descending.

`name`	name of the gRNA
`gRNAPlusPAM`	gRNA plus PAM sequence
`targetInSeq1`	target/off-target sequence including PAM in the 1st input sequence file
`targetInSeq2`	target/off-target sequence incuding PAM in the 2nd input sequence file
`guideAlignment2Offtarget`	alignment of gRNA to the other input sequence (off-target sequence)
`offTargetStrand`	strand of the other sequence (off-target sequence) the gRNA align to
`scoreForSeq1`	score for the target sequence in the 1st input sequence file
`scoreForSeq2`	score for the target sequence in the 1st input sequence file
`mismatch.distance2PAM`	distances of mismatch to PAM, e.g., 14 means the mismatch is 14 bp away from PAM
`n.mismatch`	number of mismatches between the off-target and the gRNA
`targetSeqName`	the name of the input sequence where the target sequence is located
`scoreDiff`	scoreForSeq1 - scoreForSeq2

QS3.

What is significance of field A in xls (file2_gr7)

See above (the name field)

Qs4. Can we get common GRNAs from the given 2 input sequences.

Yes, scoreDiff = 0 will be the common ones (you only need to look at one of the output files I sent to you via email)

Last and very imnportant for me

Q5: what will be the commands for only potential off target analysis of a given input sequence only.

Not species specific

Can you please clarify? Do you mean you are not interested in the genome-wide off-target analysis?

I do not want chromosome and spices specific data.

Best regards,

Julie

score 0 · Answer 5 · 2017-08-21

Thanks Julie again,

Your previous reply was

Susobhan,

Bioconductor recently transitioned from SVN to github. I need to set up account to clone my package, do testing and make changes if needed. It will take me quite sometime maybe days if I do have a big chunk of time to do this.

Therefore, I decied to send you some results for you to look at in case you are in a hurry. Please let me know if this makes sense.Here are the code I used to generate the outputfile.

inputFile1Path <- DNAStringSet("TAATATTTTAAAATCGGTGACGTGGGCCCAAAACGAGTGCAGTTCCAAAGGCACCCACCTGTGGCAGCGTCTCTCCTTTCACATCAAAAAGCGCGAAAGCAACCAGACGGCTGTGATCAAACCCTTCTCCAAGACATCTGAGGGACGCTACTGCGGAGGCAGCGATGCTGCGACACTACAGCCAAAGCTCGTCTTCATTA")

names(inputFile1Path) <- "file1"

inputFile2Path <- DNAStringSet("TAATATTTTAAAACCGGTGACGTGGGCCCAAAACGAGTGCAGTTCCAAAGGCACCCACCTGTGGCAGCGTATCTCCTTTCACATTAAAAAGCGCGAAAGCAACCTGACAGCTGTGAACGAACCCTACTCAAAGACATCTGTTGGACGATACTGCGGCGGCAGTGACATAAACACGGACAAGTGCAACACCAAGATGCTCTATGAAGTCACGGAGGCGGAGGAACGCTACCCAATTACCTACCTGCCCCAAACGCGACACTACAGCCAAAGCTCGTCTTCATTA")

names(inputFile2Path) <- "file2"

gRNAs1 <- findgRNAs(inputFile1Path, pairOutputFile ="testPair.xls", calculategRNAEfficacy = FALSE)

if (!file.exists("gRNAs4seq1"))

dir.create(file.path(getwd(), "gRNAs4seq1"))

compare2Sequences(gRNAs1, inputFile2Path, inputNames=c("Seq1", "Seq2"),findgRNAs = c(FALSE, TRUE), searchDir = "1to2",

scoring.method = "CFDscore",outputDir = "gRNAs4seq1", overwrite = TRUE)

gRNAs2 <- findgRNAs(inputFile2Path, pairOutputFile ="testPair.xls", calculategRNAEfficacy = FALSE)

if (!file.exists("gRNAs4seq2"))

dir.create(file.path(getwd(), "gRNAs4seq2"))

compare2Sequences(gRNAs2, inputFile1Path, inputNames=c("Seq2", "Seq1"),findgRNAs = c(FALSE, TRUE), searchDir = "1to2",

scoring.method = "CFDscore",outputDir = "gRNAs4seq2", overwrite = TRUE)

Best,

Julie

Susobhan,

Bioconductor recently transitioned from SVN to github. I need to set up account to clone my package, do testing and make changes if needed. It will take me quite sometime maybe days if I do have a big chunk of time to do this.

Therefore, I decied to send you some results for you to look at in case you are in a hurry. Please let me know if this makes sense.

Here are the code I used to generate the outputfile.

**************************************************************************************************************************************

1. Yes then it is ok

I need to run the above code to compare two large sequence.

But I am following the above code but it is not running.

Giving same error that I have mentioned . The out put u sent it will help me lots.

You told you need to change few package.

Please confirm me then if it is modified.

2. I want to get all potential off-target GRNAs from a single sequence.

Where, I do not want to mention any chromosome, species etc.

Just input information will be Long Input genome sequence.

What will be the command in that case

3. Julie, can you help me to clear the concept of Off target?

What i understood from papers, materials that it is similar kind of pattern over the sequence, which is very complex to handle genome edit. because it can be wrongly cleaved to another side.

score 0 · Answer 6 · 2017-08-21

Susobhan,

Please try the following code, it should work and produce a file called scoresFor2InputSequences.xls in your output directory.

Please note that I removed one base at the beginning of your sequences. I need to fix the code in the github. However, I need wait for a day to be able to download CRISPRseek with read and write access. Will let you know once the package is updated. Meanwhile, the following approach should work (remove one base from the beginning of your sequences).

inputFile1Path <- DNAStringSet("AATATTTTAAAATCGGTGACGTGGGCCCAAAACGAGTGCAGTTCCAAAGGCACCCACCTGTGGCAGCGTCTCTCCTTTCACATCAAAAAGCGCGAAAGCAACCAGACGGCTGTGATCAAACCCTTCTCCAAGACATCTGAGGGACGCTACTGCGGAGGCAGCGACATAAACACGGACAACAGCAACAACAAGATGCTCTATGAAGTCACGGAGGCGGAGGAACGCTACCCAATCACCTACCGGCCCCAAACGCCCTCTCCAATCAGCACCGTCACACAAAGAGCTAGCATCGCTTTAGCACAAAGCTCTGACGCGCAGGTCATGAGTGTTTCTACGTACATGTGCCAGCCTCCACAGCATCGCGTGCCATGTCCTGGCGGAAGCGCGTCACAGGGCTCTCTAATGGAGCAGATTAGCTGTGTGGTAAACCGCTTCACTGCTAACATCAGCGAGCTTAATAGCATGATGCTCACCAGTTCTCCTCCAGGGGGAGCTGTCGGTTTATCCACTAGCACTTCAGCACACGTCCACTCACCTGAACCATGCTGTCCACCATCTTACCGCCTGTCCCGGGAGGTTTCCCTACCATCCACTATGACAACCTATGCTGAAATACAGCCACTCCCACCGGTGGAGTCAAACGTGGGTCGAGCTGCCACCTCCTGCCATCTTTCTGGACCATCACGACTGAAAACATCTCCAACAACCAGTGGGAATGAGCTAAAAGTGGGAGTGGCATCAGCTCTGGAGTCGCCGTCCAATCAACCGGAGCTTGAGGAACTGCTGGCGCTGACTCCGCCCTCCCCCTTTAGGGACTCGGTTGGGTCAGGCAGCGCTTCACCCAGTTCTCCAACATCCGATGCTGAAACTTCAGTACTTCCGACCATCCCTAAATACGAGTGCCTGGTGCTGCGACACTACAGCCAAAGCTCGTCTTCATTA")

names(inputFile1Path) <- "file1"

inputFile2Path <- DNAStringSet("AATATTTTAAAACCGGTGACGTGGGCCCAAAACGAGTGCAGTTCCAAAGGCACCCACCTGTGGCAGCGTATCTCCTTTCACATTAAAAAGCGCGAAAGCAACCTGACAGCTGTGAACGAACCCTACTCAAAGACATCTGTTGGACGATACTGCGGCGGCAGTGACATAAACACGGACAAGTGCAACACCAAGATGCTCTATGAAGTCACGGAGGCGGAGGAACGCTACCCAATTACCTACCTGCCCCAAACGCCCTCTCCAATCAGCACCGTCACACAAAGAGCGAGCATCGCTATAGCACAAAGCTCTGACGCGCAGGTCATGAGTGTTTCTACGTACATGTCCCAGCCTCCACAGCATCGCGTTCCATGTCCCGGCGGAAGAGCTTCACAGGGCTCTCTAATGGAGCAGATTAGCTTTGTGGTAAACCGCTACACTGCTAACATCATCGAGCTTCATAGAATGAGGCTCACCACTTCTGCTCCAGGGGGAGCTGTCAGATTATCCACTATCACTTCAGCACACGTCCAATCACCTGTACGATGCTCTCCGCCATCTTACCGCCCGTCCCGGGAGGTTTCCCTACCTTCCACTATGACAACCTATGTTGAAATTCAACCACTCCGACCGGTGGAGTCAAACGTAGGTGGAGCAGCCACCTCCTGCCATCTTTGTGGACCATCACGACTGAAAACATCTCCAACAACCAGTGGCAATGAGCTATATGTGGGACTGGCAGCAGCTCTGGACTTGTCGTCCAATTAACCGGAGCTTGAGGAACTGCTGGCGCTGCCTCCGCCCTCCCCCTTTAGGGACTCGGTTGGGTCAGGCAGCGCTTCACCCAGTTCTCCAACATCCGATGCTGAAACTTCAGTATTTCCGAGCATCCCTAAATACGAGTGCCTGGTGCTGCGACACTACAGCCAAAGCTCGTCTTCATTA")

names(inputFile2Path) <- "file2"

seqs <- compare2Sequences(inputFile1Path, inputFile2Path, inputNames=c("Seq1", "Seq2"),scoring.method = "CFDscore",outputDir = getwd(), overwrite = TRUE)

I do not see use cases for #2.

Offtargets here means sites, not intended for cut, might get cut.

Best regards,

Julie

score 0 · Answer 7 · 2017-08-21

Your reply was

Q5: what will be the commands for only potential off target analysis of a given input sequence only.

Not species specific

Can you please clarify? Do you mean you are not interested in the genome-wide off-target analysis?

Yes

I want to get all potential off-target GRNAs from a single sequence.

Where, Just input information will be Long Input genome sequence.

What will be the command in that case .

regards

Baidya

score 0 · Answer 8 · 2017-08-22

0

Entering edit mode

Julie Zhu ★ 4.3k

@julie-zhu-3596

Last seen 8 months ago

United States

Baidya,

Please try the function compare2Sequences with

inputFile1Path and inputFile2Path containing the same sequence,  and set

searchDirection = "1to2".

Best,

Julie

ADD COMMENT • link 6.9 years ago Julie Zhu ★ 4.3k

score 0 · Answer 9 · 2017-08-22

0

Entering edit mode

susobhanbaidya • 0

@susobhanbaidya-13780

Last seen 6.9 years ago

I cant get your answer, if you do not mind, can you please write complete code for that example,

I am very new with R.

Keenly waiting for your reply

ADD COMMENT • link 6.9 years ago susobhanbaidya • 0

score 0 · Answer 10 · 2017-08-22

inputFile1Path <- DNAStringSet("AATATTTTAAAATCGGTGACGTGGGCCCAAAACGAGTGCAGTTCCAAAGGCACCCACCTGTGGCAGCGTCTCTCCTTTCACATCAAAAAGCGCGAAAGCAACCAGACGGCTGTGATCAAACCCTTCTCCAAGACATCTGAGGGACGCTACTGCGGAGGCAGCGACATAAACACGGACAACAGCAACAACAAGATGCTCTATGAAGTCACGGAGGCGGAGGAACGCTACCCAATCACCTACCGGCCCCAAACGCCCTCTCCAATCAGCACCGTCACACAAAGAGCTAGCATCGCTTTAGCACAAAGCTCTGACGCGCAGGTCATGAGTGTTTCTACGTACATGTGCCAGCCTCCACAGCATCGCGTGCCATGTCCTGGCGGAAGCGCGTCACAGGGCTCTCTAATGGAGCAGATTAGCTGTGTGGTAAACCGCTTCACTGCTAACATCAGCGAGCTTAATAGCATGATGCTCACCAGTTCTCCTCCAGGGGGAGCTGTCGGTTTATCCACTAGCACTTCAGCACACGTCCACTCACCTGAACCATGCTGTCCACCATCTTACCGCCTGTCCCGGGAGGTTTCCCTACCATCCACTATGACAACCTATGCTGAAATACAGCCACTCCCACCGGTGGAGTCAAACGTGGGTCGAGCTGCCACCTCCTGCCATCTTTCTGGACCATCACGACTGAAAACATCTCCAACAACCAGTGGGAATGAGCTAAAAGTGGGAGTGGCATCAGCTCTGGAGTCGCCGTCCAATCAACCGGAGCTTGAGGAACTGCTGGCGCTGACTCCGCCCTCCCCCTTTAGGGACTCGGTTGGGTCAGGCAGCGCTTCACCCAGTTCTCCAACATCCGATGCTGAAACTTCAGTACTTCCGACCATCCCTAAATACGAGTGCCTGGTGCTGCGACACTACAGCCAAAGCTCGTCTTCATTA")

inputFile2Path <- inputFile1Path

names(inputFile1Path) <- "file1"

names(inputFile2Path) <- "file2"

seqs <- compare2Sequences(inputFile1Path, inputFile2Path, inputNames=c("Seq1", "Seq2"),scoring.method = "CFDscore",

searchDirection = "1to2", outputDir = getwd(), overwrite = TRUE)

I think you will benefit greatly from learning some basics about R.

Best,

Julie

score 0 · Answer 11 · 2017-08-25

0

Entering edit mode

susobhanbaidya • 0

@susobhanbaidya-13780

Last seen 6.9 years ago

Thank you julie,

Can you tell me, in the column of F in xls sheet sgrna efficiency means what?

What is the scoring mechanism ( Doench Activity score or Efficiency score )

ADD COMMENT • link 6.9 years ago susobhanbaidya • 0

0

Entering edit mode

Baidya, I gave a workshop at Bioc2017 in Boston last month. Please refer page 8 of the slides at https://www.bioconductor.org/help/course-materials/2017/BioC2017/Day1/Workshops/CRISPR/CRISPRseek-GUIDEseq-Bioc2017-forPosting.pdf. BTW, the code has been enhanced to work with your sequences on the development version of CRISPRseek (1.7.3). I re-analyzed your two input sequences and please find attached the updated results. Best regards, Julie From: "susobhanbaidya [bioc]" <noreply@bioconductor.org> Reply-To: "reply+8f49d4c4+code@bioconductor.org" <reply+8f49d4c4+code@bioconductor.org> Date: Friday, August 25, 2017 at 11:00 AM To: "Zhu, Lihua (Julie)" <julie.zhu@umassmed.edu> Subject: [bioc] A: 'x' has "out of limits" views , Issue with the file size!! Compareequence functio Activity on a post you are following on support.bioconductor.org<https: support.bioconductor.org=""> User susobhanbaidya<https: support.bioconductor.org="" u="" 13780=""/> wrote Answer: 'x' has "out of limits" views ,Issue with the file size!! Compareequence function is not working <https: support.bioconductor.org="" p="" 99429="" #99728=""> : Thank you julie, Can you tell me, in the column of F in xls sheet sgrna efficiency means what? What is the scoring mechanism ________________________________ Post tags: compilation error, filesize, bioconductor You may reply via email or visit A: 'x' has "out of limits" views ,Issue with the file size!! Compareequence functio

ADD REPLY • link 6.9 years ago Julie Zhu ★ 4.3k

score 0 · Answer 12 · 2017-08-25

0

Entering edit mode

susobhanbaidya • 0

@susobhanbaidya-13780

Last seen 6.9 years ago

Yes I got your attached file.

Thank you Julie.

I understood Score difference Zero means Common GRNA.

Q1. But Score Difference One means what ?

Q2. Still I want to know the formula of sgRNA efficiency score which is fractional.

How it is coming.

Q3. High Score means what ? is it stable sgRNA chunk to be edited by CRISPR CAS9?

Is it calculated by DOENCH ACTIVITY SCORE.

I read your Workshop pdf.

There it is not mentioned

ADD COMMENT • link 6.9 years ago susobhanbaidya • 0

0

Entering edit mode

Q1. But Score Difference One means what ? Score difference of 1 means that the gRNA is unique to one of the two sequences, Q2. Still I want to know the formula of sgRNA efficiency score which is fractional. The paper has the details. 1. Doench et al., 2014 Rational design of highly active sgRNAs for CRISPR-Cas9-mediated gene inactivation. Nat Biotechnol. 2014 Sep 3. doi: 10.1038 nbt.3026 Q3. High Score means what ? is it stable sgRNA chunk to be edited by CRISPR CAS9? Is it calculated by DOENCH ACTIVITY SCORE? Yes (the reference is on slide 8) Best, Julie From: "susobhanbaidya [bioc]" <noreply@bioconductor.org> Reply-To: "reply+60d492b5+code@bioconductor.org" <reply+60d492b5+code@bioconductor.org> Date: Friday, August 25, 2017 at 12:35 PM To: "Zhu, Lihua (Julie)" <julie.zhu@umassmed.edu> Subject: [bioc] A: 'x' has "out of limits" views , Issue with the file size!! Compareequence functio Activity on a post you are following on support.bioconductor.org<https: support.bioconductor.org=""> User susobhanbaidya<https: support.bioconductor.org="" u="" 13780=""/> wrote Answer: 'x' has "out of limits" views ,Issue with the file size!! Compareequence function is not working <https: support.bioconductor.org="" p="" 99429="" #99732=""> : Yes I got your attached file. Thank you Julie. I understood Score difference Zero means Common GRNA. Q1. But Score Difference One means what ? Q2. Still I want to know the formula of sgRNA efficiency score which is fractional. How it is coming. Q3. High Score means what ? is it stable sgRNA chunk to be edited by CRISPR CAS9? Is it calculated by DOENCH ACTIVITY SCORE. I read your Workshop pdf. There it is not mentioned ________________________________ Post tags: compilation error, filesize, bioconductor You may reply via email or visit A: 'x' has "out of limits" views ,Issue with the file size!! Compareequence functio

ADD REPLY • link 6.9 years ago Julie Zhu ★ 4.3k

0

Entering edit mode

hanks Julie,

Can you please tell me about the the off target region.

I mean to say. suppose the location is 1000 for a frame of 20bp in the sequence. Now what will be the region of consideration for the off target analysis.

As far my knowledge it should be plus/minus 500bp at max.

That means I have to consider the locus from 500bp to 1500bp to find out any potential of target for the frame at location 1000.

If I am wrong please guide me.

ADD REPLY • link 6.9 years ago susobhanbaidya • 0

0

Entering edit mode

Please start a new thread with a relevant title so that others can find it easily. I suggest use "CRISPRSEEK package compare2Sequences function offTarget region" as the title . Best, Julie On Aug 29, 2017, at 6:49 AM, susobhanbaidya [bioc] <noreply@bioconductor.org<mailto:noreply@bioconductor.org>> wrote: Activity on a post you are following on support.bioconductor.org<https: support.bioconductor.org=""> User susobhanbaidya<https: support.bioconductor.org="" u="" 13780=""/> wrote Comment: 'x' has "out of limits" views ,Issue with the file size!! Compareequence function is not working <https: support.bioconductor.org="" p="" 99429="" #99791=""> : hanks Julie, Can you please tell me about the the off target region. I mean to say. suppose the location is 1000 for a frame of 20bp in the sequence. Now what will be the region of consideration for the off target analysis. As far my knowledge it should be plus/minus 500bp at max. That means I have to consider the locus from 500bp to 1500bp to find out any potential of target for the frame at location 1000. If I am wrong please guide me. ________________________________ Post tags: compilation error, filesize, bioconductor You may reply via email or visit C: 'x' has "out of limits" views ,Issue with the file size!! Compareequence functio

ADD REPLY • link 6.9 years ago Julie Zhu ★ 4.3k