![]() Of which is first, five followed by six, or six followed by five, wouldĬause a difference in the boundary position between the two. Sequence is five characters, another is six characters, then the order If coded sequences could be of variable length, then you cannot be sure The first find and on to any subsequent one if you have to cope with You would have to work with an offset in FindString() to work beyond To ensure any match begins on a boundary, such as 1, 6, 11, 16, etc. If the sequences you are looking for are allįive characters in length, then you could use the MOD(offset,5)=1 test That say that each found sequence must be positioned on a certainīoundary to be valid. Whether overlapping sequences are valid - it may be bound by rules ![]() But the DNA sequence may have its own rules as to GATAC, beginning at the first character, and CGCGT, beginning at theĥth character. ![]() Sequence that was coded GATACGCGT, you would get a match for both Two example sequences above, if you found a section of the DNA Sequence and the leading part of the other. That is, two sequences appear to both be in in the DNA sequence, butĪ portion of the DNA sequence appears to be the trailing part of one One problem to recognize is the possibility of overlapping sequences. You find you have to put the DNA sequence into a memory block, youĬan use the CompareMemoryString() function instead. Value indicates its offset from the beginning of the DNA$ string. IF the value returned is non-zero, the sequence was found, and the Space to the end of the line, then using FindString(DNA$,Sequence$). Sequence file one line at a time, discarding everything from the first Searching for one of your sequences is then just a matter of reading the Will have to find another way to hold the sequence, possibly in an array PureBasic has a maximum string length for any one string of just overĦ4,000 characters, the DNA sequence cannot be longer than that, or you Static, or could be read from the same file, or a different one. The DNA sequence can either be embedded into the program if it is Your program can easily process the whole DNA sequence. The contents of this file whenever you want. The format permits any combination of characters. You create a text file with one sequence per line, such as: Since you are talking about 8000 or so sequences, I would suggest that
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |