How-to-Use

On the Linux command prompt. Enter the command of your needs as the following description.

Usage: ./localConvergence [-s SbjFileName] [-o OutputFileName] [OPTIONS]

SbjFileName ........: File contains FASTA-formated subject sequences.

OutputFileName .....: File for text output.

OPTIONS:

-b Base ............: Number of NTs' as a motif.

-t MotifsInSeed ....: # of motifs in a seed.

-x MotifScore ......: Score for a matched motif.

-y mismatchedScore .: Score for a mismatched NT.

-v Threshold .......: Minimum value to accept a window.

-w WindowSize ......: Window size.

-f ExtendedWindows .: Number of extended Windows to be accepted.

-c CutoffScore .....: Minimum Score to accept this found segment.

-l SegmentLength ...: Minimum Length of the found segment to be accepted.

-p OutputOption ....: Output format. [=0,1,2,3]

[=0]: Local Convergence regions only.

[=1]: Complex output for Local Convergence regions.

[=2]: Mask Local Convergence regions as 'n' in original sequence.

[=3]: Mask Local Convergence regions in virtual sequence.

Note: The virtual sequence contains symbols only 0's and 1's.

0's indicate the Local Convergence regions corresponding

to the original DNA sequence.

Notes:

-t option gives a barrier to initiate a motif search. -t 1 has no restriction, but -t 4 requires an occurrence of four consecutive and identical motifs in a subject sequence.
-x, -y and -v options provide a rule for further extensions if on a window.
-w option shows the size of a motif observation. -w 16 gives a window size 16 as the following fashion where X=”tga”.

DNA Segment: ...tgatgaaatgagatgaaatgaaatgatgaa...
Window: ........XXaaXgaXaaXaaXXa...
The -p option files the result to various formats. See the examples of -p 0, -p 1, -p 2 and -p 3 for DNA sequence U47110 as the options -b 2 -t 2 -x 3 -y -1 -v 0 -w 16 -f 1.
Any number, which is applied to or resulted by this program, referring to the positions of subject sequence starts from 0.