motif by motif compute_motif_frequencies_for_all_motifs.pl $inputFile1 $inputFile2 $inputWindowSize3 $outputFile1 .. class:: infomark **What it does** This program computes the frequencies of each motif at a window size, determined by the user, in both upstream and downstream sequences flanking indels in all chromosomes. - The first input is a TABULAR format file containing the motif names and sequences, one line per motif, such that the file consists of two columns: - The left column represents the motif names - The right column represents the motif sequence, as follows:: dnaPolPauseFrameshift1 GAG dnaPolPauseFrameshift2 ACG xSites1 CCG - The second input is a TABULAR format file representing the windows of both upstream and downstream flanking sequences. It consists of multiple left columns representing the windows of the upstream flanking sequences, followed by one column representing the indels, then followed by multiple right columns representing the windows of the downstream flanking sequences, as follows:: cgaggtcagg agatcgagac catcctggct aacatggtga aatcccgtct ctactaaaaa indel aaatttatat ttataaacaa ttttaataca cctatgttta ttatacattt GCCAGTTTAT GGTCTAACAA GGAGAGAAAC AGGGGGCTGA AGGGGTTTCT TAACCTCCAG indel TTCCGGGCTC TGTCCCTAAC CCCCAGCTAG GTAAGTGGCA AAGCACTTCT CAGTGGGACC AAGCACTGAA CCACTTTGGG GAGAATCTCA CACTGGGGCC CTCTGACACC indel tatatatttt tttttttttt tttttttttt tttttttttg agatggtgtc AGAGCAGCAG CACCCACTTT TGCAGTGTGT GACGTTGGTG GAGCCATCGA AGTCTGTGCT indel GAGCCCTCCC CAGTGCTCCG AGGAGCTGCT GTTCCCCCTG GAGCTCAGAA - The third input is an integer number representing the number of windows to be considered starting from the indel and leftward for the upstream flanking sequence and, starting from the indel and rightward for the downstream flanking sequence. - The output is a TABULAR format file consisting of three columns: - The left column represents the motif name - The middle column represents the motif frequency in the specified windows of the upstream sequence flanking an indel - The right column represents the motif frequency in the specified windows of the downstream sequence flanking an indel There is line per indel in the output file, such that the total number of lines in the output file = number of motifs x number of indels. Note: The number of windows entered by the user must be a positive integer >= 1. if negative integer or 0 is entered by the user, the program will consider it as 1.