Context Navigation

execute_dwt_cor_aVa_perClass.xml

リビジョン 2, 4.8 KB (コミッタ: hatakeyama, 14 年前)
import galaxy-central

行番号
1	<tool id="compute_p-values_correlation_coefficients_feature_occurrences_between_two_datasets_using_discrete_wavelet_transfom" name="Compute P-values and Correlation Coefficients for Feature Occurrences" version="1.0.0">
2	<description>between two datasets using Discrete Wavelet Transfoms</description>
3
4	<command interpreter="perl">
5	execute_dwt_cor_aVa_perClass.pl $inputFile1 $inputFile2 $outputFile1 $outputFile2
6	</command>
7
8	<inputs>
9	<param format="tabular" name="inputFile1" type="data" label="Select the first input file"/>
10	<param format="tabular" name="inputFile2" type="data" label="Select the second input file"/>
11	</inputs>
12
13	<outputs>
14	<data format="tabular" name="outputFile1"/>
15	<data format="pdf" name="outputFile2"/>
16	</outputs>
17
18	<help>
19
20	.. class:: infomark
21
22	What it does
23
24	This program generates plots and computes table matrix of coefficient correlations and p-values at multiple scales for the correlation between the occurrences of features in one dataset and their occurrences in another using multiscale wavelet analysis technique.
25
26	The program assumes that the user has two sets of DNA sequences, S1 and S1, each of which consists of one or more sequences of equal length. Each sequence in each set is divided into the same number of multiple intervals n such that n = 2^k, where k is a positive integer and k >= 1. Thus, n could be any value of the set {2, 4, 8, 16, 32, 64, 128, ...}. k represents the number of scales.
27
28	The program has two input files obtained as follows:
29
30	For a given set of features, say motifs, the user counts the number of occurrences of each feature in each interval of each sequence in S1 and S1, and builds two tabular files representing the count results in each interval of S1 and S1. These are the input files of the program.
31
32	The program gives two output files:
33
34	- The first output file is a TABULAR format file representing the coefficient correlations and p-values for each feature at each scale.
35	- The second output file is a PDF file consisting of as many figures as the number of features, such that each figure represents the values of the coefficient correlation for that feature at every scale.
36
37	-----
38
39	.. class:: warningmark
40
41	Note
42
43	In order to obtain empirical p-values, a random perumtation test is implemented by the program, which results in the fact that the program gives slightly different results each time it is run on the same input file.
44
45	-----
46
47	Example
48
49	Counting the occurrences of 5 features (motifs) in 16 intervals (one line per interval) of the DNA sequences in S1 gives the following tabular file::
50
51	deletionHoptspot insertionHoptspot dnaPolPauseFrameshift topoisomeraseCleavageSite translinTarget
52	269 366 330 238 1129
53	239 328 327 283 1188
54	254 351 358 297 1151
55	262 371 355 256 1107
56	254 361 352 234 1192
57	265 354 367 240 1182
58	255 359 333 235 1217
59	271 389 387 272 1241
60	240 305 341 249 1159
61	272 351 337 257 1169
62	275 351 337 233 1158
63	305 331 361 253 1172
64	277 341 343 253 1113
65	266 362 355 267 1162
66	235 326 329 241 1230
67	254 335 360 251 1172
68
69	And counting the occurrences of 5 features (motifs) in 16 intervals (one line per interval) of the DNA sequences in S2 gives the following tabular file::
70
71	deletionHoptspot insertionHoptspot dnaPolPauseFrameshift topoisomeraseCleavageSite translinTarget
72	104 146 142 113 478
73	89 146 151 94 495
74	100 176 151 88 435
75	96 163 128 114 468
76	99 138 144 91 513
77	112 126 162 106 468
78	86 127 145 83 491
79	104 145 171 110 496
80	91 121 147 104 469
81	103 141 145 98 458
82	92 134 142 117 468
83	97 146 145 107 471
84	115 121 136 109 470
85	113 135 138 101 491
86	111 150 138 102 451
87	94 128 151 138 481
88
89
90	We notice that the number of scales here is 4 because 16 = 2^4. Running the program on the above input files gives the following output:
91
92	The first output file::
93
94	motif 1_cor 1_pval 2_cor 2_pval 3_cor 3_pval 4_cor 4_pval
95
96	deletionHoptspot 0.4 0.072 0.143 0.394 -0.667 0.244 1 0.491
97	insertionHoptspot 0.343 0.082 -0.0714 0.446 -1 0.12 1 0.502
98	dnaPolPauseFrameshift 0.617 0.004 -0.5 0.13 0.667 0.234 1 0.506
99	topoisomeraseCleavageSite -0.183 0.242 -0.286 0.256 0.333 0.353 -1 0.489
100	translinTarget 0.0167 0.503 -0.0714 0.469 1 0.136 1 0.485
101
102	The second output file:
103
104	.. image:: ../static/operation_icons/dwt_cor_aVa_1.png
105	.. image:: ../static/operation_icons/dwt_cor_aVa_2.png
106	.. image:: ../static/operation_icons/dwt_cor_aVa_3.png
107	.. image:: ../static/operation_icons/dwt_cor_aVa_4.png
108	.. image:: ../static/operation_icons/dwt_cor_aVa_5.png
109
110	</help>
111
112	</tool>

Note: リポジトリブラウザについてのヘルプは TracBrowser を参照してください。

Context Navigation

root/galaxy-central/tools/discreteWavelet/execute_dwt_cor_aVa_perClass.xml

異なるフォーマットでダウンロード: