1 | <tool id="rgCaCo1" name="Case Control:"> |
---|
2 | <code file="listFiles.py"/> |
---|
3 | <code file="rgCaCo_code.py"/> |
---|
4 | |
---|
5 | <description>for unrelated subjects</description> |
---|
6 | |
---|
7 | <command interpreter="python"> |
---|
8 | rgCaCo.py '$i.extra_files_path/$i.metadata.base_name' "$name" |
---|
9 | '$out_file1' '$logf' '$logf.files_path' '$gffout' |
---|
10 | </command> |
---|
11 | |
---|
12 | <inputs> |
---|
13 | <param name="i" type="data" label="RGenetics genotype data from your current history" |
---|
14 | format="pbed" /> |
---|
15 | <param name='name' type='text' size="132" value='CaseControl' label="Title for this job"/> |
---|
16 | |
---|
17 | </inputs> |
---|
18 | |
---|
19 | <outputs> |
---|
20 | <data format="tabular" name="out_file1" /> |
---|
21 | <data format="txt" name="logf" /> |
---|
22 | <data format="gff" name="gffout" /> |
---|
23 | </outputs> |
---|
24 | <tests> |
---|
25 | <test> |
---|
26 | <param name='i' value='tinywga' ftype='pbed' > |
---|
27 | <metadata name='base_name' value='tinywga' /> |
---|
28 | <composite_data value='tinywga.bim' /> |
---|
29 | <composite_data value='tinywga.bed' /> |
---|
30 | <composite_data value='tinywga.fam' /> |
---|
31 | <edit_attributes type='name' value='tinywga' /> |
---|
32 | </param> |
---|
33 | <param name='name' value='rgCaCotest1' /> |
---|
34 | <output name='out_file1' file='rgCaCotest1_CaCo.xls' ftype='tabular' compare='diff' /> |
---|
35 | <output name='logf' file='rgCaCotest1_CaCo_log.txt' ftype='txt' compare='diff' lines_diff='20' /> |
---|
36 | <output name='gffout' file='rgCaCotest1_CaCo_topTable.gff' ftype='gff' compare='diff' /> |
---|
37 | </test> |
---|
38 | </tests> |
---|
39 | <help> |
---|
40 | |
---|
41 | .. class:: infomark |
---|
42 | |
---|
43 | **Syntax** |
---|
44 | |
---|
45 | - **Genotype file** is the input case control data chosen from available library Plink binary files |
---|
46 | - **Map file** is the linkage format .map file corresponding to the genotypes in the Genotype file |
---|
47 | - **Type of test** is the kind of test statistic to report such as Armitage trend test or genotype test |
---|
48 | - **Format** determines how your data will be returned to your Galaxy workspace |
---|
49 | |
---|
50 | ----- |
---|
51 | |
---|
52 | **Summary** |
---|
53 | |
---|
54 | This tool will perform some standard statistical tests comparing subjects designated as |
---|
55 | affected (cases) and unaffected subjects (controls). To avoid bias, it is important that |
---|
56 | controls who had been affected would have been eligible for sampling as cases. This may seem |
---|
57 | odd, but it requires that the cases and controls are drawn from the same sampling frame. |
---|
58 | |
---|
59 | The armitage trend test is robust to departure from HWE and so very attractive - after all, a real disease |
---|
60 | mutation may well result in distorted HWE at least in cases. All the others are susceptible to |
---|
61 | bias in the presence of HWE departures. |
---|
62 | |
---|
63 | All of these tests are exquisitely sensitive to non-differential population stratification in cases |
---|
64 | compared to controls and this must be tested before believing any results here. Use the PCA method for |
---|
65 | 100k markers or more. |
---|
66 | |
---|
67 | If you don't see the genotype data set you want here, it can be imported using one of the methods available from |
---|
68 | the Galaxy Get Data tool page. |
---|
69 | |
---|
70 | Output format can be UCSC .bed if you want to see your |
---|
71 | results as a fully fledged UCSC track. A map file containing the chromosome and offset for each marker is required for |
---|
72 | writing this kind of output. |
---|
73 | Alternatively you can use .gg for the UCSC Genome Graphs tool which has all of the advantages |
---|
74 | of the the .bed track, plus a neat, visual front end that displays a lot of useful clues. |
---|
75 | Either of these are a very useful way of quickly getting a look |
---|
76 | at your data in full genomic context. |
---|
77 | |
---|
78 | Finally, if you can't live without |
---|
79 | spreadsheet data, choose the .xls tab delimited format. It's not a stupid binary excel file. Just a plain old tab delimited |
---|
80 | one with a header. Fortunately excel is dumb enough to open these without much protest. |
---|
81 | |
---|
82 | ----- |
---|
83 | |
---|
84 | **Attribution** |
---|
85 | |
---|
86 | When you click 'execute', this Galaxy tool will run Plink (from Shaun Purcell) for you (which is GPL, so this must be too I guess) for calculations. For full Plink |
---|
87 | attribution, source code and documentation, please see http://pngu.mgh.harvard.edu/~purcell/plink/ |
---|
88 | |
---|
89 | The Plink output files will be adjusted into UCSC compatible tracks - gg or wig, or else as tab delimited |
---|
90 | for you spreadsheet junkies out there. Python is used for all glue and data format yoga. |
---|
91 | Originally designed and written for the Rgenetics Galaxy tools by |
---|
92 | ross lazarus (ross spot lazarus ate gmail spot com), who didn't write |
---|
93 | either Galaxy or Plink but wishes he had. |
---|
94 | |
---|
95 | copyright Ross Lazarus 2007 |
---|
96 | Licensed under the terms of the LGPL as documented http://www.gnu.org/licenses/lgpl.html |
---|
97 | but is about as useful as a chocolate teapot without Plink which is GPL. |
---|
98 | |
---|
99 | I'm no lawyer, but it looks like you got GPL if you use this software. Good luck. |
---|
100 | |
---|
101 | </help> |
---|
102 | </tool> |
---|