|
Program: ldSelect All rights reserved. You have the permission to use and develop ldSelect.pl ("the
software"), provided that the following conditions are met: This software is provided ``AS IS'' and any express or implied warranties, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose are disclaimed. In particular, this disclaimer applies to any diagnostic purpose. In no event shall the authors or the University of Washington be liable for any direct, indirect, incidental, special, exemplary, or consequential damages (including, but not limited to, procurement of substitute goods or services; loss of use, data, or profits; or business interruption) however caused and on any theory of liability, whether in contract, strict liability, or tort (including negligence or otherwise) arising in any way out of the use of this software, even if advised of the possibility of such damage. |
The details of ldSelect algorithm can be referenced to the publication of Carlson et al., 2004.
The ldSelect is written in Perl programming language. It runs only via command-line input format.
ldSelect.pl
[-pb] prettybase file
<-r2> r-squared threshold, from 0.0 to 1.0, default: 0.64
<-context> a file containing genomic and sequence context for snps
<-required> a list of sites required to be tagSNPs
<-verbose> print command line args? (y or n; default in 'no')
<-freq> minor allele frequency threshold (>0.0 and <=0.5)
<-excluded> a list of sites that can not be tagSNPs
Flags in square brackets are required; flags in diamond brackets are optional.
Site Sample Allele1 Allele2
000834 D001 G G 000834 D002 G G 000834 D003 G G 000834 D004 G G 000834 D005 N N 000834 E001 G G 000834 E002 G G 000834 E003 G G 000834 E004 G G 000834 E005 G G 000963 D001 T T 000963 D002 T T 000963 D003 T T 000963 D004 T T 000963 D005 N N 000963 E001 T T 000963 E002 N N 000963 E003 G T 000963 E004 G G 000963 E005 G T
SNP-coordinate genomic-context sequence-context
644 5'-flanking unique 887 5'-flanking unique 644 5'-flanking unique 834 5'-flanking unique 3428 intron repeat 3524 intron repeat 3596 intron unique 4125 intron unique 40239 3'-flanking repeat 40265 3'-flanking repeat
D001 D002 D003 D004
644 834 3524 4125 40239
644 834 3524 4125 40239
Bin 1 total_sites: 8 average_minor_allele_frequency: 15% Bin 1 TagSnps: 11710 Bin 1 other_snps: 12954 13573 20207 26629 29517 33639 9367 Bin 2 total_sites: 7 average_minor_allele_frequency: 44% Bin 2 TagSnps: 38834 41744 Bin 2 other_snps: 39727 40147 43118 43170 43928
Bin 1 total_sites: 8 average_minor_allele_frequency: 15% Bin 1 TagSnps: UI-11710 Bin 1 other_snps: US-12954 UI-13573 UI-20207 RI-26629 UI-29517 UI-33639 US-9367 Bin 2 total_sites: 7 average_minor_allele_frequency: 44% Bin 2 TagSnps: RI-38834 RI-41744 Bin 2 other_snps: RI-39727 RI-40147 UI-43118 UI-43170 UI-43928
1. Carlson CS, Eberle MA, Rieder MJ, Yi Q, Kruglyak L, Nickerson DA. Selecting a maximally informative set of single-nucleotide polymorphisms for association analysis using linkage disequilibrium. Am J Hum Genet. 2004 Jan;74(1):106-20. Epub 2003 Dec 15.