A5A4C3A1 Color FASTA
The longest form of insertion/deletion sites is shown in the FASTA sequence and contain a '-' as the alternate allele in the variation tag.
*Display courtesy of
GeneSNPs
CTACTCAGTT TGATTCAGCT ATGACCCTTT CAGACATGGT CCAAATGAGA
50
GTCACGCCAG GCAAACT
T
C
T
A
T
C
A
T
A
A
A
C
A
T
C
T
A
T
T
T
C
A
T
T
T
G
A
G
C
C
A
G
A
100 | Exon 1 |
UTR
A
G
T
C
T
A
G
G
C
T
T
C
A
A
C
T
T
G
G
G
A
A
C
C
C
A
G
A
G
G
G
A
G
G
G
G
C
T
G
G
A
G
A
T
G
G
G
G
A
G
150
A
G
C
A
C
A
C
A
G
T
A
T
G
C
T
C
C
C
C
A
G
A
G
G
G
T
T
T
A
A
G
G
A
G
C
T
C
C
C
A
C
A
G
T
T
G
G
G
A
G
200
G
G
T
T
C
A
G
T
C
C
C
T
T
A
T
T
T
A
G
G
T
G
T
A
G
A
G
C
T
A
T
G
T
C
A
G
G
A
A
A
C
A
T
G
G
G
C
T
C
T
250
G
C
C
C
T
C
T
C
C
T
A
T
C
A
C
A
G
C
C
C
A
C
A
C
C
T
T
T
C
G
T
A
C
C
C
A
C
C
A
G
C
A
T
C
A
G
A
G
C
C
300
A
G
C
A
G
G
G
A
T
A
T
T
C
A
C
A
C
A
C
C
A
T
C
A
C
A
T
G
T
T
C
T
C
C
C
C
A
T
G
A
T
A
T
T
C
T
C
T
T
T
350
C
T
C
C
C
T
T
C
T
A
T
T
C
C
C
C
T
G
A
T
A
G
C
T
G
C
C
A
T
G
G
C
A
G
C
C
C
T
G
G
G
G
A
G
A
C
A
A
G
T
400
G
C
T
C
C
T
C
T
C
T
G
T
G
G
A
C
C
A
G
C
T
G
T
A
G
C
A
G
T
G
G
C
C
A
C
C
A
G
G
C
A
G
C
C
A
G
A
A
G
T
450 |
var(434):[G:0.05]
G
A
C
T
A
G
A
G
C
C
A
A
A
C
T
C
C
A
G
G
A
T
G
T
A
G
T
G
G
C
A
C
A
G
G
C
T
T
C
C
A
G
C
A
T
C
A
C
G
G
500
C
A
A
A
C
A
G
G
C
T
T
G
C
A
G
A
T
A
A
T
C
A
C
C
C
A
C
A
T
G
C
A
T
G
G
T
G
C
C
T
C
T
C
C
C
T
C
C
C
T
550 |
var(537):[:0.95]
A
C
T
C
C
C
T
C
A
C
C
C
T
T
G
A
A
T
G
G
A
G
T
A
A
C
T
C
A
T
G
A
G
C
A
T
T
C
C
C
A
A
A
T
G
A
G
C
A
C
600
T
G
G
G
A
G
G
C
T
G
G
G
T
T
T
G
C
A
A
A
G
C
C
T
G
G
T
G
A
A
T
G
T
A
A
T
G
C
A
T
C
C
A
G
A
T
T
G
G
G
650 |
var(637):[A:0.01]
G
A
G
T
C
G
C
A
G
G
A
G
G
C
T
G
G
A
T
A
T
G
C
A
G
G
A
G
A
C
A
G
C
A
G
C
C
C
C
T
T
T
G
G
T
G
G
C
C
T
700 |
var(670):[G:0.04]
C
C
C
T
G
T
C
C
T
G
C
A
C
A
G
G
A
C
C
T
T
C
C
A
C
C
C
T
C
C
A
C
C
C
A
A
C
A
G
G
C
C
A
C
T
T
T
C
A
A
750
G
G
A
C
T
G
A
A
C
C
A
T
G
C
T
A
G
A
G
G
C
T
C
A
G
A
G
C
C
A
A
G
G
C
T
C
C
C
C
A
G
A
C
A
A
G
G
A
G
C
800 |
var(752):[A:0.04]
|
var(797):[A:0.11]
T
G
G
G
A
A
T
G
G
G
C
C
T
G
G
G
C
A
G
G
T
A
G
A
T
C
C
T
C
A
G
G
G
G
T
C
C
C
C
C
A
G
A
T
G
G
C
T
G
T
850
M A S M A A V L 8
G
G
C
C
C
T
G
G
T
C
A
T
G
A
A
G
G
C
T
G
T
G
A
G
T
G
A
T
G
T
C
T
T
C
C
C
A
C
A
G
G
T
C
A
T
C
C
A
G
A
900
T W A L A L L S A F S A T Q A R 24
C
G
G
G
C
C
T
G
C
A
G
C
T
T
G
C
T
C
A
G
A
A
C
C
T
T
G
C
C
A
C
T
G
T
C
T
G
T
T
T
G
T
T
G
A
A
A
C
T
C
950
K G F W D Y F S Q T S G D K G R V 41
T
G
G
G
G
C
G
A
A
G
G
C
A
C
T
G
T
G
G
C
C
T
G
G
T
G
G
A
G
G
T
G
G
C
G
C
C
A
G
C
T
G
C
T
G
C
T
G
G
A
1000
E Q I H Q Q K M A R E P A T L K D 58
C
C
T
C
C
T
C
A
G
T
C
T
C
C
T
G
G
T
C
G
A
T
G
G
C
G
C
G
A
G
T
G
A
A
G
G
C
A
G
C
T
A
T
C
T
G
C
A
G
G
1050
S L E Q D L N N M N K F L E K L 74
T
A
G
G
T
G
T
C
C
T
G
G
C
G
G
A
A
A
G
C
C
T
G
A
A
G
T
C
G
C
T
G
G
C
G
C
A
C
C
T
C
C
T
C
G
G
A
G
A
G
1100
R P L S G S E A P R L P Q D P V G 91
C
A
T
C
T
G
G
G
G
G
T
C
C
G
G
G
C
C
G
G
C
C
C
C
T
T
C
C
T
C
A
G
T
C
C
C
A
G
T
G
C
C
T
G
C
A
A
A
G
G
1150
M R R Q L Q E E L E E V K A R L Q 108
C
T
C
T
G
C
T
G
A
G
C
T
C
T
T
C
G
C
G
C
A
G
C
T
G
G
T
C
C
A
G
G
T
T
C
T
G
C
T
G
G
A
T
G
C
G
T
G
C
G
1200
P Y M A E A H E L V G W N L E G 124
T
G
C
A
G
G
G
C
C
T
T
G
G
C
C
T
T
G
A
G
C
G
T
G
A
G
C
T
T
C
C
G
G
G
A
G
A
G
C
A
C
C
T
G
C
A
C
G
C
A
1250
L R Q Q L K P Y T M D L M E Q V A 141
G
C
G
A
C
T
G
A
G
G
C
G
C
G
C
G
G
G
G
C
T
G
G
C
G
G
G
G
G
C
G
T
G
C
G
G
A
G
C
C
A
C
A
C
T
G
C
G
G
T
1300
L R V Q E L Q E Q L R V V G E D T 158
G
C
A
G
C
T
C
C
T
G
C
A
C
G
T
G
G
C
G
C
C
C
G
A
T
G
C
C
G
C
T
C
A
C
C
A
G
G
C
T
C
T
C
G
G
C
G
T
A
T
1350
K A Q L L G G V D E A W A L L Q 174
G
G
G
T
G
G
A
A
G
A
G
C
T
C
T
T
T
G
A
A
G
C
G
G
C
C
G
G
T
G
T
G
G
T
G
C
A
C
C
A
C
G
C
G
G
C
T
C
T
G
1400
G L Q S R V V H H T G R F K E L F 191
C
A
G
T
C
C
C
T
G
C
A
G
C
A
A
A
G
C
C
C
A
A
G
C
C
T
C
G
T
C
C
A
C
G
C
C
C
C
C
C
A
G
C
A
A
C
T
G
G
G
1450
H P Y A E S L V S G I G R H V Q E 208
C
C
T
T
G
G
T
G
T
C
T
T
C
C
C
C
C
A
C
C
A
C
G
C
G
C
A
A
C
T
G
C
T
C
C
T
G
C
A
G
C
T
C
C
T
G
C
A
C
G
1500 |
var(1472):[T:0.05]
L H R S V A P H A P A S P A R L 224
C
G
C
A
G
G
G
C
C
A
C
C
T
G
C
T
C
C
A
T
C
A
G
A
T
C
C
A
T
C
G
T
G
T
A
G
G
G
C
T
T
C
A
G
T
T
G
C
T
G
1550
S R C V Q V L S R K L T L K A K A 241
C
C
G
C
A
A
G
C
C
C
T
C
C
A
A
A
T
T
C
C
A
G
C
C
C
A
C
C
A
G
C
T
C
G
T
G
C
G
C
C
T
C
T
G
C
C
A
T
G
T
1600
L H A R I Q Q N L D Q L R E E L S 258
A
G
G
G
C
T
G
G
A
G
G
C
G
A
G
C
C
T
T
C
A
C
C
T
C
C
T
C
C
A
A
C
T
C
C
T
C
C
T
G
C
A
G
C
T
G
C
C
G
C
1650
R A F A G T G T E E G A G P D P 274
C
G
C
A
T
G
C
C
C
A
C
C
G
G
G
T
C
C
T
G
T
G
G
G
A
G
C
C
G
A
G
G
A
G
C
C
T
C
G
C
T
C
C
C
A
C
T
C
A
G
1700
Q M L S E E V R Q R L Q A F R Q D 291
A
G
G
C
C
T
C
A
G
C
T
T
T
T
C
C
A
G
G
A
A
C
T
T
G
T
T
C
A
T
A
T
T
G
T
T
G
A
G
G
T
C
T
T
G
C
T
C
A
A
1750
T Y L Q I A A F T R A I D Q E T E 308
G
G
C
T
G
T
C
T
T
T
C
A
G
G
G
T
C
CTG GAGAAGGGGA CAGATATCCA GGCCGTCAGA
1800
E V Q Q Q 313
CTGCTAGCC
C
CCATCATCTC CTTTGTCCCC AAGTCATCGC GCACTGATCC
1850 |
var(1810):[T:0.04]
TCTGGGGGAA CCAAGGACGC AGGGCGTTGG CCCCAGGGTC GAGGGCTCTT
1900
GTCCTAGCCC TGGCCAGTAA CAACTCACGC ACGAAGCACA AACACATTGC
1950
CCATACAAAT CCAGACCTAC AACACTCTCC AAAGAAACAT AGATGCGCCA
2000
CATCATCCTT TGATTCTGGG GACTGCAGCG GGCGTCCTCC CGATTGATCC
2050
CAGGTCCCGC GCCTCAGTGA GCAAGGGGGC AACAGCTACG GAGTTGTCAA
2100
GGCGGGGGCT GCAGGCAGAG GGCGCTAAAG AGCCCAGGAT GGCCGGGATC
2150
TGCAGACAGA GCTAGCACCG CTCCTTTCCT CTGTCCCAGC AGCGGCCACA
2200
GAGGTTGAGG CAGCAGAGGC AGGTCATCAT GGCATGGCCC AGCTGTCTCC
2250
TCCCTTCGCC TACACCCCTT CCCCTGGGCA CTCAC
G
C
G
G
G
C
T
C
G
C
G
A
G
C
C
2300 | Exon 2
L A P P P 318
A
T
C
T
T
C
T
G
C
T
G
A
T
G
G
A
T
C
T
G
C
T
C
C
A
C
C
C
T
G
C
C
T
T
T
G
T
C
C
C
C
G
C
T
G
G
T
C
T
G
2350 |
var(2315):[T:0.15]
P G H S A F A P E F Q Q T D S G K 335
G
C
T
G
A
A
G
T
A
G
T
C
C
C
A
G
A
A
G
C
C
T
T
T
C
C
G
T
G
C
C
T
G
G
G
T
G
G
C
C
G
A
A
A
A
C
G
CTG
2400 |
var(2391):[C:0.06]
V L S K L Q A R L D D L W E D I 351
TGGAGAGGGA CTAGGTAATC AGGG
C
CTGGG CTCTCCTCCC CCAGGGTGGA
2450 |
var(2425):[T:0.01]
CAGGGC
C
CTC TGGCCAGCCT CCACCCACAC CCCCACGTTG AAGTCAGGGT
2500 |
var(2457):[A:0.04]
CGGAGACCCA C
C
T
G
A
A
A
G
A
A
G
A
G
C
C
A
G
A
G
C
C
C
A
G
G
T
G
A
G
C
A
C
G
G
C
A
G
C
C
A
2550 | Exon 3
T H S L H D Q G H S H L G 364
T
G
C
T
T
G
C
C
A
T
T
A
T
C
T
G
C
T
CT GAGAAGACAG GTGGAGGGAG GCCTGGTTAG
2600 |
UTR
|
var(2563):[C:0.09]
D P 366
GGGAAGAAGG AGACGAAGGG ACATGGCGCA GGGGACTTGC CCAGGGGGCC
2650
TCTGCAGGGG CAACTGCCCG AAATCCTGTT ACCCCTTCCT TGGGCTGGGG
2700
AGCACAGAGC TGTTGGGGCT CAAGAGGACT GACCTAGGTG AGTCAAGGAG
2750
GCTAGGGTGT CTTCCTCA
G
A CATGGGAAGA GGGCGTGCTC TTGCTACCTC
2800 |
var(2769):[A:0.02]
AGTCACATAG CAGGGAGCGT GGTGCTCTAA CCCCTTCGCA AAGGTCCCAG
2850
ACCCCAGGAA CAGTTCTCTA GGCCACTTCT ACCACCTCTC CCCTGCCCAC
2900
CTGTCTCCCT CCCTCCCATT TCATGGTGGA AAAACTGAGC CATAATGAGG
2950
GCGAAGAGGC AACTCTGCCA AAATGTTCCA AGAGGACGTC TTAGGGGCCA
3000
CCCCAGGCTC TCCCCTGAGG CCACCTGCAA TGCCCTCCCT TAGGACTGTG
3050
ACCCCCATCC CTCTGCCCCA GCTGCTCACC TGCTCACGTC TGGGCACA
G
A
3100 |
var(3099):[A:0.01]
GAGCAGACAT TCTGCTTTAT ACTCCAGGGC CCTGAGCCTC TGGCACCAAT
3150
TGCTCTGAGT AAATACCACG TGGAAGTTCA AAAGAAGTTG ACCTCAGCTG
3200
CCTCCCAGCA CTCACCTCCT GCCCTTTCCC TGGCACCCAG AGGGTTAATG
3250
AGTGCCCTGG TATCAGGGGC TGCCCCAGTA GAGAAGTGCT TCCCAGGAGC
3300
TTTACGGGGG ATGGGGCTGA ACTCCTCACC CAGTTTCTCC CAAACCCCAT
3350
GACCTTTAAC CTTCCCACTG ACCTGCTGGC TGGCCCACCA ACAGAGAAGA
3400
ACCTGTTTGT CTGCCAAGGG CCCCTCTCTT ACACAACTAC CCAGAGTCAC
3450
TGTGTCCCAG CCGGCAAGAT GGACAGTGTT CACCTACCAG CCAGAACCCG
3500
AGCAGCCCCT GAAAGCTTCA CTACAGGTTC CGCAGGCATC CTCAGCCAGC
3550
ATTCATAGGG TTAAAGACCA ACCACATCC
C
TCTTTATGAA ACAATCCTGG
3600 |
var(3580):[T:0.06]
AACAAGCAAG GGAAGCCAGG CAGGGTGAAG ATGAGATGGC AAGAGGCATC
3650
TGGGCCAG
G
G ACTCTGAGCC CCAGGAACTG GAGCGAAAGT
A
AGATTTGCC
3700 |
var(3659):[A:0.09]
|
var(3691):[G:0.09]
CCATGAGGA
A
AAGCTGAACT CCACTC
G
CAG GGCCTCTGAG GAGAGCAAGC
3750 |
var(3710):[G:0.01]
|
var(3727):[A:0.01]
CCAAATGCTC AGATCTTCTC TGATGACACA CCCACTCCGT CTACAGTACT
3800
CATACACACG TTCACAAGCT CCCGATTCTT GGTC
C
T
A
A
A
T
G
C
A
T
C
T
T
G
A
A
3850 |
var(3835):[T:0.12]
|
REPEAT
T
C
A
A
T
C
C
C
C
T
C
T
C
C
T
C
C
A
T
T
T
C
C
A
C
T
A
C
C
A
T
C
A
T
T
G
C
A
C
C
A
G
T
T
G
T
C
T
G
T
3900
C
A
C
C
T
T
G
A
T
T
G
C
A
T
T
C
A
T
A
G
C
C
T
C
C
A
A
C
A
G
G
T
C
T
T
T
C
T
A
C
C
A
C
A
C
T
C
C
T
G
3950 |
var(3950):[A:0.10]
C
C
C
A
T
T
T
A
A
T
T
C
A
T
C
C
T
C
C
A
C
TGTGGCTCA TCCTGACTCA TTTCCAGTCT
4000
CATCTGCTGC CACATAAAAC CAC
A
GCATTC CCTGAGCCTT TATACAGGCT
4050 |
var(4024):[G:0.36]
TCCCTCTGCT TGAAATAGCC TATCCCCTGG TGAATATATA TTCATTTTTT
4100
AGAGTTAGTT TGTATTAGTT AGAATTAGAC TTGGCTGCAA GGGACATATA
4150
TATGTGTGTA TATATATACA CACACACATG TATATTTTAT ATTCTTGCAT
4200
ACATATATGT ATATATATGT GTGTGTGTAT ATATACACAT ATATATATAC
4250
AAGATAC
T
G
C
T
C
T
T
A
C
C
A
C
T
C
A
T
A
C
T
G
A
C
A
T
C
C
C
A
T
T
G
G
C
C
A
C
A
A
G
T
T
A
G
4300 |
REPEAT
T
C
A
C
A
T
G
G
C
T
A
C
A
C
T
T
A
G
C
T
G
ATATATATG TGTATATATA TATGCAAGAG
4350
AATAGCTTAA ACAAAATGGA GTCTTATTTC CTTCTCATGT AAATGTAGGC
4400
CAG
C
T
C
G
G
G
C
T
G
C
T
T
T
T
A
T
C
T
T
G
T
T
G
C
A
C
T
A
T
T
A
T
C
A
T
C
A
A
C
A
A
G
A
C
G
C
T
4450 |
REPEAT
C
A
T
A
T
C
C
A
A
G
T
T
C
C
A
G
C
T
G
C
T
T
C
A
C
C
T
C
C
A
G
C
T
A
C
C
A
A
G
T
T
C
A
C
C
T
C
C
C
A
4500
G
G
G
A
A
C
A
G
G
A
A
G
G
A
G
G
A
A
A
A
G
G
A
G
A
A
G
G
A
C
A
T
G
T
T
C
C
T
T
C
C
T
T
T
T
A
A
A
G
A
4550
C
A
C
A
T
C
C
C
A
G
A
T
A
T
T
G
C
C
A
T
T
A
C
C
A
C
T
T
G
T
A
C
T
G
A
C
A
T
C
C
C
A
T
T
G
G
C
C
A
C
4600
A
A
C
T
T
A
G
T
C
A
C
A
T
G
G
C
T
A
C
A
C
T
T
A
G
C
T
G
A
A
A
A
G
G
A
G
G
C
T
G
G
G
A
A
A
T
A
T
A
G
4650
T
T
T
T
T
A
T
T
T
T
G
G
A
T
G
G
C
T
G
T
A
T
G
C
C
T
A
G
C
T
G
A
A
A
A
A
G
G
A
C
T
C
T
A
T
T
A
C
T
C
4700
A
G
G
A
A
G
A
A
G
A
G
A
A
G
A
A
A
G
G
A
T
T
T
G
A
G
G
G
A
A
C
A
G
T
A
G
T
A
G
C
C
C
C
T
G
C
T
A
C
A
4750
CAGCTCCAGT ATCACTCTGA ATGCCTTCTT CGGCCTTCAC CTTTTTCTGC
4800
TCTGGAGATA AACTGATTCC ACATTATTCA CACAACACTG TATATTCCTG
4850
GATTATAGCA CTTACTATCT AACTGCACAA TAATTTGTTG ATGAATCCGA
4900
G
A
G
C
T
C
T
T
T
G
A
G
T
G
C
A
G
G
G
G
C
T
T
T
T
T
C
T
T
A
T
T
C
A
T
C
T
C
T
A
T
A
T
C
C
A
C
T
A
G
4950 |
REPEAT
T
A
T
G
T
T
T
T
G
C
A
C
A
G
T
G
C
C
T
A CCTACTACAT ACAAAGTTAA GTGACTG
A
A
T
5000 |
REPEAT
T
C
A
A
T
C
A
T
T
C
A
T
T
T
G
A
T
T
T
A
G
G
A
A
C
T
G
A
G
G
T
T
T
G
A
G
G
A
G
A
T
C
A
A
G
T
G
G
C
C
5050
T
G
T
C
C
A
T
C
A
T
C
A
C
G
C
A
G
T
A
A
G
A
G
T
T
G
C
A
A
C
C
A
G
A
A
T
C
A
G
A
A
C
C
C
A
G
G
T
G
C
5100
T
T
G
G
C
T
C
TGG TGGTTCAAGA GAGCAGACTA TGACCAAGTC ACAAAGGGGC
5150
TTGTGCAAGC AGTCACTGAA AGGTACATGA TGGAGAGATC ACTCA
G
G
A
G
G
5200 |
REPEAT
G
T
G
G
C
C
A
G
A
G
A
G
A
A
G
G
C
A
G
A
G
G
A
A
C
T
A
T
T
T
A
G
G
G
A
T
A
T
T
T
T
A
A
T
A
A
T
C
C
A
5250
A
G
T
G
A
G
A
A
A
T
G
A
A
G
G
C
C
T
G
A
A
G
T
A
T
G
G
C
A
T
A
G
C
A
G
T
G
A
T
G
G
T
T
G
A
T
A
G
G
T
5300
A
A
G
T
A
T
A
T
A
T
T
G
A
G
G
G
C
T
A
C
T
A
A
G
G
C
A
G
C
A
G
A
G
T
T
G
A
T
A
G
C
A
C
T
T
G
G
T
G
A
5350
C
T
G
A
T
T
G
G
A
T
G
A
G
G
A
A
G
G
T
G
A
A
G
G
A
G
A
G
G
A
A
A
A
A
A
T
C
C
C
A
A
T
G
A
T
T
C
C
C
A
5400
G
G
T
T
T
C
T
G
A
G
C
A
A
C
T
A
G
G
G
G
G
A
T
G
A
T
A
T
T
T
T
C
T
T
T
C
A
C
C
A
A
A
A
A
A
A
A
T
A
A
5450
G
C
A
G
G
T
T
G
G
G
A
C
A
G
G
A
A
G
T
G
A
G
A
G
C
T
C
A
G
A
T
T
A
A
C
A
G
G
A
G
T
C
C
T
A
A
C
T
C
G
5500
A
A
C
C
T
A
A
G
C
T
G
G
G
T
T
T
C
A
G
A
T
AGGCATAGG GCAAGCCCAG TTGAGGTCTA
5550
TTTGCTTAAT GGCTAAGAAG CCTCCTAAGA GAAAATTCAT TTGAAAGGAA
5600
AAAAAGCAAC AGGAGCTACC ATCCAGTCAC AAAACCACAG TCATCAAAAG
5650
AGAAGAGAGA CTCAGAGTAT TTGGGAAGGG AACATTTCCA GGGGTTGAAA
5700
AAATGTGGGA GTGGAGAGCC ACTGAAATTG ACTTTGGGTG ATTACTTGTA
5750
CCCACAAGCT AGTGTGGCCT TGTGCCCAAA GGCTGTCCAC
T
G
G
A
G
A
T
G
T
T
5800 |
REPEAT
C
T
G
G
C
A
G
T
T
G
G
A
T
A
C
A
T
A
T
G
T
C
T
C
A
A
G
C
C
C
A
G
G
A
G
A
G
A
A
G
T
C
T
C
G
G
C
T
G
G
5850
A
G
A
C
T
G
A
G
T
T
T
T
G
G
G
A
G
T
C
A
T
C
A
G
C
A
G
A
T
A
G
G
A
G
C
A
G
T
G
G
A
A
G
A
C
T
T
G
G
G
5900
A
G
T
A
G
A
T
G
A
A
A
T
C
T
C
T
A
G
G
G
A
G
A
G
T
A
C
A
T
G
G
C
A
G
G
A
G
A
A
A
A
G
A
A
A
A
A
C
G
C
5950
T
G
A
G
G
T
C
A
G
A
A
A
A
C
ACCAGT ATT
G
G
C
C
G
G
G
T
A
C
A
G
T
G
G
C
T
C
A
C
G
C
C
T
G
T
A
6000 |
REPEAT
A
T
C
C
C
A
A
C
A
C
T
T
T
G
G
G
A
G
A
C
C
G
A
A
G
C
A
G
G
C
G
G
A
T
C
A
C
T
T
G
A
A
G
C
C
A
A
G
A
G
6050
T
T
T
G
A
G
A
C
C
A
G
C
C
T
G
G
C
C
A
A
C
A
C
G
G
T
A
A
A
A
C
C
C
T
G
T
C
T
C
T
A
C
T
A
A
A
A
A
T
A
6100
C
A
A
A
A
A
A
A
A
T
A
G
C
C
A
G
G
T
A
T
G
A
T
G
G
C
A
C
A
C
G
C
C
T
G
T
A
A
T
C
C
T
A
G
C
T
A
C
T
T
6150
G
G
G
A
G
G
C
C
A
A
G
G
C
T
G
G
A
G
G
A
G
T
G
C
T
T
G
A
A
C
C
T
G
G
G
A
G
G
T
G
G
A
G
G
T
T
G
C
A
G
6200
T
G
A
G
C
T
G
A
G
A
T
A
G
G
G
C
C
A
C
T
G
C
A
C
T
C
C
A
G
C
C
C
G
G
G
T
G
A
C
G
G
A
G
C
G
G
G
A
C
T
6250
C
C
A
T
T
T
C
A
A
A
C
A
A
C
A
A
C
A
A
C
A
A
C
A
A
A
A
G
A
A
GCCTTTGGCA GAGAAAATTA
6300
ACGCTTTCCA AAGATCCATG TGCTGTG
C
T
G
T
A
C
T
T
C
C
C
A
G
A
C
T
C
C
T
T
T
G
C
6350 |
REPEAT
A
A
T
T
A
G
A
T
T
G
G
G
G
C
C
A
T
A
C
A
G
T
T
A
G
T
T
C
T
G
G
C
C
A
A
T
G
G
G
C
T
G
T
G
A
G
C
A
G
A
6400
A
C
A
A
A
A
T
G
G
G
T
T
A
C
T
T
C
T
G
T
T
T
G
A
G
G
A
C
A
A
T
G
A
A
G
A
G
T
G
G
A
T
G
T
G
A
G
T
T
C
6450
T
C
C
A
T
A
T
T
T
T
T
T
C
C
T
T
C
C
T
T
C
C
C
C
T
A
C
A
G
T
G
G
A
A
G
C
A
A
A
G
A
A
C
T
C
C
A
A
G
C
6500
T
G
G
C
A
C
T
A
T
T
A
C
A
A
G
A
G
G
G
A
G
A
G
G
G
T
C
T
A
T
A
T
T
C
T
G
A
G
T
T
A
G
T
G
C
T
T
G
G
A
6550
G
C
A
G
C
A
C
C
T
C
C
C
A
C
CTAAAA GAGGGGCAGA GAAGTAGAAG TATAATCAAA
6600
ATAAAGAAGA ATCACAGAAA GAAAA
T
A
T
A
T
T
A
G
T
T
A
A
A
T
T
A
T
A
T
A
T
T
T
G
G
6650 |
REPEAT
C
T
G
C
T
C
T
G
A
G
A
G
A
G
A
C
C
C
A
A
A
A
T
A
A
C
A
G
T
G
G
C
T
T
A
C
A
A
A
A
G
C
T
G
G
A
A
A
T
G
6700
T
A
T
G
T
T
T
T
C
T
C
C
C
A
T
A
C
A
A
A
A
C
T
T
C
A
C
G
C
T
G
G
T
A
T
G
G
G
A
G
C
A
C
T
G
C
T
C
C
A
6750
C
A
A
A
G
C
T
G
T
C
C
A
G
G
G
T
C
C
T
A
A
G
T
T
C
C
T
T
C
T
G
T
C
T
G
G
A
G
G
T
T
C
T
A
A
T
A
T
C
C
6800
C
T
A
T
G
T
G
T
T
T
C
C
C
T
T
G
T
C
C
A
C
A
T
G
A
T
C
T
A
A
G
A
T
A
G
C
T
T
A
C
C
A
C
C
A
T
G
T
C
C
6850
A
C
A
T
C
C
A
G
C
C
A
C
T
G
G
A
A
A
G
G
G
G
G
C
A
G
G
G
T
G
G
A
G
G
A
G
A
G
G
A
G
A
G
T
G
T
A
G
A
A
6900
T
G
T
A
C
T
C
T
C
C
T
T
T
C
T
T
T
C
T
T
T
T
T
T
T
T
T
T
T
G
A
G
A
T
G
G
A
G
T
T
T
C
G
C
T
C
T
T
C
T
6950 |
REPEAT
T
G
C
C
C
A
G
G
C
T
A
G
A
G
T
G
C
G
A
T
G
G
C
A
C
A
A
T
C
T
C
G
G
C
T
C
A
T
C
A
C
A
A
C
C
T
C
C
A
C
7000
C
T
C
C
C
A
G
G
T
T
C
A
A
G
C
G
A
T
T
C
T
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
T
G
A
G
T
A
G
C
T
G
G
G
A
T
7050
T
A
C
A
G
G
C
A
T
G
C
G
C
C
A
C
G
A
C
G
C
C
C
A
G
C
T
A
A
T
T
T
T
G
T
A
T
T
T
T
T
A
G
T
A
G
A
G
A
C
7100
G
A
G
G
T
T
T
C
T
C
C
A
T
G
T
T
G
G
T
C
A
G
G
C
T
A
G
T
C
T
T
G
A
A
C
T
C
C
C
G
G
C
C
T
C
A
G
G
T
G
7150
A
T
C
C
G
C
C
C
A
C
C
T
C
G
G
C
C
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
G
T
G
A
G
C
C
A
C
7200
T
G
C
A
C
C
C
A
G
C
C
A
C
T
C
T
C
C
T
T
T
C
T
T
T
T
A
A
G
G
A
C
A
A
A
A
C
C
T
A
A
C
T
T
C
T
T
C
T
C
7250 |
REPEAT
C
C
A
T
C
T
C
A
C
T
G
G
T
C
A
G
A
A
C
T
A
C
T
T
G
T
A
A
G
G
A
A
G
G
T
T
A
G
G
A
A
A
T
G
T
C
A
T
C
T
7300
T
T
G
T
T
T
T
G
A
A
T
G
G
G
C
C
T
G
A
G
T
T
C
A
A
T
G
A
T
C
A
G
T
T
A
A
A
A
A
T
C
A
G
G
G
A
T
T
C
T
7350
A
C
C
C
T
C
A
T
A
C
A
T
T
G
C
T
G
G
T
G
G
G
A
T
T
G
T
A
C
A
A
T
G
G
T
G
C
A
A
T
T
T
C
T
T
T
G
G
A
A
7400 |
REPEAT
A
A
C
A
G
T
T
C
G
G
C
A
G
T
T
C
A
T
T
A
A
T
G
G
T
T
A
A
A
C
A
T
G
G
A
G
G
A
G
T
T
C
T
C
C
T
A
T
G
A
7450
C
T
C
A
G
C
A
A
T
T
C
T
A
T
T
C
C
T
A
G
G
T
A
T
A
A
A
A
C
C
A
A
G
A
C
A
C
A
T
G
A
A
A
A
T
A
C
A
C
A
7500
T
C
T
G
C
A
C
A
A
A
A
A
C
T
T
G
T
G
C
A
T
T
A
A
T
G
T
T
C
A
T
G
G
C
A
A
C
A
T
T
A
T
T
C
A
T
A
A
C
A
7550
G
T
C
A
A
G
A
A
A
A
T
G
G
A
A
A
C
A
A
C
C
C
A
A
A
T
G
T
C
C
A
T
C
A
A
T
T
G
A
T
G
A
A
T
G
G
A
T
A
A
7600
A
C
A
A
A
A
T
G
T
T
A
C
C
T
A
T
C
C
A
T
A
T
A
G
T
G
G
A
A
T
A
T
T
A
T
T
T
G
G
C
A
A
T
A
A
A
A
A
G
G
7650
G
T
T
G
A
A
G
T
A
C
T
G
A
T
A
C
C
T
G
C
T
A
C
A
C
C
A
C
A
G
A
T
G
A
A
C
C
T
G
G
A
A
A
A
C
A
T
T
A
T
7700
G
C
T
A
G
G
G
G
A
A
A
A
A
A
G
C
C
A
G
T
C
A
C
A
A
A
A
G
A
C
T
A
C
A
T
G
T
T
G
T
A
A
A
A
T
C
T
C
A
T
7750
T
T
G
T
A
T
A
A
A
A
T
G
T
C
C
A
G
A
A
A
A
G
A
C
A
A
A
T
A
G
G
T
A
G
A
G
A
C
A
G
A
A
A
G
T
A
G
A
C
T
7800
G
G
T
G
G
C
T
G
C
C
T
A
G
G
G
C
T
A
T
T
G
A
G
G
G
G
A
G
G
G
G
A
G
G
T
T
G
A
T
T
G
G
G
G
A
G
A
A
A
T
7850
A
G
T
G
A
A
T
A
A
C
T
G
C
T
A
A
T
G
G
G
T
G
T
G
G
G
G
T
T
T
C
T
T
T
G
T
G
G
G
A
T
G
A
T
G
A
A
A
A
A
7900
A
G
T
T
C
T
A
A
A
C
T
T
A
G
T
T
T
G
T
G
G
T
G
A
T
A
G
T
T
G
T
A
C
A
A
A
T
C
T
G
T
G
A
A
T
A
T
A
C
T
7950
A
A
A
A
A
C
C
A
T
T
C
A
A
T
T
G
T
A
G
T
G
A
A
T
G
T
T
A
T
G
G
T
A
T
G
T
G
A
A
T
T
A
T
A
T
C
T
C
A
A
8000
T
A
A
A
G
A
T
A
T
T
T
C
T
A
A
A
A
T
C
A
G
G
G
A
T
T
T
T
A
T
A
A
C
T
G
A
G
A
A
A
G
A
A
G
G
G
A
A
G
A
8050 |
REPEAT
A
T
A
A
A
G
T
T
G
C
A
G
A
G
G
A
G
A
C
T
T
G
C
A
G
C
C
T
C
T
G
C
C
A
C
A
CCAA GGACGAAAAG
8100
GTTTCAAGAG TGAAGAATTA ATGGCTTCCC ACCATGGCAG ACTGAGCTGA
8150
GCTGAGGCTG TGTTTTCAAT GCTTTTCTAT CTCTACTC
T
T
T
T
T
T
T
T
T
T
T
T
8200 |
REPEAT
T
T
T
T
T
T
T
T
T
T
G
A
G
A
C
A
G
A
G
C
C
T
T
G
C
T
C
T
G
T
C
A
C
C
A
G
G
C
C
A
T
A
C
A
G
T
G
G
T
G
8250
T
G
A
T
C
T
C
G
G
C
T
C
A
C
T
G
C
A
A
C
T
A
C
C
G
C
C
T
C
C
C
G
G
G
T
T
C
A
A
G
C
G
A
T
T
C
T
C
T
T
8300
C
C
C
T
C
A
G
C
C
T
C
C
C
A
A
G
T
A
G
C
T
G
G
G
A
G
T
G
C
A
C
A
C
C
A
C
C
A
C
A
C
C
C
A
G
C
G
A
A
T
8350
T
T
T
T
G
T
A
T
T
T
T
T
A
A
T
A
G
A
G
A
T
G
G
A
G
T
T
T
C
A
C
C
A
T
G
T
T
G
G
C
C
A
G
G
C
T
G
A
T
C
8400
T
C
A
A
A
C
T
C
C
T
G
A
C
C
C
C
A
A
G
T
G
A
T
C
C
G
C
C
C
A
C
C
T
A
G
G
C
C
T
C
C
C
A
A
A
G
T
G
C
T
8450
G
G
G
A
T
T
A
C
A
G
G
T
G
T
G
A
G
C
C
A
T
C
C
C
C
C
T
G
G
C
C
TCTATCTCT ACTCCTAAAA
8500
GAACCATTTC TG
A
C
C
A
C
T
T
A
A
C
A
C
C
C
A
T
T
A
G
A
A
T
G
T
T
A
T
T
T
T
A
A
A
A
A
A
T
A
8550 |
REPEAT
A
A
A
T
A
A
A
A
G
C
C
A
A
A
A
A
T
A
G
C
A
A
G
T
G
T
G
G
G
T
G
A
A
G
A
T
G
T
G
G
A
G
A
A
G
C
T
G
G
A
8600
C
C
G
C
T
T
G
T
A
C
G
C
T
G
C
T
G
C
T
G
G
A
A
A
G
G
T
A
A
A
A
T
G
G
T
G
C
A
G
C
T
A
T
G
G
T
G
G
A
G
8650
A
A
C
A
A
T
A
C
G
G
C
A
G
T
C
C
C
T
C
A
A
A
A
A
A
T
T
T
A
A
T
C
T
A
G
A
A
T
T
A
C
C
T
T
A
T
G
A
C
C
8700
C
A
G
C
A
A
T
T
C
C
A
C
C
T
C
T
G
G
G
C
A
T
A
T
A
T
C
C
A
A
A
A
G
A
T
A
A
A
A
G
C
A
G
G
G
A
C
T
G
G
8750
A
G
C
A
G
A
T
A
T
T
T
G
T
A
T
G
C
T
C
G
T
G
T
T
C
C
T
A
G
T
A
G
T
A
T
T
A
T
C
C
A
C
A
A
T
A
A
C
C
A
8800
G
G
G
G
G
T
G
G
A
A
A
C
A
A
C
C
C
A
A
A
T
G
T
C
C
A
T
T
G
A
C
A
G
A
T
G
A
A
T
G
A
A
T
A
A
A
C
A
A
A
8850
T
T
G
T
G
G
T
A
T
A
T
A
C
A
G
A
C
A
A
C
A
A
A
A
T
A
T
C
A
C
T
T
A
G
C
C
T
T
A
A
A
T
A
A
T
G
A
C
A
A
8900
T
C
G
G
G
G
G
C
C
A
G
G
C
A
T
A
G
T
G
G
C
T
C
A
C
G
C
C
T
G
T
A
A
T
C
C
C
A
G
C
A
C
T
T
T
G
G
G
A
G
8950 |
REPEAT
G
C
C
G
A
G
A
C
G
G
G
C
A
G
A
T
C
A
C
T
T
G
A
G
G
C
C
G
G
A
A
G
T
G
C
A
A
G
A
C
C
A
G
C
C
T
G
G
C
C
9000
A
A
T
A
T
G
G
T
A
A
A
A
C
C
C
C
G
T
C
T
C
T
A
C
T
A
A
A
A
A
T
A
C
A
A
A
A
A
T
T
A
G
C
C
A
G
G
T
G
T
9050
G
G
T
G
G
T
G
C
A
T
G
C
C
T
G
T
C
A
T
T
C
C
A
G
C
T
A
C
T
T
G
G
G
A
G
G
C
T
G
A
G
G
C
A
C
G
A
G
A
A
9100
T
T
G
C
T
T
G
A
A
C
C
T
G
G
A
A
G
G
C
A
G
A
G
G
C
T
G
C
A
G
T
G
A
A
C
T
G
A
G
A
T
T
G
T
G
C
T
A
C
T
9150
G
C
C
C
T
C
C
A
G
C
C
T
G
G
G
T
G
A
C
A
G
A
G
C
G
A
G
A
C
T
G
T
G
T
C
T
C
A
A
A
A
A
A
A
A
A
A
A
A
A
9200
A
A
G
A
A
A
A
G
A
A
A
A
G
A
A
A
A
G
G
A
A
A
T
C
T
G
A
C
A
C
A
T
G
C
T
A
T
A
A
C
G
T
G
G
A
T
G
A
C
C
9250 |
REPEAT
C
T
T
G
A
A
G
A
C
A
T
G
A
T
G
C
T
A
A
G
T
G
A
A
A
C
A
A
G
C
C
A
G
T
C
G
T
A
G
A
A
C
G
A
C
A
C
A
T
A
9300
C
T
G
T
G
A
T
T
C
T
G
A
G
G
T
A
T
C
C
A
G
A
A
T
A
G
T
C
A
G
A
T
T
T
G
T
A
G
A
G
A
C
A
G
A
A
A
G
T
A
9350
T
A
A
T
T
C
T
G
G
T
T
T
C
T
A
G
G
A
G
A
A
G
T
G
A
G
G
A
A
G
G
A
G
A
A
G
G
T
A
T
T
G
G
T
T
A
A
T
A
G
9400
G
T
A
C
A
A
G
T
T
T
C
T
G
T
T
T
G
G
G
A
A
G
A
T
T
A
A
A
A
G
T
T
T
C
T
G
G
A
G
C
T
A
G
A
T
A
G
T
G
G
9450
T
G
C
A
C
A
A
C
A
G
T
G
T
A
A
A
T
G
T
A
T
T
T
A
G
T
G
C
T
A
C
T
G
G
G
C
T
A
T
A
T
A
C
T
T
A
G
C
T
A
9500
C
A
A
T
G
T
A
C
A
A
T
A
G
C
T
A
C
A
A
T
G
G
T
A
A
T
T
T
T
T
A
T
G
T
T
A
T
G
T
A
T
A
T
T
T
T
G
C
C
A
9550
C
A
A
T
T
T
A
A
A
A
A
GGAATTTCT CATACTCTAA TGGTTTTTTG CTGATAACCC
9600
AAACTTAAAG AGCCTGTGCT GTAAAATACA ACTGCGCTGG ATTGTCTTTT
9650
TGCAAATAGC ATTATCAGTG CCCTTTCTAA GGTCTTTTAT GAGATGCAGA
9700
GGGGACACTT GGAACTATTA A
T
A
C
A
T
T
T
T
C
T
A
G
A
A
T
C
C
C
T
T
C
T
C
T
A
T
A
T
G
9750 |
REPEAT
A
G
C
C
T
A
G
G
T
T
A
G
A
G
T
C
A
G
C
C
A
GCGTGAGAC GGAAGAGGAG AGGCCATTAT
9800
GTTTCAATGC CGGTTGCAGA CAGAAGCATG GACAGGCTTG GAGTTTAAAG
9850
CAGCTTCTGG GTGACCTTCC CAGGAATCAC ATGCATTGAT ATTGCAGGCA
9900
TAATAAGGCG AGCTTCCCAT TTGTAGCCAC TTACTAATGA GCGTTTGAGA
9950
GTCACTCCTT CTGAACTGCA AACCAAGGTG GTCACTCCTC
C
A
G
C
C
C
T
T
C
C
10000 |
REPEAT
A
A
G
A
G
T
T
G
G
A
T
A
A
A
C
C
T
T
T
A
A
T
T
T
T
T
T
G
T
T
T
T
A
A
A
G
C
C
T
T
T
C
A
T
G
C
T
T
G
A
10050
A
A
T
A
C
G
T
G
G
A
A
T
G
G
C
T
T
T
T
G
T
T
T
T
T
C
T
G
A
C
T
G
A
A
C
C
T
G
G
A
T
T
G
A
T
A
C
A
AT
10100
GGGTAGATGC AACACACACT CATGCTTCCC TGTTTGGTGG CATCTTATAG
10150
TTTCAGTGTT GAACATGTTT TGAGTTGGTT GGAGGTATAT GTATATGTGT
10200
ATGTGGTATG TGCTTTGTGT GTG
G
A
A
C
C
A
T
T
T
G
A
A
A
T
A
A
T
T
A
T
A
G
C
C
A
A
C
10250 |
REPEAT
A
T
G
A
C
A
C
T
T
C
A
C
T
C
T
G
A
A
T
A
C
T
T
C
A
A
C
A
T
A
C
A
T
T
G
T
G
A
A
A
A
A
T
A
A
G
G
A
C
T
10300
C
T
T
T
T
A
C
A
A
A
A
G
A
A
A
A
A
A
A
C
A
T
T
A
T
C
A
T
A
C
C
T
T
T
G
A
A
A
A
T
T
A
A
C
A
C
A
A
T
T
10350
C
C
T
T
A
A
T
A
T
C
A
T
C
T
A
A
C
A
T
A
T
C
C
T
A
C
A
T
A
T
T
C
A
G
A
T
T
T
T
C
C
C
C
A
A
A
A
T
G
C
10400
T
T
T
C
A
T
A
G
T
T
G
T
T
T
T
T
C
T
C
T
C
T
C
T
G
T
A
A
A
T
T
C
A
A
G
C
T
C
T
A
A
C
A
C
A
T
C
T
C
A
10450
T
C
T
G
G
T
T
G
T
T
A
T
C
C
C
A
C
T
T
T
A
G
T
T
T
T
T
A
A
A
T
T
T
A
G
A
A
C
C
G
T
T
C
T
C
C
C
C
A
A
10500
C
C
C
C
A
C
C
A
T
T
T
G
T
T
T
G
T
T
T
T
C
A
T
G
A
C
A
T
T
G
A
T
T
T
T
T
T
T
T
T
G
G
A
A
G
A
G
T
G
C
10550
A
A
G
G
A
G
G
A
C
A
T
C
T
T
G
T
A
A
A
A
T
G
T
C
C
C
A
C
A
T
T
C
T
G
G
A
T
T
T
G
A
G
T
G
A
T
T
A
C
T
10600
T
T
C
T
C
A
T
G
A
T
T
A
G
A
T
T
A
A
G
G
T
T
A
A
A
C
A
T
T
T
T
T
G
G
C
A
A
A
A
A
T
A
C
T
A
G
G
T
G
A
10650
T
A
T
C
T
G
G
C
C
C
T
T
C
T
T
A
C
T
G
T
A
T
C
C
T
A
G
C
T
G
G
A
G
G
T
T
G
C
A
C
T
C
T
A
T
T
T
A
G
T
10700
G
A
T
G
T
T
A
A
G
T
T
T
G
A
T
C
A
C
T
C
C
A
T
T
A
A
A
G
T
G
A
T
A
A
C
A
A
C
C
A
G
A
T
T
T
C
T
C
C
A
10750
T
G
G
C
T
A
G
T
A
A
A
G
T
T
T
G
G
T
T
T
T
T
T
A
C
G
T
T
A
T
T
G
A
T
A
A
G
T
A
A
T
T
T
G
T
G
G
A
G
C
10800
A
A
T
A
C
T
T
T
G
A
A
G
C
C
A
T
G
T
G
A
G
C
A
T
C
C
C
A
T
T
C
T
G
C
A
A
C
A
A
T
T
T
T
T
C
A
C
C
C
A
10850
A
T
G
G
T
T
T
T
A
G
C
A
T
T
C
A
T
T
G
A
T
G
A
T
C
C
T
T
G
C
C
T
G
A
A
T
C
G
A
A
T
A
T
T
A
T
C
C
T
G
10900
A
G
A
G
T
T
G
C
C
A
G
A
T
A
G
T
G
A
T
T
T
T
T
A
A
A
A
A
A
T
T
C
T
A
G
G
C
C
A
G
G
T
G
C
A
G
T
A
G
C
10950 |
REPEAT
T
C
A
T
G
C
C
T
G
T
A
A
T
C
C
C
A
G
C
T
C
T
T
T
G
G
A
A
G
G
C
T
G
A
G
A
T
G
G
G
A
G
G
A
T
C
A
C
T
T
11000
G
A
A
C
C
C
A
G
G
A
A
T
T
T
G
A
G
A
C
C
A
G
C
C
T
T
G
G
C
A
A
C
A
T
G
G
C
A
A
A
A
C
C
T
G
T
C
T
C
T
11050
A
C
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
C
T
A
A
A
A
A
T
A
A
A
A
T
A
C
A
A
A
A
A
T
T
A
11100
G
C
C
T
A
G
C
A
T
G
G
T
G
G
C
A
C
A
C
A
T
G
C
T
T
G
T
A
G
T
C
C
C
A
G
C
C
A
C
T
C
A
G
G
A
G
G
C
T
G
11150
A
G
G
T
G
G
G
A
G
G
A
T
C
A
C
C
A
G
A
G
C
C
T
G
G
G
A
G
G
T
T
G
A
G
G
C
T
G
C
A
G
T
G
A
G
C
C
G
T
G
11200
A
T
C
A
A
G
C
C
A
C
T
G
C
A
C
T
C
C
A
G
C
C
T
G
G
G
C
C
A
C
A
G
A
G
T
T
G
C
C
C
A
A
A
A
C
A
A
A
A
T
11250 |
REPEAT
G
A
A
A
A
C
T
T
A
A
A
G
A
A
T
T
C
C
A
T
C
A
T
T
C
C
A
G
C
T
A
C
A
T
T
T
A
T
T
A
T
C
C
G
A
T
T
T
C
T
11300
T
C
T
G
T
A
A
G
T
A
A
G
A
G
C
T
T
T
TA GGGAACTTGT TCTTAAATCC AGAGACTTTG
11350
AACAGATTCC TTCCAGGTGA ACTTGTTTGT GCTGCCCTTT CCTTGACTGT
11400
CGTTTTGTAT CACAGTGTTT GCGGATAGCT GCATTTTTAA GAGAGTTGCC
11450
CCTCTACTTT GTTTTGTGTT TTTGTTTTTT AATACACAAG AATTGCTGAC
11500
TCATTCTGGA TGTGTAGTAA GAAGGATTTT AAAATGTAGA TTGGTAAAGG
11550
TTAGCAGGTA CCTTATTCCT TACTGCTCAG CAGCAGCACT TTTGCCTTAA
11600
ACACATGGAT TAAGCAAGTG TATGGGTCTC AAATAGACAC TTGATATGCA
11650
CCAAGTGAGA AGATCTTCTA GAGTCTTTCT TTTGTGACCT GTTCTTTAAA
11700
GCAACATTAT GACTTCTCCT TCTGAACAGT AAGTA
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
11750 |
REPEAT
T
T
T
T
T
T
T
T
T
T
G
G
C
A
G
A
G
T
C
T
C
A
C
T
C
T
G
T
C
A
C
C
C
A
G
G
G
T
G
G
C
T
G
G
A
G
T
A
C
A
11800
G
T
G
G
C
G
T
G
A
T
C
T
C
G
G
C
T
C
T
C
G
G
C
A
A
C
C
C
A
G
T
C
A
C
T
T
G
G
G
T
T
C
A
A
G
C
G
A
T
T
11850
C
T
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
T
G
A
G
T
A
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
G
T
G
T
G
C
C
A
C
C
G
T
11900
G
C
C
T
G
G
C
T
A
A
T
T
T
T
T
G
T
A
T
T
T
T
T
A
G
T
A
G
A
G
A
T
G
G
G
G
G
T
T
A
T
T
C
C
A
T
G
T
T
G
11950
G
C
C
A
G
G
C
T
G
G
T
C
T
T
G
A
A
C
T
C
C
T
G
A
C
C
T
C
A
G
G
T
G
A
T
C
C
A
C
C
C
A
C
C
T
C
G
G
A
C
12000
T
C
C
C
A
A
A
G
C
T
C
C
C
A
A
A
G
T
A
C
T
G
G
G
A
T
T
A
C
A
G
G
T
G
T
G
A
G
C
C
A
C
C
A
C
A
C
C
C
G
12050
G
GCCAAGTAA ATCTTGTTAC AAATTGTTCT CCTTCAGTCT TGTCTTCTAA
12100
GAACTCAGAT GTAAACTGTG AGGTAGCAGT CTTTACTTGG TGTTCCTGGA
12150
CTCCATCTCA GAACGCACCA AAAACATCTA TATGATTGTG GAGCCACTCT
12200
ACATTGTTTC TACTGCTATC ACCAGACCTT AATGGAGGTG TGGGTTTTTT
12250
AAAAATCAAT ATACATCTCA AATACATTTC AGAAAGAAAG GTGTTTATGC
12300
TTAACGCAGT GTTAAAACTA TCCCAGCAAG AAAGTAGTAA CCTGTGTAAG
12350
CATTTTTTTG
T
T
T
C
T
T
T
G
C
T
T
G
T
T
T
G
T
T
T
A
T
T
T
G
T
T
T
T
T
G
A
G
A
C
A
G
G
A
T
C
12400 |
REPEAT
T
C
A
C
T
T
T
G
T
C
A
C
C
C
A
G
G
C
T
A
G
A
G
T
G
C
A
G
T
G
G
C
A
T
G
A
T
C
A
C
A
G
T
T
C
A
C
T
G
C
12450
A
G
C
C
T
C
A
A
C
C
T
C
C
C
A
G
G
C
T
C
A
A
G
C
A
A
T
C
C
T
C
C
A
G
C
C
T
C
T
G
C
C
T
C
A
A
G
C
A
A
12500
T
C
C
T
C
C
A
G
C
C
T
C
T
G
C
C
T
C
C
C
A
A
G
A
A
T
C
T
G
T
A
G
T
T
T
C
T
G
G
G
A
C
T
A
C
A
G
G
A
G
12550
T
G
T
C
C
C
A
C
A
A
C
A
C
C
C
A
G
C
T
G
A
T
T
T
T
T
T
A
T
T
T
T
T
T
T
G
G
T
A
G
A
G
A
A
A
G
G
A
T
C
12600
T
C
A
C
T
A
T
G
T
T
G
C
C
C
A
G
G
C
T
G
G
C
C
T
T
G
A
A
C
T
C
C
T
G
G
G
C
T
C
A
A
G
C
A
A
T
C
C
T
C
12650
C
T
G
C
C
T
C
A
G
C
C
T
C
C
C
A
A
A
G
C
A
C
T
G
G
G
A
T
T
A
C
A
G
G
C
A
T
G
A
A
C
C
A
C
C
A
T
G
C
C
12700
T
G
G
C
C
TTTTG TTTGTTTTGA AATAAGTTCT ATTTGTCAAA AAGA
A
A
G
A
G
A
12750 |
REPEAT
A
T
T
A
A
A
T
A
G
A
A
C
A
T
C
T
A
T
A
A
G
T
G
A
A
A
A
A
A
A
T
A
T
A
G
T
C
A
T
T
A
T
A
A
T
A
T
A
A
A
12800
A
C
A
C
A
T
T
T
A
G
C
T
A
C
A
G
T
T
G
G
A
G
A
G
A
G
A
A
T
T
A
G
T
A
A
G
C
T
A
G
A
A
G
G
T
A
G
A
T
C
12850
T
G
A
G
A
A
A
A
T
C
A
C
A
C
A
G
A
A
A
G
A
A
T
C
A
C
A
G
A
A
A
T
A
A
A
A
A
G
G
A
G
G
A
TATTGA
C
12900 |
REPEAT
C
A
G
G
C
A
C
G
G
T
G
G
C
T
C
A
T
G
C
C
T
G
T
A
A
T
C
C
C
A
G
C
A
C
T
T
T
G
G
G
A
G
G
C
C
G
A
G
G
C
12950
G
G
G
T
G
ATATT
G
G
C
C
A
G
G
C
A
T
G
G
T
G
G
C
T
C
A
T
G
C
C
T
A
T
A
A
T
C
C
C
A
G
C
A
C
T
T
T
13000 |
REPEAT
G
G
G
A
G
G
C
C
G
A
G
G
T
G
G
G
T
G
G
A
T
C
A
C
C
T
G
A
G
G
T
C
A
G
G
A
G
T
T
T
C
A
G
A
C
C
A
G
C
C
13050
T
G
G
C
C
A
A
T
A
T
G
G
T
G
A
A
A
C
C
C
C
G
T
C
T
C
T
A
C
T
A
A
A
A
A
T
A
C
A
A
A
A
A
T
T
A
G
C
C
G
13100
G
G
C
G
T
G
G
T
G
G
T
G
C
A
T
G
C
C
T
G
T
A
A
T
C
C
C
A
G
C
T
A
C
T
T
G
G
G
A
G
A
C
T
G
A
G
G
C
A
G
13150
G
A
G
A
A
T
A
G
C
T
T
G
A
A
C
C
T
G
G
G
A
G
G
C
A
G
A
G
T
T
T
G
C
C
G
T
G
A
G
C
C
A
A
G
A
T
C
A
T
G
13200
C
A
A
T
T
G
C
A
C
T
C
C
G
T
C
C
T
G
G
G
C
C
A
C
A
G
A
G
C
A
A
A
A
C
T
C
T
G
T
C
T
C
A
A
A
A
A
A
A
T
13250
T
A
A
A
T
A
T
A
A
A
A
A
G
G
A
G
G
G
T
A
T
G
A
A
T
A
G
A
C
G
G
T
A
A
G
A
G
A
C
A
T
G
G
A
G
G
A
C
A
G
13300 |
REPEAT
A
A
T
G
G
G
A
G
G
T
C
C
A
A
T
A
T
A
T
G
C
C
T
G
T
A
G
G
A
T
T
T
C
C
A
C
G
A
A
G
A
G
A
A
C
A
C
A
G
G
13350
G
A
C
T
G
T
G
G
A
A
G
A
G
G
C
A
A
T
A
T
T
C
A
A
A
A
G
A
T
G
A
T
G
G
C
T
G
A
G
A
A
A
T
T
T
C
A
A
A
C
13400
A
T
T
G
A
A
G
C
A
A
G
A
T
G
T
G
A
C
C
G
G
A
A
T
C
C
T
C
A
G
G
T
A
C
A
G
G
A
A
G
C
A
T
C
C
T
G
A
G
T
13450
C
T
T
G
G
T
A
G
G
A
A
A
A
A
T
T
A
A
C
A
C
A
C
A
A
C
C
A
A
A
T
G
A
G
G
A
C
A
C
A
T
C
A
C
A
G
G
A
A
T
13500
C
T
G
C
A
G
A
A
C
A
C
T
G
A
A
G
G
C
A
A
T
G
G
A
T
A
T
C
T
T
A
A
A
A
T
C
A
A
C
C
A
A
A
G
A
G
A
A
A
G
13550
A
C
T
A
A
T
G
A
T
C
T
A
C
A
C
A
C
G
A
A
G
C
A
T
G
A
T
T
A
A
A
C
C
G
G
C
A
A
C
T
G
A
T
T
T
A
T
C
A
T
13600
C
A
A
T
G
T
A
C
C
T
T
C
T
A
A
A
G
A
C
T
G
A
A
A
G
G
A
A
A
T
TAATGGCCAA CCAAGACTTA
13650
TACTCAGGTG AACTATTATT TGAGAGGAAG AAGTTAATTT CAGAGAAAGA
13700
GAGTTTACCA CTCACAGATC CTCACTGAAA AGAACTACTA AAAATCTGAA
13750
AAGGAAACAG AACCCAGAAA AAAGGAATAG GATAGAAGGA GACATGGGGA
13800
GCAATGAAAT TAGTAAACGT TTAGGTAAAT CCAAAAAACA TTAGTGAAAA
13850
TACACACACA CACACTTATG AGACTAGTTT TCACTA
A
A
A
A
T
T
G
G
T
C
A
A
A
T
13900 |
REPEAT
A
G
T
G
G
C
A
A
T
T
T
C
A
T
A
T
A
G
T
T
C
T
A
C
C
T
A
A
CA AAAATGTATT TACACATAGT
13950
TTAGTTTGTG GGGGT
T
T
T
G
T
T
T
G
C
T
T
G
T
T
T
G
T
T
T
G
T
T
T
G
T
T
T
T
T
T
G
A
G
A
C
14000 |
REPEAT
A
G
G
G
T
C
T
C
A
C
C
G
T
T
G
C
C
C
A
G
G
A
T
G
G
A
G
T
G
C
A
G
T
G
G
T
G
T
A
A
T
C
A
C
T
G
C
A
G
C
14050
C
T
C
G
A
C
C
A
C
C
C
T
G
G
G
C
T
C
A
G
A
T
G
A
T
C
C
T
C
C
C
A
C
C
T
C
A
G
T
C
T
T
C
T
G
A
G
T
A
G
14100
C
T
A
G
G
A
C
C
A
C
A
G
G
C
A
T
A
C
A
C
C
A
C
C
A
T
G
C
C
C
A
G
C
T
A
A
T
T
T
T
T
A
T
A
T
T
T
T
T
T
14150
T
G
T
A
G
G
G
A
C
G
G
G
G
G
T
C
T
C
G
C
T
A
T
G
T
T
T
C
C
C
A
G
G
C
T
G
G
T
C
T
C
G
A
A
C
T
C
A
T
G
14200
G
G
C
T
C
A
A
G
C
G
A
G
C
T
G
C
C
C
G
C
C
T
C
G
G
C
C
T
C
A
A
A
A
G
T
G
T
T
G
G
G
A
T
T
A
C
A
G
G
T
14250
G
T
A
A
G
C
C
A
C
C
C
T
A
C
A
C
C
T
G
G
C
C
AA
G
T
T
T
A
T
C
T
C
A
T
T
T
C
C
T
A
C
T
G
T
A
C
T
C
C
14300 |
REPEAT
C
A
G
T
G
C
C
T
A
G
A
C
C
A
G
T
G
C
T
T
A
T
C
A
C
A
T
G
G
T
A
G
G
G
A
G
G
G
C
T
C
A
A
A
A
A
C
A
T
C
14350
T
A
T
T
G
A
T
G
A
A
GGGACTACTT TGGAGGAGGT GTTAAAAGTA ATGAGGAACT
14400
AAAATATCAG ACCGAAGCAA CCTGGAAATG GGAAAGGAGA GATCAAAGTG
14450
AAAGCATTCA GCAGCCTTTG CATTATTCGG GAGAAAGATA GGGATATTGA
14500
TTATCTATAG ACTTTGTTAG GTCAGACAAG AATATTAAAA TTTTAAGGAT
14550
AACCCATAAA ACGAATAGAA AGAATGGATA ACTTTTAAAC TAGTATATGG
14600
GGAAGAGAGA CTAAAGAGCA TCCGATT
G
A
C
T
T
C
T
A
G
T
A
A
C
G
G
A
C
G
G
T
G
G
A
14650 |
REPEAT
T
T
A
A
T
T
G
C
A
T
G
C
A
T
T
A
G
C
C
T
C
T
G
C
T
G
T
T
C
T
C
T
G
A
A
A
T
C
T
C
A
A
T
A
A
C
G
T
G
G
14700
C
A
T
T
G
A
A
G
T
G
C
A
C
T
T
T
T
T
A
C
A
A
A
A
T
G
C
A
T
A
A
A
C
C
C
A
A
A
G
G
G
A
A
A
A
T
A
G
A
A
14750
A
A
G
G
A
G
A
C
A
A
A
A
A
T
A
A
C
A
A
C
A
T
T
T
G
G
G
A
A
G
C
T
G
G
A
A
A
G
C
T
T
C
C
A
A
A
T
G
G
T
14800
C
A
A
A
C
G
G
T
G
G
T
G
A
C
A
G
G
C
A
C
A
C
C
T
G
A
C
A
T
G
A
A
T
T
T
T
G
A
A
G
C
A
G
C
A
G
T
G
G
A
14850
G
A
A
A
G
C
C
A
A
G
A
A
G
G
C
C
C
C
T
G
A
T
T
T
T
C
A
C
T
A
C
T
A
A
A
C
T
C
T
C
C
A
A
A
T
G
C
T
C
T
14900
G
A
C
A
T
T
A
G
T
G
T
C
A
A
C
A
T
T
C
T
T
C
T
G
G
A
A
G
A
G
T
G
G
G
T
G
G
C
A
G
T
G
G
G
A
C
T
A
G
A
14950
A
A
T
A
T
G
G
C
A
G
T
T
G
G
T
C
G
A
G
A
G
T
C
T
G
T
T
T
G
A
G
A
A
G
C
A
G
C
T
G TTCTCTTCAA
15000
CTCTACTTGG CTGGGTACCC CCAAAGCAGC AGAAAACCAG ATGCTTATTC
15050
TCTCTGAAAA GGGTAAAAGA GTGGGACTCC GGCCTGGAGA AAACTAGGTA
15100
TTGTTGGGGG CCAGGGTTTT TGTGCTAAGA AAAATTAAGG AAAACTTTAT
15150
ACATTAAATT TTGA
G
G
C
T
G
G
G
C
A
C
A
G
T
G
G
C
T
C
A
T
G
C
C
T
G
T
A
A
T
C
C
C
A
A
C
A
15200 |
REPEAT
C
T
T
T
G
A
G
A
A
G
G
C
A
A
G
G
C
A
G
G
A
G
G
A
T
C
G
C
T
T
G
A
G
C
C
C
A
G
G
A
G
T
T
C
G
A
T
A
C
C
15250
A
G
C
C
T
G
G
G
C
A
A
T
G
T
A
G
C
G
A
G
A
C
C
C
T
G
T
C
T
C
T
G
C
A
A
A
G
A
A
T
A
C
A
A
A
A
A
T
T
A
15300
G
C
C
A
G
G
T
A
T
G
G
T
G
G
C
A
C
C
T
G
T
G
G
T
C
C
T
A
G
C
T
A
C
T
C
A
G
G
A
G
G
C
T
G
A
G
G
C
A
G
15350
A
A
G
G
A
T
T
A
C
T
T
G
A
G
C
C
C
A
C
G
A
G
T
T
A
G
A
G
A
C
C
G
C
A
G
T
G
A
G
C
C
A
T
G
A
T
C
A
T
G
15400
C
T
A
C
T
G
C
T
C
T
C
T
A
G
C
C
T
G
G
G
C
G
A
C
A
G
A
A
T
G
A
G
A
C
C
C
C
A
T
C
T
C
A
A
A
A
A
T
A
A
15450
A
TTTTTTTAA AAGACTCCTC AGTTCACCAT CTTTCCCCAT ATTGAAAATG
15500
GAACAACATT CTCCCTGAGA AATTTGGCTG GCCCAAGGAA ATAAGCCTAA
15550
AGAAACTGAC TTTTGGGGGT TCCCTGCTCT CCAGATGGTC CCTCCTAAAT
15600
TGAACCATGG ACTCAGGATT TCACATCAGC TTCTTCTAAT TTATGCA
G
C
T
15650 |
REPEAT
T
T
C
T
T
C
T
A
A
T
T
T
A
C
T
G
T
G
G
G
G
T
T
T
T
T
T
C
G
G
T
G
T
T
T
T
T
T
G
T
T
T
T
G
T
T
T
T
G
T
15700 |
REPEAT
T
T
T
T
T
G
A
G
A
C
A
G
G
G
T
C
T
T
G
C
T
C
T
G
T
C
A
T
C
C
A
G
C
T
G
G
A
G
T
G
C
A
G
T
G
G
T
G
T
G
15750
A
T
C
A
T
A
G
C
T
C
A
C
T
G
C
A
G
C
C
T
C
A
A
A
C
T
C
C
T
G
G
G
C
T
C
A
A
G
T
G
A
T
C
C
T
C
C
C
A
C
15800
C
T
C
A
G
C
C
T
C
C
T
A
A
G
T
A
G
C
T
G
G
G
G
C
T
A
C
A
G
G
C
A
T
G
A
C
C
A
C
C
A
C
A
C
C
T
G
G
C
T
15850
A
A
C
T
T
T
T
T
A
A
T
T
T
T
T
T
G
T
A
G
A
G
G
T
G
G
G
G
G
T
C
T
T
G
A
C
A
T
G
T
T
G
C
T
C
A
G
G
C
T
15900
G
G
T
C
T
T
G
A
A
A
T
C
C
T
G
G
C
C
T
C
A
A
G
C
G
A
T
C
C
C
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
C
A
A
A
G
15950
T
G
C
T
G
G
G
A
T
T
T
C
A
A
G
C
A
T
G
T
G
C
C
A
C
T
G
T
G
C
C
T
G
G
C
T
A
A
G
A
A
T
A
C
T
T
A
A
A
G
16000 |
REPEAT
G
T
G
A
A
G
T
G
A
G
A
A
G
T
T
C
A
C
A
T
G
A
T
G
T
C
A
T
A
T
C
A
G
G
A
G
G
T
A
C
A
C
A
A
T
G
G
C
T
A
16050
C
T
T
G
T
C
C
C
A
C
T
A
A
T
A
G
C
A
A
T
C
C
T
T
A
G
T
T
T
T
T
T
C
A
C
T
T
C
G
T
G
A
G
G
T
G
G
T
C
A
16100
C
T
A
G
C
A
G
A
C
C
T
C
T
T
G
G
T
T
G
G
A
A
A
G
A
T
A
G
G
T
T
A
T
T
T
C
C
C
T
T
T
G
T
A
A
T
T
A
A
C
16150
A
A
G
T
A
A
C
A
G
A
A
T
T
G
G
G
A
T
A
C
T
T
T
G
G
C
A
T
C
T
T
A
A
A
A
T
A
C
C
T
T
G
G
C
A
A
C
C
T
G
16200
A
T
G
A
A
C
A
T
A
G
C
A
T
A
C
A
T
T
C
A
A
G
A
T
T
T
T
T
G
C
C
T
G
A
A
T
T
A
A
T
T
A
T
T
A
C
A
T
T
A
16250
G
A
A
G
A
G
T
T
G
C
A
G
A
A
T
G
A
G
A
A
T
G
T
T
C
T
A
A
T
T
C
T
A
C
T
A
T
T
T
C
T
T
T
T
A
T
A
T
A
T
16300
T
T
C
A
G
C
T
G
A
T
G
T
T
A
T
T
C
T
A
T
G
A
A
G
A
A
A
A
G
C
T
C
C
C
T
C
A
T
C
A GTGAAGATGA
16350
ACTGCACTTC TTTCTAAAGA GGCAATGTCA GA
C
G
G
G
C
A
C
G
G
T
G
G
C
C
T
A
T
A
16400 |
REPEAT
A
T
C
C
T
A
G
C
A
C
T
T
T
G
G
G
A
G
G
C
C
G
A
G
G
A
G
G
G
T
G
G
A
T
C
A
C
C
T
G
A
G
G
T
C
A
G
G
A
G
16450
T
T
C
G
A
G
A
C
C
A
A
C
C
T
G
G
C
C
A
A
C
A
T
G
G
T
A
A
A
A
C
C
C
T
G
T
C
T
C
T
A
C
T
T
A
A
A
A
T
A
16500
C
A
A
A
A
A
T
T
A
G
C
C
G
G
G
C
A
T
G
G
T
G
G
C
A
C
A
T
G
C
C
T
G
T
A
A
T
C
C
T
A
G
C
T
A
C
A
T
G
G
16550
G
A
G
G
C
T
G
A
G
G
C
A
G
A
A
G
A
A
T
C
G
C
T
T
G
A
A
C
C
T
G
A
G
A
G
G
C
A
G
A
G
G
T
T
G
C
A
G
T
G
16600
A
G
C
T
A
A
G
A
T
T
G
T
A
C
C
A
C
T
G
C
A
C
T
C
C
A
G
G
C
T
G
G
G
T
G
A
C
A
G
A
G
T
G
A
G
A
C
T
C
T
16650
G
T
C
T
C
A
A
A
A
A
A
A
A
A
C
A
A
A
A
C
A
A
A
A
C
G
A
A
A
C
A
A
A
A
A
A
A
CAG TTGACAGTAG
16700
TTGTTTTTGT AGAGTGAAAA ATAGAGGTGG GGGTTGAGAT ACTGCTGCTA
16750
TTTTAAACTA ACTTGTAGAT CTGTTTGACT CTTTTTTAAA AAACA
A
T
T
T
T
16800 |
REPEAT
T
T
G
G
A
G
A
G
A
T
G
A
A
G
C
C
T
T
G
C
T
A
T
G
T
A
G
C
C
C
G
G
G
C
T
G
C
T
C
T
C
A
A
A
C
C
C
C
T
G
16850
G
C
C
T
C
A
A
G
T
G
A
T
C
C
C
C
C
T
G
T
C
T
C
G
A
C
C
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
G
T
T
A
C
A
G
A
16900
C
A
T
A
A
G
T
C
A
C
C
A
C
A
C
C
T
G
G
C
T
CTGTTTGAC TCTTTATGTG CAGTTTGAAC
16950
TTTTTAACTG AATTTTTTAA AAGGTGTGAA ATTAGTCTTT AAAGGAACAC
17000
AAAACCAACA GAGAAGCAAG CAGGAAGATA CAAGAAGCAA AGAAATATCA
17050
TAGTAAAATG AAAATACAAA ATAATATGAC ATAAATAAAA CAGTAATCAC
17100
TATCAAATTA AATAGATTAA GTGAGCCCAC
A
A
G
G
C
A
T
T
G
T
C
T
C
T
G
G
G
A
C
T
17150 |
REPEAT
A
G
T
C
C
G
C
T
G
G
G
T
C
G
A
A
G
T
C
C
T
G
C
C
A
C
T
T
A
T
T
G
G
T
T
C
T
G
T
G
A
C
C
T
T
T
G
G
C
A
17200
A
A
T
T
A
T
T
T
A
A
C
T
T
C
T
C
T
G
T
G
A
T
T
C
C
A
T
T
T
C
T
T
T
C
T
T
T
T
G
T
T
T
T
C
T
T
T
C
T
T
17250 |
REPEAT
T
C
G
T
T
T
T
T
T
T
T
T
T
T
G
A
G
A
C
A
G
A
G
T
C
T
C
A
C
T
C
T
G
T
C
A
C
T
C
A
G
G
C
T
G
G
G
T
G
C
17300
A
G
T
G
G
T
G
C
A
A
T
C
A
C
C
A
A
T
C
A
C
A
A
C
T
T
A
C
T
G
C
A
G
C
C
T
C
C
A
C
C
T
C
C
T
G
G
G
C
C
17350
C
A
C
G
C
G
A
T
C
C
T
C
T
C
A
C
C
T
C
A
G
T
C
T
C
C
T
G
A
G
T
C
C
C
T
G
G
G
A
A
C
A
C
A
G
A
C
A
T
G
17400
T
G
C
C
A
C
C
C
T
G
C
C
A
A
G
C
T
A
A
T
T
T
T
T
A
A
A
T
T
T
T
T
T
G
T
A
G
C
G
A
T
T
G
G
G
T
C
T
C
A
17450
C
T
A
T
A
T
T
G
C
C
T
A
G
G
C
T
G
G
T
C
T
C
A
G
A
C
T
C
C
T
G
A
G
C
T
C
A
A
G
T
G
A
T
C
C
T
C
C
C
G
17500
C
C
T
T
G
G
C
C
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
A
T
G
A
G
C
C
A
C
T
G
C
A
C
C
C
A
G
17550
C
C
T
C
A
A
A
C
T
C
T
G
T
A
A
A
A
G
G
G
G
A
A
T
A
A
T
G
A
T
A
A
T
A
C
T
T
G
C
C
T
C
A
T
A
G
G
G
C
T
17600 |
REPEAT
T
G
T
G
G
G
A
A
A
T
A
A
T
G
C
T
T
T
A
T
T
A
C
T
G
T
A
G
G
T
T
A
A
A
A
A
G
C
A
C
A
C
A
A
A
C
A
T
G
C
17650
T
T
G
A
C
A
C
A
G
T
A
G
G
T
G
C
T
A
C
A
T
C
A
G
T
T
T
A
C
T
A
T
T
G
T
T
GTTG
T
T
T
T
C
A
G
A
T
T
17700 |
REPEAT
G
G
A
T
T
T
T
A
T
A
A
A
A
A
C
T
C
T
A
C
T
T
A
T
T
T
G
C
A
A
T
T
T
A
T
A
A
G
A
G
G
C
A
T
A
C
C
T
A
A
17750
A
A
T
A
T
A
A
T
G
T
C
A
G
A
G
A
A
A
G
G
T
T
A
A
A
A
A
G
T
G
G
A
G
C
C
A
G
G
G
A
A
G
G
A
A
T
T
T
A
A
17800
G
A
A
G
A
C
A
T
A
C
C
A
G
A
C
C
A
A
T
A
G
T
A
A
T
C
A
A
A
A
G
A
A
A
G
C
A
G
G
T
A
T
G
C
A
A
T
G
T
A
17850
A
C
A
T
C
A
G
A
A
C
A
A
A
T
A
A
G
C
T
T
T
G
C
T
T
A
T
T
T
G
G
A
T
T
G
A
T
A
G
G
G
A
T
A
G
A
T
A
G
G
17900
A
T
T
G
A
T
A
G
G
G
A
T
A
A
A
G
C
A
G
A
T
C
A
T
T
A
T
T
T
A
A
T
G
A
T
C
A
A
A
G
A
A
G
A
A
A
T
T
T
G
17950 |
REPEAT
G
C
T
G
G
G
C
G
T
G
G
T
G
G
C
T
C
A
C
A
C
C
T
G
T
A
A
T
C
C
C
A
G
C
G
C
T
T
T
G
G
G
A
G
G
C
C
G
G
G
18000
G
C
A
G
G
G
G
G
A
T
G
G
A
T
C
A
C
A
A
G
G
T
C
A
G
G
A
G
T
T
T
G
A
G
A
C
C
A
G
C
C
T
G
A
C
C
A
A
A
G
18050
T
G
G
T
G
A
A
A
C
C
C
T
G
T
C
T
C
T
A
C
T
A
A
A
A
A
T
A
C
A
A
A
A
A
A
T
T
A
T
C
C
G
G
G
T
G
T
G
G
T
18100
G
G
T
G
C
A
T
G
C
C
T
G
T
A
G
T
C
C
C
A
G
C
T
A
C
T
C
A
G
G
A
G
G
C
T
G
A
G
G
C
A
G
G
A
A
A
A
T
C
A
18150
T
T
T
G
A
A
C
C
G
G
G
G
A
G
G
C
A
G
A
G
G
T
T
G
C
A
G
T
G
A
A
C
T
G
A
G
A
T
C
G
C
A
C
C
A
C
T
G
C
A
18200
C
T
C
C
A
G
C
C
T
G
G
G
T
G
A
C
A
G
A
A
T
A
A
G
A
C
T
C
T
G
T
C
T
C
A
A
A
A
A
A
A
A
A
G
A
A
A
A
A
A
18250
G
A
A
G
A
A
G
A
A
G
A
A
T
A
A
A
T
T
C
A
C
T
G
G
G
A
A
G
A
T
A
A
A
A
C
A
A
T
T
C
T
G
A
A
C
C
T
G
A
C
18300 |
REPEAT
A
A
C
A
T
A
G
T
C
T
C
A
A
A
A
T
A
C
G
T
A
A
A
G
C
A
A
A
A
A
G
C
C
A
C
T
G
A
G
T
C
C
C
A
A
A
G
A
G
A
18350
A
C
T
T
G
A
C
A
A
A
T
T
T
G
G
T
A
G
G
A
G
A
T
T
T
T
A
A
T
A
C
C
A
T
T
T
C
C
T
C
A
A
A
C
T
G
A
T
G
G
18400
A
A
T
C
A
C
A
G
A
A
A
A
A
A
A
A
T
A
T
A
A
G
A
A
C
A
A
G
T
C
A
G
C
A
A
A
C
T
T
G
A
TATGGAAAG
18450
AACAGCATCC CCACCTGAGA GTCAGCAGCC TCCATGCAGG AGGACCGAGG
18500
GGAAGCCAGG GAGCAGGGCA GCCTGAGCCA AGGCCAAGCC CAGTTGAGGA
18550
CAGGAAGTGG AAGGAGGCAC CTAGGAAGGC TGAAGATGCA GAGGAGTGTG
18600
TTCCAGAGGC AGTGGGAAAA CTGGGAGGGA ATAGTTCTCG GGATGTTGGA
18650
TAGAGAGAAT AAGCAAAGAA CAATGCAGGA AGAAGAGTGG GGTGGGAGGC
18700
CGTGACCTTG CTAGAGAGCC TTATGTTTTG GGCTATGACC AGGGCTGACA
18750
GAAGTGTGAG GCATGGCGGG TTCAGAAGAG AGATCAATGA TGCATCGCAC
18800
ATCTCTAAAG CTGATGAAGG CAGCAATTGG CACTGAGGCC AACCTGGTTG
18850
GGACTCAGAG AGGTGGGACA CATATTGTGT GCCAGGCACC CTGCCACATC
18900
TTGGAGAGAA TAAGACACAA ACCCTCATCT AGTGGGTTGG GGAAATGATG
18950
CAGGGCTCAG TGTGGCTAGC GCGTTGGGGT GGGAAGCAGC CTGGGCACGC
19000
CCACAAGGCT CTCTGCCTTC CAGAATCTGG CCCAGTTGGG TTTGAA
A
A
G
A
19050 |
REPEAT
T
T
T
A
A
T
G
A
C
A
A
T
G
A
G
C
T
C
T
C
T
A
T
G
G
A
C
C
T
A
C
C
T
T
G
T
G
C
C
C
A
G
T
A
T
T
T
A
C
C
19100
T
T
T
C
T
T
A
T
C
T
G
T
A
A
C
G
C
T
C
A
A
G
T
C
A
A
T
A
C
T
C
T
A
A
G
G
T
A
G
G
A
A
T
T
G
T
T
A
T
C
19150
C
T
C
A
T
T
T
T
A
T
T
T
A
C
A
G
A
G
A
A
A
C
T
G
G
A
G
T
T
C
A
G
A
G
A
G
A
T
T
C
T
T
G
C
C
C
A
A
G
G
19200
T
C
T
T
G
C
C
C
T
G
G
G
C
C
A
A
T
T
C
C
A
G
G
T
C
T
A
T
C
T
A
A
C
T
C
C
A
A
A
G
T
C
C
A
T
G
C
T
C
A
19250
GTACACTGTC TCCTGCTTGG GAGCAAAGAG AGGACTTTGC TCCTAAGACC
19300
AGACCCATAC CACAAAGAGC CTCAACACTT AACTACTGGG AGAAAACACC
19350
GTCCTAAGTA AAAAGCCAGG TGCTCTTCCC CTTACCCCAG GGGGTTTAGG
19400
ATTTGGGAGA AAGAGGCCTG ATGGAATATA GGGGTACCAG GTCCATGTGT
19450
TAACTGATCT GGGTACAAGA ACAGTAACTC AAGGCCAGCC AGGGAAAGAA
19500
CAGCCAAGCC GAGGCACAGC TGCCAAGTCA TCACCTCTCG TTCCTTGTCC
19550
CACCATGCCT TGTCTCAGCA CGAACATAAC CCCCTCAGTT TCCCTCTTCT
19600
CCTGTCTCCT GCATAGGCAG AACCAGTTCT GAGTGATAGC CAGGACAGCA
19650
GGTCCTCGGG TCCTCAGGCT TGCCACTACA ACCTGTTTAC CCTGCGCAGA
19700
CCATGATCCA TCAGGTCGAG AAATGCCCCA CTGCTAGATG TTTGTGTCAA
19750
AAATGCCTGC CGGTGGTGCA ACAGCTGGGT GGGACCAGTG ACGTGGACTT
19800
GAGTGCCACA AAATGTACCG AACACATTCT GAGTCAACAT AGTGATGGTG
19850
TGCCAACTCC AGATAAAGAT TTTCATTATT ATTTGTTTGG GTTTTGTTTA
19900
TATGTACTCC TCTTCATCAC CAAAGATTTG AGGCTACAGA AAATGAAAAT
19950
TGAAATGGTA ATAAAAGAGA GGGGCTTGCA GAGAGGACCA CAGGATGGGG
20000
AAAATAGAAA TAATCGCAGA AAATCATGAC CCGATGAAGA GCACAATCAT
20050
AGAAGCACTA ACAGATGACA AGGTTGCCAG ATCTTCACCA TTGGTTTTGG
20100
GTTTCGTGAA AACCCGCGAA GAGAGGGATT CAGTCAATTA CATATTTTTT
20150
TTCCTTCT
T
T
C
T
T
T
T
A
T
T
T
T
T
A
T
T
T
T
T
T
A
T
T
T
T
T
T
C
T
T
G
A
A
A
T
G
G
A
G
T
C
T
20200 |
REPEAT
C
A
T
T
C
C
G
T
C
A
C
C
A
A
G
G
C
T
G
G
A
G
T
G
C
A
G
T
G
G
T
G
C
A
A
T
C
T
T
G
G
C
T
C
A
C
T
G
C
A
20250
G
C
C
T
T
C
A
C
C
T
C
C
T
G
A
G
G
T
T
C
A
A
G
C
C
A
T
T
C
T
C
C
T
G
A
T
T
C
A
G
C
T
T
C
C
T
G
A
A
T
20300
A
G
C
T
G
G
G
A
C
T
A
C
A
G
G
C
A
T
G
A
A
C
C
A
C
C
A
T
G
C
C
T
G
G
C
T
A
A
T
T
T
T
T
G
T
A
T
T
T
T
20350
T
A
G
T
A
G
A
G
A
C
T
G
T
C
A
C
A
C
G
C
G
A
G
A
C
A
G
G
G
T
T
T
C
A
C
C
A
T
G
T
T
T
G
C
C
A
G
G
C
T
20400
C
T
C
G
A
A
C
T
A
C
T
G
A
C
C
T
C
A
A
G
T
G
A
C
C
C
G
C
C
C
A
C
A
G
C
A
G
C
C
T
C
C
C
A
A
A
G
T
C
C
20450
T
G
G
A
A
T
T
A
C
A
G
G
C
A
T
G
A
G
C
C
A
C
C
G
T
G
C
C
T
G
G
C
T
TG
T
T
T
T
A
A
T
T
T
T
T
T
A
T
T
20500 |
REPEAT
A
T
A
A
A
C
A
A
C
A
C
T
G
T
G
A
T
G
A
A
C
A
T
C
C
A
C
A
T
A
G
C
T
A
A
A
T
G
C
A
T
G
C
A
T
T
C
C
A
T
20550
G
A
T
C
A
T
T
T
T
C
T
T
A
A
G
A
T
A
A
T
T
T
T
T
T
T
C
T
T
A
T
T
A
T
T
T
G
T
G
T
A
T
T
T
A
T
T
T
A
T
20600 |
REPEAT
T
T
A
T
T
T
T
T
G
A
G
A
C
G
G
A
G
T
C
T
T
G
C
T
C
T
G
T
C
A
C
C
C
A
G
G
C
A
A
G
A
G
T
G
C
A
G
T
G
G
20650
C
A
C
A
A
T
C
T
C
G
G
C
T
T
A
C
T
G
C
A
A
C
C
T
C
C
A
C
C
T
C
C
C
A
G
G
T
T
C
A
A
A
C
A
A
T
T
C
T
C
20700
C
T
G
C
C
T
C
A
G
C
C
T
C
T
T
A
A
G
T
A
G
C
T
G
G
G
A
T
A
C
A
G
G
C
A
C
A
A
G
C
C
A
C
C
A
A
G
C
C
C
20750
A
G
C
T
A
A
T
T
T
T
G
T
A
T
T
T
T
T
A
G
T
A
G
A
G
A
C
A
G
A
G
T
T
T
C
G
C
C
A
T
G
T
T
G
G
C
C
A
G
G
20800
C
T
G
G
T
C
G
C
C
A
A
C
T
C
C
T
G
A
C
C
T
C
A
G
G
C
A
A
T
C
G
C
C
T
G
C
C
T
C
G
G
C
C
T
C
C
C
A
A
A
20850
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
G
T
G
A
G
C
C
A
C
C
T
C
A
C
C
T
G
G
C
C
TTA AGATACATTT
20900
TTGAAGGGAA GAACATTTTC TGTCCAAGTA CATTCCTGTT CACAAACCCT
20950
TTTCCATAAC ACTGGAGACT TCCTGGGGAG GTTGGAACCA ATGGCTTAGA
21000
CTCCAAAGGA ATGAATGTAC CATAAAGCTT TCAGCAATCT TCTGTCAATG
21050
CTACTTGTAT A
G
A
C
T
T
C
C
A
G
A
C
A
A
A
C
A
T
T
G
A
A
G
A
T
T
G
A
A
T
G
C
A
T
G
C
A
T
C
T
21100 |
REPEAT
A
A
T
T
T
C
A
C
T
T
T
T
T
T
T
T
C
C
T
G
A
A
A
T
C
C
T
C
C
T
G
A
A
A
C
T
A
T
A
A
T
G
A
A
G
A
A
A
T
G
21150
T
T
T
G
G
G
A
A
A
A
G
A
T
G
T
A
G
C
C
A
G
G
C
A
C
T
G
T
G
G
C
T
T
A
T
G
C
C
T
G
T
A
A
T
C
C
T
A
G
C
21200 |
REPEAT
A
C
T
T
T
G
G
G
A
G
G
C
C
A
A
G
G
T
G
G
G
C
A
G
A
T
C
G
C
T
T
G
A
G
T
C
C
A
G
G
A
G
T
T
T
G
A
G
A
C
21250
T
A
G
C
C
T
G
G
G
C
A
A
C
A
T
G
G
C
G
A
A
A
C
C
C
T
G
T
C
T
C
T
A
C
T
G
A
A
A
A
A
A
T
A
C
A
A
A
A
A
21300
A
T
C
A
G
C
C
A
G
G
C
A
T
G
G
C
A
G
A
C
C
T
G
T
A
G
T
T
C
C
A
G
C
T
A
C
T
C
A
G
G
G
G
G
C
T
G
A
G
G
21350
T
G
T
G
A
G
G
A
T
C
A
C
T
T
G
A
G
C
C
T
G
G
G
A
G
G
T
A
G
A
A
G
C
T
G
C
C
G
T
G
A
G
C
C
C
T
G
A
T
T
21400
G
T
G
C
C
A
C
T
G
A
A
C
T
C
C
A
G
C
C
T
G
G
G
C
G
A
C
A
G
A
G
T
G
A
G
A
C
C
C
T
G
T
C
T
C
A
A
A
A
A
21450
A
C
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
G
A
G
A
C
G
G
G
A
T
C
G
G
G
A
A
A
A
G
G
A
A
G
A
A
21500 |
REPEAT
A
A
G
A
C
C
A
C
A
G
C
A
C
A
A
T
T
T
T
G
G
A
T
G
T
T
A
G
A
A
A
G
C
A
A
A
T
G
G
A
C
C
A
G
T
T
G
T
A
A
21550
C
T
G
A
T
T
T
A
G
C
C
G
G
C
C
T
T
G
G
G
A
A
T
G
C
T
G
G
T
C
A
T
G
A
G
C
T
G
G
C
A
G
T
G
A
G
A
A
A
A
21600
G
A
T
G
A
G
T
T
C
C
A
G
C
C
T
G
A
C
T
T
A
C
A
C
C
G
C
T
G
G
A
T
T
C
T
C
A
A
A
A
G
G
C
G
A
A
G
A
G
A
21650
T
G
A
T
A
G
T
A
G
G
G
C
T
C
C
T
G
C
C
A
G
C
C
C
A
C
G
C
T
G
C
A
C
C
T
G
T
G
T
G
T
G
C
A
A
T
A
G
A
A
21700 |
REPEAT
A
C
A
G
C
C
T
C
C
C
C
T
T
T
C
T
C
A
A
G
A
C
A
C
T
C
T
A
C
A
G
C
C
G
T
C
T
G
C
C
A
G
A
A
A
T
T
T
G
A
21750
C
A
T
G
A
T
A
A
A
T
A
T
A
C
A
A
A
T
C
T
G
T
C
A
T
G
T
C
T
A
C
C
C
T
G
T
G
A
G
G
G
A
G
G
T
G
G
C
C
T
21800
C
A
G
A
A
T
G
A
T
A
C
A
G
T
G
C
A
C
A
G
C
T
T
G
C
A
C
A
G
C
C
A
T
A
T
A
T
G
A
C
T
G
T
C
C
T
G
A
A
T
21850 |
REPEAT
T
G
G
G
A
C
A
C
C
A
G
T
T
T
C
T
A
C
T
G
A
A
A
G
T
A
G
A
G
G
T
G
A
C
G
G
A
G
G
A
G
G
C
T
G
T
C
A
T
A
21900
A
A
A
C
A
T
G
A
A
A
G
A
T
T
G
G
T
T
G
A
A
A
C
T
A
T
T
T
A
A
G
G
A
G
C
A
G
T
T
A
G
A
G
C
T
T
T
A
G
C
21950
T
T
C
C
C
T
G
T
C
T
C
A
C
C
T
T
A
G
T
A
G
A
A
G
A
C
T
G
G
A
A
G
T
T
T
A
T
T
C
T
C
T
T
G
A
A
C
G
T
C
22000
A
G
G
C
A
G
A
G
T
T
G
A
G
G
G
C
A
G
G
A
G
A
C
C
T
G
T
A
C
T
A
G
C
A
A
G
A
G
G
A
T
T
A
A
A
T
C
A
A
A
22050
G
T
A
T
A
C
C
C
T
C
T
G
A
A
A
T
G
C
T
G
A
A
T
C
A
C
C
C
T
C
C
T
G
C
T
T
C
T
T
C
C
A
C
T
T
G
G
C
C
C
22100
C
A
G
G
G
A
C
A
C
T
G
T
A
T
T
C
A
G
G
C
C
T
T
T
A
C
C
C
T
C
C
A
G
G
C
A
G
A
A
G
A
A
T
G
A
A
A
G
A
C
22150
A
T
T
T
C
T
T
A
G
G
G
G
A
A
T
A
T
G
T
C
C
A
G
C
T
C
A
A
G
A
G
G
T
A
A
G
A
C
C
C
A
A
A
G
A
T
T
T
T
C
22200
A
C
A
A
T
G
G
G
G
T
T
C
C
C
C
A
G
C
T
A
A
A
C
A
T
C
T
C
A
G
T
G
G
A
T
C
A
C
C
C
T
G
C
A
G
T
G
A
C
A
22250
T
T
C
A
T
A
G
T
C
A
A
T
A
A
G
T
A
C
T
A
C
G
C
A
C
G
T
A
T
T
G
G
A
G
C
T
T
C
A
A
A
T
C
A
C
C
T
T
T
A
22300
A
A
A
T
C
T
C
C
C
A
T
T
C
T
T
T
T
C
T
T
T
T
T
T
C
T
T
T
T
T
G
A
G
A
C
A
G
A
G
T
C
T
C
G
C
T
C
T
G
T
22350 |
REPEAT
C
A
C
C
C
A
G
G
C
T
G
G
A
G
T
G
C
A
G
T
G
G
T
G
C
G
A
T
C
T
C
G
G
C
T
C
A
C
T
G
C
A
A
G
C
T
C
T
G
C
22400
C
T
C
C
C
G
G
G
T
T
C
A
C
G
C
C
A
T
T
C
T
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
T
G
A
G
T
A
A
C
T
G
G
G
A
C
22450
T
A
C
A
G
G
C
A
C
C
T
G
C
C
A
C
C
A
C
G
C
C
T
G
G
C
T
A
A
T
T
T
T
T
T
G
C
A
T
A
T
T
T
A
G
T
A
G
A
G
22500
A
C
A
G
A
G
T
T
T
C
A
C
C
A
T
G
T
T
A
G
C
C
A
G
G
A
T
G
G
T
C
T
C
G
A
T
C
T
C
C
T
G
A
C
C
T
C
G
T
G
22550
A
T
C
C
A
C
C
C
A
C
C
T
T
G
G
C
C
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
G
T
G
A
G
C
C
A
C
22600
C
G
C
T
C
C
C
A
G
C
C
C
A
A
A
T
C
T
C
C
C
A
C
T
C
T
T
A
A
A
C
T
T
A
A
G
C
A
A
C
C
A
A
G
G
A
T
T
A
C
22650 |
REPEAT
C
A
G
T
C
A
T
T
T
G
A
G
G
A
A
A
G
A
T
T
C
T
A
A
C
A
T
G
A
A
A
G
A
T
A
G
A
G
A
C
C
C
A
A
A
A
C
A
A
A
22700
C
A
A
A
T
A
T
T
T
A
A
A
A
A
G
G
A
A
G
T
T
A
G
A
G
A
A
A
A
C
T
G
A
G
A
A
T
A
T
A
C
A
G
A
A
A
G
A
A
G
22750
A
A
C
A
T
T
G
A
A
A
A
A
T
G
C
C
A
T
T
A
A
T
A
A
C
A
T
C
A
G
A
G
A
A
T
G
A
A
A
A
G
A
T
T
A
C
A
T
T
G
22800
A
T
T
A
A
A
C
A
A
G
A
A
C
A
G
G
A
T
A
G
T
A
T
A
A
A
A
A
G
A
C
C
T
T
C
A
C
A
G
A
A
T
A
A
C
A
A
T
A
A
22850
C
A
A
A
A
A
C
T
C
T
T
A
G
A
A
A
T
T
A
A
A
A
A
A
A
A
G
A
T
A
G
T
A
G
A
A
A
T
A
A
A
A
A
C
T
G
A
A
T
A
22900
G
A
G
A
G
T
A
A
A
G
A
A
A
T
A
G
T
T
C
A
G
A
A
A
G
G
G
A
G
A
G
C
A
T
G
T
A
G
A
G
A
T
T
G
A
A
G
T
A
A
22950
G
A
A
C
A
A
A
G
A
T
A
A
G
A
A
A
A
T
G
A
G
A
G
GATCCAC
G
C
C
G
G
G
C
G
C
A
G
T
G
G
C
T
C
A
C
G
23000 |
REPEAT
C
C
T
G
T
A
A
T
C
C
C
A
G
C
A
C
T
T
T
G
G
G
A
TTTAATA TAAATATATT AAATATTTTA
23050
AATAATATAA TTTTAAATAA TATAAATATA TTAAATATAT TAAATATTTA
23100
AATATATTAA ATATTTAAAT ATATTAA
A
T
T
C
A
T
T
C
A
G
T
C
A
A
C
A
A
A
T
A
T
T
T
23150 |
REPEAT
A
T
T
G
A
A
T
A
C
C
T
A
C
T
A
A
G
T
T
C
T
A
G
T
C
A
T
C
A
T
G
C
T
A
G
G
C
A
C
T
G
A
C
T
G
T
T
C
A
A
23200
G
A
A
T
A
A
A
C
A
T
A
G
A
C
A
A
G
A
C
T
T
C
T
G
C
C
C
T
TT CAGAAACCAG CCAAGATTCT
23250
TCAACCCAAG GCAGCTGTGA AGTATAACAC ATCTCTGGCT CAATTTTCTC
23300
CATATGGTTT GGGGTTCCAT GAATGAGTCT GTGGGAAGAT TTCCTGTACG
23350
TAGAATGCAT GCATAAAGGA ATAATTATTC TATGGTTCGA AAGAGAAAAA
23400
AAGTAAGATC AAAGTGTAGA AGCTATGTTA AGAAAGATTC CAGTTTAATG
23450
ATTTTTGAA
A
G
A
G
C
A
G
T
A
A
A
A
A
T
A
A
G
A
C
C
A
A
G
A
T
G
T
G
G
A
A
G
C
T
G
T
A
T
T
G
A
23500 |
REPEAT
G
A
A
A
G
A
T
C
C
A
A
T
T
T
A
A
T
T
C
T
A
A
T
A
A
T
C
T
T
T
A
A
C
A
G
A
A
T
T
G
C
C
C
A
A
A
G
A
T
A
23550
G
A
A
T
G
A
A
T
T
G
T
T
T
T
G
A
G
A
G
A
T
G
G
T
A
C
A
G
T
T
A
G
C
T
G
T
C
A
G
T
G
G
G
A
G
T
G
T
T
C
23600
A
A
TACTAAGC AAG
A
T
T
C
A
C
T
C
G
C
T
C
A
A
C
C
A
T
T
G
A
G
T
G
A
T
G
A
C
C
A
C
A
T
G
C
A
23650 |
REPEAT
G
G
C
A
C
T
G
T
G
C
T
A
G
G
C
T
C
T
C
T
G
T
A
T
T
T
T
C
A
G
T
A
A
G
C
T
A
T
T
G
T
T
G
T
A
C
A
A
C
A
23700 |
REPEAT
A
A
G
C
A
C
C
T
T
A
A
A
A
C
A
G
T
G
T
C
T
T
A
G
A
A
C
A
A
C
A
G
T
A
A
A
T
G
T
T
T
A
T
G
A
T
C
T
C
T
23750
C
A
C
A
A
A
G
T
C
T
G
T
G
G
G
T
C
A
G
C
A
G
C
T
T
G
G
C
T
G
T
G
T
G
T
T
T
C
T
G
G
T
T
C
A
G
A
G
T
C
23800
T
G
T
C
A
T
G
A
G
G
T
T
G
C
A
T
C
T
G
A
A
G
G
C
T
T
G
G
C
T
G
A
G
G
C
T
G
G
A
G
G
A
T
C
T
G
T
T
T
C
23850
C
A
A
G
G
T
G
A
A
T
C
A
C
T
T
T
A
C
A
T
G
G
C
T
G
G
C
A
A
G
T
T
G
G
T
G
C
T
A
G
C
T
G
T
T
C
G
G
A
A
23900
G
G
G
G
C
T
T
C
A
G
T
T
C
C
C
C
C
C
T
G
T
G
T
T
G
A
C
C
T
C
T
T
C
A
C
A
G
G
G
C
T
G
C
T
T
G
C
A
C
G
23950
T
C
C
T
C
C
C
A
A
C
A
T
G
G
C
A
G
C
T
A
T
C
T
T
C
C
C
T
C
A
G
A
A
T
G
A
G
T
G
A
T
C
C
A
A
G
A
G
A
A
24000
A
T
C
A
A
G
G
T
A
G
G
T
G
C
C
A
C
A
A
T
G
C
C
C
T
T
T
A
T
G
A
C
T
T
A
G
A
C
T
T
A
G
A
A
G
T
G
A
A
A
24050
T
A
C
C
A
T
G
A
C
A
T
A
G
C
C
T
A
T
T
G
A
T
C
A
C
A
C
A
T
C
A
G
C
C
ACCATT GTGAGGGCTA
24100
GCTACCACAT CCTGCAAATA CAATGATGAA CAAACAAATG GTGTGCCTGA
24150
GCTCGAGGGC TGTTAGCCTA GAGTTAAGAT CATTCCTTAA ATGATATCAA
24200
AGGAATTAGG GCTGAGAGAC AAGGTTGTTC CCTCCAACCC TAATGCCCTG
24250
TGATCTTATG ATTCCAGGTC CTAGGTCTTT TGTTCCCATA TCACCAATCC
24300
ATTATCTCAG AACCAGCAGC TCACCAAACC CTCGCTCCTC AGATGTGGGC
24350
CATTCGGGGG TTTTTATTGT GCCCCTGCAT TGTCAGCTAC TTGTCATCAC
24400
ACAGCAAGAT AGCATGACTC CACCCTGCAC GCTCTCAGAC AGGAGACAAG
24450
GGAGTGCCCA AATAACCCTA AGCTTGTTGT TTCTGTCCCC AGGACATACC
24500
TCAGCCTCCT CCTGGGAGCA CAGGGCATAC TATGATAACA GCAGGAGGAC
24550
ACGGCGTCCC CATGCAGGAG CTGCTTTTGA TCCTGCCAGG ACAATCCCTC
24600
AAGAGCAGCA AGGCCCTGTA TGTACAGGAA GAGAGTGGCC CTCAGATCAC
24650
TTCCCTTTGC CCTAATCAGG GCCACGTAGT ACACCAGAGA CCTCAGGGGT
24700
AGTGACCCTG GATGTTCTCA GAGGACTTCT GACCTTTGAA CTTGGGGATG
24750
GGCATAGGAG TGTGGCTAGC ATCTCAGCCT GGCCCAAACC AGTGCTGGTC
24800
AGCATGCCTT TGAGGCACTA ACCCAGGCCC ACAGAGGAGT CATCTACTTC
24850
CAAGTCACCT GAGGAGCCAG TGGTGACCAG GGAGATAATG GCCTGAAATT
24900
GTGCTGGCTC TGAGGGACTG TTGGGTAAGC TAATGTGGGC TGGGCTGGAG
24950
TTCAAACACA CCTCCCTGGT GCCCACAGGG ACCTCCCAGC ACACGCTCTC
25000
CTATGGGTCT CAATGATTCA GTCTTTCCTG GTGGCATAGA CTGTAGCCTG
25050
GAGAAGCTCT GGGGAATACT TCTTTTCTTA CTTATTTCTG GTTTTAAAAG
25100
TACTACTCTC AGGATGACTA TTGTTAAAAA CAAAATAAAA CAA
A
A
C
C
A
A
A
25150 |
REPEAT
A
A
A
T
A
A
C
A
G
A
A
A
A
C
A
A
C
A
A
G
C
G
T
T
G
G
A
G
A
G
G
C
T
G
T
G
A
A
G
A
A
A
T
T
G
G
A
A
C
C
25200
C
T
G
G
G
C
G
T
C
A
C
C
A
G
T
G
G
G
A
A
T
G
T
G
A
A
A
T
G
G
T
G
C
C
A
C
T
A
C
T
A
T
A
A
A
A
A
A
A
C
25250
A
G
T
T
T
G
G
C
G
G
T
T
C
C
T
C
C
A
A
A
A
T
T
T
A
A
A
T
A
T
A
G
A
A
T
T
A
C
C
G
T
G
T
G
A
C
C
C
A
G
25300
C
A
A
T
T
C
C
A
C
T
T
C
T
G
G
A
T
A
T
T
T
G
C
T
C
A
G
A
A
G
A
A
T
T
G
A
G
A
G
C
A
A
G
G
A
C
T
G
G
A
25350
A
A
G
A
T
A
T
G
T
G
T
A
C
A
C
C
C
C
C
A
T
T
C
A
T
A
G
C
G
G
C
A
G
T
T
T
T
C
A
C
A
A
T
T
G
C
T
T
A
A
25400 |
REPEAT
T
G
G
A
T
A
G
A
G
T
T
T
T
C
T
G
T
T
T
G
G
G
G
T
G
A
T
G
A
A
A
A
G
A
T
T
T
T
C
A
G
A
A
A
T
A
G
T
G
G
25450
T
G
G
C
G
G
T
G
G
T
T
G
C
A
C
A
A
C
A
T
T
G
T
A
A
A
T
G
T
A
A
T
T
A
A
T
G
C
C
A
C
T
G
A
C
T
T
G
A
A
25500
C
A
C
C
T
A
T
A
A
A
T
A
G
T
T
T
A
A
A
T
G
G
T
A
A
A
T
T
T
T
A
T
G
T
T
A
T
AAA AGTAATACTA
25550
AAGTGGTGGC ATTTTTTTCT
T
T
T
C
T
T
C
T
T
C
T
T
C
T
T
T
T
T
T
T
T
T
T
T
T
T
T
C
T
T
25600 |
REPEAT
G
A
G
A
C
A
G
G
G
T
C
T
T
C
C
T
C
T
G
T
T
G
C
C
C
A
G
G
C
T
G
G
A
G
T
G
C
A
G
T
G
G
C
A
T
G
A
T
C
T
25650
C
A
A
C
T
C
A
C
C
C
C
A
G
C
C
T
C
A
A
C
C
T
C
C
T
G
G
G
C
T
C
A
A
G
C
A
A
T
C
C
T
C
C
C
A
C
C
T
C
A
25700
G
T
C
T
C
T
C
G
A
G
T
A
G
C
T
T
G
G
A
C
C
A
C
A
G
G
T
G
T
G
C
A
C
C
A
C
C
A
C
A
C
T
T
G
G
C
T
A
A
T
25750
T
T
T
T
G
T
A
T
T
T
T
T
T
G
T
A
G
A
G
A
C
A
A
G
T
T
T
T
C
G
C
C
A
T
G
T
T
G
C
C
A
G
G
C
T
G
G
T
C
T
25800
C
G
A
A
C
T
C
C
T
A
G
G
C
T
C
A
A
G
C
A
A
T
C
C
A
C
C
T
G
T
C
T
C
A
G
C
C
T
C
C
C
A
A
A
G
T
G
C
T
G
25850
G
G
A
T
T
A
C
A
G
G
T
G
T
G
A
G
C
C
A
C
C
A
C
A
C
C
T
G
G
C
C
TAAAATGGC ATATTT
T
T
T
T
25900 |
REPEAT
C
T
T
T
C
T
C
T
T
C
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
G
G
A
G
A
C
A
G
A
G
T
C
T
C
A
C
T
C
T
G
T
C
A
C
C
25950
C
A
G
G
C
T
G
G
C
A
G
T
G
G
C
A
T
G
A
T
C
T
C
G
G
C
T
C
A
C
T
G
A
A
A
T
C
T
C
C
G
T
C
T
C
C
C
G
A
G
26000
T
T
C
A
G
G
C
G
A
T
T
C
T
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
C
T
A
G
T
A
G
C
T
G
A
G
A
T
T
G
C
A
A
G
C
A
26050
C
G
C
A
C
C
A
C
A
A
A
G
C
C
C
A
G
C
C
A
A
T
T
T
T
T
G
T
A
T
T
T
T
T
A
A
T
A
G
A
G
G
A
G
G
G
G
G
T
T
26100
T
C
G
C
C
A
T
G
T
T
G
G
C
C
A
G
G
C
T
G
G
T
C
T
T
G
A
A
C
T
C
C
T
G
A
C
C
T
C
A
A
G
T
G
A
T
C
C
A
C
26150
C
C
G
C
C
T
C
G
G
C
C
T
C
C
C
A
A
A
G
C
G
C
T
G
G
TAAAA AAAAAAAAAA AAAAAAAAAA
26200
AATCCCATTC CACCACTTGC TAGGTAATGT ACCTCACCAT TCTGAAGTGC
26250
CTCTTCCTCA TCTATTGTGT CTTTGATGTG ATGGTCGATC ATGAGAATTA
26300
TATGAGCCAA TATGGTAATA TAAGGTGGTT TTTAAAAGCA CCTCACACTA
26350
CCAGCATTGC AGTCAGCAAT GAATTGCCAT CCTTGCATTG AAAAGCTCTA
26400
CAACCGTTGC CACTGGGCAT TTCTTATTGG CCATCATTTG TAGGGGGCTT
26450
TCACCTCCTC ATGTATTCCC TGTTTTCCCT TCTGTGACCT CCCCTCCTTT
26500
CCTACTCTTC ATTACCACCT CAAATGCCAG TGCAGGGAAT GCACGGAACA
26550
ATTGACTCCA TCTCTCTGCC TTGTGGTCTC TTTCCAGCCA GTTCCTCTGG
26600
GTGTCCTGGT CAACAGCTAG CATGCACTGT CAATGACATT CTCTCTTTCA
26650
GAGGTATTCT AGGTCTAACT GTTAAGCTAA CAGGATTTTC ACTTCCTTAC
26700
ACGTTCACAT ACCAATATGT ATGTGTGTTC CTTGACCGGG TTGTAAAGTA
26750
CATAATAGTT GAAACCTTGC TTTTTTTTTT TTTCATAAAA TATCTTGGAC
26800
ATCTTCCCAT AAATAAACAG ATTCCCCAAA CCAAACCAGA GTGTTTTGCT
26850
TGAACTCAGG ACCTCATGAG AGATCTGAGA CCAGCTGTTA GAGAATCTCT
26900
GAGAAGGGCA GGTAGAGGCT GTTGCCCAGG TCCCCAGTCA GCTCAGCCGC
26950
TGGAGAGGAG CAGAGGACTG ACTGGCCACA GGGGGACTAA GTCCACCTGC
27000
TAGCTCAGAA CAGGAGGAAG GCTGCTTTCC TTGGAAGATG GGACCAACCC
27050
TAATGAGGAC TAATAAGTGT ATCTCAGACA AACCAATTGA CCAACTGGAA
27100
CAGGCCAGAG ATGAGGGTGT GTCTTCCAAG GCCCAAAGCA AGAGAAACAT
27150
CCCTAAGAGG GAAGGGGTGT GTGGGGTGTG TCAATCAAAG TGATGTGTGT
27200
GTG
T
T
T
G
T
T
T
G
T
T
T
T
T
G
A
G
A
C
A
G
A
G
T
C
T
T
G
C
T
C
T
T
T
C
A
C
C
C
A
G
G
C
T
G
G
A
G
27250 |
REPEAT
T
G
C
A
A
T
G
G
C
C
T
G
A
T
C
T
C
A
G
C
T
C
A
C
T
G
C
A
A
C
C
T
C
C
A
C
C
T
C
C
T
G
G
G
T
T
C
A
A
G
27300
C
A
A
T
T
T
T
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
T
G
A
G
T
A
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
G
T
G
T
G
C
C
27350
A
C
C
A
C
A
C
C
T
A
G
T
T
A
A
T
T
T
T
T
G
T
A
T
T
T
T
T
A
G
T
A
G
A
G
A
C
G
G
G
G
G
T
T
T
C
A
C
C
A
27400
T
G
T
T
G
G
T
C
A
G
G
C
T
G
G
T
C
T
C
G
A
A
C
T
C
C
C
T
A
T
C
T
C
A
G
G
T
G
A
T
T
C
A
C
C
C
A
C
C
T
27450
C
A
G
T
C
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
G
T
G
A
T
C
C
A
T
C
A
C
G
C
C
T
G
G
C
C
T
27500 |
REPEAT
C
A
A
T
C
A
A
G
G
T
T
T
T
T
A
A
T
T
G
C
T
A
G
C
C
A
C
A
G
A
A
A
C
C
T
A
C
T
C
T
G
G
A
A
G
A
C
A
T
A
27550
A
T
C
A
G
A
A
G
A
T
G
A
A
C
T
T
A
C
T
G
A
C
A
T
A
T
T
T
C
C
A
G
G
A
A
T
T
T
C
A
A
G
G
A
A
A
T
G
T
C
27600
T
A
G
G
A
A
A
G
C
T
G
G
A
G
T
A
C
T
T
G
G
T
T
T
G
G
A
A
A
A
T
A
A
A
C
A
A
G
A
A
T
A
A
G
G
G
G
A
G
C
27650
T
G
A
G
G
G
A
G
A
A
A
C
A
T
G
G
C
C
A
T
G
C
T
C
A
T
G
C
C
A
C
A
G
A
A
C
T
A
G
A
G
T
G
A
T
G
G
G
G
A
27700
C
G
G
C
T
G
T
G
C
T
G
T
C
A
C
C
A
C
T
G
A
G
C
CAGGATG GGGACAGCCC AGTTTGCCCA
27750
CCAGCCACTG CCACGCCTGG TCCCTGGCCT GAGGCTCTAG CCTGGCCGCA
27800
AGCTGCCTGT TGCTGCCGCA CTGTTCCTGC CCACCCCAGA AAAGATTCTC
27850
CACTGTCTCT GTTTCTTTAC ACCAGTCTGG GGCAATTACA TCTGATTTGC
27900
TGAGCATAGG TCCCATGTCC CATGCTGGAG GTAAGCGAGG CTAGAAGGCA
27950
GGCGACTGGA GTTGTTAGTG TCTGTAATGG AGGCAGCCTC TGACTCCCAC
28000
CAAAACTCAT CAGCGAGACA CTTTCGAAAC AAAACAGGAG AGCTTCAGAG
28050
GCTGGGAAAG
T
G
T
C
T
T
A
G
A
T
C
T
T
T
T
G
G
G
C
T
T
C
T
A
T
A
A
C
G
A
G
A
C
A
C
C
A
A
A
A
28100 |
REPEAT
A
C
T
G
G
G
T
G
G
A
T
T
A
C
A
A
A
C
A
A
C
A
G
A
C
A
T
T
T
A
T
T
C
C
T
C
A
T
A
G
T
C
C
T
G
G
A
G
G
C
28150
T
G
G
G
A
A
G
T
C
T
A
A
G
A
G
C
A
A
G
G
C
A
C
C
A
G
C
A
T
A
T
C
C
C
G
T
G
T
C
T
G
G
T
G
A
G
G
G
C
C
28200
C
A
C
T
T
C
C
T
G
G
T
T
C
A
T
A
A
A
T
G
G
C
A
C
C
T
T
C
T
C
G
C
T
G
T
G
T
C
C
T
C
G
C
A
C
G
T
A
G
A
28250
A
G
G
G
G
C
A
A
G
G
G
A
T
T
T
C
T
G
T
G
G
G
G
C
C
T
C
T
T
T
T
A
T
A
A
G
G
A
C
A
C
T
A
A
T
C
T
C
A
T
28300
T
C
A
T
G
A
G
G
G
C
T
C
C
C
C
C
T
T
C
A
T
G
A
C
C
T
G
A
T
C
A
C
C
T
C
C
C
A
A
A
T
G
C
C
T
T
C
T
C
T
28350
C
C
T
A
A
T
G
C
C
A
T
C
A
C
C
T
T
A
C
A
G
A
T
T
A
G
A
A
T
T
G
C
A
A
C
A
T
A
G
G
A
A
T
T
T
A
G
G
G
G
28400
G
A
C
G
C
A
A
A
C
A
T
T
C
A
G
T
T
C
A
C
T
G
C
A
GCAGCC AAAAGCAAAG TTACTACCAA
28450
GAGACCTCTG CCAGACCCAG GGGATTGATG AGACATCTTG GTACAGGAAA
28500
GAGAAGGCAT TCCTCATCCA GGAGGACGAG GACTGTGGCA GGACCCTCAG
28550
AGACCTGCAG GATGGCCCTA AGGGTGGTGG GAGAGGTGCA CGGGTCTAAT
28600
CCAGGATTCA TATATAGATA TATTCATATA TATATTCATA TAGATATGAA
28650
TATATCTATA TGAATATCTA TATATGAATA TATATATGAA TATCTATACA
28700
TGAATATCTA TATGAATATC TATATATGAA TATCTATATG AATATCTATA
28750
TATGAATATC TATGAATATC TATATATGAA TATCTATA
T
G
A
A
T
A
T
A
C
A
T
G
28800 |
REPEAT
A
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
G
A
A
T
A
T
C
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
G
A
A
28850
T
A
T
C
T
A
T
G
A
A
T
A
T
C
T
A
T
A
T
G
A
A
T
T
A
T
A
T
A
T
G
A
A
T
A
T
C
T
A
T
A
T
G
A
A
T
A
T
A
T
28900
A
T
A
T
G
A
A
T
T
A
T
A
T
A
T
G
A
A
T
T
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
28950
T
G
A
A
T
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
G
A
A
T
29000
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
29050
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
29100
A
T
G
A
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
AATAT ATATGAATAT ATATATGAAT
29150
TATATATGAA TATATATATG AATATATATA TGAATATATA TGAATATATA
29200
TATGAAT
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
G
A
29250 |
REPEAT
A
T
A
T
A
T
A
T
A
T
G
A
A
T
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
GAAT
A
T
A
29300 |
REPEAT
T
A
T
A
T
G
A
A
T
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
A
T
G
A
A
T
T
A
T
A
T
A
T
G
A
A
T
A
T
A
T
A
T
G
29350
T
G
T
G
T
G
T
A
T
A
T
A
T
A
T
A
T
A
T
A
T
A
T
A
T
A
T
A
T
A
T
A
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
29400 |
REPEAT
T
T
T
T
T
T
T
T
T
T
G
A
G
A
C
A
G
A
G
T
T
T
C
T
C
T
C
T
T
T
T
T
G
C
C
C
A
G
G
C
T
G
G
A
G
T
G
G
T
G
29450
C
A
A
T
G
G
C
G
C
G
A
T
C
T
T
G
G
C
T
C
A
C
T
G
C
A
A
C
C
T
C
C
G
C
C
T
C
C
T
G
G
G
T
T
C
A
A
G
C
G
29500
A
T
T
C
T
C
C
T
G
C
T
T
C
A
G
C
C
T
C
C
C
A
A
G
T
A
G
C
T
G
G
G
A
T
T
A
C
A
G
G
A
G
C
C
C
A
C
C
A
C
29550
C
A
C
G
C
C
C
A
G
C
T
A
A
T
T
T
T
T
T
G
T
A
T
T
T
T
T
A
G
T
A
G
A
G
A
C
G
G
A
G
T
T
T
C
A
C
C
A
T
G
29600
T
T
G
G
C
C
A
G
G
A
T
G
G
T
C
T
T
G
A
T
C
T
C
T
T
C
A
C
C
T
C
A
T
G
A
T
C
C
A
C
C
C
A
C
C
T
C
G
G
C
29650
C
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
A
T
G
A
G
C
C
A
C
C
G
C
G
C
C
C
G
G
C
C
CAGGG
29700
TTCATATTAA GAACCAAGTT CTGGGAGACA CTGAGGCCCC CAGGGGTTCT
29750
GGAGTCAGCA CTAAACCAGG CAGCCCTCAG GTGAGGTCTG AGAGCATCTC
29800
AGAAGCTTCC AGACCCTCTT GGAACCCGGC CCACCCTGCA TCTCACTTCC
29850
TTTTGGTGAC TCGTTGCTCT GTTTGGCCTC ATTGGCCCTT TCCTAAAGCC
29900
ATCCAGTAAA CCTCCAGGTG CCTGGTGGGT TTGTCAAGGT GTTTCCCCAT
29950
AGGTTCCTGT GCTTCTAGTG GGGGTAGGTA TCAAAGTGGC CGCAGCAGCC
30000
TTACTGGCCT GGTATGCAGG AGCGAGAGCA GCTGGGGCAG GGGAGACCCG
30050
GGGGTCTACC CTTGGCCAGC TGTGGCGGTC TCATTAATAG AGTGTGGCCT
30100
GGAGGTGAGC GGGATGAGGA AGTCACATTT ACTGCAGCAC CAGCTATTGC
30150
ACGGTTATTC CGTGCCTGAT GT
T
A
A
G
T
G
C
T
T
G
A
C
T
C
C
G
T
T
G
C
T
C
C
A
T
C
A
A
30200 |
REPEAT
A
C
C
T
T
C
A
G
C
A
C
A
A
C
T
C
C
T
C
T
C
T
A
A
G
C
A
G
A
G
G
A
T
T
G
T
C
A
T
T
A
C
C
C
A
T
G
T
C
T
30250
T
C
C
A
A
A
A
G
A
A
G
C
T
C
A
A
T
T
A
T
T
C
G
C
C
C
C
A
C
A
T
C
A
C
C
C
G
G
T
A
G
T
C
G
G
T
C
A
A
A
30300
A
A
G
C
C
A
G
G
A
T
T
C
A
A
A
C
C
T
A
G
G
C
T
T
G
T
C
T
G
A
T
T
C
C
C
A
A
G
CA TACATCCCTC
30350
CCACCAGGCC AGCACCAGGG GCCAGAGGGG TATAGGCCTG GGCCAGACCA
30400
TGCTGGGGGC CACTACCCTG AGGGCACCTT GAGAGGGAAG GAGGAGGCCC
30450
CAGGACACCC TCCAAGGGGA CTTCCCCCTC CTCCTTAGCA TGTTCCAGGC
30500
CTCTTCCAGG AAAGCCCTGA AGGGCCTGCA TCCCCCTGTG GCGAGGAAAG
30550
GGAGGTGGCT AGACTCGCAG GGCTGGGGTC CACGGGTAAC AGGCTGTGGA
30600
GTCGTTTGAG ACAAATGCTT GAACTGGCTC TTTCCTTGTG AGGTTTTCGA
30650
GTCTCTAAGG GTGGGGTGAA AGACCACTCC AAGGCTGCAG CTGCCCACAG
30700
CTGGCTGTGC TGATGAGACT GCTCAGGCCC CCTGACTCTC TGCCCCTAAT
30750
CCCAGTGGCT GCAAACTCAG CCCCAAACTG GCAGGCCACC TGGCAGGAGG
30800
CCCCTGGACA GAGCAGACCA GGGCCCCGCA GGCTCCAGCA CCTGCTCTGC
30850
CCCTCCTCCC CAGACTCGTC TCGGTCCCCA AGCCTCAATC CCCAATTCTG
30900
GATGGAGACC ACAGGTTTAG CAGTCAAGAA AGTTCTGAAA TAAGCATGAA
30950
TCGCTTTCAT AATGGAACTG GGGTGGGGTG GGTTAAATGA CACTACATGA
31000
ATAAGTTAAA GGCACTGAAG ATCTCGTGCA AGGAGGAA
C
C
A
T
C
T
C
C
C
C
C
A
31050 |
REPEAT
G
T
G
T
A
C
T
G
T
G
A
G
C
C
C
C
C
C
A
G
G
G
C
C
A
G
G
G
A
C
C
C
T
G
T
T
C
T
G
C
T
C
G
C
C
T
C
C
A
G
31100
G
T
G
T
C
C
C
C
C
T
C
A
G
T
G
T
G
A
G
G
C
A
G
A
G
T
C
G
G
G
C
A
C
A
T
G
G
C
T
G
C
A
C
A
A
A
A
G
C
T
31150
G
T
T
T
G
C
T
G
G
T
T
G
T
C
C
G
G
T
G
C
C
T
A
G
A
A
T
A
A
T
G
A
C
T
G
G
C
G
C
A
T
C
G
G
A
G
G
C
A
T
31200
T
C
A
A
T
G
C
A
T
A
T
T
T
G
T
T
G
G
C
T
A
A
A
T
G
A
A
T
AT AAGTAGGTTG CATATTTTGT
31250
CTTGGGATAA TGTGTGACAT CTTTTTCTGC CTCCCCTTAG AGGACTCCCC
31300
ATGTACTCGC CTCCAGTCCA CCAGGCCCTT ACCTCTCCTG CAGTCTCATC
31350
TCCCCAATCC TGGGTGTCTC CTGCCAGGCC GCTGCTAAGC TCTGCCCAGT
31400
CATGGTTACC TCCCCACCTC CCCAGCTGCA AGGAGGATTC ATCCGGCAAC
31450
CAGTTGAGGC TAGATTCTCA GCAGCTTTAT TGAATATTGA GAGGTGGTCT
31500
CACCTCCCAC TGGACAT
G
TG TCCTCAAGTT CATACCAGAA CTTCTTTGGG
31550 |
var(31518):[A:0.01]
ACAGACAGAC AG
A
C
A
G
GTGG CAGGGCAGGG CAGGTGTCCA CGAGGGTGGG
31600 |
var(31563):[:0.37]
GCCAGTGCAC CAGGGGCAGC TCAGCTCTCC AAAGGGGCCA GCATCTGCAC
31650
M F L K A V V L T L 376
CTGCTCCTGC TGCTGCTCCT G
C
TGCTGTTC CTGCTGTTGC TCCAGCTCAG
31700 |
var(31672):[A:0.04]
A L V A V A G A R A E V S A D Q V 393
GGAGGGAGAG AG
T
CTTGTCC TGGCTCTCTT TCTCCTTGAA GGTGCTGAAG
31750 |
var(31713):[A:0.15]
A T V M W D Y F S Q L S N N A K 409
AAGGAGTTGA CCTTGTCCCT CAGGTCCTTC TCCAGGAAGC TCAAGTGGCC
31800
E A V E H L Q K S E L T Q Q L N A 426
TTC
C
ACGTCC CCCGCATGGG GGCCCAGTTT CTGCCTGAGC TGTTCCATCT
31850 |
var(31804):[A:0.04]
L F Q D K L G E V N T Y A G D L Q 443
GCTGCACCAG GGCTTTGTTG AAGTTTTCCC CGTAGGGCTC CACCCGGCGT
31900
K K L V P F A T E L H E R L A K 459
CGGAACTCCT CCACCTGCTG GTCCAGGTGC CCACCCAGCT CTGCCAGTGA
31950
D S E K L K E E I G K E L E E L R 476
CTTCTGCAGC CCCTC
G
GTGT TGCCCCTCAG GTTGCCACGC ACGTCCTCGG
32000 |
var(31966):[A:0.01]
A R L L P H A N E V S Q K I G D N 493
CCAAGGGCGC CAGCCTCTGC CGCAGCTCCT CGGCACTGGC CGAGATCCTG
32050
L R E L Q Q R L E P Y A D Q L R 509
GCCTTGAGCT CCTCGGCGTT CTTCTTCATC TGGAAGGTCA GGCCCTCAAG
32100
T Q V N T Q A E Q L R R Q L T P Y 526
CTGGTGGTTG AGCTTCTCCT GCGTGTCCTG AGCATAGGGA GCCAGGCTGC
32150
A Q R M E R V L R E N A D S L Q A 543
GGCGCAGCTC CTCCACGGTC TGGTCAATCT TGACTTTGAA TTCGTCAGCG
32200
S L R P H A D E L K A K I D Q N 559
TAGGGCGTAA GGCGTCCCTT GAGCTCCTCC ACGTTCTGGT CGATCTTGGC
32250
V E E L K G R L T P Y A D E F K V 576
CTTGAGCTCG TCGGCGTGGG GCCTCAGCGA GGCCTGCAGG CTGTCGGCGT
32300
K I D Q T V E E L R R S L A P Y A 593
TCTCCCGCAG CACTCTCTCC ATGCGCTGTG
C
GTAGGGGGT CAGCTGGCGC
32350 |
var(32331):[A:0.04]
Q D T Q E K L N H Q L E G L T F 609
CGCAGCTGCT C
G
GCCTG
C
G
T
G
T
T
G
A
C
C
T
G
G
G
T
G
C
G
C
A
G
C
T
G
G
T
C
C
G
C
G
T
A
32400 |
var(32362):[A:0.03]
|
REPEAT
|
var(32372):[C:0.14]
Q M K K N A E E L K A R I S A S A 626
G
G
G
C
T
C
C
A
G
G
C
G
C
T
G
C
T
G
A
A
G
C
T
C
T
C
G
C
A
G
G
T
T
G
T
C
C
C
C
G
A
T
C
T
T
C
T
G
G
C
32450
E E L R Q R L A P L A E D V R G N 643
T
C
A
C
C
T
C
A
T
T
G
G
C
A
T
G
G
G
G
C
A
G
C
A
G
C
C
G
G
G
C
C
C
T
C
A
G
C
T
C
C
T
C
C
A
G
C
T
C
C
32500
L R G N T E G L Q K S L A E L G 659
T
T
C
C
C
A
A
T
C
T
C
C
T
C
C
T
T
C
A
G
T
T
T
C
T
C
C
G
A
G
T
C
C
T
T
G
G
C
C
A
G
G
C
G
T
T
C
A
T
G
32550
G H L D Q Q V E E F R R R V E P Y 676
C
A
G
C
T
C
G
G
T
G
G
C
A
A
A
G
G
G
C
A
C
C
A
G
C
T
T
C
T
T
C
T
G
C
A
G
G
T
C
A
C
C
T
G
C
G
T
AAG
32600 |
var(32581):[A:0.01]
|
var(32590):[G:0.05]
|
var(32596):[A:0.08]
G E N F N K A L V Q Q M E Q L R Q 693
TGTTCACTTC TCCAAGTTTG TCCTGGAAGA GGGCACTGTG GGGAAGGGCA
32650
K L G P H A G D V E G 704
CAAGGAGGCC ACGTTACATT TGGCATTTAC ACGGCAAGAC TTTGTCTGCT
32700
TGTATACCAC TCGCCTTATG CACACGTGCT A
A
GACAGGTG AGTGCTCAGA
32750 |
var(32732):[G:0.39]
GGTACCGAGT TCACCCTCCC TATGGTGGTG CAGATTGGAG AGGATGGGTG
32800
TCACAGACTA AGTTACGATG CTGAATTTCT ATCTCAGGAT CTCCCACATA
32850
G
TTTGTCTCA GAATCTACCC ACCATGTCAC CTCCAGCAGT TTGCTTTCCC
32900 |
var(32851):[C:0.04]
CTCCTTGTCT GCAGAGGGTG GGACTTTGAG GTCCTTCCCA GAGGGAGTGT
32950
TCTGGGCCAT AGGATCACAT TGTGTAACAC AATGTTGTCT TCTTGACCTC
33000
TGCTGGGGTC TTCCAGTGGC ACAAGGTGCT GCTTTCCTTG CCTGTGCCCA
33050
GGGGCCTCCC AAATATTCCC ACCTCCCTCC CCTTGATTCC ACTGGGGCCT
33100
GTCTTTCTGA AACGTATTAG AAATCAGCTC AC
A
TGACATT TTCCTCCCTC
33150 |
var(33133):[G:0.39]
CCTCTCTTCC ATGAGCCTGA ATGATAGGAT GTGGCCATTT GTCAAACTC
T
33200 |
REPEAT
A
G
A
A
C
G
T
C
A
A
A
G
C
A
A
A
A
G
G
A
C
A
T
G
G
C
A
G
G
G
A
T
T
A
T
T
C
G
G
T
C
C
A
A
A
C
T
T
C
C
33250 |
var(33201):[G:0.16]
G
G
T
T
T
A
C
A
T
A
T
G
T
G
G
A
T
C
C
T
A
G
G
T
T
C
A
G
A
G
G
C
C
C
G
G
C
C
A
G
T
T
A
G
T
G
G
C
A
G
33300 |
var(33251):[A:0.04]
G
G
C
A
G
T
G
G
G
T
A
T
C
C
T
A
A
G
C
T
C
A
G
G
G
C
T
C
C
T
G
T
C
T
C
T
A
A
G
C
T
C
AACCCTTG
33350
CCAGTACATT GCATGGCCTT TAAGAATTCC CCGTCACCAC
C
G
CACACTGT
33400 |
var(33391):[T:0.08]
|
var(33392):[A:0.04]
AGTCCCTCTT ACTTGAGTTG CTGGGTGAGT TCAGATTTCT GGAGATGTTC
33450
H L S F L E K D L R D K V 717
CACGGCCTCC TTGGCATTGT TGCTCAGCTG GCTGAAGTAG TCCCACATCA
33500
N S F F S T F K E K E S Q D K T L 734
C
T
GTGGCCAC CTGGTCAGCA CTGACCTCAG CCCTGGCTCC TGGGTGGTAA
33550 |
var(33502):[C:0.18]
S L P E L E Q Q Q E Q Q Q 747
CAGAGAGCAA TTCATGAGGC CCCGTCTCCT GTGTGGCCCC TCTGCTCCAG
33600
CTCTGAGCTG CAGACTGGAT GATGGTGGGG GTGCCTCAAC CTGCCATTTT
33650
CCCTGTCTGA GCTTAGCTTT TTGGAAGCCA CAGGGATATG TGAAACTGGT
33700
ATAGCACCAA TGCCAATGGG CCCACCACTG GGACTGGGCC AGCTCTCCTG
33750
TGAGCCACTT GGCAGCCAGG CAGACCTCAT GTCCAGCTGG GGGCTGATGG
33800
GCACTCAGAA GTGTCCTTTT GCTTTGGCTC AGCCTCCATC CTGCACTACT
33850
CAGAGCAGCA GCCCAGGAGT GCCATCCAAA GACAGCTTCT ACTCACCGGC
33900
E 748
GACAGCCA
C
C AGGGCCAGGG TCAGGACCAC GGCCTTCAGG AACATCCTGA
33950 |
var(33909):[T:0.01]
Q Q Q E Q V Q M L A P L E S 762
GCTGCTTGCT GGGCTGGAGG AGTTTCTTGC CACACTGGAT CCTCCCTACA
34000
ATCAGGGGAG CTGACAGAGA GGTCCTCAGG AGAGCTCACC TG
C
GCTGCAG
34050 |
var(34043):[T:0.04]
TGGGAACTGA CTGAAGCTCA GAGCCAGCCA GACATTTAAA CT
C
TCTCCCT
34100 |
var(34093):[G:0.05]
ATCGCCACCC CCCTGGCTGC CCTCCCCTCC TCCTTCCTCA GTGTGACTCC
34150
ACGCTGGAAG GTGACACATT CCCAAGAGGC CTCTTGGACT TTTGTGACCC
34200
TGAGACTACG TGGAAGCTGA CAGCAGATCA GGGCTCAGGC GATAGTTAGA
34250
AGTGGTGGCT GTTCCGTGCG TGGGCACCCC CACCCCCACC CCACCAACCT
34300
CAGCATGGAA GGGAGGAGGG GAACGGAAAA CAGTAGGAGA AACACCTCAG
34350
GGAGCCCATG AGCCCGGGAC TGATGCCT
C
G AGGCCCTGCT GGCGGCTCTG
34400 |
var(34379):[T:0.01]
GGTTTGTGCC GGGCTCATGG GGGGCAAAGT CCACATCTCC TCTGTCTACC
34450
CCAGTGAAGA CAGGTCCCCC CAGTCTCCAC ATCCGCCACT AATGCTGACT
34500
CCATCTCAGA GCACCAGACA CTGGAGTGGG ACCATCCAGG ACTGCAGACA
34550
GCATAGTGAG CTAAAATCTA GGTCTCTTGT CAAGGTACAC CCAAGAGGAG
34600
GGGGCCCTGG AGAGCCTTTG GGCAGATGGG TGTGGGAGAC AGGGCCTTGT
34650
CCTGAAAGGG TTAATTGTAG AGTGCTTCAT GTC
C
C
A
C
T
C
C
C
A
T
G
C
T
G
T
G
C
34700 |
REPEAT
C
T
C
C
T
T
G
G
T
C
T
A
G
T
T
A
T
T
C
A
A
A
G
C
C
C
T
G
A
G
C
T
T
C
A
G
T
T
T
C
T
C
C
T
C
T
G
A
A
C
34750
A
A
T
G
G
G
A
T
G
G
T
T
CAGTACAT TAAAGAAGAC ATTGCTTGCG AAGTGGGTAG
34800
TA
C
G
G
T
C
A
T
C
T
C
T
C
A
G
T
C
T
C
C
A
T
G
G
G
A
A
A
T
T
G
A
T
T
C
C
A
G
G
A
C
C
C
T
C
T
G
C
A
34850 |
REPEAT
G
A
T
A
C
C
A
A
A
A
T
C
C
A
T
G
G
A
T
G
T
T
C
A
A
G
T
C
C
T
T
G
A
T
A
T
A
A
A
A
T
G
G
C
A
T
A
G
T
A
34900
T
T
T
A
C
A
T
A
T
A
A
C
C
T
A
A
G
C
A
C
A
G
C
C
T
C
C
C
A
C
A
T
A
C
T
T
T
A
A
A
T
C
T
C
T
A
G
A
T
T
34950
A
C
T
T
A
T
A
A
T
A
C
C
T
A
A
T
A
C
G
A
C
G
G
A
A
G
T
G
C
T
G
T
G
T
G
A
A
T
G
G
T
T
G
T
T
A
T
A
C
T
35000
G
T
A
T
T
G
T
T
T
A
G
G
A
G
G
T
A
A
T
C
A
C
C
A
A
A
A
A
A
G
T
C
T
G
T
A
T
A
T
G
T
T
T
A
G
T
A
C
A
G
35050
G
T
G
C
A
A
T
T
T
T
T
T
T
T
T
T
T
T
T
T
G
A
G
A
T
G
G
A
G
T
C
T
T
A
C
T
C
T
G
T
C
A
C
C
C
A
G
G
C
T
35100 |
REPEAT
G
G
A
G
T
G
C
A
G
T
G
G
C
A
C
G
A
T
C
T
T
A
G
C
T
C
A
C
T
G
C
A
A
T
C
T
C
T
G
C
C
C
C
C
C
A
G
G
T
T
35150
C
A
A
G
T
G
A
T
T
C
T
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
T
G
A
G
T
A
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
A
C
C
35200
C
A
C
C
A
C
C
A
C
G
C
C
C
A
G
C
T
A
A
T
T
T
G
T
G
T
A
T
T
T
A
C
A
G
C
A
G
A
G
A
C
G
G
G
G
T
T
T
C
A
35250
C
C
A
T
G
T
T
G
C
T
C
A
G
G
C
T
G
G
T
C
T
C
C
A
A
C
T
C
C
T
G
G
C
C
T
C
A
A
G
C
A
A
T
C
C
T
C
C
C
A
35300
T
G
T
G
G
G
C
C
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
C
T
T
A
T
A
G
G
C
G
T
G
A
G
C
C
A
C
T
G
C
G
C
C
C
T
G
35350
C
C
C
A
G
A
T
G
C
A
A
T
T
T
T
T
T
TAA AACATTG
T
T
T
T
T
A
A
T
G
G
A
G
A
T
G
G
G
G
T
G
T
C
G
35400 |
REPEAT
|
REPEAT
C
T
A
T
G
T
T
G
C
T
C
A
G
G
C
T
G
G
T
C
T
C
C
A
A
C
T
C
C
T
G
G
C
C
T
T
A
A
G
T
G
A
T
C
C
T
C
C
A
A
35450
T
C
T
C
T
G
C
C
T
C
C
T
A
A
A
A
T
G
C
T
G
G
G
C
T
T
A
C
A
G
G
C
A
T
G
A
G
C
C
A
C
T
G
C
A
C
C
C
G
G
35500
C
C
CACAGATG CAATTTAAAA AAAAAAAAAA AAAATATATA TATATATATA
35550
TAT
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
G
A
G
A
T
G
G
A
G
T
C
T
T
G
C
T
C
T
G
T
C
A
C
C
C
A
G
G
C
T
G
G
35600 |
REPEAT
A
G
T
G
C
A
G
T
G
G
C
A
C
G
A
T
C
T
C
G
G
C
T
C
A
C
T
G
C
A
A
G
C
T
C
T
G
C
C
T
C
C
C
G
G
G
T
T
C
A
35650
C
G
C
C
A
T
T
C
T
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
C
A
A
G
T
A
G
C
T
G
G
G
A
C
C
A
T
A
G
G
C
G
C
C
C
G
35700
C
C
A
C
C
A
C
G
C
C
C
G
G
C
T
A
A
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
G
T
A
T
T
T
T
T
A
G
T
A
G
A
G
A
35750
T
G
G
G
G
T
T
T
C
G
C
C
G
T
G
T
T
A
G
C
C
A
G
G
A
T
G
G
T
C
T
C
G
A
T
C
T
C
C
T
G
A
C
C
T
C
G
T
G
A
35800
T
C
C
G
C
C
C
G
C
C
T
C
A
G
C
C
T
C
C
C
A
A
A
G
T
G
C
T
G
A
G
A
T
T
A
C
A
G
G
C
G
T
G
A
G
C
C
A
C
C
35850
A
C
G
C
C
C
G
G
C
C
TAAAAAAT
T
A
T
T
T
T
C
A
A
T
C
C
A
A
G
G
T
T
G
G
T
T
G
A
A
T
C
C
A
T
G
G
35900 |
REPEAT
A
C
A
C
A
G
A
A
A
C
C
A
A
G
G
A
T
A
C
A
G
A
G
G
G
C
T
G
C
C
T
ATAGAGCTC CTGGCACAGA
35950
GTAAGCTCTG AAAAATAGTT GCTGTTATTA TAATAATCAT ACTGTATAAT
36000
AATCATACTG TGTTCCTGAT TTGGAGCAAA GCAGGGAGGG TAGCAGAGTG
36050
CTCCCTCCTC TAGTGAGAGT GGCAGAATGA GACTCAGCCC TCTGAGGGGC
36100
GCCAGGTGGG CCAGAGGCAG GGTGATGGGT GGGACCAGCC TGAGGCCTGA
36150
CTCCTGCCCT TCTTGCCCCC CAGCCACACT CCCTGGGCAG GAGCAGCTGG
36200
CTTGAGCAGA ATCTTGGGAC CTGAGGCTCT CAGGGGACCT CCCATTGGGG
36250
GATGGAGGGC AATGGTGGTG GTGCCCAGGA GGTCTTCTAC TTAGATGTCT
36300
ATTGGATCTC TAAATGAGGC TGCATGCATA ATCACACACA AACATCCACT
36350
GAGAAGGTGA CACACCACGT CAGCATGGGT CCCTCTGCCG GACCACACCA
36400
CTCCTAGTGA CTATGAGGTG ACATCCAGGC ACGTTGCACT ATTGGCTCCT
36450
GTCGGTGAGT GCAGTGCCTG ACAACAGTGA GCTACATTTA TTTGTAAAAA
36500
TGAACGCCAT CAGAGTAGAC CACAATTGTA CTAACTCTAA TTTGCTTTGT
36550
GTTCATTTTT TCAGTTTCCA GAAGTGGCTT AATGTTTCCT AGGGTCAAAG
36600
GCAGTCAAAT GACCTCCTGA CTCTGGCACC CCTTCTGCTG GGTCCCCACT
36650
GCCCTGTAGT GGTCCCCACG CTACCATGCT GCCTCCTTTT TGATGCAGCC
36700
TGTGCCATCT CTCTGTGATT GTTGGGGT
C
A
G
A
T
G
C
T
G
G
A
G
T
C
C
A
C
T
C
C
C
T
36750 |
REPEAT
G
G
C
T
T
G
G
C
A
T
C
C
A
G
G
C
T
C
C
A
A
C
A
C
G
T
A
C
T
G
C
C
T
G
T
G
T
G
T
C
C
T
T
G
G
G
C
A
A
G
36800
T
C
T
C
A
T
A
A
C
C
T
C
T
C
T
G
A
G
C
C
T
C
A
G
T
T
A
C
C
T
T
G
G
T
G
A
G
A
C
A
T
A
A
C
C
A
T
T
G
T
36850
A
C
C
T
G
C
C
T
C
C
T
A
G
G
C
T
G
T
G
A
G
G
A
T
T
C
A
C
T
G
A
G
A
T
G
A
T
C
T
T
A
T
A
G
T
G
C
T
T
G
36900
C
A
A
C
A
A
T
G
T
C
T
G
G
C
A
C
A
T
A
G
T
A
A
A
A
G
T
G
A
T
C
A
C
T
A
A
A
T
G
T
T
A
G
CCACGTC
36950
TTACCCCTGC AAGGCTCACC TCCCTGGAAC CCATCGGTCC CAACCCTGCT
37000
CCTGAATCAG GCACAGTCCA GCTTGCAGCG GGAGCAAAGG TCAGTACTCA
37050
GTGCCCCTGT CCCTTCCCCA GGCCAGAGGG GAGGAGGAGA CTGAGTCACG
37100
AATGACACCT CAGCCGCAGT TTGACCTCCA GGACTTACAG TCCTAGCAGC
37150
AGGTGCCACT AGCATGTGAG AGGTCCAGAG GCGCTTCTGT CTCACCCGCC
37200
CGCCTGGGTG CACCCATGCT GGGAGCGCCT GCACCATTTG AGCATGTCCG
37250
AGAGCATCCA CCAGAGTGTG TGTGGATTCA CAGAAGTGTG CAAATCACTA
37300
AGAACCAAGG GACTGGCACA GCCCATGCGT GCACCCACGC TCGCGAGGGG
37350
ACCTCCTGCC TTTCAACGTG GCGGGGATGT GACCTGTTAA TGAATGTATT
37400
TACTTCCCAA AGTCTGAGGG TACGTTTTGC ATCAATCTGT AGATGGATTT
37450
GTTTTGGGGA GCAGGGAGAG AATGAGAGCC CCCTGTGCTC AGTCTTAGAG
37500
GGTGCAAGTA GCTGATGGGA AGAGCAGACT GCCTTCCAGC CAGGCCTGGT
37550
CCTGTGAGTC AGGGACGTCC ACCTTAGTGG GCATGAAAGG C
C
T
G
T
G
T
G
A
T
37600 |
REPEAT
C
T
C
G
A
G
G
G
A
G
A
C
A
T
C
G
C
C
T
C
T
C
C
A
A
G
C
C
T
C
T
C
C
T
T
A
T
C
T
G
T
G
C
A
A
C
A
G
G
C
37650
A
G
A
C
T
T
A
A
T
G
A
T
T
G
G
T
G
A
G
G
CAATGAGGCT GATAGCTCAG CATTAGCTAC
37700
AGCCACCCCT CCTGGCCAAC CACACAGGGA TCAAACCAGG GGTCAGTCCA
37750
GAGGTCAGAG TCAGGAGCAG AAAACTCAGA TCCAGCCAGG GACAGGCAGG
37800
TCACACGGAC ATGTGCCTCA CGTATGCTTC AAGGGGCCCT CCCCCGGGCA
37850
GAACTGAAGG ACAGCTCCTG TTGCCATAGG AAGGAGCTGG GTGAGATACT
37900
AGGAGGAACT TCCGGCATGA TGATGTGTGA TGAACAAGGG CCTCTGGCCA
37950
ACAGGTCTGA ATCAGGGCTG CCCAGCCCAG CCTGGTGGGA AGGGCATGGA
38000
GCATGGGGGC TCATGTACTA AACCTCACCT GGACACAAGG TGAAACAGCC
38050
CAACCCCAGA GGACCATTTT TGGCCCCGGA TGGTCAAATC CCCTCTTCCT
38100
CCCATCTACC ACTGGCTTCT CCCTGGAGCA GTCTTCATCC CAGGGGAGCC
38150
ATGATGGGAG AGAGGGGCAG CGCAGGCTGG CCACCAAGAG ATCCCCTGCC
38200
GGGGTGCAGG TTGGACTGTT GGTGAGGGGC CACAGGTATT CTCAGGTACC
38250
AAGCCCTTGG AAGGAGACAA GGTACCAGGC TTCCTGGAGG TGTGCTACAT
38300
CTAGCTCAGC ACCCTGCCAG GTCTCTCTAC CCACATGTCC TGACCTCCCT
38350
GGGTCCGTTG CCATGCGGGA GAGAGAGGCC AGGCTCCTCC AGACCCTCTG
38400
CAGAGATGGA AAGGCTTGGA GGGTCTGGGG CCACGGGACC CCGCCAGCCC
38450
ATTCTAGCAC ACCCGGGCCC ATAGACCTTG TTGCCTGCCC CTGCCTGGAT
38500
CTGGGTCCCC ACTGTGCCTT TGCCTCTGGG GCTATGGAGC AGGCCGCAGC
38550
AGAAGAGGAA AGGGCATCCC CAATACCAAA TCCTCCAGTG ACCACTTCTT
38600
CACCTTCTAC CCCACCACCA AAGTCTGCAG GAGACTTGAG ACAGGTTTGT
38650
TCTGGGCGTG TGACTGATGC CTCTATAGGG GTCTCAGTGC TCTAAGCCGT
38700
CTGGTATTTG CCTG
G
G
G
T
G
T
G
T
G
A
A
G
A
C
C
T
G
G
A
T
T
A
A
G
G
T
T
C
C
C
A
G
C
C
T
T
38750 |
REPEAT
A
C
T
A
C
T
A
A
T
G
G
G
C
T
G
T
G
C
A
C
T
T
G
G
A
G
C
C
C
T
T
A
G
A
G
C
C
T
T
A
G
G
T
T
T
C
T
A
A
C
38800
C
T
A
T
A
A
A
A
T
G
G
A
C
T
T
A
A
C
G
T
C
T
A
C
T
T
C
A
C
A
G
G
G
T
T
CTATT TGCATTTTAA
38850
CAGAAAACAA AGTCTTAAGT CAAAGGAATG AATCTCTCTC TCTCTCTCTC
38900
TCTCTCTCT
T
T
T
T
T
A
G
A
C
C
A
A
G
T
C
T
A
G
C
T
C
T
G
T
C
A
C
T
G
G
A
G
T
G
C
A
A
T
G
G
T
38950 |
REPEAT
G
C
G
A
T
C
T
C
T
G
C
T
C
A
C
T
G
C
A
A
C
C
T
C
C
A
C
C
T
C
C
G
G
G
G
T
T
C
A
A
G
C
A
A
T
T
C
T
C
G
39000
T
G
C
C
T
C
A
G
C
C
T
C
C
T
G
A
G
T
A
G
C
T
G
G
G
A
C
T
A
C
A
G
G
C
G
T
G
C
A
T
C
A
C
C
A
T
G
C
T
C
39050
G
G
C
T
A
A
T
T
T
T
T
T
G
T
A
T
T
T
T
T
A
G
T
A
G
A
G
A
C
T
G
G
G
T
T
T
C
G
C
C
A
T
G
T
T
G
C
C
C
A
39100
G
G
C
T
G
G
T
C
T
C
G
A
A
C
T
C
C
T
G
G
C
C
T
C
A
G
A
T
A
T
C
C
G
T
C
C
G
T
C
C
C
A
G
C
C
T
C
C
C
A
39150
A
A
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
A
T
G
A
G
C
C
A
C
T
G
T
G
C
C
C
A
G
C
C
A GGAATGGATC
39200
GCTAATAGAG GAATTCCAAG TCTCACCCAC CGATAAAGAA TTCTGAGGGC
39250
AGAGCCGGGC CACTTTCTCA GGCCTCTGAT TTCATACTGT GGTGTTAGTT
39300
ACTTCTGAGA GGACAGCTTG CTGCCAGAGC TCTATTTTTT ATGTTAGAGG
39350
CTCCTTCTGC CTGCAGACTC T
G
CTGTCTGG GAAGGGCACA GCGTTAGGAG
39400 |
var(39372):[T:0.02]
GGAGAGGGAG GTGTGAGTCC CTCC
G
TGGAC
C
CGCT
G
C
T
T
T
G
T
A
C
T
T
C
T
C
T
39450 |
var(39425):[A:0.02]
|
var(39431):[A:0.05]
|
REPEAT
A
T
C
T
C
A
T
T
T
C
C
T
T
T
T
C
A
G
C
A
C
C
A
C
T
C
T
G
G
G
A
A
A
T
C
A
G
T
A
T
T
C
C
A
G
C
C
C
C
A
39500 |
var(39481):[C:0.46]
T
T
T
T
A
T
C
C
T
C
A
G
A
A
A
A
T
T
G
A
G
G
C
T
C
T
G
A
G
A
T
G
T
T
A
T
C
T
C
T
G
T
G
A
C
C
T
G
G
G
39550
T
C
C
T
A
T
T
A
C
G
T
G
C
C
A
A
A
G
G
C
A
T
C
A
T
T
T
A
A
G
C
C
T
A
A
G
A
T
G
T
C
C
T
G
G
C
T
C
C
A
39600
A
G
G
T
GTCAGC ATCTGGAAGA CAGGCGCCCT CATCCTGCCA TCCCTGCTGC
39650
GGCTTCACTG TGGGCCCAGG GGACATCTCA GCC
C
CGAGAA GGGTCAGCGG
39700 |
var(39684):[T:0.04]
CCCCTCCTGG ACCAC
C
G
ACT CCCCGCAGAA CTCCTCTGTG CCCTCTCCTC
39750 |
var(39716):[T:0.01]
|
var(39717):[A:0.09]
ACCAGACCTT GTTCCTCCCA GTTGCTCCCA CAGCCAGGGG GCAGTGAGGG
39800
CTGCTCTTCC CCCAGCCCCA CTGAGGAAC
C
CAGGAAGGTG AACGAGAGAA
39850 |
var(39830):[T:0.01]
TCAGTCCTGG TGGGGGCTGG GGAGGGCCCC AGACATGAGA CCAGCTCCTC
39900
CCCCAGGGGA TGTTATCAGT GGGTCCAGAG GGCAAAATAG GGAGCCTGGT
39950
GGAGGGAGGG GCAAAGGCCT CGGGCTCTGA GCGGCCTTGG CCCTTCTCCA
40000
CCAACCCCTG CCCTACACT
C
AGGGGGAGGC
G
GCG
G
GGGGC ACACAGGGTG
40050 |
var(40020):[C:0.44]
|
var(40031):[G:0.44]
|
var(40035):[:0.56]
GGGGCGGGTG GGGGGCTGCT GGGTGAGCAG CACTCGCCTG CCTGGATTGA
40100
AACCCAGAGA TGGAGGTGCT GGGAGGGGCT GTGAGAGCTC AGCCCTGTAA
40150
CCAGGCCTTG CCGGAGCCAC TGATGCC
C
GG TCTTCTGTGC CTTTACTCCA
40200 |
var(40178):[T:0.49]
AACA
C
CCCCC AGCCCAAGCC ACCCACTTG
T
TCTCAAGTCT GAAGAAGCCC
40250 |
var(40205):[T:0.45]
|
var(40230):[C:0.04]
CTCACCCCTC TACTCCAGGC TGTGTTCAGG GCTTGGGGCT GGTGGAGGGA
40300
GGGGCCTGAA ATTCCAGTGT GAAAGGCTGA GATGGGCCCG AGGCCCCTGG
40350
CCTATGTCCA AGCCATTTCC CCTCTCACCA GCCTCTCCCT GGGGAGCCAG
40400
TCAGCTAGGA AGGAATGAGG GCTCCCCAGG CCCACCCCCA GTTCCTGAGC
40450
TCATCTGGGC TGCAGGGCTG GCGGGACAGC AGCGTGGACT CAGTCTCCTA
40500
GGGATTTCCC AACTCTCCCG CCC
G
CTTGCT GCATCTGGAC ACCCTGCCTC
40550 |
var(40524):[A:0.01]
AGGCCCTCAT CTCCACTGGT CAGCAGGTGA CCTTTGCCCA GCGCCCTGGG
40600
TCCTCAGTGC CTGCTGCCCT GGAGATGATA TAAAACAGGT CAGAACCCTC
40650
C
T
GCCTGTCT GCTCAGTTCA TCCCTAGAGG CAGCTGCTCC AGGTAATGCC
40700 |
var(40652):[G:0.02]
CTCTGGGGAG GGGAAAGAGG AGGGGAGGAG GATGAAGAGG GGCAAGAGGA
40750
GCTCCCTGCC CAGCCCAGCC AGCAAGCCTG GAGAAGCACT TGCTAGAGCT
40800
AAGGAAGCCT
C
G
G
AGCTGGA
C
GGGTGCCCC CCA
C
C
C
C
T
C
A
T
C
A
T
A
A
C
C
T
G
40850 |
var(40811):[T:0.04]
|
var(40813):[C:0.26]
|
var(40821):[T:0.10]
|
REPEAT
A
A
G
A
A
C
A
T
G
G
A
G
G
C
C
C
G
G
G
A
G
G
G
G
T
G
T
C
A
C
T
T
G
C
C
C
A
A
A
G
C
T
A
C
A
T
A
G
GG
40900 |
var(40896):[C:0.20]
GGTGGGGCTG GAAGTGGCTC CAAGTGCAGG TTCCCCCCTC ATTCTTCAGG
40950
C
T
T
A
G
G
G
C
T
G
G
A
G
G
A
A
G
C
C
T
T
A
G
A
C
A
G
C
C
C
A
G
T
C
C
T
A
C
C
C
C
A
G
A
C
A
G
G
G
A
41000 |
REPEAT
A
A
C
T
G
A
G
G
C
C
T
G
G
A
G
A
G
G
G
C
C
A
G
A
A
A
T
C
A
C
C
C
A
A
A
G
A
C
A
C
A
C
A
G
C
A
T
G
T
T
41050
G
G
C
T
G
G
A
C
T
G
G
A
C
G
G
A
G
A
T
C
A
G
T
C
C
A
G
A
C
C
G
C
A
G
G
T
G
C
C
T
T
G
A
T
G
T
T
C
A
G
41100
T
C
T
G
G
T
G
G
G
T
T
T
T
C
T
G
C
T
C
C ATCCCACCCA CCTCCCTTTG GGCCTCGATC
41150
CCTCGCC
G
CT CACCAGTCCC CCTTCTGAGA GCCCGTAT
G
A GCAGGGAGCC
41200 |
var(41158):[C:0.49]
|
var(41189):[G:0.40]
GGCCCCTACT CCTTCTGGCA GACCCAGCTA AGGTTCTACC TTAGGGGCCA
41250
C
G
CCACCTC
C
CCAGGGAGGG GTCCAGAGGC ATGGGGACCT GGGGTGCCCC
41300 |
var(41252):[A:0.07]
|
var(41260):[T:0.03]
TCACAGGACA CTTCCTTGCA GGAACAGAGG TGCCATGCAG CCCCGGGTAC
41350
M Q P R V 767
TCCTTGTTGT TGCCCTCCTG GCGCTCCTGG CCTCTGCCCG TAAGCACTTG
41400
L L V V A L L A L L A S A 780
GTGGGACTGG GCTGGGGGCA GGGTGGAGGC AACTTGGGGA TCCCAGTCCC
41450
A
A
TGGGTGGT CAAGCAGGAG CCCAGGGCTC GTCCAGAGGC CGATCCACCC
41500 |
var(41452):[G:0.01]
CACTCAGCCC TGCTCTTTCC TCAGGAGCTT CAGAGGCCGA GGATGCCTCC
41550
R A S E A E D A S 789
CTTCTCAGCT TCATGCAGGG
C
TACATGAAG CACGCCACCA AGACCGCCAA
41600 |
var(41571):[T:0.32]
L L S F M Q G Y M K H A T K T A K 806
GGATGCACTG AGCAGCGTGC AGGAGTCCCA GGTGGCCCAG CAGGCCAGGT
41650
D A L S S V Q E S Q V A Q Q A R 822
ACACCCGCTG GCCTCCCTCC CCATCCCCCC TGCCAGCTGC CTCCATTCCC
41700
ACCC
G
CCCC
T
GCCCTGGTGA GATCCCAACA ATGGAATGGA GGTGCTCCAG
41750 |
var(41705):[A:0.24]
|
var(41710):[A:0.15]
CCTCCCCTGG GCCTGTGCCT CTTCAGCCTC CTCTTTCCTC ACAGGGCCTT
41800
TGTCAGGCTG CTGCGGGAGA GATGACAGAG TTGAGACTGC ATTCCTCCCA
41850
GGTCCCTCCT TTCTCCCC
G
G AGCAGTCCTA GGGCG
C
GCCG TTTTAGCCCT
41900 |
var(41869):[A:0.22]
|
var(41886):[T:0.14]
CATTTCCATT TTCCTTTCCT TTCCC
T
T
T
C
T
T
T
C
T
C
T
T
T
C
T
A
T
T
T
C
T
T
T
C
T
41950 |
REPEAT
|
var(41934):[C:0.19]
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
42000
C
T
T
T
C
T
T
T
C
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
T
C
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
C
T
T
T
C
T
T
T
42050
C
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
T
T
T
C
C
T
T
T
T
T
C
T
T
T
C
T
T
T
C
C
C
T
C
T
C
T
T
C
C
T
T
42100
T
C
T
C
T
C
T
T
T
C
T
T
T
C
T
T
C
T
T
C
T
T
T
T
T
T
T
T
T
T
A
A
T
G
G
A
G
T
C
T
C
C
C
T
C
T
G
T
C
A
42150
C
C
C
A
G
G
C
T
G
G
A
G
T
G
C
A
G
T
G
G
T
G
C
C
A
T
C
T
C
G
G
C
T
C
A
C
T
G
C
A
A
C
C
T
C
C
G
T
C
T
42200 |
var(42153):[T:0.09]
|
var(42197):[A:0.01]
C
C
C
G
G
G
T
T
C
A
A
C
C
C
A
T
T
C
T
C
C
T
G
C
C
T
C
A
G
C
C
T
C
C
C
A
A
G
T
A
G
C
T
G
G
G
A
T
T
A
42250
C
A
G
G
C
A
C
G
C
G
C
C
A
C
C
A
C
A
C
C
C
A
G
C
T
A
A
T
T
T
T
T
G
T
A
T
T
T
T
T
A
G
C
A
G
A
G
A
T
G
42300 |
var(42259):[T:0.02]
G
G
G
T
T
T
C
A
C
C
A
T
G
T
T
G
G
C
C
A
G
G
T
T
G
G
T
C
T
T
G
A
A
T
T
C
C
T
G
A
C
C
T
C
A
G
G
G
G
A
42350
T
C
C
T
C
C
T
G
C
C
T
C
G
G
C
C
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
A
C
G
A
G
C
C
A
C
T
42400 |
var(42375):[C:0.24]
|
var(42383):[C:0.01]
|
var(42392):[T:0.37]
G
C
G
C
C
T
G
G
C
C
CCATTTTCCT TTTCTGAAGG TCTGGCTAGA G
C
A
G
T
G
G
T
C
C
42450 |
REPEAT
T
C
A
G
C
C
T
T
T
T
T
G
G
C
A
C
C
A
G
G
G
A
C
C
A
G
T
T
T
T
G
T
G
G
T
G
G
A
C
A
A
T
T
T
T
T
C
C
A
T
42500
G
G
G
C
C
A
G
C
G
G
G
G
A
T
G
G
T
T
T
T
G
G
G
A
T
G
A
A
G
C
T
G
T
T
C
C
A
C
C
T
C
A
G
A
T
C
A
T
C
A
42550
G
G
C
A
T
T
A
G
A
T
T
C
T
C
A
T
A
A
G
G
A
G
C
C
C
T
C
C
A
C
C
T
A
G
A
T
C
C
C
T
G
G
C
A
T
G
T
G
C
A
42600
G
T
T
C
A
C
A
A
T
A
G
G
G
T
T
C
A
C
A
C
T
C
C
T
A
T
G
A
G
A
A
T
G
T
A
A
G
G
C
C
A
C
T
T
G
A
T
C
T
G
42650 |
var(42609):[C:0.24]
A
C
A
G
G
A
G
G
C
G
G
A
G
C
T
C
A
G
G
C
G
G
T
A
T
T
G
C
T
C
A
C
T
C
A
C
C
C
A
C
C
A
C
T
C
A
C
T
T
C
42700
G
T
G
C
T
G
T
G
C
A
G
C
C
C
G
G
C
T
C
C
T
A
A
C
A
G
T
C
C
A
T
G
G
A
C
C
A
G
T
A
C
C
T
A
T
C
T
A
T
G
42750
A
C
T
T
G
G
G
G
G
T
T
G
G
G
G
A
C
C
C
C
T
G
GGCTAGGG GTTTGCCTTG GGAGGCCCCA
42800
CCTGACC
C
AA TTCAAGCCCG TGAGTGCTTC TGCTTTGTTC TAAGACCTGG
42850 |
var(42808):[T:0.19]
GGCCAGTGTG AGCAGAAGTG TGTCCTTCCT CTCCCATCCT GCCCCTGCCC
42900
ATCAGTACTC TCCTCTCCCC TACTCCCTTC TCCACCTCAC CCTGACTGGC
42950
ATTAGCTGGC ATAGCAGAGG TGTTCA
T
AAA CATTC
T
T
A
G
T
C
C
C
C
A
G
A
A
C
C
43000 |
var(42977):[C:0.01]
|
REPEAT
G
G
C
T
T
T
G
G
G
G
T
A
G
G
T
G
T
T
A
T
T
T
T
C
T
C
A
C
T
T
T
G
C
A
G
A
T
G
A
G
A
A
A
A
T
T
G
A
G
G
43050 |
var(43010):[:0.01]
C
T
C
A
G
A
G
C
G
A
T
T
A
G
G
T
G
A
C
C
T
G
C
C
C
C
A
G
A
T
C
A
C
A
C
A
A
C
T
A
A
T
C
A
A
T
CCTC
43100
CAATGACTTT CCAAATGAGA GGCTGCCTCC CTCTGTCCTA CCCTGCTCAG
43150
AGCCACCAGG TTGTGCAACT CCAGG
T
GGTG CTGTTTGCAC AGAAAACAAT
43200 |
var(43176):[C:0.37]
GACAGCCTTG ACCTTTCACA TCTCCCCACC CTGTCACTTT GTGCCTCAGG
43250
CCCAGGGGCA TAAACATCTG AGGTGACCTG GAGATGGCAG GGTTTGACTT
43300
GTGCTGGGGT TCCTGCAAGG ATATCTCTTC TCCCAGGGTG GCAGCTGTGG
43350
GGGATTCCTG CCTGAGGTCT CAGGGCTGTC GTCCAGTGAA GTTGAGAGGG
43400
TGGTGTGGTC CTGACTGGTG TCGTCCA
G
TG GGGACATGGG TGTGGGTCCC
43450 |
var(43428):[A:0.04]
ATGGTTGCCT ACAGAGGAGT TCTCATGCCC TGCTCTGTTG CTTCCCCTGA
43500
CTGATTTAGG GGCTGGGTGA CCGATGGCTT CAGTTCCCTG AAAGACTACT
43550
G W V T D G F S S L K D Y 835
GGAGCACCGT TAAGGACAAG TTCTCTGAGT TCTGGGATTT GGACCCTGAG
43600
W S T V K D K F S E F W D L D P E 852
GTCAGACCAA CTTCAGCCGT GGCTGCCTGA GACCTCAATA CCCCAAGTCC
43650
V R P T S A V A A 861
ACCTGCCTAT CCATCCTGC
C
AGCTCCTTGG GTCCTGCAAT CTCCAGGGCT
43700 |
var(43670):[G:0.13]
G
CCCCTGTAG GTTGCTTAAA AGGGACAGTA TTCTCAGTGC TCTCCTACCC
43750 |
var(43701):[T:0.40]
CACCTCATGC CTGGCCCCCC TCCAGGCATG CTGGCCTCCC AATAAAGCTG
43800
GACAAGAAGC TGCTATGAGT GGGCCGTCGC AAGTGTGCCA TCTGTGTCTG
43850
GGCATGGGAA AGGGCCGAGG CTGTTCTGTG GGTGGGCACT GGACAGACTC
43900
CAGGTCAGGC AGGCATGGAG GCCAG
C
GCTC TATCCACCTT CTGGTAGCTG
43950 |
var(43926):[T:0.01]
GGCAG
T
C
T
C
T
G
G
G
C
C
T
C
A
G
T
T
T
C
T
T
C
A
T
C
T
C
T
A
A
G
G
T
A
G
G
A
A
T
CACCCTC
44000 |
REPEAT
CGTACCCTGC CTTCCTTGAC AGCTTTGTGC GGAAGGTCAA ACAGGACAAT
44050
AAGTTTGCTG ATACTTTGAT AAACTGTTAG GTGCTGCACA ACATGACTTG
44100
AGTGTGTGCC CCATGCCAGC CACTATGCCT GGCACTTAAG TTGTCATCAG
44150
AGTTGAGACT GTGTGTGTTT ACTCAAAACT GTGGAGCTGA CCTCCCCTAT
44200
CCAGGCC
A
CC TAGCCCTCTT AGGCGCACGT GAAGGGAGGA GGCCGGATGG
44250 |
var(44208):[C:0.28]
GCTAGAGGTT GGAGTAAGAT GCAACGAGGC ACTATTCTTG GCTCCACCAC
44300
TTGATATCA
G
C
C
T
C
A
G
T
T
T
C
T
T
A
C
A
T
G
T
A
A
A
G
T
G
G
A
T
A
C
A
A
C
C
G
T
A
C
C
C
C
44350 |
REPEAT
C
T
C
C
A
C
C
G
T
A
G
G
T
T
T
G
C
C
G
T
G
A
G
A
T
T
G
A
A
A
T
G
A
G
A
G
A
G
C
G
T
T
C
G
A
A
C
C
G
T
44400
T
T
G
G
C
A
C
A
G
C
A
C
C
T
G
C
A
C
G
T
A
A
A
G
A
T
G
C
T
T
G
A
T
C
A
A
T
G
T
T
G
T
C
A
T
G
A
T
T
A
44450
CAGTTGAGCT GACTGGGCCC TTGGGACCCG GACTGGAGTG GTGGGGGGCA
44500
GTGTCCTGGG ACCAAAAAGA AGCACAAGGT CTCC
C
AATAG AGGCTGCTTC
44550 |
var(44535):[T:0.03]
CTTTGTGTCC CCACCACCCG AAAGATGTCA GGTCAGAGAG CCCGAGAGCT
44600
GCAGATGGCT TGAGTAGGG
C
T
C
C
A
C
T
C
T
T
C
A
G
A
T
C
A
A
A
A
A
A
C
T
G
T
G
G
C
C
C
44650 |
REPEAT
|
var(44633):[G:0.11]
G
G
A
G
A
G
G
C
G
A
A
G
G
C
A
C
T
T
G
G
C
C
A
G
C
A
T
C
A
C
A
G
A
G
C
C
A
G
C
A
C
G
T
G
G
C
A
G
G
G
44700
C
C
A
G
A
C
C
T
T
G
A
G
C
C
C
A
G
G
T
C
AGCTGCGTGT ATTCTGCTCA GTTGGTGCAG
44750
AAAACAGTTT TGTCACTCCT ATGTCAGGTG TTAGGGACTC CTTTACAGAT
44800
CTCAGTGGCA TCAGTACATC CAGCCCCACC TGGAGACTGC TTTCTCTCTG
44850
AAAATTCCCC AGGGCTTCTC TCTGGGCTGA GAGATCTCAG CACCCGTATC
44900
TAGAAAATGT TCCCACCCAG ACCTGGCTGG ATGACTGCTG TTGTAGCTCT
44950
GGAAGGTTAG GAACTAAAAA GCCCACTCCT TTACCTAGGG TAGCTAAGAT
45000
ACACTGGAGA TGGGGACATG GGGATGGGGC CGATTATCCA GGGGCCTGCA
45050
T
GAGGGGGCA AAAGGCCCTG CAGAGAGAGG GTAGGGAAGG CACTGCCCAG
45100 |
var(45051):[A:0.07]
ATCTGTGAAG CCATGTGCGT GCACGCGGG
G
ACATTCAGAC ATGAGTGCAA
45150 |
var(45130):[T:0.20]
GGAGGGACCG TGAGCAGGGA GGTCATGTGA GAATACACAG GCATGCCTGC
45200
ACACCCATGT GAACTTGAGT GCCAGGCCAC ACACTC
T
T
T
T
T
T
T
T
T
T
T
T
T
T
45250 |
REPEAT
T
T
T
T
T
T
T
T
T
T
A
G
C
T
G
G
A
G
T
C
T
T
G
C
T
C
C
G
T
C
G
C
C
C
A
G
G
C
T
G
G
A
G
T
G
C
A
G
T
G
45300
G
C
A
T
G
A
T
T
T
C
G
G
C
T
C
A
C
T
G
T
G
A
C
C
T
C
T
G
C
C
T
C
C
C
A
G
G
T
T
C
A
A
G
C
G
A
T
T
C
T
45350 |
var(45310):[T:0.19]
C
C
T
G
C
C
T
C
A
G
C
C
T
T
C
C
T
A
G
T
A
G
C
T
G
G
G
A
T
T
A
C
G
G
G
T
G
C
A
A
G
C
C
A
C
C
A
T
G
C
45400
C
C
A
G
C
T
G
A
T
T
T
T
T
T
T
T
G
T
A
T
T
T
T
T
A
G
T
A
G
A
G
A
C
A
G
G
G
T
T
T
C
A
C
C
A
T
A
T
T
G
45450 |
var(45409):[:0.99]
G
C
C
A
G
G
C
T
G
G
T
C
T
C
A
A
A
C
T
C
C
T
G
G
C
C
T
G
A
A
G
T
G
A
T
A
C
G
C
C
C
A
C
C
T
C
A
G
C
C
45500
T
C
C
C
A
A
A
G
T
G
C
T
G
G
G
A
T
T
A
C
A
G
G
C
T
T
G
A
G
C
C
A
C
C
G
C
A
C
C
C
G
A
C
C
CGC
A
CA
45550 |
var(45543):[T:0.01]
|
var(45548):[G:0.49]
CTCTTTTCAA TAATCATGGA TGGCCAGGGG TGCAGGGTCT AAAAAGCGC
T
45600 |
var(45600):[G:0.08]
GCCTAGCCCA TCCTGCTGTT C
A
CTGGGCAA GCGACGTCAC AGGTCCAGGC
45650 |
var(45622):[G:0.30]
TTCAGTGTCC TCATCCATGC TCTGCGTCTG ATGGCAATCT AGCCAGGATG
45700
TGGGGAAGGG AGGATGCA
G
T GAGAGCACAG ATATGAGAGC ATCTTGGAAA
45750 |
var(45719):[C:0.01]
T
AAAAATGTA CCTGCAAGAG GTGGTGGTGA ATTTTCTTAC TCAGGCCAGC
45800 |
var(45751):[C:0.05]
TTCTGCCAGG GCTGGCAGAA AGAGGGGGTG GCATGGCATG GAGCCGCAGG
45850
GGGTGGAGGA CTGGCTTCCA CTGCTGTGCC TGAGGAAGCC GCGGCTGTTT
45900
CTGGGCGGGA TGGGAGTAGT GGGAGGGGGA TACTGGCCTT GTGAGAAGAA
45950
AAGGGAAGTG TCTGTTTGAG AGGTTTTTGA ATTAGTAAAG GAGGACAGGC
46000
GCAAACTCCA AGCGCTTCAC TTGCACCCGG GACCAAACCC CAATCCCAGT
46050
GGCTGGCTCC CTGAGGCCGC CCCGCTCC
G
T CCCGCCCCGC TGACAGCGGC
46100 |
var(46079):[A:0.05]
TGGGCTGGAG AAGGCTCTAT ACGGACACAC CTCTGGGGAC GGGGAACCCA
46150
CTGCTCCCAG CTAAAGCAAC CCTGTTTCCT GGCCCGCCTC AGACAGGGC
T
46200 |
var(46200):[G:0.05]
GCAGGCCTTG TTTGAGCCCC TTTCAGGGCA CCTGGCCTTG GATTGTCTGT
46250
GGCTTTGCCT GGTCCGCTGT GACTTCCTTT CTACTTGAGC CTTGCTAAGG
46300
CAGACTCTAC TCCCTCACTC GTAAGCAGCC AGGCGTCCAG CAGGTCCTCC
46350
AACGTCGATC TTGGCCCTAA GACG
T
CC
T
GT CTGGGCACGG AGTTGTTGAG
46400 |
var(46375):[C:0.16]
|
var(46378):[T:0.08]
ATCCGGCAGG AAGTCCCTGC TCCAGGGCCA AAGGCCCCAC CCGGGCTCCC
46450
CCGGATGTCC CCGCACCCCC CTCTATTCTC CCAAAAGAAA GAAGCTGCTT
46500
CCCACTTTGG AAACGTTTAT TCTGAGCACC GGGAAGGGGG GCGGCGGCGG
46550
GCGCCTCACT GGGTGTTGAG CTTCTTAGTG TACTCCTCGA GAGCGCTCAG
46600
M K A A V L T L A V L F L T G 876
GAAGCTGACC TTGAAGCTCT CCAGCACGGG CAGCAGGCCT TGGCGGAGGT
46650
S Q A R H F W Q Q D E P P Q S P W 893
CCTCGAGCGC GGGCTTGGCC TTCTCGCTGA GCGTGCTCAG ATGCTCGGTG
46700
D R V K D L A T V Y V D V L K D 909
GCCTTGGCGT GGTACTCGGC CAGTCTGGCG CCGCCGTTCT CCTTGAGAGC
46750
S G R D Y V S Q F E G S A L G K Q 926
CTCAAGGCGC GCGGCCAAGC GCTGGCGCAG CTCGTCGCTG TAGGGGGCCA
46800
L N L K L L D N W D S V T S T F S 943
GATGCGTGCG CAGCGCGTCC ACATGGGCGC GCGCGCGGTC GCGCATCTCC
46850
K L R E Q L G P V T Q E F W D N 959
TCGCCCAGTG GGCTCAGCTT CTCTTGCAGC TCGTGCAGCT TCTGGCGCGC
46900
L E K E T E G L R Q E M S K D L E 976
GCCCTCTTGG AGCTCTGCGC GCAGCGGCTC CACCTTCTGG CGGTAGAGCT
46950
E V K A K V Q P Y L D D F Q K K W 993
CCATCTCCTC CTGCCACTTC TTCTGGAAGT CGTCCAGGTA GGGCTGCACC
47000
Q E E M E L Y R Q K V E P L R A 1009
TTGGCCTTCA CCTCCTCCAG ATCCTTGCTC ATCTCCTGCC TCAGGCCCTC
47050
E L Q E G A R Q K L H E L Q E K L 1026
TGTCTCCTTT TCCAGGTTAT CCCAGAACTC CTGGGTCACA GGGCCGAGCT
47100
S P L G E E M R D R A R A H V D A 1043
GTTCGCGCAG CTTGCTGAAG GTGGAGGTCA CGCTGTCCCA GTTGTCAAGG
47150
L R T H L A P Y S D E L R Q R L 1059
AGCTTTAGGC TGGAGGGTGA GACAGAAGGG TTGAGGGCTG GCCTCCCAGC
47200
A A R 1062
GCCCCAGCCT ATCAGGGGTG A
G
CCCTGGGT GACACCTGTC CGCGGAGGTG
47250 |
var(47222):[A:0.12]
CAGTGGCCTA GCATTTCCAG TACACTCTCA ACCAACACCC CTGCCCCGCC
47300
AGGCCATGCC CCGTTGTGCA GCTGGACCGA GGCACAGAGA GGAGCTAAAA
47350
AGGAGACAGA GCTGGGACT
A
GTGCCCAGTT ACGTTGGTCT CCAAAGTGGG
47400 |
var(47370):[A:0.21]
CCCTTGAAGG ACCTCGCTTC TATGCCTCCA GC
G
CATTCTC CTCTCCGAAG
47450 |
var(47433):[A:0.06]
ACAGCCCACA GTCTTGCTGG GCAGGAGCAG AGGAGGTGGT GAAGAAGGGA
47500
AAGGGGCTTG CTACACTTGC AGGCACAATG TGGCTCTGTG ATCA
C
GCATC
47550 |
var(47545):[T:0.04]
GGGCAGCCCT GGTCTGTCCT TTGCAGGGGA CATGGGCTGT GACCCTGCCT
47600
GGAGATCCCA TTCC
G
GTTTC TCCATCCAGA CCATCTGTGG GGCCCAGCTC
47650 |
var(47615):[A:0.08]
ATCAGATATT AGGTGAGGAC TCGGCCAGTC TGGCTTCAAC ATCATCCCAC
47700
AGGCCTCTGC CCCCT
G
CCCC TGCCCTCAAC CCCAGGCTGG GTCCTTACTT
47750 |
var(47716):[A:0.08]
L 1063
TAGCTGTTTT CCCAAGG
C
GG AGCCTTCAAA CTGGGACACA TAGTCTCTGC
47800 |
var(47768):[T:0.06]
E A L K E N G G A R L A E Y H A K 1080
CGCTGTCTTT GAGCACATCC ACGTACACAG TGGCCAGGTC CTTCACTCGA
47850
A T E H L S T L S E K A K P A L 1096
TCCCAGGGGC TCTGGGGGGG TTCATCTTGC TGCCAGAAAT GCCGAGCCTG
47900
E D L R Q G L L P V L E S F K V S 1113
GCTCCCTGAG GGTGGGAGGG GAGACCCAGA TCAGGCCAGC TGTGGGCTGA
47950
F L 1115
GATCTGAGCC GAAAGGCCAA GCTTGGAGGT GGGGGAGAGG GGGCCAGTGA
48000
GAAACCTGCT GCCTCTGCCC AGGAGGGTGG GCCACGGGGA TTTAGGGAGA
48050
A
A
GCCCCCCG ATGGTTGGCT CCCTAGGTTA GGGGACACCT ACCCGTCAGG
48100 |
var(48052):[A:0.43]
S A 1117
AAGAGCACGG CCAAGGTCAG CACCGCAGCT TTCATCCTGA AGGGCCGTGG
48150
L E E Y T K K L N T Q 1128
GGGACCTGGA GGAGAAGAAG GGCCTGGCTG AGTGGGGTGC CTTCAGCATG
48200
CAGAAGCCCC GTGCTCCCCC ACTCATTGCA GCCAGGTGAG GAGAAG
G
GCA
48250 |
var(48247):[A:0.01]
CAGAGCGGGA GAAGACCTCA GGTACCCAGA GGCCC
G
GCCT GGGGCAAGGC
48300 |
var(48286):[A:0.12]
CTGAACCTTG AGCTGGGGAG CCAGAGTGAC CGGGGCAGGC AGCAGGACGC
48350
ACCTCCTTCT CGCAGTCTCT AAGCAGCCAG CTCTTGCAGG GCCTATTTAT
48400
GTCTGCAGCC AGGGTCTGGG CTGGGAGGCT GATAAGCCCA GCCC
C
GGCCC
48450 |
var(48445):[T:0.19]
TGTTGCTGCT CACTGGTCCT GGCAATGTGG AACTTAAGAG TTCAAGGATC
48500
AGCTCTGTCC CTGGGGCTGG GCAAATAGAG TGGGCAAACA GCAAGCTGCG
48550
GGGGCTGCAG GGCAGGGGTC AAGGGTTCAG TGGGGGCGGG AGGGGAGTGT
48600
CTGCAGGCTT GCAGGTCTCC CGGGTGGGGT CGGGGTTCCC TGCACTCATC
48650
CCCTTCCCCT CCATGGGAGT GTGTGGGCAG TTGCCATTGT CCAT
T
GTGTT
48700 |
var(48695):[G:0.03]
GGCAGAGGAG GGGAGGGGAG GGACGCTGGG ACT
C
CTCCAC CAAGGAGACT
48750 |
var(48734):[A:0.01]
GCCTCCCCCA CCACCAGCAT TCCAGGGAGA CTACTTCACT CCCCTCCCCC
48800
TTCCCCCG
C
C CTGTCCTCCC ACCAGTGCTC TTCTTTAGTC CCCAGCAGGT
48850 |
var(48809):[T:0.04]
CCTCCAGGCC TCTCTCCAAG CCTCCCAAAC TGGTAAACCT GGGGAGAGGG
48900
GAGAGCCCTC CGTGGCTCCC AGACTGAGGT TTCGGAGACC TCTTGCATTT
48950
CAAAACACTC CAGAGATCAA TTCGGAGCTG CCAACTTTTA ATTTTGTCAT
49000
GTAAAGATAT TGTCCGCCTC CAAAAAACCC TCACCATCTA CAGTGACCAT
49050
CACTTCAAAA AGGAAAGGCT TTAACAAAAA AGGGCATAAT CTCAGAATTA
49100
CATTACAGAA TTGAAGCCCC TTAGATTGAA GACGTCTCCC TTTGCATTGT
49150
TCACACTTAT ATTTGATCAC GCACAGGTGT GACTGGATCT CAGCCCTTGG
49200
GAAGCCCTGG TGTGGGGAGA AGACCTCTGG GCAAA
G
G
A
A
G
T
G
G
C
A
G
G
A
G
A
49250 |
REPEAT
C
C
T
G
C
T
T
T
C
T
C
A
T
T
T
C
A
C
T
T
G
G
C
C
T
C
C
A
T
C
T
T
G
C
T
G
G
A
G
G
A
A
G
A
T
C
G
T
G
G
49300
G
C
A
A
G
T
T
A
T
A
T
A
A
T
C
T
C
T
C
T
G
A
G
C
C
T
C
G
G
T
T