KCC000191A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000191A_C01 KCC000191A_c01
GAAGGGGGATTCGGGCAAATTTGCGGCGGGAATTGGAAGACCGAAACTTTCGTGCTTTTC
CGTTTTGGAAGACCAACTCTTTCAGCCAGCCCCGAAACATAAAGAGCAGTCCCTTTGCAG
CCGCTATCCAACAAGATTACCCTGGCATAAAATTGGGGGTCCCGGGGCTGGTACCGATTG
CTGGTACGGTCGGTGAATAAAAACGCGGGGCGCTCGCCGTGTGCCTTCAACACTATCAAA
TTCCACAGCTTATTAACTTAGATTTACAACTTAGCTACTATTGACCTGGGCGCCTGTAAA
CGCTTTACGTTCCTTTCCAAATTCTTGCACGCAACCTCGCTTCGGCGCCCTCGTTTTGCC
TGAACGCTATACAGCGATAGACGAGCATGTCGGTACCTCTGCAGTGCAATGCGGGGCGCC
TGCTCGCGGGCCAGCGGCCCTGCGGCGTCCGCGCCCGGCTGAATCGTCGCGTTTGTGTCC
CAGTCACCGCGCACGGCAAGGCCTCTGCGACCCGCGAATATGCTGGTGACTTCCTTCCCG
GCACTACCATTTCACACGCGTGGAGTGTCGAGCGTGAGACGCACCACAGGTACCGCAACC
CCGCCGAGTGGATCAACGAGGCCGCTATTCACAAGGCGCTGGAGACCTCCAAGGCGGACG
CCCAGGACGCCGGACGGGTGCGCGAGATCCTGGCCAAGGCCAAGGAAAAGGCCTTCGTCA
CCGAGCATGCGCCCGTCAACGCCGAGTCCAAGTCCGAGTTCGTGCAAGGCCTGACGCTGG
AGGAGTGCGCTACGCTCATCAACGTGGACTCGAACAACGTCGAGCTGATGAATGAGATCT
TCGACACGGCCCTGGCCATCAAGGAGCGCATCTACGGGAACCGTGTGGTGCTCTTCGCGC
CGCTTTACATCGCCAATCACTGCATGAACACCTGCACCTACTGCGCCTTCCGCTCCGCCA
ACAAGGGCATGGAGCGCTCCATCCTCACCGACGACGACCTACGCGAGGAGGTAGCGGCGC
TGCAGCGCCAGGGCCACCGCCGCATCCTGGCGCTCACCGGCGAGCACCCCAAGTACACCT
TTGACAACTTCCTGCACGCCGTGAACGTGATCGCATCTGTCAAGACGGAGCCGGAGGGCA
GCATCCGCCGCATCAATGTGGAGATTCCGCCCCTATCGGTGTCGGACATGCGCCGCCTGA
AGAACACGGACAGCGTGGGCACGTTCGTGCTGTTCCAGGAGACCTACCACCGCGACACCT
TCAAGGTCATGCACCCCTCCGGCCCAAAGTCCGACTTCGACTTCCGCGTGCTGACGCAGG
ACCGGGCCATGCGCGCCGGCCTTGACGACGTGGGCATCGGCGCCCTGTTCGGACTGTACG
ACTACCGCTACGAGGTGTGCGCGATGTTGATGCACAGCGAGCACCTGGAGCGCGAGTACA
ACGCCGGCCCGCACACCATCAGCGTGCCTCGCATGCGCC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000191A_C01 KCC000191A_c01
         (1479 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_229072.1| thiH protein, putative [Thermotoga maritima] gi...   258  2e-67
ref|ZP_00059905.1| COG1060: Thiamine biosynthesis enzyme ThiH an...   256  7e-67
ref|NP_781953.1| thiH protein [Clostridium tetani E88] gi|282034...   251  3e-65
ref|NP_719454.1| thiH protein, putative [Shewanella oneidensis M...   244  2e-63
ref|ZP_00129808.1| COG1060: Thiamine biosynthesis enzyme ThiH an...   243  6e-63

>ref|NP_229072.1| thiH protein, putative [Thermotoga maritima] gi|7462818|pir||B72274
            hypothetical protein TM1267 - Thermotoga maritima (strain
            MSB8) gi|4981824|gb|AAD36342.1|AE001782_3 thiH protein,
            putative [Thermotoga maritima]
          Length = 473

 Score =  258 bits (659), Expect = 2e-67
 Identities = 134/290 (46%), Positives = 189/290 (64%)
 Frame = +3

Query: 609  WINEAAIHKALETSKADAQDAGRVREILAKAKEKAFVTEHAPVNAESKSEFVQGLTLEEC 788
            +I E  I + LE +K    D  RVREI+ K+ +K                    L  EE 
Sbjct: 16   FIPEEKIFELLEKTKNP--DPARVREIIQKSLDK------------------NRLEPEET 55

Query: 789  ATLINVDSNNVELMNEIFDTALAIKERIYGNRVVLFAPLYIANHCMNTCTYCAFRSANKG 968
            ATL+NV+  + EL+ EIF+ A  +KERIYGNR+VLFAPLYI N C+N C YC FR +NK 
Sbjct: 56   ATLLNVE--DPELLEEIFEAARTLKERIYGNRIVLFAPLYIGNDCINDCVYCGFRVSNKV 113

Query: 969  MERSILTDDDLREEVAALQRQGHRRILALTGEHPKYTFDNFLHAVNVIASVKTEPEGSIR 1148
            +ER  LT++ L+EEV AL  QGH+R++ + GEHP Y+ +     ++++ + K    G IR
Sbjct: 114  VERRTLTEEQLKEEVKALVSQGHKRLIVVYGEHPNYSPEFIARTIDIVYNTKYG-NGEIR 172

Query: 1149 RINVEIPPLSVSDMRRLKNTDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDFRVLTQDRA 1328
            R+NV   P ++   + +K+   +GTF +FQETYHR+T+  +HP GPKS++++R+   DRA
Sbjct: 173  RVNVNAAPQTIEGYKIIKSV-GIGTFQIFQETYHRETYLKLHPRGPKSNYNWRLYGLDRA 231

Query: 1329 MRAGLDDVGIGALFGLYDYRYEVCAMLMHSEHLEREYNAGPHTISVPRMR 1478
            M AG+DDVGIGALFGLYD+++EV  +L H+ HLE  +  GPHTIS PR++
Sbjct: 232  MMAGIDDVGIGALFGLYDWKFEVMGLLYHTIHLEERFGVGPHTISFPRIK 281

>ref|ZP_00059905.1| COG1060: Thiamine biosynthesis enzyme ThiH and related
            uncharacterized enzymes [Clostridium thermocellum ATCC
            27405]
          Length = 473

 Score =  256 bits (654), Expect = 7e-67
 Identities = 133/291 (45%), Positives = 184/291 (62%)
 Frame = +3

Query: 606  EWINEAAIHKALETSKADAQDAGRVREILAKAKEKAFVTEHAPVNAESKSEFVQGLTLEE 785
            ++I E  I   LE  K    D   +REILAKA+E                   +G++L E
Sbjct: 21   DFIKEDLIFSLLEKGKIT--DRNEIREILAKARE------------------CKGISLGE 60

Query: 786  CATLINVDSNNVELMNEIFDTALAIKERIYGNRVVLFAPLYIANHCMNTCTYCAFRSANK 965
             A L+ ++    EL+ E++D A  IK +IYG RVVLFAPLY +N C N C YC FR  NK
Sbjct: 61   VAKLLYLEDE--ELLEELYDVAKYIKNKIYGKRVVLFAPLYTSNECTNNCLYCGFRHDNK 118

Query: 966  GMERSILTDDDLREEVAALQRQGHRRILALTGEHPKYTFDNFLHAVNVIASVKTEPEGSI 1145
             + R  L+ +++ EE  A++RQGH+R+L + GE P+ T  N  H  + + ++    +  I
Sbjct: 119  ELHRKTLSLEEIVEEAKAIERQGHKRLLLICGEDPRKT--NVKHFTDAMEAIYKSTD--I 174

Query: 1146 RRINVEIPPLSVSDMRRLKNTDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDFRVLTQDR 1325
            RRINVE  P++V D R LK    +GT+V+FQETYHR+T+++MHP G K+++D+R+   DR
Sbjct: 175  RRINVEAAPMTVEDYRELKKA-GIGTYVIFQETYHRETYRIMHPVGKKANYDWRITAIDR 233

Query: 1326 AMRAGLDDVGIGALFGLYDYRYEVCAMLMHSEHLEREYNAGPHTISVPRMR 1478
            A   G+DDVG+GALFGLYDYR+EV  +LMH  H E +Y  GPHTISVPR+R
Sbjct: 234  AFEGGIDDVGVGALFGLYDYRFEVLGLLMHCMHFEEKYGVGPHTISVPRLR 284

>ref|NP_781953.1| thiH protein [Clostridium tetani E88] gi|28203448|gb|AAO35890.1| thiH
            protein [Clostridium tetani E88]
          Length = 478

 Score =  251 bits (640), Expect = 3e-65
 Identities = 130/292 (44%), Positives = 188/292 (63%), Gaps = 1/292 (0%)
 Frame = +3

Query: 606  EWINEAAIHKALETSKADAQDAGRVREILAKAKEKAFVTEHAPVNAESKSEFVQGLTLEE 785
            E+I  + I KAL+  +  A++   VRE+L KA E                   +GLT EE
Sbjct: 15   EFIIHSDIEKALDKGREKAKNKDYVRELLNKALE------------------CKGLTYEE 56

Query: 786  CATLINVDSNNVELMNEIFDTALAIKERIYGNRVVLFAPLYIANHCMNTCTYCAFRSANK 965
             A L+NV+  ++  + +I+  A  IKE+IYG R+VLFAPLYI+++C+N C YC ++ +N 
Sbjct: 57   GAVLLNVEDEHI--LEDIYKAAKIIKEKIYGKRIVLFAPLYISSYCVNNCKYCGYKCSNN 114

Query: 966  GMERSILTDDDLREEVAALQRQGHRRILALTGEHP-KYTFDNFLHAVNVIASVKTEPEGS 1142
              +R+ LT D++ EEV  L+  GH+R+    GE     + D  L ++  I S+K    GS
Sbjct: 115  TFKRNKLTMDEIAEEVKILESLGHKRLALEVGEDDVNCSIDYVLKSIKKIYSLKFN-NGS 173

Query: 1143 IRRINVEIPPLSVSDMRRLKNTDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDFRVLTQD 1322
            IRRINV I   ++ + ++LK  + +GT++LFQETYH++T++ MHP+GPKSD+++     D
Sbjct: 174  IRRINVNIAATTIENYKKLKEAE-IGTYILFQETYHKETYEKMHPTGPKSDYNYHTTAMD 232

Query: 1323 RAMRAGLDDVGIGALFGLYDYRYEVCAMLMHSEHLEREYNAGPHTISVPRMR 1478
            RA  AG+DDVGIG L+GLYDY+Y+  AMLMH EHLE+    GPHTISVPR+R
Sbjct: 233  RARMAGIDDVGIGVLYGLYDYKYDTVAMLMHGEHLEKATGVGPHTISVPRLR 284

>ref|NP_719454.1| thiH protein, putative [Shewanella oneidensis MR-1]
            gi|24350249|gb|AAN56898.1|AE015824_9 thiH protein,
            putative [Shewanella oneidensis MR-1]
          Length = 479

 Score =  244 bits (624), Expect = 2e-63
 Identities = 129/297 (43%), Positives = 183/297 (61%), Gaps = 2/297 (0%)
 Frame = +3

Query: 591  YRNPAEWINEAAIHKALETSKADAQDAGR--VREILAKAKEKAFVTEHAPVNAESKSEFV 764
            Y     +I++ AI + +E    DA D  R  V  IL KA++                   
Sbjct: 14   YNPNVNFIDDKAIWQTIE----DASDPSREQVLAILDKARQ------------------C 51

Query: 765  QGLTLEECATLINVDSNNVELMNEIFDTALAIKERIYGNRVVLFAPLYIANHCMNTCTYC 944
            +GL++ E A L+      ++ M  +F  A  IK  IYGNR+V+FAPLY++NHC N+C+YC
Sbjct: 52   EGLSISETALLLQNQDKTLDEM--LFSVAREIKNTIYGNRIVMFAPLYVSNHCANSCSYC 109

Query: 945  AFRSANKGMERSILTDDDLREEVAALQRQGHRRILALTGEHPKYTFDNFLHAVNVIASVK 1124
             F + N  ++R  L  D++R+EVA L+  GH+RILA+ GEHP+      + ++  + SVK
Sbjct: 110  GFNADNHELKRKTLKQDEIRQEVAILEEMGHKRILAVYGEHPRNNVQAIVESIQTMYSVK 169

Query: 1125 TEPEGSIRRINVEIPPLSVSDMRRLKNTDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDF 1304
                G IRRINV   P+SV D ++LK T ++GT+  FQETYH+DT+  +H  G K+DF +
Sbjct: 170  QGKGGEIRRINVNCAPMSVEDFKQLK-TAAIGTYQCFQETYHQDTYSQVHLKGKKTDFLY 228

Query: 1305 RVLTQDRAMRAGLDDVGIGALFGLYDYRYEVCAMLMHSEHLEREYNAGPHTISVPRM 1475
            R+    RAM AG+DDVGIGALFGLYD+R+E+ AML H + LE++   GPHTIS PR+
Sbjct: 229  RLYAMHRAMEAGIDDVGIGALFGLYDHRFELLAMLTHVQQLEKDCGVGPHTISFPRI 285

>ref|ZP_00129808.1| COG1060: Thiamine biosynthesis enzyme ThiH and related
            uncharacterized enzymes [Desulfovibrio desulfuricans G20]
          Length = 469

 Score =  243 bits (620), Expect = 6e-63
 Identities = 122/270 (45%), Positives = 167/270 (61%)
 Frame = +3

Query: 666  DAGRVREILAKAKEKAFVTEHAPVNAESKSEFVQGLTLEECATLINVDSNNVELMNEIFD 845
            DA RVREILAKA+E                   +GL  EE ATL+ +D  N EL  E+F 
Sbjct: 28   DAVRVREILAKARE------------------AKGLDAEETATLLQLD--NEELDAELFA 67

Query: 846  TALAIKERIYGNRVVLFAPLYIANHCMNTCTYCAFRSANKGMERSILTDDDLREEVAALQ 1025
            TA  +K+ IYGNR+VLFAPLYI N C N C YC F + N  ++R  L++D++R EV  L+
Sbjct: 68   TAKKVKQTIYGNRLVLFAPLYITNECYNRCAYCGFNATNSDLKRRTLSEDEIRAEVEVLE 127

Query: 1026 RQGHRRILALTGEHPKYTFDNFLHAVNVIASVKTEPEGSIRRINVEIPPLSVSDMRRLKN 1205
            R GH+R+L + GEHP+   D     + V+    +E  G IRR+N+   P +V   R+L +
Sbjct: 128  RLGHKRLLLVYGEHPRLDADWMARTIQVVYDTVSEKSGEIRRVNINCAPQTVDGFRKLHD 187

Query: 1206 TDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDFRVLTQDRAMRAGLDDVGIGALFGLYDY 1385
               +GT+  FQETYH+ T+   H  GPK D+ +R+    RAM AG+DDVG+G L GLYDY
Sbjct: 188  V-GIGTYQCFQETYHKATYDKAHLGGPKKDYLWRLYAMHRAMEAGIDDVGMGPLLGLYDY 246

Query: 1386 RYEVCAMLMHSEHLEREYNAGPHTISVPRM 1475
            R+E+ A++ H+  LE+ +  GPHTIS PR+
Sbjct: 247  RFEILALMQHAADLEKHFGVGPHTISFPRL 276



EST assemble image


clone accession position
1 HCL071f01_r AV643553 1 486
2 CL53a05_r AV395967 1 543
3 HCL098d08_r AV645114 67 550
4 HCL044g12_r AV642058 151 472
5 MXL079f08_r BP097685 155 612
6 HCL089c03_r AV644480 155 742
7 LCL068c10_r AV629886 240 575
8 LCL034c04_r AV627943 241 578
9 LCL014g09_r AV626747 241 574
10 HCL005h11_r AV639838 243 516
11 HCL047c09_r AV642198 243 525
12 LCL027g08_r AV627508 243 556
13 CL80a11_r AV397346 243 801
14 HCL016a03_r AV640421 244 425
15 LCL021h05_r AV627149 246 456
16 HCL030b06_r AV641229 246 689
17 MXL056a10_r BP096290 246 625
18 HCL047c04_r AV642194 248 535
19 HCL083g02_r AV644196 248 721
20 CL66e12_r AV396771 248 779
21 HCL023g07_r AV640867 253 619
22 HCL065a06_r AV643172 253 727
23 CL73d04_r AV396976 253 703
24 MX062a03_r BP088499 254 593
25 LCL031a12_r AV627721 254 444
26 HCL023b05_r AV640834 254 689
27 HC085c01_r AV638345 254 675
28 MXL072b08_r BP097230 257 631
29 HCL012f08_r AV640233 257 603
30 HCL041e08_r AV641865 257 742
31 CL66b06_r AV396726 257 767
32 HCL070c01_r AV643469 257 732
33 CM013c12_r AV387106 257 842
34 CL45a09_r AV397787 257 726
35 HCL099a07_r AV645162 258 685
36 HCL064c07_r AV643130 264 755
37 CL35f02_r AV395053 264 802
38 CL41f02_r AV395437 265 682
39 HCL038a12_r AV641650 266 755
40 HCL085e02_r AV644284 266 754
41 MXL047e03_r BP095792 267 647
42 HCL061b12_r AV642957 275 769
43 CL12g04_r AV393849 279 814
44 LCL073e10_r AV630121 290 797
45 CL09e11_r AV393704 305 803
46 HC086a12_r AV638414 574 1081
47 HCL078f03_r AV643929 922 1405
48 HCL044b01_r AV642022 956 1376
49 LCL100a01_r AV631794 970 1480




Chlamydomonas reinhardtii
Kazusa DNA Research Institute
Department of Plant Gene Research