KCC000438A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000438A_C01 KCC000438A_c01
ccgaaatcgaaaatagtcaaccgctctggGTCCCTACACTACCGAAACCGCTCTAGAAGC
CACATGTGGGGCCTGAAGGGCCCGCGGAAACGGGCCCGAAACATGATTGACTACCTCGTT
TCGCTGCTAGTAGTTGCCTCCGCACTCGCTGTAATCCTTCTATTGATGGGTAGGCACAAA
GCACCAACATCTAGGGCTGGCAGGGCCATAGTGGAGAAGGAAATTGAGGCTCGCAAACAG
TCGCGGATCACTGACTTGCTGAGCGGAGCTACGCGCCCTAGACCTCACACCATGGACCCG
GGGACAATTATGCATACTGGAGCCTACGGAGCTGACGAACCGGCGACAGCGGCTGGTGAT
CACGCTGGCGGGCAAAACGTGACGCAAGGTCTCCCCGGGGGGCTTGCAGTCCCCGGTCCG
GCGCCTGACCCCGTCCTGGCCGCTTCCCCTCGCGGCACTTCACCCGCAGCTGCCCCGCCC
GCACCGACAACAGCGCAACCGCCGGCTGCTGCACCAGCTCCAACTCTGCCAGGCGTGTCG
CCGTCCAACTCGCCCCTCAAGAAGAAGCAGCGACAACG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000438A_C01 KCC000438A_c01
         (578 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_491194.1| COLlagen structural gene (col-50) [Caenorhabdit...    54  2e-06
ref|NP_739264.1| hypothetical protein [Corynebacterium efficiens...    49  5e-05
gb|AAM61027.1| unknown [Arabidopsis thaliana]                          49  5e-05
ref|NP_569011.1| arabinogalactan-protein (AGP7) [Arabidopsis tha...    49  5e-05
ref|NP_630576.1| putative membrane protein [Streptomyces coelico...    49  7e-05

>ref|NP_491194.1| COLlagen structural gene (col-50) [Caenorhabditis elegans]
           gi|7508684|pir||T15142 hypothetical protein T28F2.6 -
           Caenorhabditis elegans gi|2047346|gb|AAB53052.1|
           Collagen protein 50 [Caenorhabditis elegans]
          Length = 418

 Score = 53.9 bits (128), Expect = 2e-06
 Identities = 41/111 (36%), Positives = 48/111 (42%), Gaps = 18/111 (16%)
 Frame = +1

Query: 298 PGTIMHTGAY--GADEPATA-AGDHAGGQNVT-QGLPGGLAVPGPAPDPVLAASPRG--- 456
           P   +  GAY  G D  A A AG + GG     +  P   A P PAP P  AA+P G   
Sbjct: 301 PPRTLGAGAYPEGGDAAAAAPAGGYDGGAGAAPEAAPAAAAAPQPAPAPAAAAAPAGGYQ 360

Query: 457 ----TSPAAAPPAPTTAQPPAAAPAPT-------LPGVSPSNSPLKKKQRQ 576
                  AA PPAP  A  P  APAP          G SP+    +KK R+
Sbjct: 361 GGAAAGAAAPPPAPAAAAAPEPAPAPAAAPPPAPAAGGSPTGGYRRKKVRR 411

 Score = 32.0 bits (71), Expect = 6.4
 Identities = 25/75 (33%), Positives = 28/75 (37%), Gaps = 10/75 (13%)
 Frame = -1

Query: 542 GDTPGRVGAGAAAGGCAVVGAGGAAAGEVPR------GEAARTG---SGAGPGTASPPGR 390
           GD      AG   GG         AA   P+        AA  G    GA  G A+PP  
Sbjct: 314 GDAAAAAPAGGYDGGAGAAPEAAPAAAAAPQPAPAPAAAAAPAGGYQGGAAAGAAAPPPA 373

Query: 389 PCVTFCP-PA*SPAA 348
           P     P PA +PAA
Sbjct: 374 PAAAAAPEPAPAPAA 388

 Score = 31.6 bits (70), Expect = 8.3
 Identities = 29/84 (34%), Positives = 30/84 (35%), Gaps = 5/84 (5%)
 Frame = +1

Query: 298 PGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPA----PDPVLAASPRGTSP 465
           PG     GA G D    A G  +  Q    G PG    PGPA     D     SP GT  
Sbjct: 209 PGPDGQPGAPGPDGQPGAGGTTSTNQ---PGPPGPAGPPGPAGPAGEDAYAQPSPAGTPG 265

Query: 466 AAAPPAPT-TAQPPAAAPAPTLPG 534
              PP     A P   A AP   G
Sbjct: 266 PPGPPGKDGEAGPDGPAGAPGTDG 289

>ref|NP_739264.1| hypothetical protein [Corynebacterium efficiens YS-314]
           gi|23494498|dbj|BAC19464.1| hypothetical protein
           [Corynebacterium efficiens YS-314]
          Length = 609

 Score = 48.9 bits (115), Expect = 5e-05
 Identities = 37/104 (35%), Positives = 41/104 (38%), Gaps = 7/104 (6%)
 Frame = +1

Query: 265 GATRPRPHTMDPGTIMHTGAYGADEP-ATAAGDHAGGQNVTQGLPGGLAVPG------PA 423
           GA+ P P    P     T A GA  P AT  G            PG  A PG      PA
Sbjct: 152 GASIPTPGAAMPTPGTATPAPGAAAPGATIPGSAVPAPGGAPAAPGAPAAPGAAAPRTPA 211

Query: 424 PDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSNSP 555
           P    AA P   +P +A P P     P AAP P LP   P  +P
Sbjct: 212 PG---AAIPGAVAPGSAVPTPGAISAPGAAPPPGLPAPGPPGAP 252

 Score = 39.3 bits (90), Expect = 0.040
 Identities = 33/93 (35%), Positives = 37/93 (39%), Gaps = 3/93 (3%)
 Frame = +1

Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPG-GLAVPGPAPDPVLA 441
           GA  P P T  P       A G+  PA  A            +P  G AVP P      A
Sbjct: 96  GAAVPAPATPTP-----PAAPGSAIPAPGAATPTAVPTPGSAIPTPGAAVPAPGVATPSA 150

Query: 442 ASPRGTSPAAAPPAPTTAQPP--AAAPAPTLPG 534
                 +P AA P P TA P   AAAP  T+PG
Sbjct: 151 PGASIPTPGAAMPTPGTATPAPGAAAPGATIPG 183

 Score = 38.9 bits (89), Expect = 0.052
 Identities = 32/83 (38%), Positives = 37/83 (44%), Gaps = 2/83 (2%)
 Frame = -1

Query: 536 TPGRV--GAGAAAGGCAVVGAGGAAAGEVPRGEAARTGSGAGPGTASPPGRPCVTFCPPA 363
           TPG      GAAA G  + G+   A G  P    A  G+ A PG A+P      T  P A
Sbjct: 164 TPGTATPAPGAAAPGATIPGSAVPAPGGAP----AAPGAPAAPGAAAPR-----TPAPGA 214

Query: 362 *SPAAVAGSSAP*APVCIIVPGS 294
             P AVA  SA   P  I  PG+
Sbjct: 215 AIPGAVAPGSAVPTPGAISAPGA 237

 Score = 36.6 bits (83), Expect = 0.26
 Identities = 33/84 (39%), Positives = 37/84 (43%), Gaps = 8/84 (9%)
 Frame = +1

Query: 298 PGTIMHTGAYGADEPATAA---GDHAGGQNVTQGLPGGLAVPGPAPDPVLAA-----SPR 453
           PG     GA     PA  A   G  A G  V    PG ++ PG AP P L A     +P 
Sbjct: 196 PGAPAAPGAAAPRTPAPGAAIPGAVAPGSAVPT--PGAISAPGAAPPPGLPAPGPPGAPG 253

Query: 454 GTSPAAAPPAPTTAQPPAAAPAPT 525
                AAP AP +    AAAPAPT
Sbjct: 254 APGIPAAPGAPGS----AAAPAPT 273

 Score = 35.4 bits (80), Expect = 0.58
 Identities = 30/90 (33%), Positives = 38/90 (41%), Gaps = 1/90 (1%)
 Frame = +1

Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAA 444
           G+  P P    P T+ + G      PA  A   A    +    PG  A+P P      A 
Sbjct: 35  GSAVPAPGGAVPPTVTN-GPTPQAPPAPGAAVPAPATPIPPAAPGS-AIPAPG-----AV 87

Query: 445 SPRGT-SPAAAPPAPTTAQPPAAAPAPTLP 531
           +P    +P AA PAP T  PP AAP   +P
Sbjct: 88  TPTAVPTPGAAVPAPATPTPP-AAPGSAIP 116

 Score = 35.0 bits (79), Expect = 0.75
 Identities = 32/94 (34%), Positives = 38/94 (40%), Gaps = 9/94 (9%)
 Frame = +1

Query: 265 GATRPR--------PHTMDPGTIMHT-GAYGADEPATAAGDHAGGQNVTQGLPGGLAVPG 417
           GA  PR        P  + PG+ + T GA  A   A   G  A G     G PG  A PG
Sbjct: 203 GAAAPRTPAPGAAIPGAVAPGSAVPTPGAISAPGAAPPPGLPAPGPPGAPGAPGIPAAPG 262

Query: 418 PAPDPVLAASPRGTSPAAAPPAPTTAQPPAAAPA 519
            AP    A +P     +AAP A  T   P  + A
Sbjct: 263 -APGSAAAPAPTSVPRSAAPVAADTDTRPKGSTA 295

 Score = 34.3 bits (77), Expect = 1.3
 Identities = 31/105 (29%), Positives = 37/105 (34%), Gaps = 10/105 (9%)
 Frame = -1

Query: 548 LDGDTPGRVGAGAAAGGCAVVGAGGAAA-----GEVPRGEAARTGSGAGPGTASPPGRPC 384
           +DG+ P          G AV   GGA       G  P+   A   +   P T  PP  P 
Sbjct: 19  MDGNQPPNPTTSPPPPGSAVPAPGGAVPPTVTNGPTPQAPPAPGAAVPAPATPIPPAAPG 78

Query: 383 VTF-CPPA*SPAAV----AGSSAP*APVCIIVPGSMV*GLGRVAP 264
                P A +P AV    A   AP  P     PGS +   G   P
Sbjct: 79  SAIPAPGAVTPTAVPTPGAAVPAPATPTPPAAPGSAIPAPGAATP 123

 Score = 33.5 bits (75), Expect = 2.2
 Identities = 29/97 (29%), Positives = 35/97 (35%)
 Frame = +1

Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAA 444
           GAT P      PG         A   A A    A G  +   +  G AVP P       A
Sbjct: 178 GATIPGSAVPAPGGAPAAPGAPAAPGAAAPRTPAPGAAIPGAVAPGSAVPTPGAISAPGA 237

Query: 445 SPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSNSP 555
           +P    PA  PP    A  P    AP  PG + + +P
Sbjct: 238 APPPGLPAPGPPGAPGA--PGIPAAPGAPGSAAAPAP 272

 Score = 33.1 bits (74), Expect = 2.9
 Identities = 28/92 (30%), Positives = 35/92 (37%)
 Frame = +1

Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAA 444
           G+  P P  + P  +   GA     PAT     A G  +      G A P   P P  A 
Sbjct: 78  GSAIPAPGAVTPTAVPTPGA-AVPAPATPTPPAAPGSAIPAP---GAATPTAVPTPGSAI 133

Query: 445 SPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVS 540
                +P AA PAP  A P A   +   PG +
Sbjct: 134 P----TPGAAVPAPGVATPSAPGASIPTPGAA 161

 Score = 32.3 bits (72), Expect = 4.9
 Identities = 23/72 (31%), Positives = 27/72 (36%)
 Frame = +1

Query: 259 LSGATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVL 438
           + GA  P      PG I   GA  A  P   A    G      G+P     PG A  P  
Sbjct: 216 IPGAVAPGSAVPTPGAISAPGA--APPPGLPAPGPPGAPGAP-GIPAAPGAPGSAAAPAP 272

Query: 439 AASPRGTSPAAA 474
            + PR  +P AA
Sbjct: 273 TSVPRSAAPVAA 284

 Score = 31.6 bits (70), Expect = 8.3
 Identities = 34/90 (37%), Positives = 38/90 (41%), Gaps = 12/90 (13%)
 Frame = -1

Query: 533 PGRVGAGAAAGGCAVVGAGGAAA-----------GEVPRGEAART-GSGAGPGTASPPGR 390
           P   GA AA G  A   A GAAA           G V  G A  T G+ + PG A PPG 
Sbjct: 187 PAPGGAPAAPGAPA---APGAAAPRTPAPGAAIPGAVAPGSAVPTPGAISAPGAAPPPGL 243

Query: 389 PCVTFCPPA*SPAAVAGSSAP*APVCIIVP 300
           P     PP  +P A    +AP AP     P
Sbjct: 244 PAPG--PPG-APGAPGIPAAPGAPGSAAAP 270

 Score = 31.6 bits (70), Expect = 8.3
 Identities = 33/102 (32%), Positives = 39/102 (37%), Gaps = 5/102 (4%)
 Frame = +1

Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAA 444
           GA  P P T  P       A G+  PA       G    T     G AVP PA  P   A
Sbjct: 62  GAAVPAPATPIP-----PAAPGSAIPAP------GAVTPTAVPTPGAAVPAPAT-PTPPA 109

Query: 445 SPRGTSPAAAPPAPTTAQPPAAA-PAP----TLPGVSPSNSP 555
           +P    PA     PT    P +A P P      PGV+  ++P
Sbjct: 110 APGSAIPAPGAATPTAVPTPGSAIPTPGAAVPAPGVATPSAP 151

>gb|AAM61027.1| unknown [Arabidopsis thaliana]
          Length = 130

 Score = 48.9 bits (115), Expect = 5e-05
 Identities = 23/48 (47%), Positives = 27/48 (55%)
 Frame = +1

Query: 412 PGPAPDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSNSP 555
           P P+P   +   P  T P AA PAPTT  PPA +PAPT    S + SP
Sbjct: 24  PAPSPTTTVTPPPVATPPPAATPAPTTTPPPAVSPAPTSSPPSSAPSP 71

 Score = 40.4 bits (93), Expect = 0.018
 Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 8/60 (13%)
 Frame = +1

Query: 394 PGGLAVPGPA--------PDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSN 549
           P  +A P PA        P P ++ +P  + P++A P+P++  P A+ PAP  PGVSP +
Sbjct: 34  PPPVATPPPAATPAPTTTPPPAVSPAPTSSPPSSA-PSPSSDAPTASPPAPEGPGVSPGD 92

 Score = 33.5 bits (75), Expect = 2.2
 Identities = 21/55 (38%), Positives = 25/55 (45%), Gaps = 5/55 (9%)
 Frame = +1

Query: 418 PAPD---PVLAASPRGTSPAAAPPAP--TTAQPPAAAPAPTLPGVSPSNSPLKKK 567
           PAP    P  A SP   +P A+PPAP      P   AP P+     P N+ L  K
Sbjct: 58  PAPTSSPPSSAPSPSSDAPTASPPAPEGPGVSPGDLAPTPSDASAPPPNAALTNK 112

 Score = 33.5 bits (75), Expect = 2.2
 Identities = 17/40 (42%), Positives = 20/40 (49%), Gaps = 3/40 (7%)
 Frame = +1

Query: 436 LAASPRGTSPAAAPPAPTTAQPPAAAPAPTL---PGVSPS 546
           LA +P  +      P P    PPAA PAPT    P VSP+
Sbjct: 20  LAQAPAPSPTTTVTPPPVATPPPAATPAPTTTPPPAVSPA 59

>ref|NP_569011.1| arabinogalactan-protein (AGP7) [Arabidopsis thaliana]
           gi|9759619|dbj|BAB11561.1| gene_id:MNA5.12~unknown
           protein [Arabidopsis thaliana]
           gi|15215666|gb|AAK91378.1| AT5g65390/MNA5_12
           [Arabidopsis thaliana] gi|20334898|gb|AAM16205.1|
           AT5g65390/MNA5_12 [Arabidopsis thaliana]
          Length = 130

 Score = 48.9 bits (115), Expect = 5e-05
 Identities = 23/48 (47%), Positives = 27/48 (55%)
 Frame = +1

Query: 412 PGPAPDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSNSP 555
           P P+P   +   P  T P AA PAPTT  PPA +PAPT    S + SP
Sbjct: 24  PAPSPTTTVTPPPVATPPPAATPAPTTTPPPAVSPAPTSSPPSSAPSP 71

 Score = 40.0 bits (92), Expect = 0.023
 Identities = 22/58 (37%), Positives = 33/58 (55%), Gaps = 8/58 (13%)
 Frame = +1

Query: 394 PGGLAVPGPA--------PDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSP 543
           P  +A P PA        P P ++ +P  + P++A P+P++  P A+ PAP  PGVSP
Sbjct: 34  PPPVATPPPAATPAPTTTPPPAVSPAPTSSPPSSA-PSPSSDAPTASPPAPEGPGVSP 90

 Score = 33.9 bits (76), Expect = 1.7
 Identities = 21/55 (38%), Positives = 25/55 (45%), Gaps = 5/55 (9%)
 Frame = +1

Query: 418 PAPD---PVLAASPRGTSPAAAPPAP--TTAQPPAAAPAPTLPGVSPSNSPLKKK 567
           PAP    P  A SP   +P A+PPAP      P   AP P+     P N+ L  K
Sbjct: 58  PAPTSSPPSSAPSPSSDAPTASPPAPEGPGVSPGELAPTPSDASAPPPNAALTNK 112

 Score = 33.5 bits (75), Expect = 2.2
 Identities = 17/40 (42%), Positives = 20/40 (49%), Gaps = 3/40 (7%)
 Frame = +1

Query: 436 LAASPRGTSPAAAPPAPTTAQPPAAAPAPTL---PGVSPS 546
           LA +P  +      P P    PPAA PAPT    P VSP+
Sbjct: 20  LAQAPAPSPTTTVTPPPVATPPPAATPAPTTTPPPAVSPA 59

>ref|NP_630576.1| putative membrane protein [Streptomyces coelicolor A3(2)]
           gi|7480977|pir||T34724 probable membrane protein -
           Streptomyces coelicolor gi|3861426|emb|CAA22031.1|
           putative membrane protein [Streptomyces coelicolor
           A3(2)]
          Length = 205

 Score = 48.5 bits (114), Expect = 7e-05
 Identities = 46/147 (31%), Positives = 54/147 (36%), Gaps = 1/147 (0%)
 Frame = +1

Query: 112 YLVSLLVVASALAVILLLMGRHKAPTSRAGRAIVEKEIEARKQSRITDLLSGATRPRPHT 291
           Y  + + +A  LA+ L L G   A  + A  A+V              LL GA RP P  
Sbjct: 75  YAGAAVALAVGLALALALPGWAAALITAALLAVVAY------------LLRGAARPHPSR 122

Query: 292 MDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAASPRGTSPAA 471
             P      G  G D    A  DH  G       PGGL VP P   PV   +P G   A 
Sbjct: 123 PGPAP----GTAGHDH--VAGHDHVAGGGAPAAAPGGLGVPYPPMPPV---APGGVGGAT 173

Query: 472 APPAPTTAQPPAAAP-APTLPGVSPSN 549
             P   T  P    P AP    + P N
Sbjct: 174 GAPGAGTPAPGGTGPAAPRQDDLDPEN 200



EST assemble image


clone accession position
1 MXL011h08_r BP093676 1 496
2 LCL078f04_r AV630421 30 576
3 CL29c06_r AV394654 42 504
4 LCL033h08_r AV627921 84 578




Chlamydomonas reinhardtii
Kazusa DNA Research Institute
Department of Plant Gene Research