KCC000272A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000272A_C01 KCC000272A_c01
gcaccaggTCTCGAGTTTTTTTTTTTTTTTTTTCAACTCCAAATTGTTACATGAGCTCAT
GGCGTGGGTCTGTGCGCCAGTGGTGGGGTGCGCTTGCGTTCGGTCACCGTTCACGCGTTC
CATCATTTTATCCCCAAGGGTTCCACATCGGCGGAGTTCGTCATGCACTGCTGCGCTCCC
CAGTCGGCTAGGGCACCGAGCTGAACTCCACACGACAACTCCTCCCGCAAACTCCCAGGC
GCCAGTACGAGTGCGTTCAACCTCAACCCGTGGACACATTCCATCAACAAAACCCGAGTC
TTGCACGTTGCTACCGGACCGGAACCCGACATGCGCTCACGGTCGAAAACGTGCTAGCCA
CAAGACATCGGAGCCTGACGAAATCGCGACGCAGTGCCCAGCGACCCACGCCCTATTGCG
CCTAGCCCCAACACTGACCAGCAGACCCACACCAGGCCACGGGTTGCAACAACCCCAATA
TGAGCCGCAGCCCAGTAGGTTCTGAGCGCGC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000272A_C01 KCC000272A_c01
         (511 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAH32208.1| Gtf3c1 protein [Mus musculus]                           40  0.017
ref|NP_191218.2| proline-rich protein family [Arabidopsis thaliana]    39  0.029
ref|NP_523475.2| Salivary gland secretion 1 CG3047-PA [Drosophil...    39  0.038
ref|XP_299843.2| similar to mucin 10 [Homo sapiens]                    38  0.084
ref|XP_310043.1| ENSANGP00000015192 [Anopheles gambiae] gi|21293...    37  0.14

>gb|AAH32208.1| Gtf3c1 protein [Mus musculus]
          Length = 1373

 Score = 40.0 bits (92), Expect = 0.017
 Identities = 24/74 (32%), Positives = 30/74 (40%)
 Frame = +2

Query: 182  SRLGHRAELHTTTPPANSQAPVRVRSTSTRGHIPSTKPESCTLLPDRNPTCAHGRKRASH 361
            SRL     +   T    +  PV   ST  R H P T  E  T LP + PT    R  AS 
Sbjct: 1104 SRLPEGPSIEDHTSEGAAVPPVSSHSTKKRPHCPETDAEEATRLPAKKPTLQDVRVAASP 1163

Query: 362  KTSEPDEIATQCPA 403
            +    ++   Q PA
Sbjct: 1164 RPGAEEQAEAQAPA 1177

>ref|NP_191218.2| proline-rich protein family [Arabidopsis thaliana]
          Length = 477

 Score = 39.3 bits (90), Expect = 0.029
 Identities = 33/135 (24%), Positives = 50/135 (36%), Gaps = 9/135 (6%)
 Frame = +2

Query: 116 RSIILSPRVPHRRSSSCTAA---------LPSRLGHRAELHTTTPPANSQAPVRVRSTST 268
           + + LS  +PH  ++S T +          P    H    H    P  S +P       T
Sbjct: 304 KQVRLSSILPHSPATSSTPSPSPQPETHQYPHHHPHHHHHHHELAPEPSLSP------PT 357

Query: 269 RGHIPSTKPESCTLLPDRNPTCAHGRKRASHKTSEPDEIATQCPATHALLRLAPTLTSRP 448
           +G  P++ P   + LP RNP C + ++R    ++     A   PA H      P     P
Sbjct: 358 KGFAPASAPTKHSPLPPRNPPCPYEQRRPKGNSALNHHTAPPTPAPHRSQPHPPAPNPAP 417

Query: 449 TPGHGLQQPQYEPQP 493
              H +  P   P P
Sbjct: 418 PRHHAI--PVSSPLP 430

>ref|NP_523475.2| Salivary gland secretion 1 CG3047-PA [Drosophila melanogaster]
            gi|28380257|gb|AAF50957.3| CG3047-PA [Drosophila
            melanogaster]
          Length = 1225

 Score = 38.9 bits (89), Expect = 0.038
 Identities = 38/137 (27%), Positives = 50/137 (35%), Gaps = 6/137 (4%)
 Frame = +2

Query: 101  RSPFTRSIILSPRVPHRRSSSCTAALPSRLGHRAELHTTTPPANSQAPVRVRSTSTRGHI 280
            RS  T S         R ++  +   P+    R+   TTT    +  P R  +T+T    
Sbjct: 996  RSTTTTSTSRPTTTTPRSTTKTSTCAPTTTTPRSTTTTTTSRPTTTTP-RSTTTTTTSRP 1054

Query: 281  PSTKPESCTLLPDRNPTCAHGRKRASHKTSEPD-----EIATQCP-ATHALLRLAPTLTS 442
             +T P S T      PT    R   +  TS P         T CP  T +      T T 
Sbjct: 1055 TTTTPRSTTTPCTSRPTTTTPRSTTTTTTSRPTTTTPRSTTTPCPTTTPSASPTRTTTTR 1114

Query: 443  RPTPGHGLQQPQYEPQP 493
            RP P H   QP Y+  P
Sbjct: 1115 RPCPCH--PQPPYQIPP 1129

 Score = 33.1 bits (74), Expect = 2.1
 Identities = 28/109 (25%), Positives = 41/109 (36%), Gaps = 8/109 (7%)
 Frame = +2

Query: 149  RRSSSCTAALPSRLGHRAELHTTTPPANSQAPVRVRSTSTRGHIPSTKPESCTLLPDRNP 328
            R ++  +   P+    R+   T+T    +  P R  +T+T     +T P S T      P
Sbjct: 932  RSTTKTSTCAPTTTTPRSTTTTSTSRPTTTTP-RSTTTTTTSRPTTTTPRSTTTPSTSRP 990

Query: 329  TCAHGRKRASHKTSEPD--------EIATQCPATHALLRLAPTLTSRPT 451
            T    R   +  TS P         + +T  P T        T TSRPT
Sbjct: 991  TTTTPRSTTTTSTSRPTTTTPRSTTKTSTCAPTTTTPRSTTTTTTSRPT 1039

 Score = 32.0 bits (71), Expect = 4.6
 Identities = 37/137 (27%), Positives = 48/137 (35%), Gaps = 10/137 (7%)
 Frame = +2

Query: 71   CAPVVGCACVRSPFTRSIILSPRVPHRRSSSCTAALPSRLGHRAELHTTTPPANSQAPVR 250
            CAP       RS  T S         R +++ T + P+    R+   TTTP  +      
Sbjct: 940  CAPTT--TTPRSTTTTSTSRPTTTTPRSTTTTTTSRPTTTTPRS---TTTPSTSRPTTTT 994

Query: 251  VRSTSTRG--HIPSTKPESCTLLPDRNPTCAHGRKRASHKTSEPDEIATQCPATHALLRL 424
             RST+T       +T P S T      PT    R   +  TS P     +   T    R 
Sbjct: 995  PRSTTTTSTSRPTTTTPRSTTKTSTCAPTTTTPRSTTTTTTSRPTTTTPRSTTTTTTSRP 1054

Query: 425  APTL--------TSRPT 451
              T         TSRPT
Sbjct: 1055 TTTTPRSTTTPCTSRPT 1071

 Score = 31.2 bits (69), Expect = 7.9
 Identities = 26/89 (29%), Positives = 31/89 (34%), Gaps = 10/89 (11%)
 Frame = +2

Query: 215 TTPPANSQAPVRVRSTSTRG--HIPSTKPESCTLLPDRNPTCAHGRKRASHKTSEPDEIA 388
           TTP   +Q     RST+T       +T P S T      PT    R   +  T  P    
Sbjct: 212 TTPCTCAQTTTTPRSTTTTSTSRPTTTTPRSTTTTTTSRPTTTTPRSTTTTTTRRPTTTT 271

Query: 389 TQC--------PATHALLRLAPTLTSRPT 451
            +C        P T        T TSRPT
Sbjct: 272 PRCTTTTSTCAPTTTTPRSTTTTTTSRPT 300

 Score = 31.2 bits (69), Expect = 7.9
 Identities = 33/127 (25%), Positives = 44/127 (33%), Gaps = 6/127 (4%)
 Frame = +2

Query: 89  CACV------RSPFTRSIILSPRVPHRRSSSCTAALPSRLGHRAELHTTTPPANSQAPVR 250
           C C       RS  T S         R +++ T + P+    R+   TTT    +  P  
Sbjct: 215 CTCAQTTTTPRSTTTTSTSRPTTTTPRSTTTTTTSRPTTTTPRSTTTTTTRRPTTTTPRC 274

Query: 251 VRSTSTRGHIPSTKPESCTLLPDRNPTCAHGRKRASHKTSEPDEIATQCPATHALLRLAP 430
             +TST     +T P S T      PT    R   +  T  P     +   T        
Sbjct: 275 TTTTSTCAP-TTTTPRSTTTTTTSRPTTTTPRCTTTTSTCSPTRTTPRSTTT-------- 325

Query: 431 TLTSRPT 451
           T TSRPT
Sbjct: 326 TSTSRPT 332

>ref|XP_299843.2| similar to mucin 10 [Homo sapiens]
          Length = 566

 Score = 37.7 bits (86), Expect = 0.084
 Identities = 40/136 (29%), Positives = 50/136 (36%), Gaps = 18/136 (13%)
 Frame = +2

Query: 134 PRVPHRRSSSCTAA-LPSRLGHRAE----LHTTTPPANS--QAPVRVRSTSTRGHIPSTK 292
           P  P  R+   TA+  PS    R E    LH   PP N+  Q P            P T+
Sbjct: 387 PTTPKTRTQPLTASPTPSPQHQRPEPTLSLHPRRPPHNTKDQNPAPHCIPDALPTTPKTR 446

Query: 293 PESCTLLPDRNPTCAHGRKRAS---HKTSEPDEIATQCPATHALLRLAPT--------LT 439
            +  T  P  +P   H R   S   H    P     Q PA H +    PT        LT
Sbjct: 447 TQPLTASPTPSPQ--HQRPEPSPSLHPRRPPHNTKDQNPAPHCIPDALPTSPKTRTQPLT 504

Query: 440 SRPTPGHGLQQPQYEP 487
           + PTP    Q+P+  P
Sbjct: 505 ASPTPSPQHQRPERSP 520

>ref|XP_310043.1| ENSANGP00000015192 [Anopheles gambiae] gi|21293583|gb|EAA05728.1|
           ENSANGP00000015192 [Anopheles gambiae str. PEST]
          Length = 576

 Score = 37.0 bits (84), Expect = 0.14
 Identities = 41/138 (29%), Positives = 55/138 (39%), Gaps = 19/138 (13%)
 Frame = +2

Query: 140 VPHRRSSSCTAALPSRLGHRAELHTTTPPANSQAPVRVRSTSTRGHIPSTKPESCTLLPD 319
           VP   S    AA P      A+L T+ PPA++++P    S ST  +  +  P S    P 
Sbjct: 9   VPSSSSPGSPAAYP------ADLSTSKPPASTKSP---SSGSTAAYPANPSPRS----PA 55

Query: 320 RNPTCAHGRKRASHKTSEPDEIATQCPATHALLRLAPTLTSRPT---------------- 451
            NP        A+  TS+P    T  P+  +    A + TS+PT                
Sbjct: 56  ANP--------AASPTSKPPASTTANPSPRSSANPAASPTSKPTSINKIPPQDHQPHIPQ 107

Query: 452 -PGHGLQQ--PQYEPQPS 496
            P H  QQ  PQ  P PS
Sbjct: 108 IPPHDHQQQIPQLHPLPS 125

 Score = 35.4 bits (80), Expect = 0.42
 Identities = 33/120 (27%), Positives = 46/120 (37%), Gaps = 6/120 (5%)
 Frame = +2

Query: 155 SSSCTAALPSRLGHR------AELHTTTPPANSQAPVRVRSTSTRGHIPSTKPESCTLLP 316
           SS  TAA P+    R      A   T+ PPA++ A    RS++     P++KP S   +P
Sbjct: 38  SSGSTAAYPANPSPRSPAANPAASPTSKPPASTTANPSPRSSANPAASPTSKPTSINKIP 97

Query: 317 DRNPTCAHGRKRASHKTSEPDEIATQCPATHALLRLAPTLTSRPTPGHGLQQPQYEPQPS 496
            ++        +       P +   Q P  H L          P   H  Q PQ  P PS
Sbjct: 98  PQD-------HQPHIPQIPPHDHQQQIPQLHPLPSHQHQQPQIPLHDHQQQIPQLHPLPS 150



EST assemble image


clone accession position
1 CL15d03_r AV393955 1 346
2 CM088a06_r AV393075 9 511
3 MX226g06_r BP090632 9 300




Chlamydomonas reinhardtii
Kazusa DNA Research Institute
Department of Plant Gene Research