bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.24+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.



Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: egene_temp_file_orthology_annotation_similarity_blast_database_966
           164,496 sequences; 82,071,388 total letters



Query=  Eace_3218_orf1
Length=84
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

  tgo:TGME49_089620  cathepsin C (EC:3.4.14.1); K01275 cathepsin ...  82.4    4e-16
  pfa:PF11_0174  cathepsin C, homolog; K01275 cathepsin C [EC:3.4...  80.1    2e-15
  cpv:cgd4_2110  preprocathepsin c precursor ; K01275 cathepsin C...  79.3    3e-15
  pfa:PFL2290w  preprocathepsin c precursor, putative (EC:3.4.14....  76.3    3e-14
  tgo:TGME49_067490  papain family cysteine protease domain-conta...  74.7    6e-14
  bbo:BBOV_I000540  16.m00694; preprocathepsin c precursor; K0127...  74.3    9e-14
  tpv:TP03_0357  cathepsin C; K01275 cathepsin C [EC:3.4.14.1]        70.5    1e-12
  tpv:TP02_0883  cathepsin C; K01275 cathepsin C [EC:3.4.14.1]        69.3    3e-12
  bbo:BBOV_II000170  18.m05995; cathepsin C precursor (EC:3.4.22....  68.9    4e-12
  cel:F41E6.6  tag-196; Temporarily Assigned Gene name family mem...  55.1    6e-08
  cel:R09F10.1  hypothetical protein                                  54.7    7e-08
  mmu:13036  Ctsh, AL022844; cathepsin H (EC:3.4.22.16); K01366 c...  53.5    2e-07
  tgo:TGME49_076130  cathepsin C2 (TgCPC2) (EC:3.4.14.1)              51.6    5e-07
  cel:W07B8.5  cpr-5; Cysteine PRotease related family member (cp...  51.6    6e-07
  hsa:1512  CTSH, ACC-4, ACC-5, CPSB, DKFZp686B24257, MGC1519, mi...  50.1    2e-06
  mmu:13032  Ctsc, AI047818, DPP1, DPPI; cathepsin C (EC:3.4.14.1...  49.7    2e-06
  mmu:26944  Tinag, AI452335, TIN-ag; tubulointerstitial nephriti...  49.7    3e-06
  cel:W07B8.4  hypothetical protein; K01363 cathepsin B [EC:3.4.2...  49.3    3e-06
  dre:368704  ctsc, cb912, ik:tdsubc_1h2, sb:cb146, wu:fb34g12, w...  48.5    4e-06
  hsa:27283  TINAG, TIN-AG; tubulointerstitial nephritis antigen      48.5    5e-06
  xla:380203  ctsc, MGC69126; cathepsin C (EC:3.4.14.1); K01275 c...  48.5    6e-06
  cpv:cgd2_3320  secreted papain like protease, signal peptide        48.1    6e-06
  dre:550475  ctsk, wu:fa95f03, wu:fb08b05, zgc:110367; cathepsin...  48.1    7e-06
  cel:K02E7.10  hypothetical protein                                  47.8    8e-06
  cel:R07E3.1  hypothetical protein                                   47.4    1e-05
  dre:324818  ctsh, fc44c02, wu:fc44c02, zgc:85774; cathepsin H (...  47.4    1e-05
  hsa:1521  CTSW, LYPN; cathepsin W; K08569 cathepsin W [EC:3.4.2...  47.4    1e-05
  mmu:56464  Ctsf, AI481912; cathepsin F (EC:3.4.22.41); K01373 c...  47.4    1e-05
  hsa:64129  TINAGL1, ARG1, LCN7, LIECG3, TINAGRP; tubulointersti...  47.4    1e-05
  ath:AT3G48350  cysteine proteinase, putative                        47.0    1e-05
  dre:562116  tinagl1, si:dkey-158b13.1; tubulointerstitial nephr...  46.6    2e-05
  ath:AT3G19390  cysteine proteinase, putative / thiol protease, ...  46.2    3e-05
  cel:F36D3.9  cpr-2; Cysteine PRotease related family member (cp...  45.8    3e-05
  ath:AT3G54940  cysteine-type endopeptidase/ cysteine-type pepti...  45.4    4e-05
  cel:F26E4.3  hypothetical protein                                   45.4    5e-05
  ath:AT2G21430  cysteine proteinase A494, putative / thiol prote...  45.1    5e-05
  cel:F44C4.3  cpr-4; Cysteine PRotease related family member (cp...  45.1    5e-05
  xla:100127265  ctsk, cts02, ctso, ctso1, ctso2, pknd, pycd; cat...  45.1    5e-05
  ath:AT4G39090  RD19; RD19 (RESPONSIVE TO DEHYDRATION 19); cyste...  45.1    6e-05
  mmu:13041  Ctsw, lymphopain; cathepsin W; K08569 cathepsin W [E...  45.1    6e-05
  xla:444163  ctsl1, MGC80629, catl, cpl-1, ctsl, mep; cathepsin ...  45.1    6e-05
  mmu:13038  Ctsk, AI323530, MMS10-Q, Ms10q, catK; cathepsin K (E...  45.1    6e-05
  cel:Y51A2D.1  hypothetical protein                                  44.7    6e-05
  pfa:PFD0230c  protease, putative                                    44.7    7e-05
  hsa:1519  CTSO, CTSO1; cathepsin O (EC:3.4.22.42); K01374 cathe...  44.7    7e-05
  dre:569298  ctsbb; capthepsin B, b; K01363 cathepsin B [EC:3.4....  44.3    8e-05
  cel:Y71H2AR.2  hypothetical protein                                 44.3    1e-04
  dre:567046  ctskl, si:dkey-121a11.6; cathepsin K, like              43.9    1e-04
  cel:C25B8.3  cpr-6; Cysteine PRotease related family member (cp...  43.9    1e-04
  cel:Y113G7B.15  hypothetical protein                                43.9    1e-04


> tgo:TGME49_089620  cathepsin C (EC:3.4.14.1); K01275 cathepsin 
C [EC:3.4.14.1]
Length=733

 Score = 82.4 bits (202),  Expect = 4e-16, Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 54/83 (65%), Gaps = 13/83 (15%)

Query  1    GWGETDAAAATTATPAAAGDSGK-LKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS  59
            GWGETD            G++GK  KYWIVRNTWG +WG  GY  + RG N GGIESQA+
Sbjct  660  GWGETD------------GENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIESQAT  707

Query  60   FIDPDLTRGKGKQLFDSLLAAHS  82
            FIDPD +RG+G ++  ++ A  S
Sbjct  708  FIDPDFSRGQGLKVAKAIEALKS  730


> pfa:PF11_0174  cathepsin C, homolog; K01275 cathepsin C [EC:3.4.14.1]
Length=700

 Score = 80.1 bits (196),  Expect = 2e-15, Method: Composition-based stats.
 Identities = 38/78 (48%), Positives = 49/78 (62%), Gaps = 14/78 (17%)

Query  1    GWGETDAAAATTATPAAAGDSGKL-KYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS  59
            GWGE +              +GKL KYWI RN+WG+ WGK GYF ++RG N  GIESQ+ 
Sbjct  630  GWGEEEI-------------NGKLYKYWIGRNSWGNGWGKEGYFKILRGQNFSGIESQSL  676

Query  60   FIDPDLTRGKGKQLFDSL  77
            FI+PD +RG GK L + +
Sbjct  677  FIEPDFSRGAGKILLEKM  694


> cpv:cgd4_2110  preprocathepsin c precursor ; K01275 cathepsin 
C [EC:3.4.14.1]
Length=635

 Score = 79.3 bits (194),  Expect = 3e-15, Method: Composition-based stats.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 4/75 (5%)

Query  1    GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF  60
            GW  T+ A A        G+   + YWI+RN+WG++WGK GY  + RG N GGIE+QA F
Sbjct  529  GWEYTNHAIAIVGW----GEENGIPYWIIRNSWGANWGKKGYAKIRRGKNIGGIENQAVF  584

Query  61   IDPDLTRGKGKQLFD  75
            IDPD TRG G  L +
Sbjct  585  IDPDFTRGMGLSLLN  599


> pfa:PFL2290w  preprocathepsin c precursor, putative (EC:3.4.14.1); 
K01275 cathepsin C [EC:3.4.14.1]
Length=590

 Score = 76.3 bits (186),  Expect = 3e-14, Method: Composition-based stats.
 Identities = 30/53 (56%), Positives = 37/53 (69%), Gaps = 0/53 (0%)

Query  24   LKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDPDLTRGKGKQLFDS  76
            +KYWI+RNTWG +WG  GY    RG+N  GIESQA +IDPD +RG  K +  S
Sbjct  534  VKYWIIRNTWGKNWGYKGYLKFQRGINLAGIESQAVYIDPDFSRGYPKNILQS  586


> tgo:TGME49_067490  papain family cysteine protease domain-containing 
protein (EC:3.4.14.1); K01275 cathepsin C [EC:3.4.14.1]
Length=622

 Score = 74.7 bits (182),  Expect = 6e-14, Method: Composition-based stats.
 Identities = 37/73 (50%), Positives = 43/73 (58%), Gaps = 9/73 (12%)

Query  1    GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF  60
            GWGE D     T  P         K+W+VRNTWGS+WG  GY  + RG N   IESQA +
Sbjct  549  GWGE-DEPDNATGKPK--------KFWVVRNTWGSNWGTHGYVKIPRGENMAAIESQAVY  599

Query  61   IDPDLTRGKGKQL  73
             DPDLTRG+  QL
Sbjct  600  FDPDLTRGRAAQL  612


> bbo:BBOV_I000540  16.m00694; preprocathepsin c precursor; K01275 
cathepsin C [EC:3.4.14.1]
Length=546

 Score = 74.3 bits (181),  Expect = 9e-14, Method: Composition-based stats.
 Identities = 36/68 (52%), Positives = 43/68 (63%), Gaps = 0/68 (0%)

Query  1    GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF  60
            GW  T+ A A         D    KYWI +NTWG+ WG GG+F + RGVN  GIE+QA +
Sbjct  468  GWEYTNHAIAIVGWGEDEIDGIITKYWICKNTWGNDWGVGGFFKIKRGVNQCGIETQAVY  527

Query  61   IDPDLTRG  68
            IDPDLTRG
Sbjct  528  IDPDLTRG  535


> tpv:TP03_0357  cathepsin C; K01275 cathepsin C [EC:3.4.14.1]
Length=501

 Score = 70.5 bits (171),  Expect = 1e-12, Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 39/68 (57%), Gaps = 15/68 (22%)

Query  1    GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF  60
            GWGETD                  KYW+ RN+WG  WG  G+F ++RG+NA GIES+A  
Sbjct  430  GWGETDEG---------------FKYWVARNSWGKDWGDNGFFKIVRGINAFGIESEAVV  474

Query  61   IDPDLTRG  68
            +DPD+ + 
Sbjct  475  LDPDIEKA  482


> tpv:TP02_0883  cathepsin C; K01275 cathepsin C [EC:3.4.14.1]
Length=365

 Score = 69.3 bits (168),  Expect = 3e-12, Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 40/68 (58%), Gaps = 0/68 (0%)

Query  1    GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF  60
            GW  T+ A           +   +KYWI +NTWG++WG  GYF + +GVN  GIESQA F
Sbjct  282  GWEYTNHAIVVVGWGEELVNGENVKYWICKNTWGTNWGVQGYFKIKKGVNLCGIESQAVF  341

Query  61   IDPDLTRG  68
             DP L +G
Sbjct  342  FDPSLNKG  349


> bbo:BBOV_II000170  18.m05995; cathepsin C precursor (EC:3.4.22.-); 
K01275 cathepsin C [EC:3.4.14.1]
Length=530

 Score = 68.9 bits (167),  Expect = 4e-12, Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 40/67 (59%), Gaps = 0/67 (0%)

Query  1    GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF  60
            GW  T  A A          +  +KYWI RN+WG +WG  G+F + RG NA GIES+A F
Sbjct  448  GWEYTSHAVAIVGWGQEKVGARMIKYWICRNSWGQNWGINGHFKIERGKNAYGIESEAVF  507

Query  61   IDPDLTR  67
            IDPD ++
Sbjct  508  IDPDFSK  514


> cel:F41E6.6  tag-196; Temporarily Assigned Gene name family member 
(tag-196); K01373 cathepsin F [EC:3.4.22.41]
Length=477

 Score = 55.1 bits (131),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 20/41 (48%), Positives = 28/41 (68%), Gaps = 0/41 (0%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS  59
            G  G+  YWIV+N+WG +WG+ GYF L RG N  G++  A+
Sbjct  432  GKDGRKPYWIVKNSWGPNWGEAGYFKLYRGKNVCGVQEMAT  472


> cel:R09F10.1  hypothetical protein
Length=383

 Score = 54.7 bits (130),  Expect = 7e-08, Method: Composition-based stats.
 Identities = 21/36 (58%), Positives = 27/36 (75%), Gaps = 0/36 (0%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGI  54
            G  G+  YWIV+N+WG+SWG  GYF L RGVN+ G+
Sbjct  338  GGEGESAYWIVKNSWGTSWGASGYFRLARGVNSCGL  373


> mmu:13036  Ctsh, AL022844; cathepsin H (EC:3.4.22.16); K01366 
cathepsin H [EC:3.4.22.16]
Length=333

 Score = 53.5 bits (127),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 23/47 (48%), Positives = 32/47 (68%), Gaps = 0/47 (0%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDPDL  65
            G+   L YWIV+N+WGS WG+ GYFL+ RG N  G+ + AS+  P +
Sbjct  287  GEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAACASYPIPQV  333


> tgo:TGME49_076130  cathepsin C2 (TgCPC2) (EC:3.4.14.1)
Length=753

 Score = 51.6 bits (122),  Expect = 5e-07, Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 35/67 (52%), Gaps = 0/67 (0%)

Query  2    WGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI  61
            W + D A   +    A      L YW VRN+WG+ WG+GGY  ++RGVN   IE  A   
Sbjct  572  WEKVDHAVVISGWGWAKHGDSWLPYWKVRNSWGTKWGEGGYARVLRGVNEMAIERVAVVG  631

Query  62   DPDLTRG  68
            +  L RG
Sbjct  632  EVSLFRG  638


> cel:W07B8.5  cpr-5; Cysteine PRotease related family member (cpr-5); 
K01363 cathepsin B [EC:3.4.22.1]
Length=344

 Score = 51.6 bits (122),  Expect = 6e-07, Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 28/42 (66%), Gaps = 0/42 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDPDLTR  67
            YW+V N+W  +WG+ GYF +IRG+N  GIE  A    PDL R
Sbjct  301  YWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIPDLAR  342


> hsa:1512  CTSH, ACC-4, ACC-5, CPSB, DKFZp686B24257, MGC1519, 
minichain; cathepsin H (EC:3.4.22.16); K01366 cathepsin H [EC:3.4.22.16]
Length=335

 Score = 50.1 bits (118),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 21/45 (46%), Positives = 29/45 (64%), Gaps = 0/45 (0%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDP  63
            G+   + YWIV+N+WG  WG  GYFL+ RG N  G+ + AS+  P
Sbjct  289  GEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP  333


> mmu:13032  Ctsc, AI047818, DPP1, DPPI; cathepsin C (EC:3.4.14.1); 
K01275 cathepsin C [EC:3.4.14.1]
Length=462

 Score = 49.7 bits (117),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 18/35 (51%), Positives = 27/35 (77%), Gaps = 0/35 (0%)

Query  24   LKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQA  58
            ++YWI++N+WGS+WG+ GYF + RG +   IES A
Sbjct  419  IEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIA  453


> mmu:26944  Tinag, AI452335, TIN-ag; tubulointerstitial nephritis 
antigen
Length=475

 Score = 49.7 bits (117),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 19/33 (57%), Positives = 24/33 (72%), Gaps = 0/33 (0%)

Query  23   KLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIE  55
            K K+WI  N+WG SWG+ GYF ++RGVN   IE
Sbjct  427  KEKFWIAANSWGKSWGENGYFRILRGVNESDIE  459


> cel:W07B8.4  hypothetical protein; K01363 cathepsin B [EC:3.4.22.1]
Length=335

 Score = 49.3 bits (116),  Expect = 3e-06, Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 28/42 (66%), Gaps = 0/42 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDPDLTR  67
            YW+  N+W + WG+ GYF ++RGV+  GIES A    PDL R
Sbjct  292  YWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMPDLNR  333


> dre:368704  ctsc, cb912, ik:tdsubc_1h2, sb:cb146, wu:fb34g12, 
wu:fj58d01; cathepsin C (EC:3.4.14.1); K01275 cathepsin C [EC:3.4.14.1]
Length=455

 Score = 48.5 bits (114),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 20/39 (51%), Positives = 26/39 (66%), Gaps = 0/39 (0%)

Query  25   KYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDP  63
            KYWIV+N+WGS WG+ G+F + RG +   IES A    P
Sbjct  413  KYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAATP  451


> hsa:27283  TINAG, TIN-AG; tubulointerstitial nephritis antigen
Length=476

 Score = 48.5 bits (114),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 19/33 (57%), Positives = 24/33 (72%), Gaps = 0/33 (0%)

Query  23   KLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIE  55
            K K+WI  N+WG SWG+ GYF ++RGVN   IE
Sbjct  428  KEKFWIAANSWGKSWGENGYFRILRGVNESDIE  460


> xla:380203  ctsc, MGC69126; cathepsin C (EC:3.4.14.1); K01275 
cathepsin C [EC:3.4.14.1]
Length=458

 Score = 48.5 bits (114),  Expect = 6e-06, Method: Composition-based stats.
 Identities = 20/39 (51%), Positives = 27/39 (69%), Gaps = 0/39 (0%)

Query  25   KYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDP  63
            KYWIV+N+WG SWG+ G+F + RG +   IES A   +P
Sbjct  416  KYWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVSANP  454


> cpv:cgd2_3320  secreted papain like protease, signal peptide 

Length=819

 Score = 48.1 bits (113),  Expect = 6e-06, Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 0/57 (0%)

Query  2    WGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQA  58
            W + D A   T        + ++ YWIV+N+WG  WG+ G+  +IRGVN   IE  A
Sbjct  732  WHKVDHAMVITGWGWETYGNERIPYWIVQNSWGKRWGEKGFCRIIRGVNELSIEHAA  788


> dre:550475  ctsk, wu:fa95f03, wu:fb08b05, zgc:110367; cathepsin 
K (EC:3.4.22.38); K01371 cathepsin K [EC:3.4.22.38]
Length=333

 Score = 48.1 bits (113),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query  6    DAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGV-NAGGIESQASF  60
            D   A  A    A   GK KYWIV+N+WG  WGK GY L+ R   NA GI + ASF
Sbjct  276  DVNHAVLAVGYGATPRGK-KYWIVKNSWGEEWGKKGYVLMARNRNNACGIANLASF  330


> cel:K02E7.10  hypothetical protein
Length=299

 Score = 47.8 bits (112),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 0/54 (0%)

Query  6    DAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS  59
            +A  A +      G  G  KYWIV+ ++G+SWG+ GY  L R VNA G+    S
Sbjct  238  NANEARSLAIVGYGKDGAEKYWIVKGSFGTSWGEHGYMKLARNVNACGMAESIS  291


> cel:R07E3.1  hypothetical protein
Length=402

 Score = 47.4 bits (111),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 19/34 (55%), Positives = 26/34 (76%), Gaps = 1/34 (2%)

Query  25   KYWIVRNTWGSSWG-KGGYFLLIRGVNAGGIESQ  57
            KYWIV+N+WG++WG + GY    RG+NA GIE +
Sbjct  363  KYWIVKNSWGNTWGVEHGYIYFARGINACGIEDE  396


> dre:324818  ctsh, fc44c02, wu:fc44c02, zgc:85774; cathepsin H 
(EC:3.4.22.16); K01366 cathepsin H [EC:3.4.22.16]
Length=330

 Score = 47.4 bits (111),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 18/38 (47%), Positives = 27/38 (71%), Gaps = 0/38 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDP  63
            YWIV+N+WG++WG  GYF + RG N  G+ + +S+  P
Sbjct  291  YWIVKNSWGTNWGIKGYFYIERGKNMCGLAACSSYPIP  328


> hsa:1521  CTSW, LYPN; cathepsin W; K08569 cathepsin W [EC:3.4.22.-]
Length=376

 Score = 47.4 bits (111),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 6/53 (11%)

Query  2    WGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGI  54
            W ET ++ +    P          YWI++N+WG+ WG+ GYF L RG N  GI
Sbjct  308  WAETVSSQSQPQPPHPT------PYWILKNSWGAQWGEKGYFRLHRGSNTCGI  354


> mmu:56464  Ctsf, AI481912; cathepsin F (EC:3.4.22.41); K01373 
cathepsin F [EC:3.4.22.41]
Length=462

 Score = 47.4 bits (111),  Expect = 1e-05, Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 27/41 (65%), Gaps = 0/41 (0%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS  59
            G+   + YW ++N+WGS WG+ GY+ L RG  A G+ + AS
Sbjct  417  GNRSNIPYWAIKNSWGSDWGEEGYYYLYRGSGACGVNTMAS  457


> hsa:64129  TINAGL1, ARG1, LCN7, LIECG3, TINAGRP; tubulointerstitial 
nephritis antigen-like 1
Length=436

 Score = 47.4 bits (111),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 11/56 (19%)

Query  1    GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIES  56
            GWGE               D   LKYW   N+WG +WG+ G+F ++RGVN   IES
Sbjct  375  GWGEETLP-----------DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIES  419


> ath:AT3G48350  cysteine proteinase, putative
Length=364

 Score = 47.0 bits (110),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 4/40 (10%)

Query  25   KYWIVRNTWGSSWGKGGYFLLIRGV--NAG--GIESQASF  60
            KYWIVRN+WG  WG+GGY  + RG+  N G  GI  +AS+
Sbjct  302  KYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASY  341


> dre:562116  tinagl1, si:dkey-158b13.1; tubulointerstitial nephritis 
antigen-like 1
Length=471

 Score = 46.6 bits (109),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 11/56 (19%)

Query  1    GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIES  56
            GWGE    +  T            KYWI  N+WG +WG+ GYF + RGVN   IE+
Sbjct  402  GWGEERDYSGRTR-----------KYWIGANSWGKNWGEDGYFRIARGVNECDIET  446


> ath:AT3G19390  cysteine proteinase, putative / thiol protease, 
putative
Length=452

 Score = 46.2 bits (108),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 4/46 (8%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAG----GIESQASF  60
            G  G   YWIVRN+WGS+WG+ GYF L R +       G+   AS+
Sbjct  298  GSEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASY  343


> cel:F36D3.9  cpr-2; Cysteine PRotease related family member (cpr-2); 
K01363 cathepsin B [EC:3.4.22.1]
Length=344

 Score = 45.8 bits (107),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 17/32 (53%), Positives = 24/32 (75%), Gaps = 0/32 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQ  57
            YW+  N+WGS WG+ G F ++RGV+  GIES+
Sbjct  306  YWLAVNSWGSQWGESGTFRILRGVDECGIESR  337


> ath:AT3G54940  cysteine-type endopeptidase/ cysteine-type peptidase
Length=367

 Score = 45.4 bits (106),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 17/36 (47%), Positives = 24/36 (66%), Gaps = 0/36 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI  61
            YWI++N+WG  WG+ GY+ L RG +  GI S  S +
Sbjct  326  YWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAV  361


> cel:F26E4.3  hypothetical protein
Length=452

 Score = 45.4 bits (106),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 36/69 (52%), Gaps = 14/69 (20%)

Query  2    WGETDAAAATTATPAAAG-------------DSGK-LKYWIVRNTWGSSWGKGGYFLLIR  47
            +  +D AA   A+  A G              +GK +KYW+  N+WG+ WG+ GYF ++R
Sbjct  355  YQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLR  414

Query  48   GVNAGGIES  56
            G N   IES
Sbjct  415  GENHCEIES  423


> ath:AT2G21430  cysteine proteinase A494, putative / thiol protease, 
putative
Length=361

 Score = 45.1 bits (105),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 15/36 (41%), Positives = 26/36 (72%), Gaps = 0/36 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI  61
            YWI++N+WG SWG+ G++ + +G N  G++S  S +
Sbjct  321  YWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTV  356


> cel:F44C4.3  cpr-4; Cysteine PRotease related family member (cpr-4); 
K01363 cathepsin B [EC:3.4.22.1]
Length=335

 Score = 45.1 bits (105),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 0/37 (0%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIE  55
            G      YW+V N+W  +WG+ GYF +IRG N  GIE
Sbjct  289  GTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE  325


> xla:100127265  ctsk, cts02, ctso, ctso1, ctso2, pknd, pycd; cathepsin 
K (EC:3.4.22.38); K01371 cathepsin K [EC:3.4.22.38]
Length=331

 Score = 45.1 bits (105),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGV-NAGGIESQASF  60
            G   K KYWIV+N+WG  WG  GY L+ +   NA GI + AS+
Sbjct  286  GTQKKAKYWIVKNSWGEEWGDKGYILMAKDKGNACGIANLASY  328


> ath:AT4G39090  RD19; RD19 (RESPONSIVE TO DEHYDRATION 19); cysteine-type 
endopeptidase/ cysteine-type peptidase
Length=368

 Score = 45.1 bits (105),  Expect = 6e-05, Method: Compositional matrix adjust.
 Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 0/36 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI  61
            YWI++N+WG +WG+ G++ + +G N  G++S  S +
Sbjct  324  YWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTV  359


> mmu:13041  Ctsw, lymphopain; cathepsin W; K08569 cathepsin W 
[EC:3.4.22.-]
Length=371

 Score = 45.1 bits (105),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 16/29 (55%), Positives = 22/29 (75%), Gaps = 0/29 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGI  54
            YWI++N+WG+ WG+ GYF L RG N  G+
Sbjct  321  YWILKNSWGAHWGEKGYFRLYRGNNTCGV  349


> xla:444163  ctsl1, MGC80629, catl, cpl-1, ctsl, mep; cathepsin 
L1 (EC:3.4.22.15)
Length=256

 Score = 45.1 bits (105),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 16/37 (43%), Positives = 24/37 (64%), Gaps = 0/37 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFID  62
            YWI++N+WG  WG+ GY  + R VN   I + A+ +D
Sbjct  219  YWIIKNSWGKDWGENGYIRMKRNVNQCDIATAAATVD  255


> mmu:13038  Ctsk, AI323530, MMS10-Q, Ms10q, catK; cathepsin K 
(EC:3.4.22.38); K01371 cathepsin K [EC:3.4.22.38]
Length=329

 Score = 45.1 bits (105),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 21/37 (56%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query  25   KYWIVRNTWGSSWGKGGYFLLIRGV-NAGGIESQASF  60
            K+WI++N+WG SWG  GY LL R   NA GI + ASF
Sbjct  290  KHWIIKNSWGESWGNKGYALLARNKNNACGITNMASF  326


> cel:Y51A2D.1  hypothetical protein
Length=411

 Score = 44.7 bits (104),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 20/35 (57%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query  25   KYWIVRNTWG-SSWGKGGYFLLIRGVNAGGIESQA  58
            ++WI++N+WG S WG GGY  LIRG N  GIE  A
Sbjct  366  RFWIMKNSWGVSGWGTGGYVKLIRGKNWCGIERGA  400


> pfa:PFD0230c  protease, putative
Length=939

 Score = 44.7 bits (104),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 16/32 (50%), Positives = 23/32 (71%), Gaps = 0/32 (0%)

Query  25   KYWIVRNTWGSSWGKGGYFLLIRGVNAGGIES  56
            KYW V N+WG++WG  GYF ++R  N+  I+S
Sbjct  892  KYWKVLNSWGTNWGNSGYFYILRNNNSFNIKS  923


> hsa:1519  CTSO, CTSO1; cathepsin O (EC:3.4.22.42); K01374 cathepsin 
O [EC:3.4.22.42]
Length=321

 Score = 44.7 bits (104),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 21/41 (51%), Positives = 24/41 (58%), Gaps = 0/41 (0%)

Query  21   SGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI  61
            +G   YWIVRN+WGSSWG  GY  +  G N  GI    S I
Sbjct  279  TGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSI  319


> dre:569298  ctsbb; capthepsin B, b; K01363 cathepsin B [EC:3.4.22.1]
Length=326

 Score = 44.3 bits (103),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 17/39 (43%), Positives = 25/39 (64%), Gaps = 0/39 (0%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQ  57
            G+     +W+V N+W S WG  GYF ++RG +  GIES+
Sbjct  280  GEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESE  318


> cel:Y71H2AR.2  hypothetical protein
Length=345

 Score = 44.3 bits (103),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 18/33 (54%), Positives = 24/33 (72%), Gaps = 0/33 (0%)

Query  19   GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNA  51
            G  G+ KYWIV+ ++G+SWG+ GY  L R VNA
Sbjct  253  GIEGEQKYWIVKGSFGTSWGEQGYMKLARDVNA  285


> dre:567046  ctskl, si:dkey-121a11.6; cathepsin K, like
Length=349

 Score = 43.9 bits (102),  Expect = 1e-04, Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 26/36 (72%), Gaps = 1/36 (2%)

Query  26   YWIVRNTWGSSWGKGGYFLLIR-GVNAGGIESQASF  60
            YWI++N+WG+ WG+GGY  +IR G N  GI S A +
Sbjct  311  YWIIKNSWGTGWGEGGYMRMIRNGKNTCGIASYALY  346


> cel:C25B8.3  cpr-6; Cysteine PRotease related family member (cpr-6)
Length=379

 Score = 43.9 bits (102),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 16/33 (48%), Positives = 24/33 (72%), Gaps = 0/33 (0%)

Query  24   LKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIES  56
            + YW V N+W + WG+ G+F ++RGV+  GIES
Sbjct  318  IPYWTVANSWNTDWGEDGFFRILRGVDECGIES  350


> cel:Y113G7B.15  hypothetical protein
Length=328

 Score = 43.9 bits (102),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 19/34 (55%), Positives = 23/34 (67%), Gaps = 0/34 (0%)

Query  26   YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS  59
            YW+VRN+W S WG  GY  + RGVN   IES A+
Sbjct  289  YWLVRNSWNSDWGLHGYVKIRRGVNWCLIESHAA  322



Lambda     K      H
   0.314    0.132    0.416 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Effective search space used: 2040069136


  Database: egene_temp_file_orthology_annotation_similarity_blast_database_966
    Posted date:  Sep 16, 2011  8:45 PM
  Number of letters in database: 82,071,388
  Number of sequences in database:  164,496



Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40