bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.24+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: egene_temp_file_orthology_annotation_similarity_blast_database_966
164,496 sequences; 82,071,388 total letters
Query= Eace_3218_orf1
Length=84
Score E
Sequences producing significant alignments: (Bits) Value
tgo:TGME49_089620 cathepsin C (EC:3.4.14.1); K01275 cathepsin ... 82.4 4e-16
pfa:PF11_0174 cathepsin C, homolog; K01275 cathepsin C [EC:3.4... 80.1 2e-15
cpv:cgd4_2110 preprocathepsin c precursor ; K01275 cathepsin C... 79.3 3e-15
pfa:PFL2290w preprocathepsin c precursor, putative (EC:3.4.14.... 76.3 3e-14
tgo:TGME49_067490 papain family cysteine protease domain-conta... 74.7 6e-14
bbo:BBOV_I000540 16.m00694; preprocathepsin c precursor; K0127... 74.3 9e-14
tpv:TP03_0357 cathepsin C; K01275 cathepsin C [EC:3.4.14.1] 70.5 1e-12
tpv:TP02_0883 cathepsin C; K01275 cathepsin C [EC:3.4.14.1] 69.3 3e-12
bbo:BBOV_II000170 18.m05995; cathepsin C precursor (EC:3.4.22.... 68.9 4e-12
cel:F41E6.6 tag-196; Temporarily Assigned Gene name family mem... 55.1 6e-08
cel:R09F10.1 hypothetical protein 54.7 7e-08
mmu:13036 Ctsh, AL022844; cathepsin H (EC:3.4.22.16); K01366 c... 53.5 2e-07
tgo:TGME49_076130 cathepsin C2 (TgCPC2) (EC:3.4.14.1) 51.6 5e-07
cel:W07B8.5 cpr-5; Cysteine PRotease related family member (cp... 51.6 6e-07
hsa:1512 CTSH, ACC-4, ACC-5, CPSB, DKFZp686B24257, MGC1519, mi... 50.1 2e-06
mmu:13032 Ctsc, AI047818, DPP1, DPPI; cathepsin C (EC:3.4.14.1... 49.7 2e-06
mmu:26944 Tinag, AI452335, TIN-ag; tubulointerstitial nephriti... 49.7 3e-06
cel:W07B8.4 hypothetical protein; K01363 cathepsin B [EC:3.4.2... 49.3 3e-06
dre:368704 ctsc, cb912, ik:tdsubc_1h2, sb:cb146, wu:fb34g12, w... 48.5 4e-06
hsa:27283 TINAG, TIN-AG; tubulointerstitial nephritis antigen 48.5 5e-06
xla:380203 ctsc, MGC69126; cathepsin C (EC:3.4.14.1); K01275 c... 48.5 6e-06
cpv:cgd2_3320 secreted papain like protease, signal peptide 48.1 6e-06
dre:550475 ctsk, wu:fa95f03, wu:fb08b05, zgc:110367; cathepsin... 48.1 7e-06
cel:K02E7.10 hypothetical protein 47.8 8e-06
cel:R07E3.1 hypothetical protein 47.4 1e-05
dre:324818 ctsh, fc44c02, wu:fc44c02, zgc:85774; cathepsin H (... 47.4 1e-05
hsa:1521 CTSW, LYPN; cathepsin W; K08569 cathepsin W [EC:3.4.2... 47.4 1e-05
mmu:56464 Ctsf, AI481912; cathepsin F (EC:3.4.22.41); K01373 c... 47.4 1e-05
hsa:64129 TINAGL1, ARG1, LCN7, LIECG3, TINAGRP; tubulointersti... 47.4 1e-05
ath:AT3G48350 cysteine proteinase, putative 47.0 1e-05
dre:562116 tinagl1, si:dkey-158b13.1; tubulointerstitial nephr... 46.6 2e-05
ath:AT3G19390 cysteine proteinase, putative / thiol protease, ... 46.2 3e-05
cel:F36D3.9 cpr-2; Cysteine PRotease related family member (cp... 45.8 3e-05
ath:AT3G54940 cysteine-type endopeptidase/ cysteine-type pepti... 45.4 4e-05
cel:F26E4.3 hypothetical protein 45.4 5e-05
ath:AT2G21430 cysteine proteinase A494, putative / thiol prote... 45.1 5e-05
cel:F44C4.3 cpr-4; Cysteine PRotease related family member (cp... 45.1 5e-05
xla:100127265 ctsk, cts02, ctso, ctso1, ctso2, pknd, pycd; cat... 45.1 5e-05
ath:AT4G39090 RD19; RD19 (RESPONSIVE TO DEHYDRATION 19); cyste... 45.1 6e-05
mmu:13041 Ctsw, lymphopain; cathepsin W; K08569 cathepsin W [E... 45.1 6e-05
xla:444163 ctsl1, MGC80629, catl, cpl-1, ctsl, mep; cathepsin ... 45.1 6e-05
mmu:13038 Ctsk, AI323530, MMS10-Q, Ms10q, catK; cathepsin K (E... 45.1 6e-05
cel:Y51A2D.1 hypothetical protein 44.7 6e-05
pfa:PFD0230c protease, putative 44.7 7e-05
hsa:1519 CTSO, CTSO1; cathepsin O (EC:3.4.22.42); K01374 cathe... 44.7 7e-05
dre:569298 ctsbb; capthepsin B, b; K01363 cathepsin B [EC:3.4.... 44.3 8e-05
cel:Y71H2AR.2 hypothetical protein 44.3 1e-04
dre:567046 ctskl, si:dkey-121a11.6; cathepsin K, like 43.9 1e-04
cel:C25B8.3 cpr-6; Cysteine PRotease related family member (cp... 43.9 1e-04
cel:Y113G7B.15 hypothetical protein 43.9 1e-04
> tgo:TGME49_089620 cathepsin C (EC:3.4.14.1); K01275 cathepsin
C [EC:3.4.14.1]
Length=733
Score = 82.4 bits (202), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 54/83 (65%), Gaps = 13/83 (15%)
Query 1 GWGETDAAAATTATPAAAGDSGK-LKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS 59
GWGETD G++GK KYWIVRNTWG +WG GY + RG N GGIESQA+
Sbjct 660 GWGETD------------GENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIESQAT 707
Query 60 FIDPDLTRGKGKQLFDSLLAAHS 82
FIDPD +RG+G ++ ++ A S
Sbjct 708 FIDPDFSRGQGLKVAKAIEALKS 730
> pfa:PF11_0174 cathepsin C, homolog; K01275 cathepsin C [EC:3.4.14.1]
Length=700
Score = 80.1 bits (196), Expect = 2e-15, Method: Composition-based stats.
Identities = 38/78 (48%), Positives = 49/78 (62%), Gaps = 14/78 (17%)
Query 1 GWGETDAAAATTATPAAAGDSGKL-KYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS 59
GWGE + +GKL KYWI RN+WG+ WGK GYF ++RG N GIESQ+
Sbjct 630 GWGEEEI-------------NGKLYKYWIGRNSWGNGWGKEGYFKILRGQNFSGIESQSL 676
Query 60 FIDPDLTRGKGKQLFDSL 77
FI+PD +RG GK L + +
Sbjct 677 FIEPDFSRGAGKILLEKM 694
> cpv:cgd4_2110 preprocathepsin c precursor ; K01275 cathepsin
C [EC:3.4.14.1]
Length=635
Score = 79.3 bits (194), Expect = 3e-15, Method: Composition-based stats.
Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 4/75 (5%)
Query 1 GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF 60
GW T+ A A G+ + YWI+RN+WG++WGK GY + RG N GGIE+QA F
Sbjct 529 GWEYTNHAIAIVGW----GEENGIPYWIIRNSWGANWGKKGYAKIRRGKNIGGIENQAVF 584
Query 61 IDPDLTRGKGKQLFD 75
IDPD TRG G L +
Sbjct 585 IDPDFTRGMGLSLLN 599
> pfa:PFL2290w preprocathepsin c precursor, putative (EC:3.4.14.1);
K01275 cathepsin C [EC:3.4.14.1]
Length=590
Score = 76.3 bits (186), Expect = 3e-14, Method: Composition-based stats.
Identities = 30/53 (56%), Positives = 37/53 (69%), Gaps = 0/53 (0%)
Query 24 LKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDPDLTRGKGKQLFDS 76
+KYWI+RNTWG +WG GY RG+N GIESQA +IDPD +RG K + S
Sbjct 534 VKYWIIRNTWGKNWGYKGYLKFQRGINLAGIESQAVYIDPDFSRGYPKNILQS 586
> tgo:TGME49_067490 papain family cysteine protease domain-containing
protein (EC:3.4.14.1); K01275 cathepsin C [EC:3.4.14.1]
Length=622
Score = 74.7 bits (182), Expect = 6e-14, Method: Composition-based stats.
Identities = 37/73 (50%), Positives = 43/73 (58%), Gaps = 9/73 (12%)
Query 1 GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF 60
GWGE D T P K+W+VRNTWGS+WG GY + RG N IESQA +
Sbjct 549 GWGE-DEPDNATGKPK--------KFWVVRNTWGSNWGTHGYVKIPRGENMAAIESQAVY 599
Query 61 IDPDLTRGKGKQL 73
DPDLTRG+ QL
Sbjct 600 FDPDLTRGRAAQL 612
> bbo:BBOV_I000540 16.m00694; preprocathepsin c precursor; K01275
cathepsin C [EC:3.4.14.1]
Length=546
Score = 74.3 bits (181), Expect = 9e-14, Method: Composition-based stats.
Identities = 36/68 (52%), Positives = 43/68 (63%), Gaps = 0/68 (0%)
Query 1 GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF 60
GW T+ A A D KYWI +NTWG+ WG GG+F + RGVN GIE+QA +
Sbjct 468 GWEYTNHAIAIVGWGEDEIDGIITKYWICKNTWGNDWGVGGFFKIKRGVNQCGIETQAVY 527
Query 61 IDPDLTRG 68
IDPDLTRG
Sbjct 528 IDPDLTRG 535
> tpv:TP03_0357 cathepsin C; K01275 cathepsin C [EC:3.4.14.1]
Length=501
Score = 70.5 bits (171), Expect = 1e-12, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 39/68 (57%), Gaps = 15/68 (22%)
Query 1 GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF 60
GWGETD KYW+ RN+WG WG G+F ++RG+NA GIES+A
Sbjct 430 GWGETDEG---------------FKYWVARNSWGKDWGDNGFFKIVRGINAFGIESEAVV 474
Query 61 IDPDLTRG 68
+DPD+ +
Sbjct 475 LDPDIEKA 482
> tpv:TP02_0883 cathepsin C; K01275 cathepsin C [EC:3.4.14.1]
Length=365
Score = 69.3 bits (168), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 40/68 (58%), Gaps = 0/68 (0%)
Query 1 GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF 60
GW T+ A + +KYWI +NTWG++WG GYF + +GVN GIESQA F
Sbjct 282 GWEYTNHAIVVVGWGEELVNGENVKYWICKNTWGTNWGVQGYFKIKKGVNLCGIESQAVF 341
Query 61 IDPDLTRG 68
DP L +G
Sbjct 342 FDPSLNKG 349
> bbo:BBOV_II000170 18.m05995; cathepsin C precursor (EC:3.4.22.-);
K01275 cathepsin C [EC:3.4.14.1]
Length=530
Score = 68.9 bits (167), Expect = 4e-12, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 40/67 (59%), Gaps = 0/67 (0%)
Query 1 GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASF 60
GW T A A + +KYWI RN+WG +WG G+F + RG NA GIES+A F
Sbjct 448 GWEYTSHAVAIVGWGQEKVGARMIKYWICRNSWGQNWGINGHFKIERGKNAYGIESEAVF 507
Query 61 IDPDLTR 67
IDPD ++
Sbjct 508 IDPDFSK 514
> cel:F41E6.6 tag-196; Temporarily Assigned Gene name family member
(tag-196); K01373 cathepsin F [EC:3.4.22.41]
Length=477
Score = 55.1 bits (131), Expect = 6e-08, Method: Composition-based stats.
Identities = 20/41 (48%), Positives = 28/41 (68%), Gaps = 0/41 (0%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS 59
G G+ YWIV+N+WG +WG+ GYF L RG N G++ A+
Sbjct 432 GKDGRKPYWIVKNSWGPNWGEAGYFKLYRGKNVCGVQEMAT 472
> cel:R09F10.1 hypothetical protein
Length=383
Score = 54.7 bits (130), Expect = 7e-08, Method: Composition-based stats.
Identities = 21/36 (58%), Positives = 27/36 (75%), Gaps = 0/36 (0%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGI 54
G G+ YWIV+N+WG+SWG GYF L RGVN+ G+
Sbjct 338 GGEGESAYWIVKNSWGTSWGASGYFRLARGVNSCGL 373
> mmu:13036 Ctsh, AL022844; cathepsin H (EC:3.4.22.16); K01366
cathepsin H [EC:3.4.22.16]
Length=333
Score = 53.5 bits (127), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/47 (48%), Positives = 32/47 (68%), Gaps = 0/47 (0%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDPDL 65
G+ L YWIV+N+WGS WG+ GYFL+ RG N G+ + AS+ P +
Sbjct 287 GEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAACASYPIPQV 333
> tgo:TGME49_076130 cathepsin C2 (TgCPC2) (EC:3.4.14.1)
Length=753
Score = 51.6 bits (122), Expect = 5e-07, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 35/67 (52%), Gaps = 0/67 (0%)
Query 2 WGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI 61
W + D A + A L YW VRN+WG+ WG+GGY ++RGVN IE A
Sbjct 572 WEKVDHAVVISGWGWAKHGDSWLPYWKVRNSWGTKWGEGGYARVLRGVNEMAIERVAVVG 631
Query 62 DPDLTRG 68
+ L RG
Sbjct 632 EVSLFRG 638
> cel:W07B8.5 cpr-5; Cysteine PRotease related family member (cpr-5);
K01363 cathepsin B [EC:3.4.22.1]
Length=344
Score = 51.6 bits (122), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%), Gaps = 0/42 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDPDLTR 67
YW+V N+W +WG+ GYF +IRG+N GIE A PDL R
Sbjct 301 YWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIPDLAR 342
> hsa:1512 CTSH, ACC-4, ACC-5, CPSB, DKFZp686B24257, MGC1519,
minichain; cathepsin H (EC:3.4.22.16); K01366 cathepsin H [EC:3.4.22.16]
Length=335
Score = 50.1 bits (118), Expect = 2e-06, Method: Composition-based stats.
Identities = 21/45 (46%), Positives = 29/45 (64%), Gaps = 0/45 (0%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDP 63
G+ + YWIV+N+WG WG GYFL+ RG N G+ + AS+ P
Sbjct 289 GEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP 333
> mmu:13032 Ctsc, AI047818, DPP1, DPPI; cathepsin C (EC:3.4.14.1);
K01275 cathepsin C [EC:3.4.14.1]
Length=462
Score = 49.7 bits (117), Expect = 2e-06, Method: Composition-based stats.
Identities = 18/35 (51%), Positives = 27/35 (77%), Gaps = 0/35 (0%)
Query 24 LKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQA 58
++YWI++N+WGS+WG+ GYF + RG + IES A
Sbjct 419 IEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIA 453
> mmu:26944 Tinag, AI452335, TIN-ag; tubulointerstitial nephritis
antigen
Length=475
Score = 49.7 bits (117), Expect = 3e-06, Method: Composition-based stats.
Identities = 19/33 (57%), Positives = 24/33 (72%), Gaps = 0/33 (0%)
Query 23 KLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIE 55
K K+WI N+WG SWG+ GYF ++RGVN IE
Sbjct 427 KEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459
> cel:W07B8.4 hypothetical protein; K01363 cathepsin B [EC:3.4.22.1]
Length=335
Score = 49.3 bits (116), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 28/42 (66%), Gaps = 0/42 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDPDLTR 67
YW+ N+W + WG+ GYF ++RGV+ GIES A PDL R
Sbjct 292 YWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMPDLNR 333
> dre:368704 ctsc, cb912, ik:tdsubc_1h2, sb:cb146, wu:fb34g12,
wu:fj58d01; cathepsin C (EC:3.4.14.1); K01275 cathepsin C [EC:3.4.14.1]
Length=455
Score = 48.5 bits (114), Expect = 4e-06, Method: Composition-based stats.
Identities = 20/39 (51%), Positives = 26/39 (66%), Gaps = 0/39 (0%)
Query 25 KYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDP 63
KYWIV+N+WGS WG+ G+F + RG + IES A P
Sbjct 413 KYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAATP 451
> hsa:27283 TINAG, TIN-AG; tubulointerstitial nephritis antigen
Length=476
Score = 48.5 bits (114), Expect = 5e-06, Method: Composition-based stats.
Identities = 19/33 (57%), Positives = 24/33 (72%), Gaps = 0/33 (0%)
Query 23 KLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIE 55
K K+WI N+WG SWG+ GYF ++RGVN IE
Sbjct 428 KEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460
> xla:380203 ctsc, MGC69126; cathepsin C (EC:3.4.14.1); K01275
cathepsin C [EC:3.4.14.1]
Length=458
Score = 48.5 bits (114), Expect = 6e-06, Method: Composition-based stats.
Identities = 20/39 (51%), Positives = 27/39 (69%), Gaps = 0/39 (0%)
Query 25 KYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDP 63
KYWIV+N+WG SWG+ G+F + RG + IES A +P
Sbjct 416 KYWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVSANP 454
> cpv:cgd2_3320 secreted papain like protease, signal peptide
Length=819
Score = 48.1 bits (113), Expect = 6e-06, Method: Composition-based stats.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 0/57 (0%)
Query 2 WGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQA 58
W + D A T + ++ YWIV+N+WG WG+ G+ +IRGVN IE A
Sbjct 732 WHKVDHAMVITGWGWETYGNERIPYWIVQNSWGKRWGEKGFCRIIRGVNELSIEHAA 788
> dre:550475 ctsk, wu:fa95f03, wu:fb08b05, zgc:110367; cathepsin
K (EC:3.4.22.38); K01371 cathepsin K [EC:3.4.22.38]
Length=333
Score = 48.1 bits (113), Expect = 7e-06, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query 6 DAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGV-NAGGIESQASF 60
D A A A GK KYWIV+N+WG WGK GY L+ R NA GI + ASF
Sbjct 276 DVNHAVLAVGYGATPRGK-KYWIVKNSWGEEWGKKGYVLMARNRNNACGIANLASF 330
> cel:K02E7.10 hypothetical protein
Length=299
Score = 47.8 bits (112), Expect = 8e-06, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 0/54 (0%)
Query 6 DAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS 59
+A A + G G KYWIV+ ++G+SWG+ GY L R VNA G+ S
Sbjct 238 NANEARSLAIVGYGKDGAEKYWIVKGSFGTSWGEHGYMKLARNVNACGMAESIS 291
> cel:R07E3.1 hypothetical protein
Length=402
Score = 47.4 bits (111), Expect = 1e-05, Method: Composition-based stats.
Identities = 19/34 (55%), Positives = 26/34 (76%), Gaps = 1/34 (2%)
Query 25 KYWIVRNTWGSSWG-KGGYFLLIRGVNAGGIESQ 57
KYWIV+N+WG++WG + GY RG+NA GIE +
Sbjct 363 KYWIVKNSWGNTWGVEHGYIYFARGINACGIEDE 396
> dre:324818 ctsh, fc44c02, wu:fc44c02, zgc:85774; cathepsin H
(EC:3.4.22.16); K01366 cathepsin H [EC:3.4.22.16]
Length=330
Score = 47.4 bits (111), Expect = 1e-05, Method: Composition-based stats.
Identities = 18/38 (47%), Positives = 27/38 (71%), Gaps = 0/38 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFIDP 63
YWIV+N+WG++WG GYF + RG N G+ + +S+ P
Sbjct 291 YWIVKNSWGTNWGIKGYFYIERGKNMCGLAACSSYPIP 328
> hsa:1521 CTSW, LYPN; cathepsin W; K08569 cathepsin W [EC:3.4.22.-]
Length=376
Score = 47.4 bits (111), Expect = 1e-05, Method: Composition-based stats.
Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 6/53 (11%)
Query 2 WGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGI 54
W ET ++ + P YWI++N+WG+ WG+ GYF L RG N GI
Sbjct 308 WAETVSSQSQPQPPHPT------PYWILKNSWGAQWGEKGYFRLHRGSNTCGI 354
> mmu:56464 Ctsf, AI481912; cathepsin F (EC:3.4.22.41); K01373
cathepsin F [EC:3.4.22.41]
Length=462
Score = 47.4 bits (111), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 27/41 (65%), Gaps = 0/41 (0%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS 59
G+ + YW ++N+WGS WG+ GY+ L RG A G+ + AS
Sbjct 417 GNRSNIPYWAIKNSWGSDWGEEGYYYLYRGSGACGVNTMAS 457
> hsa:64129 TINAGL1, ARG1, LCN7, LIECG3, TINAGRP; tubulointerstitial
nephritis antigen-like 1
Length=436
Score = 47.4 bits (111), Expect = 1e-05, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 11/56 (19%)
Query 1 GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIES 56
GWGE D LKYW N+WG +WG+ G+F ++RGVN IES
Sbjct 375 GWGEETLP-----------DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIES 419
> ath:AT3G48350 cysteine proteinase, putative
Length=364
Score = 47.0 bits (110), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 4/40 (10%)
Query 25 KYWIVRNTWGSSWGKGGYFLLIRGV--NAG--GIESQASF 60
KYWIVRN+WG WG+GGY + RG+ N G GI +AS+
Sbjct 302 KYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASY 341
> dre:562116 tinagl1, si:dkey-158b13.1; tubulointerstitial nephritis
antigen-like 1
Length=471
Score = 46.6 bits (109), Expect = 2e-05, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 11/56 (19%)
Query 1 GWGETDAAAATTATPAAAGDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIES 56
GWGE + T KYWI N+WG +WG+ GYF + RGVN IE+
Sbjct 402 GWGEERDYSGRTR-----------KYWIGANSWGKNWGEDGYFRIARGVNECDIET 446
> ath:AT3G19390 cysteine proteinase, putative / thiol protease,
putative
Length=452
Score = 46.2 bits (108), Expect = 3e-05, Method: Composition-based stats.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAG----GIESQASF 60
G G YWIVRN+WGS+WG+ GYF L R + G+ AS+
Sbjct 298 GSEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASY 343
> cel:F36D3.9 cpr-2; Cysteine PRotease related family member (cpr-2);
K01363 cathepsin B [EC:3.4.22.1]
Length=344
Score = 45.8 bits (107), Expect = 3e-05, Method: Composition-based stats.
Identities = 17/32 (53%), Positives = 24/32 (75%), Gaps = 0/32 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQ 57
YW+ N+WGS WG+ G F ++RGV+ GIES+
Sbjct 306 YWLAVNSWGSQWGESGTFRILRGVDECGIESR 337
> ath:AT3G54940 cysteine-type endopeptidase/ cysteine-type peptidase
Length=367
Score = 45.4 bits (106), Expect = 4e-05, Method: Composition-based stats.
Identities = 17/36 (47%), Positives = 24/36 (66%), Gaps = 0/36 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI 61
YWI++N+WG WG+ GY+ L RG + GI S S +
Sbjct 326 YWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAV 361
> cel:F26E4.3 hypothetical protein
Length=452
Score = 45.4 bits (106), Expect = 5e-05, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 36/69 (52%), Gaps = 14/69 (20%)
Query 2 WGETDAAAATTATPAAAG-------------DSGK-LKYWIVRNTWGSSWGKGGYFLLIR 47
+ +D AA A+ A G +GK +KYW+ N+WG+ WG+ GYF ++R
Sbjct 355 YQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLR 414
Query 48 GVNAGGIES 56
G N IES
Sbjct 415 GENHCEIES 423
> ath:AT2G21430 cysteine proteinase A494, putative / thiol protease,
putative
Length=361
Score = 45.1 bits (105), Expect = 5e-05, Method: Composition-based stats.
Identities = 15/36 (41%), Positives = 26/36 (72%), Gaps = 0/36 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI 61
YWI++N+WG SWG+ G++ + +G N G++S S +
Sbjct 321 YWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTV 356
> cel:F44C4.3 cpr-4; Cysteine PRotease related family member (cpr-4);
K01363 cathepsin B [EC:3.4.22.1]
Length=335
Score = 45.1 bits (105), Expect = 5e-05, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 0/37 (0%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIE 55
G YW+V N+W +WG+ GYF +IRG N GIE
Sbjct 289 GTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325
> xla:100127265 ctsk, cts02, ctso, ctso1, ctso2, pknd, pycd; cathepsin
K (EC:3.4.22.38); K01371 cathepsin K [EC:3.4.22.38]
Length=331
Score = 45.1 bits (105), Expect = 5e-05, Method: Composition-based stats.
Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGV-NAGGIESQASF 60
G K KYWIV+N+WG WG GY L+ + NA GI + AS+
Sbjct 286 GTQKKAKYWIVKNSWGEEWGDKGYILMAKDKGNACGIANLASY 328
> ath:AT4G39090 RD19; RD19 (RESPONSIVE TO DEHYDRATION 19); cysteine-type
endopeptidase/ cysteine-type peptidase
Length=368
Score = 45.1 bits (105), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 0/36 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI 61
YWI++N+WG +WG+ G++ + +G N G++S S +
Sbjct 324 YWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTV 359
> mmu:13041 Ctsw, lymphopain; cathepsin W; K08569 cathepsin W
[EC:3.4.22.-]
Length=371
Score = 45.1 bits (105), Expect = 6e-05, Method: Composition-based stats.
Identities = 16/29 (55%), Positives = 22/29 (75%), Gaps = 0/29 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGI 54
YWI++N+WG+ WG+ GYF L RG N G+
Sbjct 321 YWILKNSWGAHWGEKGYFRLYRGNNTCGV 349
> xla:444163 ctsl1, MGC80629, catl, cpl-1, ctsl, mep; cathepsin
L1 (EC:3.4.22.15)
Length=256
Score = 45.1 bits (105), Expect = 6e-05, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 24/37 (64%), Gaps = 0/37 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFID 62
YWI++N+WG WG+ GY + R VN I + A+ +D
Sbjct 219 YWIIKNSWGKDWGENGYIRMKRNVNQCDIATAAATVD 255
> mmu:13038 Ctsk, AI323530, MMS10-Q, Ms10q, catK; cathepsin K
(EC:3.4.22.38); K01371 cathepsin K [EC:3.4.22.38]
Length=329
Score = 45.1 bits (105), Expect = 6e-05, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query 25 KYWIVRNTWGSSWGKGGYFLLIRGV-NAGGIESQASF 60
K+WI++N+WG SWG GY LL R NA GI + ASF
Sbjct 290 KHWIIKNSWGESWGNKGYALLARNKNNACGITNMASF 326
> cel:Y51A2D.1 hypothetical protein
Length=411
Score = 44.7 bits (104), Expect = 6e-05, Method: Composition-based stats.
Identities = 20/35 (57%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query 25 KYWIVRNTWG-SSWGKGGYFLLIRGVNAGGIESQA 58
++WI++N+WG S WG GGY LIRG N GIE A
Sbjct 366 RFWIMKNSWGVSGWGTGGYVKLIRGKNWCGIERGA 400
> pfa:PFD0230c protease, putative
Length=939
Score = 44.7 bits (104), Expect = 7e-05, Method: Composition-based stats.
Identities = 16/32 (50%), Positives = 23/32 (71%), Gaps = 0/32 (0%)
Query 25 KYWIVRNTWGSSWGKGGYFLLIRGVNAGGIES 56
KYW V N+WG++WG GYF ++R N+ I+S
Sbjct 892 KYWKVLNSWGTNWGNSGYFYILRNNNSFNIKS 923
> hsa:1519 CTSO, CTSO1; cathepsin O (EC:3.4.22.42); K01374 cathepsin
O [EC:3.4.22.42]
Length=321
Score = 44.7 bits (104), Expect = 7e-05, Method: Composition-based stats.
Identities = 21/41 (51%), Positives = 24/41 (58%), Gaps = 0/41 (0%)
Query 21 SGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQASFI 61
+G YWIVRN+WGSSWG GY + G N GI S I
Sbjct 279 TGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSI 319
> dre:569298 ctsbb; capthepsin B, b; K01363 cathepsin B [EC:3.4.22.1]
Length=326
Score = 44.3 bits (103), Expect = 8e-05, Method: Composition-based stats.
Identities = 17/39 (43%), Positives = 25/39 (64%), Gaps = 0/39 (0%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQ 57
G+ +W+V N+W S WG GYF ++RG + GIES+
Sbjct 280 GEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESE 318
> cel:Y71H2AR.2 hypothetical protein
Length=345
Score = 44.3 bits (103), Expect = 1e-04, Method: Composition-based stats.
Identities = 18/33 (54%), Positives = 24/33 (72%), Gaps = 0/33 (0%)
Query 19 GDSGKLKYWIVRNTWGSSWGKGGYFLLIRGVNA 51
G G+ KYWIV+ ++G+SWG+ GY L R VNA
Sbjct 253 GIEGEQKYWIVKGSFGTSWGEQGYMKLARDVNA 285
> dre:567046 ctskl, si:dkey-121a11.6; cathepsin K, like
Length=349
Score = 43.9 bits (102), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 26/36 (72%), Gaps = 1/36 (2%)
Query 26 YWIVRNTWGSSWGKGGYFLLIR-GVNAGGIESQASF 60
YWI++N+WG+ WG+GGY +IR G N GI S A +
Sbjct 311 YWIIKNSWGTGWGEGGYMRMIRNGKNTCGIASYALY 346
> cel:C25B8.3 cpr-6; Cysteine PRotease related family member (cpr-6)
Length=379
Score = 43.9 bits (102), Expect = 1e-04, Method: Composition-based stats.
Identities = 16/33 (48%), Positives = 24/33 (72%), Gaps = 0/33 (0%)
Query 24 LKYWIVRNTWGSSWGKGGYFLLIRGVNAGGIES 56
+ YW V N+W + WG+ G+F ++RGV+ GIES
Sbjct 318 IPYWTVANSWNTDWGEDGFFRILRGVDECGIES 350
> cel:Y113G7B.15 hypothetical protein
Length=328
Score = 43.9 bits (102), Expect = 1e-04, Method: Composition-based stats.
Identities = 19/34 (55%), Positives = 23/34 (67%), Gaps = 0/34 (0%)
Query 26 YWIVRNTWGSSWGKGGYFLLIRGVNAGGIESQAS 59
YW+VRN+W S WG GY + RGVN IES A+
Sbjct 289 YWLVRNSWNSDWGLHGYVKIRRGVNWCLIESHAA 322
Lambda K H
0.314 0.132 0.416
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 2040069136
Database: egene_temp_file_orthology_annotation_similarity_blast_database_966
Posted date: Sep 16, 2011 8:45 PM
Number of letters in database: 82,071,388
Number of sequences in database: 164,496
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40