bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.24+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: egene_temp_file_orthology_annotation_similarity_blast_database_866
164,496 sequences; 82,071,388 total letters
Query= Eten_2892_orf1
Length=193
Score E
Sequences producing significant alignments: (Bits) Value
tgo:TGME49_089620 cathepsin C (EC:3.4.14.1); K01275 cathepsin ... 147 3e-35
pfa:PF11_0174 cathepsin C, homolog; K01275 cathepsin C [EC:3.4... 122 1e-27
bbo:BBOV_I000540 16.m00694; preprocathepsin c precursor; K0127... 113 5e-25
cpv:cgd4_2110 preprocathepsin c precursor ; K01275 cathepsin C... 107 2e-23
tgo:TGME49_067490 papain family cysteine protease domain-conta... 98.6 1e-20
pfa:PFL2290w preprocathepsin c precursor, putative (EC:3.4.14.... 87.8 2e-17
tpv:TP03_0357 cathepsin C; K01275 cathepsin C [EC:3.4.14.1] 81.6 2e-15
dre:368704 ctsc, cb912, ik:tdsubc_1h2, sb:cb146, wu:fb34g12, w... 79.3 8e-15
bbo:BBOV_II000170 18.m05995; cathepsin C precursor (EC:3.4.22.... 77.8 2e-14
tpv:TP02_0883 cathepsin C; K01275 cathepsin C [EC:3.4.14.1] 77.0 3e-14
xla:380203 ctsc, MGC69126; cathepsin C (EC:3.4.14.1); K01275 c... 63.9 3e-10
mmu:13032 Ctsc, AI047818, DPP1, DPPI; cathepsin C (EC:3.4.14.1... 63.2 5e-10
mmu:94242 Tinagl1, 1110021J17Rik, AZ-1, AZ1, Arg1, Lcn7, TARP,... 47.8 2e-05
dre:562116 tinagl1, si:dkey-158b13.1; tubulointerstitial nephr... 44.3 3e-04
tgo:TGME49_076130 cathepsin C2 (TgCPC2) (EC:3.4.14.1) 42.4 0.001
cel:F26E4.3 hypothetical protein 40.0 0.005
hsa:64129 TINAGL1, ARG1, LCN7, LIECG3, TINAGRP; tubulointersti... 40.0 0.005
hsa:27283 TINAG, TIN-AG; tubulointerstitial nephritis antigen 40.0 0.005
bbo:BBOV_IV007730 23.m06535; cysteine protease 2 38.1
cpv:cgd2_3320 secreted papain like protease, signal peptide 37.7 0.024
mmu:70202 Ctsll3, 2310051M13Rik; cathepsin L-like 3 (EC:3.4.22... 37.4 0.035
mmu:13039 Ctsl, 1190035F06Rik, Ctsl1, MEP, fs, nkt; cathepsin ... 37.4 0.037
mmu:26944 Tinag, AI452335, TIN-ag; tubulointerstitial nephriti... 37.0 0.041
xla:447313 ctsl2, MGC81823; cathepsin L2; K01365 cathepsin L [... 37.0 0.043
ath:AT3G45310 cysteine proteinase, putative; K01366 cathepsin ... 36.2 0.067
ath:AT3G19390 cysteine proteinase, putative / thiol protease, ... 35.8 0.10
dre:567623 zgc:174153 35.4 0.11
hsa:1515 CTSL2, CATL2, CTSU, CTSV, MGC125957; cathepsin L2 (EC... 34.7 0.20
dre:569298 ctsbb; capthepsin B, b; K01363 cathepsin B [EC:3.4.... 34.7 0.21
ath:AT5G60360 AALP; AALP (Arabidopsis aleurain-like protease);... 34.7 0.23
cel:R09F10.1 hypothetical protein 34.7 0.23
dre:567333 ctso; cathepsin O; K01374 cathepsin O [EC:3.4.22.42] 34.3 0.29
ath:AT1G02305 cathepsin B-like cysteine protease, putative 33.9 0.32
pfa:PF11_0165 falcipain-2A 33.5 0.41
xla:380102 cg10992; hypothetical protein MGC52983; K01363 cath... 33.5 0.42
cel:F41E6.6 tag-196; Temporarily Assigned Gene name family mem... 33.5 0.46
cel:F15D4.4 hypothetical protein 33.5 0.47
ath:AT5G50260 cysteine proteinase, putative; K01376 [EC:3.4.2... 33.5 0.50
tpv:TP03_0283 cysteine proteinase (EC:3.4.22.-); K01376 [EC:3... 33.1 0.66
mmu:13038 Ctsk, AI323530, MMS10-Q, Ms10q, catK; cathepsin K (E... 32.7 0.78
ath:AT3G49340 cysteine proteinase, putative; K01376 [EC:3.4.2... 31.2 2.1
> tgo:TGME49_089620 cathepsin C (EC:3.4.14.1); K01275 cathepsin
C [EC:3.4.14.1]
Length=733
Score = 147 bits (370), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 79/165 (47%), Positives = 96/165 (58%), Gaps = 25/165 (15%)
Query 29 KLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEK 88
+LS QS+LSCSFYNQGC GG PYLVGKHA +IG+ +CM Y A CP
Sbjct 495 ELSAQSILSCSFYNQGCDGGFPYLVGKHARDIGVPQARCMEYQADHTQGCP--------- 545
Query 89 DFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGCYECLS 148
F K+ +S + + L + G C + RW+AK YGY+GGCYEC
Sbjct 546 -FQKTAS-------------ASESQSMLQADANAGACSE-HARWYAKDYGYIGGCYECNH 590
Query 149 CSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTTRGAPHARV 193
CS E++IM EI NGPV A DAPPSLF+Y SGVY + HARV
Sbjct 591 CSGEKQIMLEIYNNGPVPVAFDAPPSLFSYRSGVYDAN-SNHARV 634
> pfa:PF11_0174 cathepsin C, homolog; K01275 cathepsin C [EC:3.4.14.1]
Length=700
Score = 122 bits (305), Expect = 1e-27, Method: Composition-based stats.
Identities = 70/178 (39%), Positives = 100/178 (56%), Gaps = 19/178 (10%)
Query 29 KLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEK 88
+LS Q+VLSCSFY+QGC+GG PYLV K A GI PY+A + +CP N +K+
Sbjct 430 QLSIQTVLSCSFYDQGCNGGFPYLVSKLAKLQGIPLNVYFPYSATE-ETCP-YNISKHPN 487
Query 89 DFLKSEKDFLKGEINGSGEDSSSNAAF-----------LFSGEATGTCHQG---ENRWFA 134
D S K L+ EIN +++ + + +++ A+ G ENRW+A
Sbjct 488 DMNGSAK--LR-EINAIFNSNNNMSTYNNINNDHHQLGVYANTASSQEQHGISEENRWYA 544
Query 135 KGYGYVGGCYECLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTTRGAPHAR 192
K + YVGGCY C C+ E+ +M EI NGP+ ++ +A P + Y+ GVY PHAR
Sbjct 545 KDFNYVGGCYGCNQCNGEKIMMNEIYRNGPIVSSFEASPDFYDYADGVYFVEDFPHAR 602
> bbo:BBOV_I000540 16.m00694; preprocathepsin c precursor; K01275
cathepsin C [EC:3.4.14.1]
Length=546
Score = 113 bits (282), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 67/166 (40%), Positives = 82/166 (49%), Gaps = 53/166 (31%)
Query 31 SPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYT-----AMDLSSCPVLNRNK 85
S Q +L CS +NQGC+GG P+LVGKH TE G+L E PY A+D S VL+ ++
Sbjct 339 SVQDMLECSPFNQGCYGGFPFLVGKHLTEFGVLSEDKSPYRMSNGGAVDTCSVDVLDPSE 398
Query 86 NEKDFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGCYE 145
RW+A GYGYVGGCYE
Sbjct 399 ---------------------------------------------RWYASGYGYVGGCYE 413
Query 146 CLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTTRGAPHA 191
C S E +IM+E+ NGPVA ALDAP SLF YSSG+Y + H
Sbjct 414 CTS---ELEIMREVYHNGPVAVALDAPQSLFQYSSGIYDDNPSNHG 456
> cpv:cgd4_2110 preprocathepsin c precursor ; K01275 cathepsin
C [EC:3.4.14.1]
Length=635
Score = 107 bits (268), Expect = 2e-23, Method: Composition-based stats.
Identities = 65/169 (38%), Positives = 84/169 (49%), Gaps = 35/169 (20%)
Query 24 QEAAVKLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNR 83
+E + LSPQSVLSCS +NQGC GG P+LVG+ A EIGI E+CM Y A C
Sbjct 385 REEKILLSPQSVLSCSPFNQGCEGGYPFLVGRQAEEIGISSEKCMGYYADSNQEC----- 439
Query 84 NKNEKDFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGC 143
+ F+ EI E C +GE R +A+ YGYVGGC
Sbjct 440 ---------NFSPFITPEI-----------------EDRIYCEEGE-RMYAEEYGYVGGC 472
Query 144 YECLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTTRGAPHAR 192
Y C E ++ +EI NGP+A A+ SL Y +GVY + H +
Sbjct 473 Y---GCCDEDRMKEEIFKNGPIAVAMHIDTSLLVYDNGVYDSIPNDHTK 518
> tgo:TGME49_067490 papain family cysteine protease domain-containing
protein (EC:3.4.14.1); K01275 cathepsin C [EC:3.4.14.1]
Length=622
Score = 98.6 bits (244), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 89/164 (54%), Gaps = 36/164 (21%)
Query 30 LSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEKD 89
LS QS+LSCS YNQGC GG P+LVGKHA E G EQC +S+C + +
Sbjct 400 LSAQSILSCSPYNQGCDGGYPFLVGKHAKEFGFGTEQC------QVSACKMFH------- 446
Query 90 FLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGCYECLSC 149
+++ +IN E +F + +C +AK Y YVGG YE C
Sbjct 447 -------YIQHKINSVAE--------IFCQPRSPSCFL-----YAKDYNYVGGFYE--GC 484
Query 150 SAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTTRGAPHARV 193
+ E+K+M E+ +GPV A+DAP +LF Y SG++ ++ + H ++
Sbjct 485 N-EEKMMNEMYHHGPVVVAIDAPDTLFMYQSGLFDSQPSEHGKI 527
> pfa:PFL2290w preprocathepsin c precursor, putative (EC:3.4.14.1);
K01275 cathepsin C [EC:3.4.14.1]
Length=590
Score = 87.8 bits (216), Expect = 2e-17, Method: Composition-based stats.
Identities = 54/150 (36%), Positives = 74/150 (49%), Gaps = 36/150 (24%)
Query 29 KLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEK 88
+LS QS+LSCS YNQGC GG P+LVGKH E GI+ EQ M Y D ++C + N N
Sbjct 351 RLSHQSILSCSPYNQGCDGGYPFLVGKHMYEYGIIPEQYMHYENNDYNNCIMDMGNYNH- 409
Query 89 DFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGCYECLS 148
L + +K EI ++ Y Y+ GCYE
Sbjct 410 --LNKQNRNIK-EI-----------------------------YYVSDYNYINGCYE--- 434
Query 149 CSAEQKIMKEIMTNGPVAAALDAPPSLFAY 178
C+ E ++M EI+ NGP+ AA++A L +
Sbjct 435 CTNEYEMMNEIILNGPIVAAINATSELLNF 464
> tpv:TP03_0357 cathepsin C; K01275 cathepsin C [EC:3.4.14.1]
Length=501
Score = 81.6 bits (200), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 73/174 (41%), Gaps = 52/174 (29%)
Query 1 RLQQQQQEGEEQQQQQLQQLQQ---QQEAAVKLSPQSVLSCSFYNQGCHGGLPYLVGKHA 57
R+++ ++ E + +L L++ + + LS LSC YNQGC GG P VGK A
Sbjct 260 RIRELLKKPEYKSDARLLHLEKVLSDKNFNINLS----LSCIPYNQGCKGGFPVNVGKFA 315
Query 58 TEIGILDEQCMPYTAMDLSSCPVLNRNKNEKDFLKSEKDFLKGEINGSGEDSSSNAAFLF 117
E G++ + P +D CP E++F
Sbjct 316 EEFGLILDNEKPEEVVDNLKCP------------PKEENF-------------------- 343
Query 118 SGEATGTCHQGENRWFAKGYGYVGGCYECLSCSAEQKIMKEIMTNGPVAAALDA 171
R FA GYVGGCYEC CS E IM EIM NGPV A +D
Sbjct 344 -------------RLFASNVGYVGGCYECTRCSGETLIMNEIMLNGPVVAGIDG 384
> dre:368704 ctsc, cb912, ik:tdsubc_1h2, sb:cb146, wu:fb34g12,
wu:fj58d01; cathepsin C (EC:3.4.14.1); K01275 cathepsin C [EC:3.4.14.1]
Length=455
Score = 79.3 bits (194), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 76/179 (42%), Gaps = 52/179 (29%)
Query 9 GEEQQQQQLQQLQQQQEAAVKLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCM 68
G + + ++Q QQ SPQ V+SCS Y+QGC GG PYL+GK+ + GI++E C
Sbjct 258 GMLEARVRIQTNNTQQPV---FSPQQVVSCSQYSQGCDGGFPYLIGKYIQDFGIVEEDCF 314
Query 69 PYTAMDLSSCPVLNRNKNEKDFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQG 128
PYT G DS N C
Sbjct 315 PYT----------------------------------GSDSPCNLP--------AKC--- 329
Query 129 ENRWFAKGYGYVGGCYECLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTTRG 187
+++A Y YVGG Y CS E +M E++ NGP+ AL+ P Y G+Y G
Sbjct 330 -TKYYASDYHYVGGFYG--GCS-ESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHHTG 384
> bbo:BBOV_II000170 18.m05995; cathepsin C precursor (EC:3.4.22.-);
K01275 cathepsin C [EC:3.4.14.1]
Length=530
Score = 77.8 bits (190), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 65/157 (41%), Gaps = 53/157 (33%)
Query 30 LSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEK- 88
P+ V CS YNQGC GG PYL+GK E GIL +N ++
Sbjct 324 FDPKDVTDCSMYNQGCDGGYPYLMGKQMREFGILT-----------------TKNAGQQC 366
Query 89 DFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGCYECLS 148
L +E+ + FA+ YGYVGGC++C +
Sbjct 367 TLLSTERRY-----------------------------------FARDYGYVGGCHQCTA 391
Query 149 CSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTT 185
C + IM+EI+ NGPV A+DA Y + T+
Sbjct 392 CQGDALIMREILANGPVVTAIDAAVLTADYDGHIITS 428
> tpv:TP02_0883 cathepsin C; K01275 cathepsin C [EC:3.4.14.1]
Length=365
Score = 77.0 bits (188), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 70/160 (43%), Gaps = 51/160 (31%)
Query 29 KLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEK 88
KLS +LS S Y+QGC GG LVGKH E+GI + + +
Sbjct 181 KLSIHDLLS-SAYSQGCFGGFLMLVGKHIKELGI---------------------HSDSE 218
Query 89 DFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGCYECLS 148
FL + + ED + +W+ YGYVGGCYEC
Sbjct 219 TFLNTLRTL---------EDIKFDM-----------------KWYIDSYGYVGGCYEC-- 250
Query 149 CSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTTRGA 188
+ E +M EI+TNGP+A A+ +PP LF Y G T A
Sbjct 251 -TNEMNMMNEIITNGPIAVAIYSPPQLFYYKHGWEYTNHA 289
> xla:380203 ctsc, MGC69126; cathepsin C (EC:3.4.14.1); K01275
cathepsin C [EC:3.4.14.1]
Length=458
Score = 63.9 bits (154), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 67/170 (39%), Gaps = 50/170 (29%)
Query 19 QLQQQQEAAVKLSPQSVLSCSFYNQGCHGGLPYLV-GKHATEIGILDEQCMPYTAMDLSS 77
Q+Q Q LSPQ V+SCS Y+QGC GG PYL+ GK+ + GI++E PY D S
Sbjct 267 QIQSQLSQKPILSPQQVVSCSNYSQGCDGGFPYLIAGKYLNDFGIVEESDFPYIGSD-SP 325
Query 78 CPVLNRNKNEKDFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGY 137
C T R++ Y
Sbjct 326 C---------------------------------------------TLKDSYQRYYTAEY 340
Query 138 GYVGGCYECLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYTTRG 187
YVGG Y C+ E + E++ GP++ A + Y SGVY G
Sbjct 341 HYVGGFYG--GCN-EAYMKLELVLGGPLSVAFEVYDDFIHYRSGVYHHTG 387
> mmu:13032 Ctsc, AI047818, DPP1, DPPI; cathepsin C (EC:3.4.14.1);
K01275 cathepsin C [EC:3.4.14.1]
Length=462
Score = 63.2 bits (152), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 42/62 (67%), Gaps = 2/62 (3%)
Query 14 QQQLQQLQQQQEAAVKLSPQSVLSCSFYNQGCHGGLPYLV-GKHATEIGILDEQCMPYTA 72
+ +++ L + + LSPQ V+SCS Y QGC GG PYL+ GK+A + G+++E C PYTA
Sbjct 267 EARIRILTNNSQTPI-LSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYTA 325
Query 73 MD 74
D
Sbjct 326 KD 327
> mmu:94242 Tinagl1, 1110021J17Rik, AZ-1, AZ1, Arg1, Lcn7, TARP,
Tinagl; tubulointerstitial nephritis antigen-like 1
Length=466
Score = 47.8 bits (112), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 67/159 (42%), Gaps = 34/159 (21%)
Query 30 LSPQSVLSC-SFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEK 88
LSPQ++LSC + + QGC GG G++ + C P++ R +NE
Sbjct 253 LSPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSG----------REQNEA 302
Query 89 DFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGE---NRWFAKGYGYVGGCYE 145
+ + + G G+ +AT C G+ N + Y G
Sbjct 303 S--PTPRCMMHSRAMGRGKR-----------QATSRCPNGQVDSNDIYQVTPAYRLG--- 346
Query 146 CLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYT 184
S E++IMKE+M NGPV A ++ F Y G+Y+
Sbjct 347 ----SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYS 381
> dre:562116 tinagl1, si:dkey-158b13.1; tubulointerstitial nephritis
antigen-like 1
Length=471
Score = 44.3 bits (103), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/169 (23%), Positives = 66/169 (39%), Gaps = 37/169 (21%)
Query 20 LQQQQEAAVKLSPQSVLSCSFYNQ-GCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSC 78
+Q +LSPQ+++SC +Q GC GG G++ + C P++ + S+
Sbjct 241 IQSMGHMTPQLSPQNLISCDTRHQDGCAGGRIDGAWWFMRRRGVVTQDCYPFSPPEQSAV 300
Query 79 PVLNRNKNEKDFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYG 138
V + ++ G G+ +AT C +
Sbjct 301 EV-------------ARCMMQSRAVGRGKR-----------QATAHC--------PNSHS 328
Query 139 YVGGCYECLS----CSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVY 183
Y Y+ + E +IMKEIM NGPV A ++ F Y SG++
Sbjct 329 YHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIF 377
> tgo:TGME49_076130 cathepsin C2 (TgCPC2) (EC:3.4.14.1)
Length=753
Score = 42.4 bits (98), Expect = 0.001, Method: Composition-based stats.
Identities = 42/148 (28%), Positives = 65/148 (43%), Gaps = 17/148 (11%)
Query 37 SCSFYNQGCHGGLPYLVGKHATEIGILDEQCMP-YTAMDLSSCPVLNRNKNEKDFLKSEK 95
SC+ YNQGC GG +L K E G E C+ Y AM L+ N L++
Sbjct 425 SCNVYNQGCGGGYVFLALKFGQEHGFRTEDCVSEYHAMADKHKGSLSPN------LQTCF 478
Query 96 DFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGCYECLSCSAEQKI 155
D L G++ S + A +C+ + YVGG Y CS E +
Sbjct 479 D-LGGQLGTSAFGCQAPPA---RASLPDSCNLSVK---VTSWHYVGGVYG--GCS-EDDM 528
Query 156 MKEIMTNGPVAAALDAPPSLFAYSSGVY 183
++ + +GP+AA+++ + Y GV+
Sbjct 529 LRTLWEHGPMAASIEPTIAFTVYKKGVF 556
> cel:F26E4.3 hypothetical protein
Length=452
Score = 40.0 bits (92), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 59/155 (38%), Gaps = 35/155 (22%)
Query 30 LSPQSVLSCSFYNQ-GCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEK 88
LS Q +LSC+ + Q GC GG + ++G++ + C PY V +++
Sbjct 235 LSSQQLLSCNQHRQKGCEGGYLDRAWWYIRKLGVVGDHCYPY---------VSGQSREPG 285
Query 89 DFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRWFAKGYGYVGGCYECLS 148
L ++D+ + S + AF + +
Sbjct 286 HCLIPKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVS------------------------ 321
Query 149 CSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVY 183
S E+ I E+MTNGPV A F Y+ GVY
Sbjct 322 -SREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 355
> hsa:64129 TINAGL1, ARG1, LCN7, LIECG3, TINAGRP; tubulointerstitial
nephritis antigen-like 1
Length=436
Score = 40.0 bits (92), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 64/159 (40%), Gaps = 34/159 (21%)
Query 30 LSPQSVLSCSFYNQ-GCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSCPVLNRNKNEK 88
LSPQ++LSC + Q GC GG G++ + C P++ R ++E
Sbjct 223 LSPQNLLSCDTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSG----------RERDEA 272
Query 89 DFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTC---HQGENRWFAKGYGYVGGCYE 145
+ + G G+ +AT C + N + Y G
Sbjct 273 G--PAPPCMMHSRAMGRGKR-----------QATAHCPNSYVNNNDIYQVTPVYRLG--- 316
Query 146 CLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVYT 184
S +++IMKE+M NGPV A ++ F Y G+Y+
Sbjct 317 ----SNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYS 351
> hsa:27283 TINAG, TIN-AG; tubulointerstitial nephritis antigen
Length=476
Score = 40.0 bits (92), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 0/34 (0%)
Query 150 SAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVY 183
S E +IMKEIM NGPV A + F Y +G+Y
Sbjct 359 SNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY 392
> bbo:BBOV_IV007730 23.m06535; cysteine protease 2
Length=445
Score = 38.1 bits (87), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 0/58 (0%)
Query 17 LQQLQQQQEAAVKLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMD 74
++ L ++Q+ V+LS Q ++SC NQGC+GG + GI + PY A D
Sbjct 269 VESLLKRQKTDVRLSEQELVSCQLGNQGCNGGYSDYALNYIKFNGIHRSEEWPYLAAD 326
> cpv:cgd2_3320 secreted papain like protease, signal peptide
Length=819
Score = 37.7 bits (86), Expect = 0.024, Method: Composition-based stats.
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 0/38 (0%)
Query 38 CSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDL 75
C+ YNQGC GGL L K A E+G+ + C+ A +L
Sbjct 437 CNVYNQGCGGGLITLAFKFAQEVGVRTQDCVDDYAKNL 474
Score = 29.3 bits (64), Expect = 8.0, Method: Composition-based stats.
Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 3/49 (6%)
Query 135 KGYGYVGGCYECLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVY 183
K Y YV Y + + IM+ + GPVA +L+ Y+SGV+
Sbjct 669 KEYSYVNNVY---GKTTARDIMESLWNEGPVAVSLEPTLEFSLYNSGVF 714
> mmu:70202 Ctsll3, 2310051M13Rik; cathepsin L-like 3 (EC:3.4.22.15)
Length=331
Score = 37.4 bits (85), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 3/59 (5%)
Query 19 QLQQQQEAAVKLSPQSVLSCSFY--NQGCHGGLPYLVGKHATEIGILDEQC-MPYTAMD 74
Q+ ++ V LS Q+++ CS+ NQGC GGLP L ++ + G LD PY A++
Sbjct 151 QMFRKTGKLVPLSVQNLVDCSWSQGNQGCDGGLPDLAFQYVKDNGGLDTSVSYPYEALN 209
> mmu:13039 Ctsl, 1190035F06Rik, Ctsl1, MEP, fs, nkt; cathepsin
L (EC:3.4.22.15); K01365 cathepsin L [EC:3.4.22.15]
Length=334
Score = 37.4 bits (85), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%), Gaps = 1/33 (3%)
Query 152 EQKIMKEIMTNGPVAAALDAP-PSLFAYSSGVY 183
E+ +MK + T GP++ A+DA PSL YSSG+Y
Sbjct 232 EKALMKAVATVGPISVAMDASHPSLQFYSSGIY 264
Score = 34.7 bits (78), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 29/50 (58%), Gaps = 3/50 (6%)
Query 28 VKLSPQSVLSCSFY--NQGCHGGLPYLVGKHATEIGILD-EQCMPYTAMD 74
+ LS Q+++ CS NQGC+GGL ++ E G LD E+ PY A D
Sbjct 159 ISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKD 208
> mmu:26944 Tinag, AI452335, TIN-ag; tubulointerstitial nephritis
antigen
Length=475
Score = 37.0 bits (84), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 0/34 (0%)
Query 150 SAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVY 183
S E +IM+EI+ NGPV A + F Y +G+Y
Sbjct 358 SNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIY 391
> xla:447313 ctsl2, MGC81823; cathepsin L2; K01365 cathepsin L
[EC:3.4.22.15]
Length=335
Score = 37.0 bits (84), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 11/82 (13%)
Query 28 VKLSPQSVLSCSFY--NQGCHGGLPYLVGKHATEIGILD-EQCMPYTAMDLSSC---PVL 81
+ LS Q+++ CS NQGC+GGL ++ + G +D E PYTA D C P
Sbjct 159 ISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNY 218
Query 82 NRNKNEKDFLK----SEKDFLK 99
N + N+ F+ SEKD +K
Sbjct 219 N-SANDTGFVDVPSGSEKDLMK 239
> ath:AT3G45310 cysteine proteinase, putative; K01366 cathepsin
H [EC:3.4.22.16]
Length=357
Score = 36.2 bits (82), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query 28 VKLSPQSVLSC--SFYNQGCHGGLPYLVGKHATEIGILD-EQCMPYTAMD 74
+ LS Q ++ C +F N GCHGGLP ++ G LD E+ PYT D
Sbjct 186 ISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 235
> ath:AT3G19390 cysteine proteinase, putative / thiol protease,
putative
Length=452
Score = 35.8 bits (81), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 56/131 (42%), Gaps = 18/131 (13%)
Query 28 VKLSPQSVLSC-SFYNQGCHGGLPYLVGKHATEIGILD-EQCMPYTAMDLSSCPVLNRN- 84
+ LS Q ++ C + YN GC GGL K E G +D E+ PY A D++ C +N
Sbjct 174 ISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNT 233
Query 85 ------------KNEKDFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQG-ENR 131
+N++ LK K I+ + E SG TGTC ++
Sbjct 234 RVVTIDGYEDVPQNDEKSLK--KALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHG 291
Query 132 WFAKGYGYVGG 142
A GYG GG
Sbjct 292 VVAVGYGSEGG 302
> dre:567623 zgc:174153
Length=336
Score = 35.4 bits (80), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 65/182 (35%), Gaps = 62/182 (34%)
Query 19 QLQQQQEAAVKLSPQSVLSCSFY--NQGCHGGLPYLVGKHATEIGILD-EQCMPYTAMDL 75
QL ++ + +S Q+++ CS NQGC+GGL L ++ E LD EQ PY A D
Sbjct 151 QLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDD 210
Query 76 SSC---PVLNRNKNEKDFLKSEKDFLKGEINGSGEDSSSNAAFLFSGEATGTCHQGENRW 132
C P N
Sbjct 211 LPCRYDPRFN-------------------------------------------------- 220
Query 133 FAKGYGYVGGCYECLSCSAEQKIMKEIMTNGPVAAALDAPP-SLFAYSSGVYTTRGAPHA 191
AK G+V + E +M + GPV+ A+DA SL Y SG+Y R +
Sbjct 221 VAKSTGFVD-----IPSGNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSS 275
Query 192 RV 193
R+
Sbjct 276 RL 277
> hsa:1515 CTSL2, CATL2, CTSU, CTSV, MGC125957; cathepsin L2 (EC:3.4.22.43);
K01375 cathepsin V [EC:3.4.22.43]
Length=334
Score = 34.7 bits (78), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 33/59 (55%), Gaps = 3/59 (5%)
Query 19 QLQQQQEAAVKLSPQSVLSCSFY--NQGCHGGLPYLVGKHATEIGILD-EQCMPYTAMD 74
Q+ ++ V LS Q+++ CS NQGC+GG ++ E G LD E+ PY A+D
Sbjct 150 QMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVD 208
Score = 32.7 bits (73), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query 144 YECLSCSAEQKIMKEIMTNGPVAAALDAPPSLF-AYSSGVY 183
+ ++ E+ +MK + T GP++ A+DA S F Y SG+Y
Sbjct 225 FTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIY 265
> dre:569298 ctsbb; capthepsin B, b; K01363 cathepsin B [EC:3.4.22.1]
Length=326
Score = 34.7 bits (78), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 20/34 (58%), Gaps = 0/34 (0%)
Query 150 SAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVY 183
S +Q+IM E+ TNGPV AA Y SGVY
Sbjct 228 SDQQQIMTELYTNGPVEAAFTVYEDFPLYKSGVY 261
> ath:AT5G60360 AALP; AALP (Arabidopsis aleurain-like protease);
cysteine-type peptidase; K01366 cathepsin H [EC:3.4.22.16]
Length=357
Score = 34.7 bits (78), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query 28 VKLSPQSVLSCS--FYNQGCHGGLPYLVGKHATEIGILD-EQCMPYTAMD 74
+ LS Q ++ C+ F N GC+GGLP ++ G LD E+ PYT D
Sbjct 186 ISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD 235
> cel:R09F10.1 hypothetical protein
Length=383
Score = 34.7 bits (78), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 0/51 (0%)
Query 28 VKLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMDLSSC 78
V LS Q ++ C N GC GG K E G+ E+ PY+A+ C
Sbjct 213 VSLSEQEMVDCDGRNNGCSGGYRPYAMKFVKENGLESEKEYPYSALKHDQC 263
> dre:567333 ctso; cathepsin O; K01374 cathepsin O [EC:3.4.22.42]
Length=334
Score = 34.3 bits (77), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 56/150 (37%), Gaps = 38/150 (25%)
Query 29 KLSPQSVLSCSFYNQGCHGGLPY--LVGKHATEIGILDEQCMPYTAMD-------LSSCP 79
+LS Q V+ CS+ NQGC+GG P L +++ ++ E P+ D +
Sbjct 167 QLSVQQVIDCSYQNQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHAG 226
Query 80 VLNRNKNEKDFLKSE----------------------KDFLKGEINGSGEDSSSNAAFLF 117
V RN + DF E +D+L G I +N A L
Sbjct 227 VAVRNYSAYDFSGQEEVMMSALVDFGPLVVIVDAISWQDYLGGIIQHHCSSHKANHAVLI 286
Query 118 SG-EATGTCHQGENR------WFAKGYGYV 140
+G + TG R W GY Y+
Sbjct 287 TGYDTTGEVPYWIVRNSWGTSWGDDGYAYI 316
> ath:AT1G02305 cathepsin B-like cysteine protease, putative
Length=362
Score = 33.9 bits (76), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query 28 VKLSPQSVLSCSFY--NQGCHGGLPYLVGKHATEIGILDEQCMPY 70
V LS +L+C + QGC+GG P ++ G++ E+C PY
Sbjct 155 VSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPY 199
> pfa:PF11_0165 falcipain-2A
Length=484
Score = 33.5 bits (75), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 1/55 (1%)
Query 19 QLQQQQEAAVKLSPQSVLSCSFYNQGCHGGLPYLVGKHATEI-GILDEQCMPYTA 72
Q ++ + LS Q ++ CSF N GC+GGL + E+ GI + PY +
Sbjct 297 QYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICTDDDYPYVS 351
> xla:380102 cg10992; hypothetical protein MGC52983; K01363 cathepsin
B [EC:3.4.22.1]
Length=333
Score = 33.5 bits (75), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 42/178 (23%), Positives = 66/178 (37%), Gaps = 46/178 (25%)
Query 20 LQQQQEAAVKLSPQSVLSCSFYN--QGCHGGLPYLVGKHATEIGIL-----DEQ--CMPY 70
+ + V++S + +LSC + GC+GG P + TE G++ D C PY
Sbjct 123 VHTNGKVNVEVSAEDLLSCCGFKCGMGCNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPY 182
Query 71 TAMDLSSCPVLNRNKNEKDFLKSEKDFLKGEINGS-----GEDSSSNAAFLFSGEATGTC 125
+ + C + +NGS GE+ + E
Sbjct 183 S---IPPC--------------------EHHVNGSRPSCKGEEGDTPKCMKTCEEGYTPA 219
Query 126 HQGENRWFAKGYGYVGGCYECLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVY 183
+ + + A YG S+E++IM +I NGPV A Y SGVY
Sbjct 220 YGSDKHFGATSYGVP---------SSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVY 268
> cel:F41E6.6 tag-196; Temporarily Assigned Gene name family member
(tag-196); K01373 cathepsin F [EC:3.4.22.41]
Length=477
Score = 33.5 bits (75), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 1/44 (2%)
Query 28 VKLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILD-EQCMPY 70
V LS Q ++ C +QGC+GGLP K +G L+ E PY
Sbjct 309 VSLSEQELVDCDSMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPY 352
> cel:F15D4.4 hypothetical protein
Length=608
Score = 33.5 bits (75), Expect = 0.47, Method: Composition-based stats.
Identities = 13/46 (28%), Positives = 25/46 (54%), Gaps = 0/46 (0%)
Query 138 GYVGGCYECLSCSAEQKIMKEIMTNGPVAAALDAPPSLFAYSSGVY 183
GY+ G + ++ +++ + GP+A + A P ++ YS GVY
Sbjct 341 GYISGNFTAAQLITMEQNIEDKVRKGPIAVGMAAGPDIYKYSEGVY 386
> ath:AT5G50260 cysteine proteinase, putative; K01376 [EC:3.4.22.-]
Length=361
Score = 33.5 bits (75), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 52/117 (44%), Gaps = 12/117 (10%)
Query 21 QQQQEAAVKLSPQSVLSC-SFYNQGCHGGLPYLVGKHATEI-GILDEQCMPYTAMDLS-- 76
Q + + LS Q ++ C + NQGC+GGL L + E G+ E PY A D +
Sbjct 164 QIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCD 223
Query 77 ----SCPVLNRNKNEKDFLKSEKDFLKGEIN----GSGEDSSSNAAFLFSGEATGTC 125
+ PV++ + +E SE D +K N + + S+ F G TG C
Sbjct 224 TNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRC 280
> tpv:TP03_0283 cysteine proteinase (EC:3.4.22.-); K01376 [EC:3.4.22.-]
Length=441
Score = 33.1 bits (74), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 0/45 (0%)
Query 30 LSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILDEQCMPYTAMD 74
LS Q +++C + GC GGLP ++ GI E +PY +D
Sbjct 275 LSEQELVNCDKSSMGCSGGLPITAMEYIHSKGISFESEIPYIGID 319
> mmu:13038 Ctsk, AI323530, MMS10-Q, Ms10q, catK; cathepsin K
(EC:3.4.22.38); K01371 cathepsin K [EC:3.4.22.38]
Length=329
Score = 32.7 bits (73), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query 19 QLQQQQEAAVKLSPQSVLSCSFYNQGCHGGLPYLVGKHATEIGILD-EQCMPYTAMDLS 76
QL+++ + LSPQ+++ C N GC GG ++ + G +D E PY D S
Sbjct 151 QLKKKTGKLLALSPQNLVDCVTENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDES 209
> ath:AT3G49340 cysteine proteinase, putative; K01376 [EC:3.4.22.-]
Length=341
Score = 31.2 bits (69), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 1/44 (2%)
Query 28 VKLSPQSVLSCSFYNQGCHGGLPYLVGKHATEI-GILDEQCMPY 70
V LS Q +L CS N GC GG+ + + E GI E PY
Sbjct 172 VSLSEQQLLDCSTENNGCGGGIMWKAFDYIKENQGITTEDNYPY 215
Lambda K H
0.314 0.130 0.387
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 5623228644
Database: egene_temp_file_orthology_annotation_similarity_blast_database_866
Posted date: Sep 17, 2011 2:57 PM
Number of letters in database: 82,071,388
Number of sequences in database: 164,496
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40