bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.24+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: egene_temp_file_orthology_annotation_similarity_blast_database_866 164,496 sequences; 82,071,388 total letters Query= Eten_8027_orf1 Length=103 Score E Sequences producing significant alignments: (Bits) Value tgo:TGME49_109290 HD domain-containing protein ; K07023 putati... 69.3 3e-12 hsa:51020 HDDC2, C6orf74, MGC87330, NS5ATP2, dJ167O5.2; HD dom... 64.7 7e-11 xla:447085 hddc2, MGC85244; HD domain containing 2; K07023 put... 62.0 4e-10 mmu:69692 Hddc2, 2310057G13Rik, MGC129436, MGC129437; HD domai... 61.2 7e-10 ath:AT2G23820 metal-dependent phosphohydrolase HD domain-conta... 61.2 8e-10 dre:554129 hddc2, fb71h08, wu:fb71h08, zgc:112330; HD domain c... 61.2 8e-10 ath:AT1G26160 metal-dependent phosphohydrolase HD domain-conta... 60.5 1e-09 sce:YGL101W Putative protein of unknown function; non-essentia... 60.1 2e-09 sce:YBR242W Putative protein of unknown function; green fluore... 57.4 1e-08 pfa:PFL1890c HD superfamily phosphohydrolase protein 34.3 0.100 ath:AT1G30600 subtilase family protein 28.9 3.8 ath:AT2G40890 CYP98A3 (cytochrome P450, family 98, subfamily A... 28.5 5.3 hsa:8881 CDC16, ANAPC6, APC6, CUT9; cell division cycle 16 hom... 28.1 6.6 eco:b2291 yfbR, ECK2285, JW2288; 5'-nucleotidase; K08722 5'-nu... 28.1 6.8 eco:b2230 yfaA, ECK2222, JW2224, pufY; conserved protein, DUF2... 27.7 9.3 > tgo:TGME49_109290 HD domain-containing protein ; K07023 putative hydrolases of HD superfamily Length=305 Score = 69.3 bits (168), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 41/84 (48%), Positives = 51/84 (60%), Gaps = 4/84 (4%) Query 20 LESFLSFGGKLKKLKRAGWLRYVHPSDVESVADHSF--GVAFHFLGL-PDSDLVVNGQLA 76 L +FL G+LKKLKR GW + ESVA+HSF G+ +G P S ++ Sbjct 32 LLNFLLMVGELKKLKRTGW-KLSGVRGPESVAEHSFRAGICAFLIGTDPQSSKLIRENKL 90 Query 77 DRNKCAAMALAHDLAESVVGDITP 100 DRNKC MAL HDLAE++ GDITP Sbjct 91 DRNKCIKMALVHDLAEALAGDITP 114 > hsa:51020 HDDC2, C6orf74, MGC87330, NS5ATP2, dJ167O5.2; HD domain containing 2; K07023 putative hydrolases of HD superfamily Length=204 Score = 64.7 bits (156), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 34/93 (36%), Positives = 52/93 (55%), Gaps = 8/93 (8%) Query 11 TMSGEERKGLESFLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLV 70 T SG + L FL G+LK++ R GW+ Y + ESV+DH + +A + + D L Sbjct 8 TFSGHGARSLLQFLRLVGQLKRVPRTGWV-YRNVQRPESVSDHMYRMAVMAMVIKDDRL- 65 Query 71 VNGQLADRNKCAAMALAHDLAESVVGDITPLDG 103 ++++C +AL HD+AE +VGDI P D Sbjct 66 ------NKDRCVRLALVHDMAECIVGDIAPADN 92 > xla:447085 hddc2, MGC85244; HD domain containing 2; K07023 putative hydrolases of HD superfamily Length=201 Score = 62.0 bits (149), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 8/86 (9%) Query 18 KGLESFLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLVVNGQLAD 77 K L F+ G+LK++ R GW+ Y ESV+DH + +A + D L + Sbjct 12 KSLLQFMKLVGQLKRVPRTGWI-YRQVEKPESVSDHMYRMAVMAMLTEDRKL-------N 63 Query 78 RNKCAAMALAHDLAESVVGDITPLDG 103 +++C +AL HD+AE +VGDI P D Sbjct 64 KDRCIRLALVHDMAECIVGDIAPADN 89 > mmu:69692 Hddc2, 2310057G13Rik, MGC129436, MGC129437; HD domain containing 2; K07023 putative hydrolases of HD superfamily Length=199 Score = 61.2 bits (147), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 8/85 (9%) Query 19 GLESFLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLVVNGQLADR 78 GL FL G+LK++ R GW+ Y + ESV+DH + +A + D L ++ Sbjct 11 GLLRFLRLVGQLKRVPRTGWV-YRNVEKPESVSDHMYRMAVMAMVTRDDRL-------NK 62 Query 79 NKCAAMALAHDLAESVVGDITPLDG 103 ++C +AL HD+AE +VGDI P D Sbjct 63 DRCIRLALVHDMAECIVGDIAPADN 87 > ath:AT2G23820 metal-dependent phosphohydrolase HD domain-containing protein; K07023 putative hydrolases of HD superfamily Length=245 Score = 61.2 bits (147), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 35/79 (44%), Positives = 48/79 (60%), Gaps = 9/79 (11%) Query 23 FLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLV-VNGQLADRNKC 81 FLS +LK RAGW++ D ES+ADH + + + L SD+ VN R+KC Sbjct 82 FLSLCTRLKTTPRAGWIKR-DVKDPESIADHMYRMGL--MALISSDIPGVN-----RDKC 133 Query 82 AAMALAHDLAESVVGDITP 100 MA+ HD+AE++VGDITP Sbjct 134 MKMAIVHDIAEAIVGDITP 152 > dre:554129 hddc2, fb71h08, wu:fb71h08, zgc:112330; HD domain containing 2; K07023 putative hydrolases of HD superfamily Length=200 Score = 61.2 bits (147), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 31/82 (37%), Positives = 47/82 (57%), Gaps = 8/82 (9%) Query 22 SFLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLVVNGQLADRNKC 81 F+ G+LK++ R GW+ Y + ESV+DH + ++ + L D+ VN + +C Sbjct 6 QFMKLVGQLKRVPRTGWV-YRNIKQPESVSDHMYRMSM--MALTIQDISVN-----KERC 57 Query 82 AAMALAHDLAESVVGDITPLDG 103 +AL HDLAE +VGDI P D Sbjct 58 MKLALVHDLAECIVGDIAPADN 79 > ath:AT1G26160 metal-dependent phosphohydrolase HD domain-containing protein; K07023 putative hydrolases of HD superfamily Length=258 Score = 60.5 bits (145), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 34/83 (40%), Positives = 47/83 (56%), Gaps = 11/83 (13%) Query 23 FLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLVVNGQLA--DRNK 80 FL+ +LK KR GW+ + ES+ADH + +A L + G L DR + Sbjct 78 FLTLCHRLKTTKRKGWINQ-GINGPESIADHMYRMALMAL--------IAGDLTGVDRER 128 Query 81 CAAMALAHDLAESVVGDITPLDG 103 C MA+ HD+AE++VGDITP DG Sbjct 129 CIKMAIVHDIAEAIVGDITPSDG 151 > sce:YGL101W Putative protein of unknown function; non-essential gene with similarity to YBR242W; interacts with the DNA helicase Hpr5p; K07023 putative hydrolases of HD superfamily Length=215 Score = 60.1 bits (144), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 33/82 (40%), Positives = 50/82 (60%), Gaps = 10/82 (12%) Query 22 SFLSFGGKLKKLKRAGWLRY-VHPSDVESVADHSFGVAFHFLGLPDSDLVVNGQLADRNK 80 +FL+ LK +R GW+ + + P ES++DH + +GL + +++ + DRNK Sbjct 31 AFLNIIQLLKTQRRTGWVDHGIDP--CESISDHMYR-----MGL--TTMLITDKNVDRNK 81 Query 81 CAAMALAHDLAESVVGDITPLD 102 C +AL HD AES+VGDITP D Sbjct 82 CIRIALVHDFAESLVGDITPND 103 > sce:YBR242W Putative protein of unknown function; green fluorescent protein (GFP)-fusion protein localizes to the cytoplasm and nucleus; YBR242W is not an essential gene; K07023 putative hydrolases of HD superfamily Length=238 Score = 57.4 bits (137), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 30/81 (37%), Positives = 51/81 (62%), Gaps = 8/81 (9%) Query 22 SFLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLVVNGQLADRNKC 81 +FL+ +LK +R G+L + + ES++DH + ++ + + DS + +R+KC Sbjct 50 AFLNVVQQLKIQRRTGYLD-LGIKECESISDHMYRLSIITMLIKDSRV-------NRDKC 101 Query 82 AAMALAHDLAESVVGDITPLD 102 +AL HD+AES+VGDITP+D Sbjct 102 VRIALVHDIAESLVGDITPVD 122 > pfa:PFL1890c HD superfamily phosphohydrolase protein Length=822 Score = 34.3 bits (77), Expect = 0.100, Method: Composition-based stats. Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 7/73 (9%) Query 26 FGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAF----HFLGLPD-SDLVVN-GQLADRN 79 F +L+ L + G +YV+P S +HS GV F +F L + S+L N G+L Sbjct 384 FFQRLRSLSQLGACQYVYPGATHSRFEHSLGVGFLSAKYFTHLCNRSNLSPNHGELKRML 443 Query 80 KCAAMA-LAHDLA 91 +C +A L HDL Sbjct 444 RCVQIAGLCHDLG 456 > ath:AT1G30600 subtilase family protein Length=832 Score = 28.9 bits (63), Expect = 3.8, Method: Composition-based stats. Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 5/66 (7%) Query 43 HPSDVESVADHSFGVAFHFLGLPDSDL-----VVNGQLADRNKCAAMALAHDLAESVVGD 97 HPS + ++ H++ V HF G+ + + N +L A AL+ + S D Sbjct 179 HPSFSDKISGHTYSVPPHFTGVCEVTIGFPPGSCNRKLIGARHFAESALSRGVLNSSQDD 238 Query 98 ITPLDG 103 +P DG Sbjct 239 ASPFDG 244 > ath:AT2G40890 CYP98A3 (cytochrome P450, family 98, subfamily A, polypeptide 3); monooxygenase/ p-coumarate 3-hydroxylase; K09754 p-coumarate 3-hydroxylase [EC:1.14.13.-] Length=508 Score = 28.5 bits (62), Expect = 5.3, Method: Composition-based stats. Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 0/49 (0%) Query 5 GSGIVKTMSGEERKGLESFLSFGGKLKKLKRAGWLRYVHPSDVESVADH 53 G+V E + + + L G L + WLR++ P+D ++ A+H Sbjct 194 AEGVVDEQGLEFKAIVSNGLKLGASLSIAEHIPWLRWMFPADEKAFAEH 242 > hsa:8881 CDC16, ANAPC6, APC6, CUT9; cell division cycle 16 homolog (S. cerevisiae); K03353 anaphase-promoting complex subunit 6 Length=620 Score = 28.1 bits (61), Expect = 6.6, Method: Composition-based stats. Identities = 24/94 (25%), Positives = 38/94 (40%), Gaps = 14/94 (14%) Query 7 GIVKTMSGEERKGLESFLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFH------ 60 G M G + + +LS L+K W+ Y H VES D + F Sbjct 306 GCYYLMVGHKNEHARRYLSKATTLEKTYGPAWIAYGHSFAVESEHDQAMAAYFTAAQLMK 365 Query 61 -------FLGLPDSDLVVNGQLADRNKCAAMALA 87 ++GL + L N +LA+R A+++A Sbjct 366 GCHLPMLYIGL-EYGLTNNSKLAERFFSQALSIA 398 > eco:b2291 yfbR, ECK2285, JW2288; 5'-nucleotidase; K08722 5'-nucleotidase [EC:3.1.3.5] Length=199 Score = 28.1 bits (61), Expect = 6.8, Method: Compositional matrix adjust. Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 4/80 (5%) Query 23 FLSFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLVVNGQLADRNKCA 82 F + +LK + R +R V E+V++HS VA L G + + A Sbjct 6 FFAHLSRLKLINRWPLMRNVR---TENVSEHSLQVAMVAHALAAIKNRKFGGNVNAERIA 62 Query 83 AMALAHDLAESVVGDI-TPL 101 +A+ HD +E + GD+ TP+ Sbjct 63 LLAMYHDASEVLTGDLPTPV 82 > eco:b2230 yfaA, ECK2222, JW2224, pufY; conserved protein, DUF2138 family Length=562 Score = 27.7 bits (60), Expect = 9.3, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 10/75 (13%) Query 25 SFGGKLKKLKRAGWLRYVHPSDVESVADHSFGVAFHFLGLPDSDLVVNGQLADRNKCAAM 84 SF G ++ GW +V +D + D SF D V N A + C A+ Sbjct 277 SFAGVRFEMGNDGWHSFVALNDESASVDASF----------DFTPVWNSMPAGASFCVAV 326 Query 85 ALAHDLAESVVGDIT 99 +H +AE ++ I+ Sbjct 327 PYSHGIAEEMLSHIS 341 Lambda K H 0.319 0.138 0.416 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 2027061836 Database: egene_temp_file_orthology_annotation_similarity_blast_database_866 Posted date: Sep 17, 2011 2:57 PM Number of letters in database: 82,071,388 Number of sequences in database: 164,496 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40