bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.24+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: egene_temp_file_orthology_annotation_similarity_blast_database_966 164,496 sequences; 82,071,388 total letters Query= Eace_2100_orf1 Length=96 Score E Sequences producing significant alignments: (Bits) Value tgo:TGME49_066930 hypothetical protein ; K03143 transcription ... 120 8e-28 tpv:TP04_0823 hypothetical protein; K03143 transcription initi... 94.4 8e-20 pfa:PF13_0279 conserved Plasmodium protein, unknown function; ... 92.4 3e-19 cpv:cgd6_5040 transcription factor ; K03143 transcription init... 50.1 2e-06 dre:436837 gtf2h3, fc26a07, wu:fc26a07, zgc:92759; general tra... 47.0 1e-05 dre:100151595 gtf2h3-2; general transcription factor IIH, poly... 47.0 2e-05 hsa:2967 GTF2H3, BTF2, TFB4, TFIIH; general transcription fact... 45.4 4e-05 mmu:209357 Gtf2h3, 34kDa, 5033417D07Rik, BTF2, BTF2_p34, C7300... 43.1 2e-04 ath:AT1G18340 basal transcription factor complex subunit-relat... 38.5 0.006 cel:ZK1128.4 hypothetical protein; K03143 transcription initia... 36.2 0.028 mmu:19824 Trim10, AI324236, BB139825, Herf1, Rnf9; tripartite ... 30.8 1.0 xla:432234 uhrf1; ubiquitin-like with PHD and ring finger doma... 30.8 1.2 xla:398154 gatad2a; GATA zinc finger domain containing 2A 30.0 2.1 cel:K02B9.4 elt-3; Erythroid-Like Transcription factor family ... 29.6 2.4 sce:YPR056W TFB4; Subunit of TFIIH complex, involved in transc... 29.6 2.7 xla:100158394 hypothetical protein LOC100158394 28.9 4.0 ath:AT4G29530 2,3-diketo-5-methylthio-1-phosphopentane phospha... 28.5 5.1 hsa:10107 TRIM10, HERF1, MGC141979, RFB30, RNF9; tripartite mo... 28.5 5.4 sce:YGR184C UBR1, PTR1; E3 ubiquitin ligase (N-recognin), form... 28.1 7.9 dre:553955 zbtb24, zfp450, zgc:194556; zinc finger and BTB dom... 27.7 8.2 cel:ZC328.2 hypothetical protein 27.7 8.7 dre:100332569 zinc finger protein 333-like 27.7 8.9 > tgo:TGME49_066930 hypothetical protein ; K03143 transcription initiation factor TFIIH subunit 3 Length=547 Score = 120 bits (302), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 56/88 (63%), Positives = 68/88 (77%), Gaps = 2/88 (2%) Query 11 FWFLPPPSLRPSVAAVSIHH-KTNAGVCFCHHNPVDLCFICSCCLAIYCSET-SDNGKER 68 FW P SLRP++AA+S+H ++N VCFCHH PV++C ICSCCLAIYCSE + GKER Sbjct 460 FWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQTGKER 519 Query 69 ISCDVCKSRFAKGLLKAKLAASVDASKY 96 ISCDVCKSRF++GLLK K+A VD Y Sbjct 520 ISCDVCKSRFSRGLLKNKMAGGVDLPNY 547 > tpv:TP04_0823 hypothetical protein; K03143 transcription initiation factor TFIIH subunit 3 Length=348 Score = 94.4 bits (233), Expect = 8e-20, Method: Composition-based stats. Identities = 40/91 (43%), Positives = 54/91 (59%), Gaps = 0/91 (0%) Query 5 LLQLLVFWFLPPPSLRPSVAAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCSETSDN 64 L QL+ FW+LP + ++ N G+C+CH+ VD+ ++C CC A+YCSE D Sbjct 257 LSQLITFWYLPSEGMSELLSTNLSFDFGNVGICYCHYKSVDVSYLCPCCFAVYCSEIDDK 316 Query 65 GKERISCDVCKSRFAKGLLKAKLAASVDASK 95 GK RI C VC SR + L+K KLA D SK Sbjct 317 GKYRIICMVCGSRLTRKLIKQKLATEADFSK 347 > pfa:PF13_0279 conserved Plasmodium protein, unknown function; K03143 transcription initiation factor TFIIH subunit 3 Length=326 Score = 92.4 bits (228), Expect = 3e-19, Method: Composition-based stats. Identities = 41/91 (45%), Positives = 59/91 (64%), Gaps = 1/91 (1%) Query 5 LLQLLVFWFLPPPSLRPSVAAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCSETS-D 63 L Q ++FWFLP + R + ++ TN VC CH+ +D+ +ICSCCLAIYCSE + Sbjct 234 LTQTIIFWFLPSVNTRKYFSNTYLNEDTNIAVCTCHNKQIDIAYICSCCLAIYCSEKNLQ 293 Query 64 NGKERISCDVCKSRFAKGLLKAKLAASVDAS 94 K+R+SC +CK+RF K LL+ K + +D S Sbjct 294 TNKDRLSCAICKTRFTKSLLRNKHVSDLDFS 324 > cpv:cgd6_5040 transcription factor ; K03143 transcription initiation factor TFIIH subunit 3 Length=286 Score = 50.1 bits (118), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 21/59 (35%), Positives = 34/59 (57%), Gaps = 1/59 (1%) Query 2 DIALLQLLVFWFLPPPSLRPSV-AAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCS 59 D L L+F LP R + +++ +T VCFCHH +++ ++CS CL+I+CS Sbjct 217 DQGLTPFLIFHLLPSIQAREEIFISINKTKQTGLAVCFCHHQKIEIGYVCSSCLSIFCS 275 > dre:436837 gtf2h3, fc26a07, wu:fc26a07, zgc:92759; general transcription factor IIH, polypeptide 3; K03143 transcription initiation factor TFIIH subunit 3 Length=296 Score = 47.0 bits (110), Expect = 1e-05, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 7/76 (9%) Query 3 IALLQLLVFWFLPPPSLRPSVAAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCSETS 62 +AL Q L++ FLP R + H CFCH N +++ ++CS CL+I+C+ + Sbjct 209 VALTQYLLWAFLPDAEQRSQLLLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSP 268 Query 63 DNGKERISCDVCKSRF 78 C C++ F Sbjct 269 -------ICTTCETAF 277 > dre:100151595 gtf2h3-2; general transcription factor IIH, polypeptide 3; K03143 transcription initiation factor TFIIH subunit 3 Length=296 Score = 47.0 bits (110), Expect = 2e-05, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 7/76 (9%) Query 3 IALLQLLVFWFLPPPSLRPSVAAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCSETS 62 +AL Q L++ FLP R + H CFCH N +++ ++CS CL+I+C+ + Sbjct 209 VALTQYLLWVFLPDAEQRSQLLLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSP 268 Query 63 DNGKERISCDVCKSRF 78 C C++ F Sbjct 269 -------ICTTCETAF 277 > hsa:2967 GTF2H3, BTF2, TFB4, TFIIH; general transcription factor IIH, polypeptide 3, 34kDa; K03143 transcription initiation factor TFIIH subunit 3 Length=308 Score = 45.4 bits (106), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 20/56 (35%), Positives = 32/56 (57%), Gaps = 0/56 (0%) Query 4 ALLQLLVFWFLPPPSLRPSVAAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCS 59 +LLQ L++ FLP R + H CFCH N +++ ++CS CL+I+C+ Sbjct 222 SLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCN 277 > mmu:209357 Gtf2h3, 34kDa, 5033417D07Rik, BTF2, BTF2_p34, C730029A10, D5Ertd679e, TFIIH; general transcription factor IIH, polypeptide 3; K03143 transcription initiation factor TFIIH subunit 3 Length=309 Score = 43.1 bits (100), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 0/56 (0%) Query 4 ALLQLLVFWFLPPPSLRPSVAAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCS 59 +LLQ L++ FLP R + H CFCH + +++ ++CS CL+I+C+ Sbjct 223 SLLQYLLWVFLPDQDQRSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCN 278 > ath:AT1G18340 basal transcription factor complex subunit-related; K03143 transcription initiation factor TFIIH subunit 3 Length=301 Score = 38.5 bits (88), Expect = 0.006, Method: Compositional matrix adjust. Identities = 19/54 (35%), Positives = 27/54 (50%), Gaps = 7/54 (12%) Query 37 CFCHHNPVDLCFICSCCLAIYCSETSDNGKERISCDVCKSRFAKGLLKAKLAAS 90 CFCH +D+ +ICS CL+I+C + C C S F + L +AS Sbjct 246 CFCHKKTIDMGYICSVCLSIFC-------EHHKKCSTCGSVFGQSKLDDASSAS 292 > cel:ZK1128.4 hypothetical protein; K03143 transcription initiation factor TFIIH subunit 3 Length=345 Score = 36.2 bits (82), Expect = 0.028, Method: Compositional matrix adjust. Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 7/74 (9%) Query 5 LLQLLVFWFLPPPSLRPSVAAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCSETSDN 64 LL++L+ L P+ R + +S + C CHH V ++CS CL++ C T Sbjct 262 LLKILMTNMLTDPTHRAVFSKLSHNSVDYRASCACHHQLVSSGWVCSICLSVLCQYTP-- 319 Query 65 GKERISCDVCKSRF 78 C VCK+ F Sbjct 320 -----ICKVCKAAF 328 > mmu:19824 Trim10, AI324236, BB139825, Herf1, Rnf9; tripartite motif-containing 10; K12002 tripartite motif-containing protein 10 Length=489 Score = 30.8 bits (68), Expect = 1.0, Method: Composition-based stats. Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 6/46 (13%) Query 39 CHHNPVDLCFICSCCLAIYCSETSDNGKERISCDVCKSRFAKGLLK 84 C HN C CL YC +E +SC +CK F G + Sbjct 31 CGHN------FCRGCLTRYCEIPGPESEESLSCPLCKEPFRPGSFR 70 > xla:432234 uhrf1; ubiquitin-like with PHD and ring finger domains 1 (EC:6.3.2.19); K10638 E3 ubiquitin-protein ligase UHRF1 [EC:6.3.2.19] Length=772 Score = 30.8 bits (68), Expect = 1.2, Method: Composition-based stats. Identities = 17/50 (34%), Positives = 23/50 (46%), Gaps = 4/50 (8%) Query 39 CHHNPVDLCFICSCCLAIYCSETSDNGKERISCDVCKSRFAKGLLKAKLA 88 C NP C +C+CC+ C D K+ + CD C F LK L+ Sbjct 303 CKDNPKRACRMCACCI---CGGKQDPEKQLL-CDECDLAFHIYCLKPPLS 348 > xla:398154 gatad2a; GATA zinc finger domain containing 2A Length=420 Score = 30.0 bits (66), Expect = 2.1, Method: Composition-based stats. Identities = 17/54 (31%), Positives = 24/54 (44%), Gaps = 4/54 (7%) Query 41 HNPVDLCFICSCCLAIYCSETSDNGKERISCDVCKSRFAKGLLKAKLAASVDAS 94 H P +IC+ C + S I CDVC S K +LKA+ + A+ Sbjct 231 HEP----YICAKCKTDFTSRWRQEKNGTIMCDVCMSSNQKKVLKAEHTTRLKAA 280 > cel:K02B9.4 elt-3; Erythroid-Like Transcription factor family member (elt-3) Length=226 Score = 29.6 bits (65), Expect = 2.4, Method: Compositional matrix adjust. Identities = 26/81 (32%), Positives = 31/81 (38%), Gaps = 7/81 (8%) Query 13 FLPPPSLRPSVAAVSIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCSETSDNGKERISCD 72 F PPP P VA K A V CH N ICS C + NG+ + C+ Sbjct 123 FTPPPQ-DPLVAEPKPMKKRMAAVQ-CHQN-----SICSNCKTRETTLWRRNGEGGVECN 175 Query 73 VCKSRFAKGLLKAKLAASVDA 93 C F K K L+ D Sbjct 176 ACNLYFRKNNRKRPLSLRKDG 196 > sce:YPR056W TFB4; Subunit of TFIIH complex, involved in transcription initiation, similar to 34 kDa subunit of human TFIIH; interacts with Ssl1p; K03143 transcription initiation factor TFIIH subunit 3 Length=338 Score = 29.6 bits (65), Expect = 2.7, Method: Composition-based stats. Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 12/87 (13%) Query 4 ALLQLLVFWFLPPPSLRPSVAAV---SIHHKTNAGVCFCHHNPVDLCFICSCCLAIYCSE 60 L+Q L PSLRP + S+ +T+ C+ V + FICS CL + Sbjct 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTS---CYLTGRVVAVGFICSVCLCVLSII 299 Query 61 TSDNGKERISCDVCKSRFAKGLLKAKL 87 N C C S+F + ++ AKL Sbjct 300 PPGN-----KCPACDSQFDEHVI-AKL 320 > xla:100158394 hypothetical protein LOC100158394 Length=555 Score = 28.9 bits (63), Expect = 4.0, Method: Composition-based stats. Identities = 17/54 (31%), Positives = 23/54 (42%), Gaps = 4/54 (7%) Query 41 HNPVDLCFICSCCLAIYCSETSDNGKERISCDVCKSRFAKGLLKAKLAASVDAS 94 H P +IC+ C + S I CDVC S K LKA+ + A+ Sbjct 368 HEP----YICAKCKTDFTSRWRQEKNGTIMCDVCMSSNQKKALKAEHTTRLKAA 417 > ath:AT4G29530 2,3-diketo-5-methylthio-1-phosphopentane phosphatase family Length=245 Score = 28.5 bits (62), Expect = 5.1, Method: Compositional matrix adjust. Identities = 24/64 (37%), Positives = 29/64 (45%), Gaps = 19/64 (29%) Query 40 HHNPVDLCFICSCCLAIYCSETS--DNGKERI-----------SCDVCKSRFAKGLLKAK 86 HH+ VD C IY + TS DNG RI SC++C S KGL+ Sbjct 107 HHDLVD------CFSEIYTNPTSLDDNGNLRILPYHSDALPPHSCNLCPSNLCKGLVMDH 160 Query 87 LAAS 90 L AS Sbjct 161 LRAS 164 > hsa:10107 TRIM10, HERF1, MGC141979, RFB30, RNF9; tripartite motif containing 10; K12002 tripartite motif-containing protein 10 Length=481 Score = 28.5 bits (62), Expect = 5.4, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 19/46 (41%), Gaps = 6/46 (13%) Query 39 CHHNPVDLCFICSCCLAIYCSETSDNGKERISCDVCKSRFAKGLLK 84 C HN C CL YC + +E +C +CK F G + Sbjct 31 CGHN------FCRACLTRYCEIPGPDLEESPTCPLCKEPFRPGSFR 70 > sce:YGR184C UBR1, PTR1; E3 ubiquitin ligase (N-recognin), forms heterodimer with Rad6p to ubiquitinate substrates in the N-end rule pathway; regulates peptide transport via Cup9p ubiquitination; mutation in human UBR1 causes Johansson-Blizzard Syndrome (JBS) (EC:6.3.2.-); K10625 E3 ubiquitin-protein ligase UBR1 [EC:6.3.2.19] Length=1950 Score = 28.1 bits (61), Expect = 7.9, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 5/64 (7%) Query 23 VAAVSIHHKTNAGVCFCHHNPVDLCFIC----SCCLAIYCSETSDNGKERISCDVCKSRF 78 + V H N G F P+ C C +C L I+C D+ + D+C + F Sbjct 112 IGDVHKHTGRNCGRKFKIGEPLYRCHECGCDDTCVLCIHCFNPKDHVNHHVCTDIC-TEF 170 Query 79 AKGL 82 G+ Sbjct 171 TSGI 174 > dre:553955 zbtb24, zfp450, zgc:194556; zinc finger and BTB domain containing 24; K10503 zinc finger and BTB domain-containing protein 24 Length=672 Score = 27.7 bits (60), Expect = 8.2, Method: Composition-based stats. Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 7/40 (17%) Query 48 FICSCCLAIYCS-------ETSDNGKERISCDVCKSRFAK 80 ++C C + E S GK+ SC +CK FA+ Sbjct 405 YVCKVCDKTFSDPSARRRHEVSHTGKKTFSCSICKVSFAR 444 > cel:ZC328.2 hypothetical protein Length=386 Score = 27.7 bits (60), Expect = 8.7, Method: Composition-based stats. Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 1/31 (3%) Query 16 PPSLRPSVAAVSIHHKTNA-GVCFCHHNPVD 45 PP L P AA++I H+ +A + CHH + Sbjct 167 PPGLSPGGAALNIEHEKSAFEIVSCHHQSAE 197 > dre:100332569 zinc finger protein 333-like Length=678 Score = 27.7 bits (60), Expect = 8.9, Method: Composition-based stats. Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 7/40 (17%) Query 48 FICSCCLAIYCS-------ETSDNGKERISCDVCKSRFAK 80 ++C C + E S GK+ SC +CK FA+ Sbjct 408 YVCKVCDKTFSDPSARRRHEVSHTGKKTFSCSICKVSFAR 447 Lambda K H 0.327 0.138 0.462 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 2060454524 Database: egene_temp_file_orthology_annotation_similarity_blast_database_966 Posted date: Sep 16, 2011 8:45 PM Number of letters in database: 82,071,388 Number of sequences in database: 164,496 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40