bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.24+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: egene_temp_file_orthology_annotation_similarity_blast_database_866 164,496 sequences; 82,071,388 total letters Query= Eten_0397_orf1 Length=169 Score E Sequences producing significant alignments: (Bits) Value tgo:TGME49_114740 splicing factor 3B subunit 2, putative ; K12... 146 3e-35 bbo:BBOV_IV002140 21.m02783; splicing factor 3B subunit 2; K12... 38.5 0.012 tpv:TP03_0436 hypothetical protein; K12829 splicing factor 3B ... 36.2 0.047 xla:734465 sf3b2, MGC115052, cus1, sap145, sf3b145, sf3b150; s... 34.3 0.19 mmu:319387 Lphn3, 5430402I23Rik, CIRL-3, D130075K09Rik, Gm1379... 34.3 0.20 mmu:75956 Srrm2, 5033413A03Rik, AA410130, SRm300, mKIAA0324; s... 32.3 0.74 mmu:654498 Hhla1, F930104E18Rik; HERV-H LTR-associating 1 32.0 mmu:229927 Clca4; chloride channel calcium activated 4; K05030... 31.2 1.6 hsa:94025 MUC16, CA125, FLJ14303; mucin 16, cell surface assoc... 30.8 2.2 hsa:54815 GATAD2A, FLJ20085, FLJ21017, p66alpha; GATA zinc fin... 30.0 4.1 cel:Y57G11C.42 hypothetical protein 28.9 8.9 > tgo:TGME49_114740 splicing factor 3B subunit 2, putative ; K12829 splicing factor 3B subunit 2 Length=743 Score = 146 bits (369), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 98/171 (57%), Positives = 111/171 (64%), Gaps = 34/171 (19%) Query 1 QQGQLFGVRHTYRIPAGALAGQQQQQQQQLLQQGGASGVQTPAGIATPR-LLSGRASP-- 57 QQGQLFGV+HTY+IP+ L G +G TP+G TPR LLSG A+P Sbjct 576 QQGQLFGVKHTYKIPS-TLPGN--------------AGTATPSGHLTPRNLLSGAATPSG 620 Query 58 ----FVAGLRSPFA-PAAAAAAAAA---------SGANTPLMAQRGGAATPAGVTVSLNP 103 F G R+P P +A+ +A SGA TP A+ G TPAGVTVSLNP Sbjct 621 VRTPFTGGARTPLVGPGGSASFGSAPGTPFLGGASGAQTP--ARGAGFRTPAGVTVSLNP 678 Query 104 NEMEQEGAFTADVIRQQLRQHEEAAARAKAAAGQIDPADQRKRKGDELRGV 154 NEMEQEG FTADVIRQQLRQHEEAAARAK AAGQ+DPAD RKRKGDE+RG Sbjct 679 NEMEQEGVFTADVIRQQLRQHEEAAARAKQAAGQVDPADTRKRKGDEIRGT 729 > bbo:BBOV_IV002140 21.m02783; splicing factor 3B subunit 2; K12829 splicing factor 3B subunit 2 Length=552 Score = 38.5 bits (88), Expect = 0.012, Method: Compositional matrix adjust. Identities = 47/145 (32%), Positives = 63/145 (43%), Gaps = 29/145 (20%) Query 36 ASGVQTP----AGIATPRLLSGRASPFV---AGLRSPFAPAAAAAAAAASGANTPLMAQR 88 A+GV TP + I PR L +A + AG S + + + TP+ Sbjct 426 ATGVDTPLEVRSNIDPPRTLPRKAYTVLEPKAGKGSTGSLFGSHVTYSMPPVATPITPIT 485 Query 89 G--GAATPAG--VTVSLNPNEMEQEGAFTADVIRQQLRQHEEAAARAKAAAGQIDPADQR 144 G G ATP G VT SL + + TA + QQL+ HE A AAGQ+ P Sbjct 486 GKPGTATPLGGLVTPSL-----QIDSNITAHGVMQQLKFHETKAKLVHEAAGQVVP---- 536 Query 145 KRKGDELRGVSAAAAKSKRKKQFKF 169 + E R V K+KK+FKF Sbjct 537 --EATETRRV-------KKKKEFKF 552 > tpv:TP03_0436 hypothetical protein; K12829 splicing factor 3B subunit 2 Length=552 Score = 36.2 bits (82), Expect = 0.047, Method: Composition-based stats. Identities = 27/71 (38%), Positives = 32/71 (45%), Gaps = 11/71 (15%) Query 68 PAAAAAAAAASGANTPLMAQRGGAATPAGVTVSLNPNEMEQEGAFTADVIRQQLRQHEEA 127 P A G TPL GG ATP+ T E +G T D I +QL+ HEE Sbjct 474 PPPVATPLGMGGLTTPL----GGMATPSLTT-------EEVDGTSTTDQILRQLKYHEEK 522 Query 128 AARAKAAAGQI 138 A + AGQI Sbjct 523 AKKVHEEAGQI 533 > xla:734465 sf3b2, MGC115052, cus1, sap145, sf3b145, sf3b150; splicing factor 3b, subunit 2, 145kDa; K12829 splicing factor 3B subunit 2 Length=764 Score = 34.3 bits (77), Expect = 0.19, Method: Compositional matrix adjust. Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 16/111 (14%) Query 30 LLQQGGASGVQTPAGIATPRLLSGRASPF---VAGLRSP--FAPAAAAAAAAASGA---N 81 L+ GG S V PAG+ TP L+ R + G +P F A GA + Sbjct 614 LITPGGFSSV--PAGMETPELIELRKKKIEEAMDGTETPQLFTVLPEKRTATVGGAMMGS 671 Query 82 TPL------MAQRGGAATPAGVTVSLNPNEMEQEGAFTADVIRQQLRQHEE 126 T + M++RG A P GV ++L P E+E + + +++R+ +E Sbjct 672 THIYEMATAMSRRGIATEPQGVEIALAPEELELDPSAMTQKYEERVREAQE 722 > mmu:319387 Lphn3, 5430402I23Rik, CIRL-3, D130075K09Rik, Gm1379, LEC3, MGC99439, mKIAA0768; latrophilin 3; K04594 latrophilin 3 Length=1543 Score = 34.3 bits (77), Expect = 0.20, Method: Composition-based stats. Identities = 27/81 (33%), Positives = 32/81 (39%), Gaps = 14/81 (17%) Query 8 VRHTYRIPAGALAGQQQQQQQQLLQQGGASGVQTPAGIATPRLLSGRASPFVAGLRSPFA 67 +RH R P GAL + QQ + G Q P G A G+R P A Sbjct 33 LRHAERSPGGALPPRHLLQQPAAERSTAHRG-QGPRGAAR-------------GVRGPGA 78 Query 68 PAAAAAAAAASGANTPLMAQR 88 P A AA A S A P+ R Sbjct 79 PGAQIAAQAFSRAPIPMAVVR 99 > mmu:75956 Srrm2, 5033413A03Rik, AA410130, SRm300, mKIAA0324; serine/arginine repetitive matrix 2; K13172 serine/arginine repetitive matrix protein 2 Length=2607 Score = 32.3 bits (72), Expect = 0.74, Method: Compositional matrix adjust. Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 4/59 (6%) Query 38 GVQTPAGIATPRLLSGRASPFVAG--LRSPFAPAAAAAAAAASGANTPLMAQRGGAATP 94 G +TPAG+A L S R +P ++G L SP P +A SG +PLM R + TP Sbjct 2211 GSRTPAGLAPTNLSSSRMAPALSGANLTSPRVP--LSAYDRVSGRTSPLMLDRARSRTP 2267 > mmu:654498 Hhla1, F930104E18Rik; HERV-H LTR-associating 1 Length=514 Score = 32.0 bits (71), Expect = 1.0, Method: Composition-based stats. Identities = 27/93 (29%), Positives = 35/93 (37%), Gaps = 2/93 (2%) Query 21 GQQQQQQQQLLQQGGASGVQTPAGIATPRLLSGRASPFVAGLRSPFAPAAAAAAAAASGA 80 QQ + LL G + TP+ +A P SG P APA A Sbjct 369 AQQLRSTGNLLHPTGI--LTTPSRLAQPSRASGTLMPGTQTTNPTQAPAPRVPQTDGIPA 426 Query 81 NTPLMAQRGGAATPAGVTVSLNPNEMEQEGAFT 113 P + ++ A PA VS P + QE A T Sbjct 427 EWPFIPEKEPARDPAAHQVSKCPRPLLQEEAIT 459 > mmu:229927 Clca4; chloride channel calcium activated 4; K05030 calcium-activated chloride channel family member 4 Length=1044 Score = 31.2 bits (69), Expect = 1.6, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 9/70 (12%) Query 35 GASGVQTPAGIATPRLLSGRASPFV-AGLRSPFAPAAAAAAAAASGANTPLMAQRGGAAT 93 G S TP G++TP G ++P GL +P P + + G +TP +T Sbjct 887 GLSTPSTPPGLSTPSTPPGLSTPSTPPGLSTPSTPPGLSTPSTPPGLSTP--------ST 938 Query 94 PAGVTVSLNP 103 P G++ P Sbjct 939 PPGLSTPSTP 948 Score = 31.2 bits (69), Expect = 1.6, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 9/70 (12%) Query 35 GASGVQTPAGIATPRLLSGRASPFV-AGLRSPFAPAAAAAAAAASGANTPLMAQRGGAAT 93 G S TP G++TP G ++P GL +P P + + G +TP +T Sbjct 932 GLSTPSTPPGLSTPSTPPGLSTPSTPPGLSTPSTPPGLSTPSTPPGLSTP--------ST 983 Query 94 PAGVTVSLNP 103 P G++ P Sbjct 984 PPGLSTPSTP 993 > hsa:94025 MUC16, CA125, FLJ14303; mucin 16, cell surface associated Length=14507 Score = 30.8 bits (68), Expect = 2.2, Method: Composition-based stats. Identities = 22/66 (33%), Positives = 28/66 (42%), Gaps = 0/66 (0%) Query 35 GASGVQTPAGIATPRLLSGRASPFVAGLRSPFAPAAAAAAAAASGANTPLMAQRGGAATP 94 G SGV P+ T L AS A R+P AAS NT + G + TP Sbjct 2076 GHSGVSNPSSTTTEFPLFSAASTSAAKQRNPETETHGPQNTAASTLNTDASSVTGLSETP 2135 Query 95 AGVTVS 100 G ++S Sbjct 2136 VGASIS 2141 > hsa:54815 GATAD2A, FLJ20085, FLJ21017, p66alpha; GATA zinc finger domain containing 2A Length=633 Score = 30.0 bits (66), Expect = 4.1, Method: Compositional matrix adjust. Identities = 37/142 (26%), Positives = 57/142 (40%), Gaps = 14/142 (9%) Query 1 QQGQLFGVRHTYRIPAGALAGQQQQQQ--QQLLQQGGASGVQTPAGIATPRLLSGRASPF 58 Q + V HT R+ A + QQ+Q+ Q+LLQQG A A P P Sbjct 445 NQKKALKVEHTSRLKAAFVKALQQEQEIEQRLLQQGTAPAQAKAEPTAAPH-------PV 497 Query 59 VAGLRSPFAPAAAAAAAAASGANTPLM-----AQRGGAATPAGVTVSLNPNEMEQEGAFT 113 + + P A + A +N ++ RG A TP GV + +P+ Q A Sbjct 498 LKQVIKPRRKLAFRSGEARDWSNGAVLQASSQLSRGSATTPRGVLHTFSPSPKLQNSASA 557 Query 114 ADVIRQQLRQHEEAAARAKAAA 135 ++ + R E + K +A Sbjct 558 TALVSRTGRHSERTVSAGKGSA 579 > cel:Y57G11C.42 hypothetical protein Length=661 Score = 28.9 bits (63), Expect = 8.9, Method: Compositional matrix adjust. Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 0/41 (0%) Query 34 GGASGVQTPAGIATPRLLSGRASPFVAGLRSPFAPAAAAAA 74 G TPA P L+ + P+ G ++P APAA++A Sbjct 118 GAQKATTTPAPTGFPTLIPFSSEPWTTGTQTPLAPAASSAV 158 Lambda K H 0.313 0.125 0.344 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 4222647260 Database: egene_temp_file_orthology_annotation_similarity_blast_database_866 Posted date: Sep 17, 2011 2:57 PM Number of letters in database: 82,071,388 Number of sequences in database: 164,496 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40