bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.24+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: egene_temp_file_orthology_annotation_similarity_blast_database_865
164,496 sequences; 82,071,388 total letters
Query= Emax_2910_orf1
Length=176
Score E
Sequences producing significant alignments: (Bits) Value
tgo:TGME49_118140 hypothetical protein ; K11984 U4/U6.U5 tri-s... 165 7e-41
pfa:PFC1060c conserved Plasmodium protein, unknown function; K... 113 4e-25
tpv:TP04_0696 hypothetical protein 106 3e-23
bbo:BBOV_III007920 17.m07694; hypothetical protein 65.1 1e-10
cel:F19F10.9 hypothetical protein; K11984 U4/U6.U5 tri-snRNP-a... 54.7 2e-07
ath:AT5G16780 DOT2; DOT2 (DEFECTIVELY ORGANIZED TRIBUTARIES 2)... 52.0 1e-06
dre:436946 sart1, zgc:91927; squamous cell carcinoma antigen r... 48.9 8e-06
hsa:9092 SART1, Ara1, HOMS1, MGC2038, SART1259, SNRNP110, Snu6... 48.9 9e-06
mmu:20227 Sart1, U5-110K; squamous cell carcinoma antigen reco... 48.9 9e-06
xla:379183 sart1, MGC132129, MGC53679; squamous cell carcinoma... 46.2 6e-05
cpv:cgd4_1570 hypothetical protein 42.4 8e-04
ath:AT3G14700 hypothetical protein 40.8 0.002
sce:YOR308C SNU66; Component of the U4/U6.U5 snRNP complex inv... 39.7 0.005
dre:338237 id:ibd1338; si:ch211-266d19.3 32.0 1.1
> tgo:TGME49_118140 hypothetical protein ; K11984 U4/U6.U5 tri-snRNP-associated
protein 1
Length=861
Score = 165 bits (418), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 75/127 (59%), Positives = 101/127 (79%), Gaps = 0/127 (0%)
Query 50 FMAEQPIGDSVSGALQYLQSKDFFSLDKIRRRRGHHLEQPLHNADNDKEVNIDYRDAFGN 109
FM E P+G ++ AL+YLQSK+ +SLDK+R+RR E PLH +K++ ID+RD +GN
Sbjct 729 FMNENPMGHGLAEALKYLQSKNHYSLDKMRQRRHRPDELPLHKPLGEKDIKIDHRDQYGN 788
Query 110 VMTAKDAFRRISWHFHGKFPSLRKQEKKIKKLELERRLQENIMDCLPTLKALKKVQEGES 169
VMT KDAFR ISW FHGK+PSLRKQEKK+KK+++ER+L +N M+ LPTL AL+++QE E
Sbjct 789 VMTPKDAFREISWRFHGKYPSLRKQEKKMKKMDIERKLLQNPMEALPTLSALQRLQEKEK 848
Query 170 SAHLVLT 176
++HLVLT
Sbjct 849 ASHLVLT 855
> pfa:PFC1060c conserved Plasmodium protein, unknown function;
K11984 U4/U6.U5 tri-snRNP-associated protein 1
Length=693
Score = 113 bits (282), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 104/169 (61%), Gaps = 8/169 (4%)
Query 13 SSSSKKQREETEEGEVVENVPKEDEEEEEEEGDSE----PEFMAEQPIGDSVSGALQYLQ 68
+SS+K EE ++++N E+EE+ ++ SE E E + + + GAL+YL+
Sbjct 525 NSSNKNILEENINEDILKNTFLENEEDHNDDNSSELHGVSEIFNEVKLDEGLFGALEYLK 584
Query 69 SKDFFSL-DKIRRRRGHHLEQPLHNADNDKEVNIDYRDAFGNVMTAKDAFRRISWHFHGK 127
+K ++ DKI R + +PLH + + ++ +DY++ FG VMT K++FR ISW FHGK
Sbjct 585 TKGELNMEDKIYRNPEN---KPLHMSTDKDDIKLDYKNEFGKVMTPKESFRYISWIFHGK 641
Query 128 FPSLRKQEKKIKKLELERRLQENIMDCLPTLKALKKVQEGESSAHLVLT 176
K EKKIK+LE+ERR +EN +D LPTL LKK Q+ + ++ L+
Sbjct 642 KQGKNKLEKKIKRLEIERRYKENPIDSLPTLNVLKKYQQTQKKSYFTLS 690
> tpv:TP04_0696 hypothetical protein
Length=554
Score = 106 bits (265), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/131 (41%), Positives = 81/131 (61%), Gaps = 14/131 (10%)
Query 45 DSEPEFMAEQPIGDSVSGALQYLQSKDFFSLDKIRRRRGHHLEQPLHNADNDKEVNIDYR 104
D +P ++EQP+GD ++ AL Y+ ++RG ++++ KEV ++Y
Sbjct 431 DDDPNTLSEQPLGDGIAAALSYI------------KQRGDYIDEKAETRS--KEVQLNYL 476
Query 105 DAFGNVMTAKDAFRRISWHFHGKFPSLRKQEKKIKKLELERRLQENIMDCLPTLKALKKV 164
D +GN MT K+AF++ISW FHGK PS +KQEK +K+ELER L N + LPT+KAL
Sbjct 477 DEYGNEMTPKEAFKKISWIFHGKRPSKKKQEKMRRKIELERALNSNPVGGLPTMKALYSH 536
Query 165 QEGESSAHLVL 175
QE E + ++ L
Sbjct 537 QEKEQTPYITL 547
> bbo:BBOV_III007920 17.m07694; hypothetical protein
Length=528
Score = 65.1 bits (157), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 53/90 (58%), Gaps = 12/90 (13%)
Query 53 EQPIGDSVSGALQYLQSKDFFSLDKIRRRRGHHLEQPLHNADNDKEVNIDYRDAFGNVMT 112
++P+ ++GAL YL+ K D I +++ L ND + + Y D +G MT
Sbjct 439 DEPMTTGIAGALAYLKDKG----DIIEKKKD------LEGVGND--ITLQYFDEYGRKMT 486
Query 113 AKDAFRRISWHFHGKFPSLRKQEKKIKKLE 142
K+AFR++SW FHGK P L K+E+ IK++E
Sbjct 487 PKEAFRQLSWKFHGKGPGLNKRERIIKRIE 516
> cel:F19F10.9 hypothetical protein; K11984 U4/U6.U5 tri-snRNP-associated
protein 1
Length=829
Score = 54.7 bits (130), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 2/81 (2%)
Query 98 EVNIDYRDAFGNVMTAKDAFRRISWHFHGKFPSLRKQEKKI-KKLELERRLQENIMDC-L 155
+VNI Y D G M AKDA+R +S+ FHG+ P ++ EK+ +K + ER L+ N D L
Sbjct 738 DVNISYVDRKGREMDAKDAYRELSYKFHGRNPGKKQLEKRANRKDKEERMLKTNSYDTPL 797
Query 156 PTLKALKKVQEGESSAHLVLT 176
TL +K Q+ S+ +LVL+
Sbjct 798 GTLDKQRKKQKQLSTPYLVLS 818
> ath:AT5G16780 DOT2; DOT2 (DEFECTIVELY ORGANIZED TRIBUTARIES
2); K11984 U4/U6.U5 tri-snRNP-associated protein 1
Length=820
Score = 52.0 bits (123), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 74/146 (50%), Gaps = 18/146 (12%)
Query 49 EFMAEQPIGDSVSGALQYLQSKDFF---------SLDKIRRR-------RGHHLEQPLHN 92
E + E +G +SGAL+ L+ + ++DK + + G + +
Sbjct 616 ENIHEVAVGKGLSGALKLLKDRGTLKEKVEWGGRNMDKKKSKLVGIVDDDGGKESKDKES 675
Query 93 ADNDKEVNIDYRDAFGNVMTAKDAFRRISWHFHGKFPSLRKQEKKIKKLELERRLQENIM 152
D K++ I+ D FG +T K+AFR +S FHGK P K+EK++K+ + E +L++
Sbjct 676 KDRFKDIRIERTDEFGRTLTPKEAFRLLSHKFHGKGPGKMKEEKRMKQYQEELKLKQMKN 735
Query 153 DCLP--TLKALKKVQEGESSAHLVLT 176
P +++ +++ Q + +LVL+
Sbjct 736 SDTPSQSVQRMREAQAQLKTPYLVLS 761
> dre:436946 sart1, zgc:91927; squamous cell carcinoma antigen
recognised by T cells; K11984 U4/U6.U5 tri-snRNP-associated
protein 1
Length=777
Score = 48.9 bits (115), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 57/105 (54%), Gaps = 4/105 (3%)
Query 76 DKIRRRRGHH-LEQPLHNADNDK-EVNIDYRDAFGNVMTAKDAFRRISWHFHGKFPSLRK 133
DK RR + Q DN K +V I+Y D G + K+AFR++S FHGK K
Sbjct 660 DKYSRREEYRGFTQDFKEKDNYKPDVKIEYVDESGRKLCPKEAFRQLSHRFHGKGSGKMK 719
Query 134 QEKKIKKLELERRLQENIMDCLP--TLKALKKVQEGESSAHLVLT 176
E+++KKLE E L++ P T+ L++ Q+ + + ++VL+
Sbjct 720 TERRMKKLEEEALLKKMSSSDTPLGTVALLQEKQKSQKTPYIVLS 764
> hsa:9092 SART1, Ara1, HOMS1, MGC2038, SART1259, SNRNP110, Snu66;
squamous cell carcinoma antigen recognized by T cells;
K11984 U4/U6.U5 tri-snRNP-associated protein 1
Length=800
Score = 48.9 bits (115), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 49/81 (60%), Gaps = 2/81 (2%)
Query 98 EVNIDYRDAFGNVMTAKDAFRRISWHFHGKFPSLRKQEKKIKKLELERRLQENIMDCLP- 156
+V I+Y D G +T K+AFR++S FHGK K E+++KKL+ E L++ P
Sbjct 707 DVKIEYVDETGRKLTPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSDTPL 766
Query 157 -TLKALKKVQEGESSAHLVLT 176
T+ L++ Q+ + + ++VL+
Sbjct 767 GTVALLQEKQKAQKTPYIVLS 787
> mmu:20227 Sart1, U5-110K; squamous cell carcinoma antigen recognized
by T-cells 1; K11984 U4/U6.U5 tri-snRNP-associated
protein 1
Length=806
Score = 48.9 bits (115), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 49/81 (60%), Gaps = 2/81 (2%)
Query 98 EVNIDYRDAFGNVMTAKDAFRRISWHFHGKFPSLRKQEKKIKKLELERRLQENIMDCLP- 156
+V I+Y D G +T K+AFR++S FHGK K E+++KKL+ E L++ P
Sbjct 713 DVKIEYVDETGRKLTPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSDTPL 772
Query 157 -TLKALKKVQEGESSAHLVLT 176
T+ L++ Q+ + + ++VL+
Sbjct 773 GTVALLQEKQKAQKTPYIVLS 793
> xla:379183 sart1, MGC132129, MGC53679; squamous cell carcinoma
antigen recognized by T cells; K11984 U4/U6.U5 tri-snRNP-associated
protein 1
Length=765
Score = 46.2 bits (108), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 48/81 (59%), Gaps = 2/81 (2%)
Query 98 EVNIDYRDAFGNVMTAKDAFRRISWHFHGKFPSLRKQEKKIKKLELERRLQENIMDCLP- 156
+V I+Y D G + K+AFR++S FHGK K E+++KKL+ E L++ P
Sbjct 672 DVKIEYVDETGRKLCPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSDTPL 731
Query 157 -TLKALKKVQEGESSAHLVLT 176
T+ L++ Q+ + + ++VL+
Sbjct 732 GTVALLQEKQKAQKTPYIVLS 752
> cpv:cgd4_1570 hypothetical protein
Length=407
Score = 42.4 bits (98), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/141 (22%), Positives = 69/141 (48%), Gaps = 20/141 (14%)
Query 10 SSSSSSSKKQREETEEGEVVENVPKEDEEEEEEEGDSEPEFMAEQPIGDSVSGALQYLQS 69
SS+ ++ +K + TEE V +E+ + + + E+P+ +S L+ L+
Sbjct 269 SSNDNNRQKTSKNTEETRV---------SDEKTQNNLNNNILYEEPLDFGISSTLELLKK 319
Query 70 KDFFSL-----------DKIRRRRGHHLEQPLHNADNDKEVNIDYRDAFGNVMTAKDAFR 118
+ S ++ ++ ++ N+++D +V+I + D GN++ K+AF+
Sbjct 320 RGNISSSNKKDPITSNNNEFGQKNENYSTDSALNSESDFQVSILHTDDNGNILNPKEAFK 379
Query 119 RISWHFHGKFPSLRKQEKKIK 139
R+ W FHG+ + K EK ++
Sbjct 380 RLCWKFHGQKVNKNKIEKMLR 400
> ath:AT3G14700 hypothetical protein
Length=204
Score = 40.8 bits (94), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 57/122 (46%), Gaps = 17/122 (13%)
Query 15 SSKKQREETEEGEVVENVPKEDEEEEEEEGDSEPEFMAEQPIGDSVSGALQYLQSKDFFS 74
SS+++RE + E + V K + GD M E +G +SGAL L+ + F
Sbjct 52 SSERRREVCSKAEDI--VDKAIDNHSRVRGDG---IMREADVGTGLSGALNRLREQGTF- 105
Query 75 LDKIRRRRGHHLEQPLHNADND------KEVNIDYRDAFGNVMTAKDAFRRISWHFHGKF 128
+ G + +N ++D K++ I + +G +MT K+A+R + FHGK
Sbjct 106 -----KEEGKVVGVKDNNHEDDRFKDRFKDIQIQRVNKWGRIMTEKEAYRSLCHGFHGKG 160
Query 129 PS 130
P
Sbjct 161 PG 162
> sce:YOR308C SNU66; Component of the U4/U6.U5 snRNP complex involved
in pre-mRNA splicing via spliceosome; also required
for pre-5S rRNA processing and may act in concert with Rnh70p;
has homology to human SART-1; K11984 U4/U6.U5 tri-snRNP-associated
protein 1
Length=587
Score = 39.7 bits (91), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 33/55 (60%), Gaps = 0/55 (0%)
Query 96 DKEVNIDYRDAFGNVMTAKDAFRRISWHFHGKFPSLRKQEKKIKKLELERRLQEN 150
D ++ + YRD GN +T K+A++++S FHG + +K+ K ++E + EN
Sbjct 524 DPDIKLVYRDEKGNRLTTKEAYKKLSQKFHGTKSNKKKRAKMKSRIEARKNTPEN 578
> dre:338237 id:ibd1338; si:ch211-266d19.3
Length=1677
Score = 32.0 bits (71), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query 15 SSKKQREETE-EGEVVENVPKEDEEE--EEEEGDSEPEFMAEQPIGDS-VSGALQYLQSK 70
S +K + ETE G VVE+ P ED+EE +EEE +EP+ +P + + +K
Sbjct 1446 SVRKGKAETEGNGSVVESGPDEDKEERSDEEEPATEPKSAGREPGSKPDKRKKVCSICNK 1505
Query 71 DFFSL-DKIRRRRGHHLEQP 89
F+SL D R R H E+P
Sbjct 1506 RFWSLQDLTRHMRSHTGERP 1525
Lambda K H
0.308 0.127 0.348
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 4600750868
Database: egene_temp_file_orthology_annotation_similarity_blast_database_865
Posted date: Sep 17, 2011 11:19 AM
Number of letters in database: 82,071,388
Number of sequences in database: 164,496
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40