bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.24+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: egene_temp_file_orthology_annotation_similarity_blast_database_865
164,496 sequences; 82,071,388 total letters
Query= Emax_2737_orf2
Length=126
Score E
Sequences producing significant alignments: (Bits) Value
hsa:56949 XAB2, DKFZp762C1015, HCNP, HCRN, NTC90, SYF1; XPA bi... 150 8e-37
mmu:67439 Xab2, 0610041O14Rik, AV025587; XPA binding protein 2... 150 8e-37
xla:100036801 xab2; XPA binding protein 2; K12867 pre-mRNA-spl... 140 1e-33
dre:387600 xab2, MGC198247, zgc:63498, zgc:63949; XPA binding ... 137 9e-33
cel:C50F2.3 hypothetical protein; K12867 pre-mRNA-splicing fac... 131 5e-31
ath:AT5G28740 transcription-coupled DNA repair protein-related... 113 2e-25
tgo:TGME49_105240 XPA-binding protein, putative ; K12867 pre-m... 106 2e-23
bbo:BBOV_IV000660 21.m02991; XBA-binding protein 2; K12867 pre... 86.3 2e-17
tpv:TP01_0087 adapter protein; K12867 pre-mRNA-splicing factor... 81.3 8e-16
sce:YDR416W SYF1, NTC90; Member of the NineTeen Complex (NTC) ... 43.9 1e-04
ath:AT4G37170 pentatricopeptide (PPR) repeat-containing protein 37.0 0.017
pfa:PFL1735c RNA-processing protein, putative; K12867 pre-mRNA... 36.2 0.023
ath:AT1G07740 pentatricopeptide (PPR) repeat-containing protein 33.9 0.14
ath:AT1G06140 pentatricopeptide (PPR) repeat-containing protein 33.1 0.22
mmu:98999 Znfx1, AI481105; zinc finger, NFX1-type containing 1 32.3
ath:AT1G26900 pentatricopeptide (PPR) repeat-containing protein 32.0 0.49
ath:AT4G19191 hypothetical protein 31.6 0.59
ath:AT3G49140 hypothetical protein 30.8 1.0
bbo:BBOV_III001340 hypothetical protein 30.8 1.2
tgo:TGME49_069200 crooked neck-like protein 1, putative ; K128... 30.8 1.3
ath:AT1G25360 pentatricopeptide (PPR) repeat-containing protein 30.0 1.9
ath:AT5G06540 pentatricopeptide (PPR) repeat-containing protein 30.0 2.1
cel:M03F8.3 hypothetical protein; K12869 crooked neck 29.3 3.0
ath:AT2G37310 pentatricopeptide (PPR) repeat-containing protein 29.3 3.1
ath:AT3G22690 pentatricopeptide (PPR) repeat-containing protein 29.3 3.2
ath:AT4G15720 pentatricopeptide (PPR) repeat-containing protein 29.3 3.4
ath:AT3G12770 pentatricopeptide (PPR) repeat-containing protein 29.3 3.5
ath:AT5G55740 CRR21; CRR21 (chlororespiratory reduction 21) 29.3 3.6
ath:AT2G33760 pentatricopeptide (PPR) repeat-containing protein 29.3 3.6
ath:AT3G50420 pentatricopeptide (PPR) repeat-containing protein 28.9 3.7
ath:AT2G22410 pentatricopeptide (PPR) repeat-containing protein 28.9 3.9
ath:AT2G18520 pentatricopeptide (PPR) repeat-containing protein 28.9 4.0
ath:AT1G74400 pentatricopeptide (PPR) repeat-containing protein 28.9 4.0
tgo:TGME49_093260 pyruvate dehydrogenase (lipoamide) kinase, p... 28.9 4.2
ath:AT4G14820 pentatricopeptide (PPR) repeat-containing protein 28.5 5.7
cel:T19C4.5 hypothetical protein 28.1 6.6
ath:AT2G45350 CRR4; CRR4 (CHLORORESPIRATORY REDUCTION 4) 28.1 7.1
ath:AT4G18840 pentatricopeptide (PPR) repeat-containing protein 28.1 7.3
ath:AT5G21482 CKX7; CKX7 (CYTOKININ OXIDASE 7); cytokinin dehy... 28.1 7.9
sce:YPR189W SKI3, SKI5; Ski complex component and TPR protein,... 27.7 8.4
tgo:TGME49_011370 hypothetical protein 27.7 9.4
> hsa:56949 XAB2, DKFZp762C1015, HCNP, HCRN, NTC90, SYF1; XPA
binding protein 2; K12867 pre-mRNA-splicing factor SYF1
Length=855
Score = 150 bits (380), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 72/125 (57%), Positives = 97/125 (77%), Gaps = 3/125 (2%)
Query 2 TLDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVF 61
+L+VDAI+R G+ R+ DQ G+LW SLA+YY R G F+KARDV+EEA+ TV T+RDF+QVF
Sbjct 237 SLNVDAIIRGGLTRFTDQLGKLWCSLADYYIRSGHFEKARDVYEEAIRTVMTVRDFTQVF 296
Query 62 DAYTQSEERLIQLKMAAADTAEDGELTEDDELELDLRIVRYEDIIERRPLLLNSVALRQN 121
D+Y Q EE +I KM +TA + E+D+++L+LR+ R+E +I RRPLLLNSV LRQN
Sbjct 297 DSYAQFEESMIAAKM---ETASELGREEEDDVDLELRLARFEQLISRRPLLLNSVLLRQN 353
Query 122 PHNVE 126
PH+V
Sbjct 354 PHHVH 358
> mmu:67439 Xab2, 0610041O14Rik, AV025587; XPA binding protein
2; K12867 pre-mRNA-splicing factor SYF1
Length=855
Score = 150 bits (380), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 72/125 (57%), Positives = 97/125 (77%), Gaps = 3/125 (2%)
Query 2 TLDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVF 61
+L+VDAI+R G+ R+ DQ G+LW SLA+YY R G F+KARDV+EEA+ TV T+RDF+QVF
Sbjct 237 SLNVDAIIRGGLTRFTDQLGKLWCSLADYYIRSGHFEKARDVYEEAIRTVMTVRDFTQVF 296
Query 62 DAYTQSEERLIQLKMAAADTAEDGELTEDDELELDLRIVRYEDIIERRPLLLNSVALRQN 121
D+Y Q EE +I KM +TA + E+D+++L+LR+ R+E +I RRPLLLNSV LRQN
Sbjct 297 DSYAQFEESMIAAKM---ETASELGREEEDDVDLELRLARFEQLISRRPLLLNSVLLRQN 353
Query 122 PHNVE 126
PH+V
Sbjct 354 PHHVH 358
> xla:100036801 xab2; XPA binding protein 2; K12867 pre-mRNA-splicing
factor SYF1
Length=838
Score = 140 bits (353), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 72/125 (57%), Positives = 97/125 (77%), Gaps = 3/125 (2%)
Query 2 TLDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVF 61
+LD AI+R G+ R+ DQ+G+LW +LAEY+TR G F+KARDV+EEA+ TV T+RDF+QVF
Sbjct 228 SLDAAAIIRGGLTRFTDQRGKLWCALAEYHTRSGHFEKARDVYEEAIQTVTTVRDFTQVF 287
Query 62 DAYTQSEERLIQLKMAAADTAEDGELTEDDELELDLRIVRYEDIIERRPLLLNSVALRQN 121
D+Y Q EE +I KM +T D ++D+L+L+LR+ R+E +IERRPLLLN+V LRQN
Sbjct 288 DSYAQFEESVIAAKM---ETVSDLGKEDEDDLDLELRLARFEQLIERRPLLLNAVLLRQN 344
Query 122 PHNVE 126
PHNV
Sbjct 345 PHNVH 349
> dre:387600 xab2, MGC198247, zgc:63498, zgc:63949; XPA binding
protein 2; K12867 pre-mRNA-splicing factor SYF1
Length=851
Score = 137 bits (345), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 71/125 (56%), Positives = 96/125 (76%), Gaps = 3/125 (2%)
Query 2 TLDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVF 61
+L+V AI+R G+ R+ DQ G+LW S+A+YY R G F+KARDV+EEA+ TV T+RDF+QVF
Sbjct 232 SLNVGAIIRGGLTRFTDQLGKLWCSMADYYIRSGHFEKARDVYEEAILTVVTVRDFTQVF 291
Query 62 DAYTQSEERLIQLKMAAADTAEDGELTEDDELELDLRIVRYEDIIERRPLLLNSVALRQN 121
D+Y Q EE +I KM T+E G+ + D+++L+LR+ R+E +I RRPLLLNSV LRQN
Sbjct 292 DSYAQFEESMIAAKMET--TSELGQDED-DDIDLELRLARFESLITRRPLLLNSVLLRQN 348
Query 122 PHNVE 126
PHNV
Sbjct 349 PHNVH 353
Score = 29.3 bits (64), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query 23 LWISLAEYYTRCGLFDKARDVFEEAV-ATVRTIRDFSQVFDAYTQSEER 70
LW+S A++Y D AR +FE+A + + D + V+ Y + E R
Sbjct 392 LWVSFAKFYEDNEQIDDARTIFEKATKVNYKQVDDLAAVWCEYGEMELR 440
> cel:C50F2.3 hypothetical protein; K12867 pre-mRNA-splicing factor
SYF1
Length=855
Score = 131 bits (330), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 64/125 (51%), Positives = 90/125 (72%), Gaps = 4/125 (3%)
Query 1 WTLDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQV 60
++L+VDAI+R+GI RY DQ G LW SLA+YY R F++ARDV+EEA+A V T+RDF+QV
Sbjct 243 FSLNVDAIIRQGIYRYTDQVGFLWCSLADYYIRSAEFERARDVYEEAIAKVSTVRDFAQV 302
Query 61 FDAYTQSEERLIQLKMAAADTAEDGELTEDDELELDLRIVRYEDIIERRPLLLNSVALRQ 120
+DAY EER + + M + + D E +E++L+ RY+ ++ER+ L+NSV LRQ
Sbjct 303 YDAYAAFEEREVSIMMQEVEQSGDPE----EEVDLEWMFQRYQHLMERKNELMNSVLLRQ 358
Query 121 NPHNV 125
NPHNV
Sbjct 359 NPHNV 363
Score = 32.3 bits (72), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query 23 LWISLAEYYTRCGLFDKARDVFEEAV-ATVRTIRDFSQVFDAYTQSEERLIQLKMA 77
LWI LA+ Y G D AR FE AV + + + + V+ AY + E + + K A
Sbjct 403 LWIGLAKLYEDNGDLDAARKTFETAVISQFGGVSELANVWCAYAEMEMKHKRAKAA 458
> ath:AT5G28740 transcription-coupled DNA repair protein-related;
K12867 pre-mRNA-splicing factor SYF1
Length=917
Score = 113 bits (282), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 87/156 (55%), Gaps = 32/156 (20%)
Query 3 LDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFD 62
L+VDAI+R GI ++ D+ G LW SLA+YY R L +KARD++EE + V T+RDFS +FD
Sbjct 231 LNVDAIIRGGIRKFTDEVGMLWTSLADYYIRKNLLEKARDIYEEGMMKVVTVRDFSVIFD 290
Query 63 AYTQSEERLIQLKM---------------AAADTAEDGELTED----------------- 90
Y++ EE + KM D ED L +
Sbjct 291 VYSRFEESTVAKKMEMMSSSDEEDENEENGVEDDEEDVRLNFNLSVKELQRKILNGFWLN 350
Query 91 DELELDLRIVRYEDIIERRPLLLNSVALRQNPHNVE 126
D+ ++DLR+ R E+++ RRP L NSV LRQNPHNVE
Sbjct 351 DDNDVDLRLARLEELMNRRPALANSVLLRQNPHNVE 386
> tgo:TGME49_105240 XPA-binding protein, putative ; K12867 pre-mRNA-splicing
factor SYF1
Length=966
Score = 106 bits (265), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/145 (40%), Positives = 83/145 (57%), Gaps = 21/145 (14%)
Query 2 TLDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVF 61
+L +A+LR GI+R+ DQ G+LW +LA ++ R G +K RDVFEEA+ V T+ D + V+
Sbjct 262 SLRAEAVLRSGISRFSDQVGKLWCALASHFVRLGQLEKTRDVFEEALCGVGTLHDLALVY 321
Query 62 DAYTQSEERLIQLKM-------------AAADTAEDGELTEDDEL--------ELDLRIV 100
DA+ Q EE L+ KM AA++ A D + E+D +
Sbjct 322 DAFVQFEESLLAAKMKELEDEENAGPDCAASEDAADRRERKRRRKEKKKQQSEEVDFLMT 381
Query 101 RYEDIIERRPLLLNSVALRQNPHNV 125
R E + ERRPLL++S LRQNPHNV
Sbjct 382 RLEFLTERRPLLVSSCKLRQNPHNV 406
Score = 31.2 bits (69), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 6/57 (10%)
Query 11 EGIARYVDQQ--GR---LWISLAEYYTRCGLFDKARDVFEEAV-ATVRTIRDFSQVF 61
E +A QQ GR LWI+ A YY G AR +FE+A A VRT+ + + ++
Sbjct 429 EAVATVDPQQAVGRVSVLWIAFARYYEDRGDLPNARLIFEKATKARVRTVDELASIW 485
> bbo:BBOV_IV000660 21.m02991; XBA-binding protein 2; K12867 pre-mRNA-splicing
factor SYF1
Length=796
Score = 86.3 bits (212), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 76/126 (60%), Gaps = 14/126 (11%)
Query 2 TLDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVF 61
++ ++AI++EGIA+Y DQ +LWI LA+ Y G ARDV+EEA+ +V T++DFS +F
Sbjct 287 SVPIEAIIKEGIAKYSDQVAQLWIILADIYILRGQMLNARDVYEEALKSVTTVQDFSTIF 346
Query 62 DAYTQSEERLIQ--LKMAAADTAEDGELTEDDELELDLRIVRYEDIIERRPLLLNSVALR 119
D Y + E+ + K+ AD L++ + + R E++I R +L+ V L+
Sbjct 347 DVYAKFLEKYAKQRKKLRGAD------------LDVVMTVDRLENLINTRAMLMAKVKLK 394
Query 120 QNPHNV 125
QN HNV
Sbjct 395 QNAHNV 400
> tpv:TP01_0087 adapter protein; K12867 pre-mRNA-splicing factor
SYF1
Length=839
Score = 81.3 bits (199), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 68/127 (53%), Gaps = 3/127 (2%)
Query 2 TLDVDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVF 61
++ V+ +++EGI +Y DQ LWI LA+ Y G + ARD +EEA+ V T++DFS +F
Sbjct 307 SIPVETLIKEGIGKYTDQVATLWIILADIYIVRGQLNIARDTYEEALDRVTTVQDFSIIF 366
Query 62 DAYTQSEERLIQLKMAAADTAEDG---ELTEDDELELDLRIVRYEDIIERRPLLLNSVAL 118
D Y + E + + D LE + + R E ++ R LLL SV L
Sbjct 367 DVYAKFLENYAKQSNKLGYVLINNLHLHYIRTDHLETLMTVERLESLVNNRALLLASVKL 426
Query 119 RQNPHNV 125
+QN HNV
Sbjct 427 KQNIHNV 433
> sce:YDR416W SYF1, NTC90; Member of the NineTeen Complex (NTC)
that contains Prp19p and stabilizes U6 snRNA in catalytic
forms of the spliceosome containing U2, U5, and U6 snRNAs; null
mutant has splicing defect and arrests in G2/M; homologs
in human and C. elegans; K12867 pre-mRNA-splicing factor SYF1
Length=859
Score = 43.9 bits (102), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 55/120 (45%), Gaps = 1/120 (0%)
Query 8 ILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQS 67
+R+ Y D+ L +SLA+YY G D D+ ++++ DF ++++ Y
Sbjct 290 FMRQMNGIYPDKWLFLILSLAKYYISRGRLDSCGDLLKKSLQQTLRYSDFDRIYNFYLLF 349
Query 68 EERLIQLKMAAADTAEDGELTEDDELE-LDLRIVRYEDIIERRPLLLNSVALRQNPHNVE 126
E+ Q + + + D E L + +E +I + LN VALRQ+ + VE
Sbjct 350 EQECSQFILGKLKENDSKFFNQKDWTEKLQAHMATFESLINLYDIYLNDVALRQDSNLVE 409
> ath:AT4G37170 pentatricopeptide (PPR) repeat-containing protein
Length=691
Score = 37.0 bits (84), Expect = 0.017, Method: Composition-based stats.
Identities = 19/60 (31%), Positives = 34/60 (56%), Gaps = 6/60 (10%)
Query 8 ILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQS 67
I+R G+ D LW SL + Y +CG D+AR++F++ V + + ++ + D Y +S
Sbjct 244 IVRAGL----DSDEVLWSSLMDMYGKCGCIDEARNIFDKIVE--KDVVSWTSMIDRYFKS 297
> pfa:PFL1735c RNA-processing protein, putative; K12867 pre-mRNA-splicing
factor SYF1
Length=1031
Score = 36.2 bits (82), Expect = 0.023, Method: Composition-based stats.
Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 0/61 (0%)
Query 23 LWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLKMAAADTA 82
++I LA + G ++KA D +EE ++ T+ DF +FD Y + + LI L + +
Sbjct 397 IYILLANNFIYDGRWNKAMDSYEEGISECYTVNDFITLFDNYIEMLKMLIDLNIYEQEER 456
Query 83 E 83
E
Sbjct 457 E 457
> ath:AT1G07740 pentatricopeptide (PPR) repeat-containing protein
Length=459
Score = 33.9 bits (76), Expect = 0.14, Method: Composition-based stats.
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
Query 5 VDAILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVA--TVRTIRDFSQVFD 62
VD ILR R V + L++ L ++Y + G DKA DVF + + VRTI+ + + +
Sbjct 100 VDQILRLVRYRNVRCRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTIQSLNTLIN 159
Query 63 AYTQSEE 69
+ E
Sbjct 160 VLVDNGE 166
> ath:AT1G06140 pentatricopeptide (PPR) repeat-containing protein
Length=558
Score = 33.1 bits (74), Expect = 0.22, Method: Composition-based stats.
Identities = 13/33 (39%), Positives = 20/33 (60%), Gaps = 0/33 (0%)
Query 16 YVDQQGRLWISLAEYYTRCGLFDKARDVFEEAV 48
++DQ L S+ + Y +C L D AR +FE +V
Sbjct 241 FIDQSDYLQASIIDMYVKCRLLDNARKLFETSV 273
> mmu:98999 Znfx1, AI481105; zinc finger, NFX1-type containing
1
Length=1909
Score = 32.3 bits (72), Expect = 0.33, Method: Composition-based stats.
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query 27 LAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLKMAAADT 81
L RC + +K + E V+++R I + + F TQ +E+L+Q KM A T
Sbjct 1758 LVNLLMRCKMAEKVKGSIAEEVSSIRNILEKTSKF---TQEDEQLVQKKMDALKT 1809
> ath:AT1G26900 pentatricopeptide (PPR) repeat-containing protein
Length=572
Score = 32.0 bits (71), Expect = 0.49, Method: Composition-based stats.
Identities = 19/68 (27%), Positives = 32/68 (47%), Gaps = 5/68 (7%)
Query 9 LREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSE 68
LR G + D + +L +Y CG AR VF+E +V + FS + + Y Q
Sbjct 152 LRSGFMVFTDLRN----ALIHFYCVCGKISDARKVFDEMPQSVDAV-TFSTLMNGYLQVS 206
Query 69 ERLIQLKM 76
++ + L +
Sbjct 207 KKALALDL 214
> ath:AT4G19191 hypothetical protein
Length=654
Score = 31.6 bits (70), Expect = 0.59, Method: Composition-based stats.
Identities = 22/64 (34%), Positives = 32/64 (50%), Gaps = 6/64 (9%)
Query 5 VDAILREGIARYVDQQ---GRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVF 61
++A+ GI VD Q WIS Y +CG D A+ VFE RT+ ++ +F
Sbjct 172 LEAMHAVGIRLGVDVQVTVANTWIST---YGKCGDLDSAKLVFEAIDRGDRTVVSWNSMF 228
Query 62 DAYT 65
AY+
Sbjct 229 KAYS 232
> ath:AT3G49140 hypothetical protein
Length=896
Score = 30.8 bits (68), Expect = 1.0, Method: Composition-based stats.
Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query 26 SLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAY 64
+L + Y +CG +KARDVFE + R + ++ + AY
Sbjct 349 ALIDMYAKCGCLEKARDVFENMKS--RDVVSWTAMISAY 385
> bbo:BBOV_III001340 hypothetical protein
Length=240
Score = 30.8 bits (68), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 13/91 (14%)
Query 5 VDAI---LREGIARYVDQQ------GRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIR 55
+DAI LRE + +Y+D Q L L++ RCG FDK D + + +
Sbjct 154 IDAIPTKLREAVNKYLDIQHFDDLVANLPKELSDEVKRCGSFDKIHDELKHKIHDHYS-- 211
Query 56 DFSQVFDAYTQSEERLIQLKMAAADTAEDGE 86
FD + + K + T +D E
Sbjct 212 --KHCFDGTEVKHHKRLSKKRTSTRTQDDFE 240
> tgo:TGME49_069200 crooked neck-like protein 1, putative ; K12869
crooked neck
Length=794
Score = 30.8 bits (68), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 6/83 (7%)
Query 24 WISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLKMAAADTAE 83
W+ + RC D+AR VFE ++ R + F + + EER Q+ A A +
Sbjct 309 WMLYIHFEERCKELDRARKVFERYLSN----RPSQESFLRFCKFEERHRQIPRARAGFEK 364
Query 84 DGELTEDDELE--LDLRIVRYED 104
EL +D L+ L+ ++E+
Sbjct 365 AIELLPEDMLDEHFFLKFAQFEE 387
> ath:AT1G25360 pentatricopeptide (PPR) repeat-containing protein
Length=790
Score = 30.0 bits (66), Expect = 1.9, Method: Composition-based stats.
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query 26 SLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQS----EERLIQLKM 76
SL Y +CG FD+AR +FE+ A + + ++ + Y S E +LI +M
Sbjct 325 SLVSLYYKCGKFDEARAIFEKMPA--KDLVSWNALLSGYVSSGHIGEAKLIFKEM 377
> ath:AT5G06540 pentatricopeptide (PPR) repeat-containing protein
Length=622
Score = 30.0 bits (66), Expect = 2.1, Method: Composition-based stats.
Identities = 9/23 (39%), Positives = 16/23 (69%), Gaps = 0/23 (0%)
Query 24 WISLAEYYTRCGLFDKARDVFEE 46
W S+ Y +CG+ + AR++F+E
Sbjct 186 WTSMVAGYCKCGMVENAREMFDE 208
> cel:M03F8.3 hypothetical protein; K12869 crooked neck
Length=747
Score = 29.3 bits (64), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 12/26 (46%), Positives = 17/26 (65%), Gaps = 0/26 (0%)
Query 22 RLWISLAEYYTRCGLFDKARDVFEEA 47
++WIS+AE+ G F+ AR FE A
Sbjct 559 KVWISMAEFEQTIGNFEGARKAFERA 584
> ath:AT2G37310 pentatricopeptide (PPR) repeat-containing protein
Length=657
Score = 29.3 bits (64), Expect = 3.1, Method: Composition-based stats.
Identities = 16/61 (26%), Positives = 32/61 (52%), Gaps = 7/61 (11%)
Query 30 YYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSE-----ERLIQLKMAAADTAED 84
YYT+C + AR VF+E + R + ++ + Y+QS +++ + +A +D +
Sbjct 176 YYTKCDNIESARKVFDE--MSERDVVSWNSMISGYSQSGSFEDCKKMYKAMLACSDFKPN 233
Query 85 G 85
G
Sbjct 234 G 234
> ath:AT3G22690 pentatricopeptide (PPR) repeat-containing protein
Length=978
Score = 29.3 bits (64), Expect = 3.2, Method: Composition-based stats.
Identities = 11/21 (52%), Positives = 13/21 (61%), Gaps = 0/21 (0%)
Query 26 SLAEYYTRCGLFDKARDVFEE 46
SL +Y CG D AR VF+E
Sbjct 174 SLVHFYAECGELDSARKVFDE 194
> ath:AT4G15720 pentatricopeptide (PPR) repeat-containing protein
Length=616
Score = 29.3 bits (64), Expect = 3.4, Method: Composition-based stats.
Identities = 12/42 (28%), Positives = 22/42 (52%), Gaps = 0/42 (0%)
Query 26 SLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQS 67
SL + Y +C + AR VF+ + R + ++ + AY Q+
Sbjct 171 SLVDMYGKCNDVETARRVFDSMIGYGRNVVSWTSMITAYAQN 212
> ath:AT3G12770 pentatricopeptide (PPR) repeat-containing protein
Length=719
Score = 29.3 bits (64), Expect = 3.5, Method: Composition-based stats.
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 0/50 (0%)
Query 27 LAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLKM 76
L Y +C AR VFE RTI ++ + AY Q+ E + L++
Sbjct 160 LIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEI 209
> ath:AT5G55740 CRR21; CRR21 (chlororespiratory reduction 21)
Length=830
Score = 29.3 bits (64), Expect = 3.6, Method: Composition-based stats.
Identities = 11/21 (52%), Positives = 15/21 (71%), Gaps = 0/21 (0%)
Query 26 SLAEYYTRCGLFDKARDVFEE 46
SLA+ Y +CG+ D A VF+E
Sbjct 213 SLADMYGKCGVLDDASKVFDE 233
> ath:AT2G33760 pentatricopeptide (PPR) repeat-containing protein
Length=583
Score = 29.3 bits (64), Expect = 3.6, Method: Composition-based stats.
Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 4/43 (9%)
Query 8 ILREGIARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEAVAT 50
I+ EG+ D +L +L Y+RCG KAR+VF++ T
Sbjct 234 IISEGL----DLNVKLGTALINLYSRCGDVGKAREVFDKMKET 272
> ath:AT3G50420 pentatricopeptide (PPR) repeat-containing protein
Length=794
Score = 28.9 bits (63), Expect = 3.7, Method: Composition-based stats.
Identities = 12/44 (27%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query 26 SLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEE 69
+L Y RCG ++AR VF++ R + ++ ++ AY+++ +
Sbjct 137 NLISMYVRCGSLEQARKVFDKMPH--RNVVSYNALYSAYSRNPD 178
> ath:AT2G22410 pentatricopeptide (PPR) repeat-containing protein
Length=681
Score = 28.9 bits (63), Expect = 3.9, Method: Composition-based stats.
Identities = 9/23 (39%), Positives = 15/23 (65%), Gaps = 0/23 (0%)
Query 24 WISLAEYYTRCGLFDKARDVFEE 46
W ++ Y RCGL D +R +F++
Sbjct 326 WTTMISGYARCGLLDVSRKLFDD 348
> ath:AT2G18520 pentatricopeptide (PPR) repeat-containing protein
Length=418
Score = 28.9 bits (63), Expect = 4.0, Method: Composition-based stats.
Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query 26 SLAEYYTRCGLFDKARDVFEE--AVATVRTIRDFSQVFDAYTQSE--ERLIQL 74
+L Y R +FD A +FEE + T RT+ F+ + A S+ ER+ QL
Sbjct 107 TLIRSYGRASMFDHAMKMFEEMDKLGTPRTVVSFNALLAACLHSDLFERVPQL 159
> ath:AT1G74400 pentatricopeptide (PPR) repeat-containing protein
Length=462
Score = 28.9 bits (63), Expect = 4.0, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 32/69 (46%), Gaps = 4/69 (5%)
Query 26 SLAEYYTRCGLFDKARDVFEEAV---ATVRTIRDFSQVFDAYTQSEERLIQLKMAAADTA 82
SL Y + G +KAR +F+E++ T T F + Q L + KM D +
Sbjct 209 SLLNMYVKSGETEKARKLFDESMRKDVTTYTSMIFGYALNGQAQESLELFK-KMKTIDQS 267
Query 83 EDGELTEDD 91
+D +T +D
Sbjct 268 QDTVITPND 276
Score = 28.1 bits (61), Expect = 7.6, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 1/62 (1%)
Query 26 SLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLKMAAADTAEDG 85
SL +Y+ G D AR VF+E + I ++ + AYT++E + +++ AE
Sbjct 105 SLVGFYSSVGDVDYARQVFDETPEK-QNIVLWTAMISAYTENENSVEAIELFKRMEAEKI 163
Query 86 EL 87
EL
Sbjct 164 EL 165
> tgo:TGME49_093260 pyruvate dehydrogenase (lipoamide) kinase,
putative (EC:2.7.11.2)
Length=1105
Score = 28.9 bits (63), Expect = 4.2, Method: Composition-based stats.
Identities = 20/64 (31%), Positives = 24/64 (37%), Gaps = 12/64 (18%)
Query 29 EYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLKMAAADTAEDGELT 88
E YTRC R ++ E+ V R EE + QL A E E
Sbjct 475 ELYTRCSRLAAVRAIYVESCDRVTRCR------------EEMMTQLVEEATVQREKAEAA 522
Query 89 EDDE 92
EDDE
Sbjct 523 EDDE 526
> ath:AT4G14820 pentatricopeptide (PPR) repeat-containing protein
Length=722
Score = 28.5 bits (62), Expect = 5.7, Method: Composition-based stats.
Identities = 19/64 (29%), Positives = 29/64 (45%), Gaps = 2/64 (3%)
Query 26 SLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLKMAAADTAEDG 85
+L Y +CG D RDVFE+ R + +S + +A + E L + A E+
Sbjct 384 ALINMYAKCGGLDATRDVFEK--MPRRNVVSWSSMINALSMHGEASDALSLFARMKQENV 441
Query 86 ELTE 89
E E
Sbjct 442 EPNE 445
> cel:T19C4.5 hypothetical protein
Length=360
Score = 28.1 bits (61), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 15/63 (23%), Positives = 30/63 (47%), Gaps = 0/63 (0%)
Query 44 FEEAVATVRTIRDFSQVFDAYTQSEERLIQLKMAAADTAEDGELTEDDELELDLRIVRYE 103
+E ++ +RDF ++ Y + L ++A + G+L E D L+++ R E
Sbjct 147 LKEPSTSIDILRDFGAIYAKYPKRVSALYFERLANSMNEAAGQLQESDHLDVEKTQSRME 206
Query 104 DII 106
D +
Sbjct 207 DAV 209
> ath:AT2G45350 CRR4; CRR4 (CHLORORESPIRATORY REDUCTION 4)
Length=606
Score = 28.1 bits (61), Expect = 7.1, Method: Composition-based stats.
Identities = 16/79 (20%), Positives = 38/79 (48%), Gaps = 0/79 (0%)
Query 26 SLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLKMAAADTAEDG 85
S+ + Y +CGL AR++F+ ++ + ++ + Y Q+ + + AD E
Sbjct 185 SMIDGYVKCGLIVSARELFDLMPMEMKNLISWNSMISGYAQTSDGVDIASKLFADMPEKD 244
Query 86 ELTEDDELELDLRIVRYED 104
++ + ++ ++ R ED
Sbjct 245 LISWNSMIDGYVKHGRIED 263
> ath:AT4G18840 pentatricopeptide (PPR) repeat-containing protein
Length=545
Score = 28.1 bits (61), Expect = 7.3, Method: Composition-based stats.
Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 5/39 (12%)
Query 11 EGIARYVDQ-----QGRLWISLAEYYTRCGLFDKARDVF 44
E + Y+D+ +G L +L + Y++CG DKA +VF
Sbjct 324 EWVHVYIDKHGIEIEGFLATALVDMYSKCGKIDKALEVF 362
> ath:AT5G21482 CKX7; CKX7 (CYTOKININ OXIDASE 7); cytokinin dehydrogenase/
oxidoreductase (EC:1.5.99.12); K00279 cytokinin
dehydrogenase [EC:1.5.99.12]
Length=524
Score = 28.1 bits (61), Expect = 7.9, Method: Composition-based stats.
Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 0/57 (0%)
Query 19 QQGRLWISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYTQSEERLIQLK 75
+ L+ S+ + G+ +AR + + A VR IR FD +TQ E L+ K
Sbjct 210 ENSELFFSVLGGLGQFGIITRARVLLQPAPDMVRWIRVVYTEFDEFTQDAEWLVSQK 266
> sce:YPR189W SKI3, SKI5; Ski complex component and TPR protein,
mediates 3'-5' RNA degradation by the cytoplasmic exosome;
null mutants have superkiller phenotype of increased viral
dsRNAs and are synthetic lethal with mutations in 5'-3' mRNA
decay; K12600 superkiller protein 3
Length=1432
Score = 27.7 bits (60), Expect = 8.4, Method: Composition-based stats.
Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query 24 WISLAEYYTRCGLFDKARDVFEEAVATVRTIRDFSQVFDAYT 65
W+ L + Y CG + + VF++A+ +R F+Q F A +
Sbjct 739 WVGLGQAYHACGRIEASIKVFDKAI-QLRPSHTFAQYFKAIS 779
> tgo:TGME49_011370 hypothetical protein
Length=697
Score = 27.7 bits (60), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 38/89 (42%), Gaps = 19/89 (21%)
Query 13 IARYVDQQGRLWISLAEYYTRCGLFDKARDVFEEA-----------------VATVRTIR 55
IA Y ++Q L + A +Y CG +KA D++ A RT++
Sbjct 403 IAVYYEKQN-LPVQAARHYANCGNAEKALDLYLNADVPAYDAAIELVSRFRQQPLTRTLQ 461
Query 56 DFSQ-VFDAYTQSEERLIQLKMAAADTAE 83
DF + V D ++ L +L MA D E
Sbjct 462 DFLEGVSDGVSKDPSFLHKLHMALGDHVE 490
Lambda K H
0.321 0.137 0.390
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 2059772308
Database: egene_temp_file_orthology_annotation_similarity_blast_database_865
Posted date: Sep 17, 2011 11:19 AM
Number of letters in database: 82,071,388
Number of sequences in database: 164,496
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40