bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-34_CDS_annotation_glimmer3.pl_2_4
Length=371
Score E
Sequences producing significant alignments: (Bits) Value
gi|575096060|emb|CDL66943.1| unnamed protein product 114 2e-25
gi|575094546|emb|CDL65906.1| unnamed protein product 112 1e-24
gi|575094487|emb|CDL65854.1| unnamed protein product 108 3e-23
gi|575094494|emb|CDL65868.1| unnamed protein product 105 3e-22
gi|530695371|gb|AGT39925.1| replication initiator 96.7 2e-19
gi|575094436|emb|CDL65809.1| unnamed protein product 96.7 2e-19
gi|585369477|ref|WP_024251053.1| hypothetical protein 88.6 4e-17
gi|575094418|emb|CDL65793.1| unnamed protein product 89.4 1e-16
gi|313766924|gb|ADR80651.1| putative replication initiation protein 84.7 2e-15
gi|575094569|emb|CDL65925.1| unnamed protein product 82.4 3e-14
>gi|575096060|emb|CDL66943.1| unnamed protein product [uncultured bacterium]
Length=339
Score = 114 bits (285), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/236 (37%), Positives = 117/236 (50%), Gaps = 30/236 (13%)
Query 1 MGCRLDRSRVWADRMLLELKDNDYKALFVTLTYNDRSLPSAWHVGSNYFDGFYedvvplv 60
+GCR+D SR WA+R +LEL+D+D A F T TY++ +P +++ +
Sbjct 56 IGCRIDYSRQWANRCMLELQDHD-SAFFCTFTYDNDHVPISYYADKETGEA--------- 105
Query 61 ldddeeWIaaaagapaTLSIRDTQLFMKrlrktfrdrrlrfflAGEYGPKTHRPHYHAII 120
TL RD QL MKR+RK F D +RFF AGEYG +T RPHYHAII
Sbjct 106 ------------KPSLTLRKRDFQLLMKRIRKHFSDDHIRFFAAGEYGGQTLRPHYHAII 153
Query 121 YGLTLSDFKDCRIKDFNKLGQPRYISKSFERIWGN------GYCVLAPVNWNTCAYVSRY 174
YGL L+D + + Y S S ++ W + G+ V+ V W +CAY +RY
Sbjct 154 YGLHLNDLVPYKTVKEGGVLYTYYNSPSLQKCWLDSDGKPIGFVVVGAVTWESCAYTARY 213
Query 175 TMKKVYKSENSHAYASGQL-PPFCTMSRRPGIGLLHADDLLKKGDKTFIRDIDLNG 229
+KK K E S Y L P F MSR+PGI + D FI L G
Sbjct 214 VLKK-QKGEASTVYQEFNLEPEFTLMSRKPGIARNYYDTHPDLFQSDFINISTLKG 268
>gi|575094546|emb|CDL65906.1| unnamed protein product [uncultured bacterium]
Length=351
Score = 112 bits (280), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/218 (39%), Positives = 115/218 (53%), Gaps = 35/218 (16%)
Query 1 MGCRLDRSRVWADRMLLELKDNDYKALFVTLTYNDRSLPSAWHVGSNYFDGFYedvvplv 60
+GCRLD SR WADR++LEL+ + A+FVTLTY++ ++P + DG
Sbjct 62 IGCRLDYSRRWADRLMLELQYHT-AAIFVTLTYSELNVPKHHYQTP---DG--------- 108
Query 61 ldddeeWIaaaagapaTLSIRDTQLFMKrlrktfrdrrlrfflAGEYGPKTHRPHYHAII 120
+L RD QLF KRLRK + D ++RFFL+GEYGPKT RPHYHAII
Sbjct 109 ----------DVNTSYSLDKRDVQLFFKRLRKMYPDTKIRFFLSGEYGPKTFRPHYHAII 158
Query 121 YGLTLS-DFKDCRIKDFNKLGQPRYISKSFERIWGN-----------GYCVLAPVNWNTC 168
+G+ + D R++ + + Y S S ER W G + V+W+TC
Sbjct 159 FGVDFAHDRYVWRVRRADNMFVNYYRSPSLERAWSVYNNDVGDYVPIGNVEFSDVSWHTC 218
Query 169 AYVSRYTMKKVYKSENSHAYASGQLPPFCTMSRRPGIG 206
AYV+RY KK+ + PPF MSR+PGI
Sbjct 219 AYVARYVTKKLTGNLAQFYTTFNLTPPFSLMSRKPGIA 256
>gi|575094487|emb|CDL65854.1| unnamed protein product [uncultured bacterium]
Length=332
Score = 108 bits (269), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 104/213 (49%), Gaps = 26/213 (12%)
Query 1 MGCRLDRSRVWADRMLLELKDNDYKALFVTLTYNDRSLPSAWHVGSNYFDGFYedvvplv 60
+GCRL +SR WA+R+++E + ++ F+TLTYND LP ++ V
Sbjct 57 VGCRLSKSREWANRVVME-QLYHVESWFLTLTYNDEHLPRSFPVDEA------------- 102
Query 61 ldddeeWIaaaagapaTLSIRDTQLFMKrlrktfrdrrlrfflAGEYGPKTHRPHYHAII 120
TL D Q F+KRLRK + F AGEYG RPHYH +I
Sbjct 103 -------TGEILSVHGTLVKEDLQKFLKRLRKNSGQKLRFFA-AGEYGSLNMRPHYHLLI 154
Query 121 YGLTLSDFKDCRIKDFNKLGQPRYISKSFERIWGNGYCVLAPVNWNTCAYVSRYTMKKVY 180
+GL L D + R + LG Y S E+ W G+ +L V W + AYV+RYTMKK
Sbjct 155 FGLHLEDLQLLRK---SPLGDEYYTSSLLEKCWPFGFHILGRVTWQSAAYVARYTMKKAS 211
Query 181 KSENSHAYASGQL-PPFCTMSRRPGIGLLHADD 212
K + Y L P F MS RPG+ + +D
Sbjct 212 KGYDKDLYKKAALQPEFQVMSNRPGLARQYYED 244
>gi|575094494|emb|CDL65868.1| unnamed protein product [uncultured bacterium]
Length=348
Score = 105 bits (261), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/256 (32%), Positives = 124/256 (48%), Gaps = 34/256 (13%)
Query 1 MGCRLDRSRVWADRMLLELKDNDYKALFVTLTYNDRSLPSAWHVGSNYFDGFYedvvplv 60
+GCRL SR WADR +LE + + + F+TLTY+D +LP + + + + Y
Sbjct 68 VGCRLAYSRQWADRCMLESSYHTH-SYFLTLTYDDDNLPLSESINQDTGEINYNA----- 121
Query 61 ldddeeWIaaaagapaTLSIRDTQLFMKr-----lrktfrdrrlrfflAGEYGPKTHRPH 115
TL +D Q F+KR + +++F AGEYG +T RPH
Sbjct 122 ----------------TLVKKDIQDFIKRLRRFCEYNIDDNLHIKYFCAGEYGSQTFRPH 165
Query 116 YHAIIYGLTLSDFKDCRIKDFNKLGQPRYISKSFERIWGNGYCVLAPVNWNTCAYVSRYT 175
YH I+YG ++D K + + G Y S + +++W G+ V+ V W+TCAY +RY
Sbjct 166 YHMILYGFPINDLK---LYKMSLDGYNYYNSATIDKLWKKGFVVIGEVTWDTCAYTARYI 222
Query 176 MKKVYKSENSHAYASGQLPPFCTMSRRPGIGLLHADDLLKKGDKTFIRD-IDLNGKECTR 234
+KK Y S LP F MS +P I + +D DK F D I L KE +
Sbjct 223 LKKQYGSGAQIYKDYNILPEFTCMSTKPAIAREYYED---NKDKIFDSDYIFLGTKEKSI 279
Query 235 EVYLGRAFIRSAAREH 250
++ + F + +E+
Sbjct 280 QMKPPKYFEKLLEKEN 295
>gi|530695371|gb|AGT39925.1| replication initiator [Marine gokushovirus]
Length=316
Score = 96.7 bits (239), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 72/213 (34%), Positives = 111/213 (52%), Gaps = 43/213 (20%)
Query 1 MGCRLDRSRVWADRMLLELKDNDYKALFVTLTYNDRSLPSAWHVGSNYFDGFYedvvplv 60
+GCRL++SR WA R E K + F+TLTYN LP
Sbjct 55 IGCRLEKSRQWALRCTHEAKLYKNNS-FITLTYNSDHLP--------------------- 92
Query 61 ldddeeWIaaaagapaTLSIRDTQLFMKrlrktfrdrrlrfflAGEYGPKTHRPHYHAII 120
+ TL++R QLF+KRLRK + ++ +RF+ GEYG HRPHYHA++
Sbjct 93 ---------LTNNSLPTLNLRHFQLFLKRLRKKYSNKTIRFYHCGEYGDMNHRPHYHALL 143
Query 121 YGLTLSDFKDCRIKDFNKLGQPRYISKSFERIWGN-------GYCVLAPVNWNTCAYVSR 173
+ DF+D ++ +K Q Y S+ + +W + G+ + + +++ AYV+R
Sbjct 144 FN---HDFEDKKLWKIHK-DQNYYTSEVLDGLWTDPKTKSNMGFSTIGDLTFDSAAYVAR 199
Query 174 YTMKKVYKSENSHAYASGQLPPFCTMSRRPGIG 206
Y +KK+ +N+ Y G++P + TMSRRPGIG
Sbjct 200 YCLKKI-TGKNAEDYYQGRVPEYATMSRRPGIG 231
>gi|575094436|emb|CDL65809.1| unnamed protein product [uncultured bacterium]
Length=340
Score = 96.7 bits (239), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/236 (33%), Positives = 109/236 (46%), Gaps = 24/236 (10%)
Query 1 MGCRLDRSRVWADRMLLELKDNDYKALFVTLTYNDRSLPSAWHVGSNYFDGFYedvvplv 60
+GCR+ + WA R+ LE + +A FVTLTY D ++P +G
Sbjct 55 IGCRIRAKQDWATRLELEARAYKGRAWFVTLTYRDDTIPLLIRNTGELIEGGVSMWSRGA 114
Query 61 ldddeeWIaaaagapaTLSIRDTQLFMKrlrktfrdrrlrffl-----AGEYGPKTHRPH 115
++ TL++ D F KRLRK AGEYG +T RPH
Sbjct 115 DVPEQI---------NTLNMDDVTKFWKRLRKYQTTEPDMGKELRYFYAGEYGEQTGRPH 165
Query 116 YHAIIYGLTLSDFKDCRIKDFNKLGQPRYISKSFERIWGNGYCVLAPVNWNTCAYVSRYT 175
YHAII+GL + D K ++ N+ Y S E+IWG G +A T YV+ Y
Sbjct 166 YHAIIFGLEIPDLK--KVPGRNQY----YKSAILEKIWGKGNVTIAYSEPGTYNYVAGYV 219
Query 176 MKKVYKSENSHAYASGQLPPFCTMSRRPGIGLLHADDLLKKGDKTFIRD-IDLNGK 230
KK+Y ++ G P+ MSR+PGIG+ + L DK + +D I L GK
Sbjct 220 TKKMYGNDTKEYQNLGLTAPYACMSRKPGIGMPWLEQNL---DKLWEQDYIQLAGK 272
>gi|585369477|ref|WP_024251053.1| hypothetical protein [Escherichia coli]
Length=243
Score = 88.6 bits (218), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 78/240 (33%), Positives = 101/240 (42%), Gaps = 42/240 (18%)
Query 77 TLSIRDTQLFMKrlrktfrdrrlrfflAGEYGPKTHRPHYHAIIYGLTLSDFKDCRIKDF 136
+L RD QLF KRLRK F D +R+F GEYG T RPHYHAI++GL L D +
Sbjct 5 SLCKRDLQLFWKRLRKAFPDDHIRYFACGEYGSTTFRPHYHAIVFGLHLHDLIPVQDIRR 64
Query 137 NKLGQPRYISKSFERIWGN----------------GYCVLAPVNWNTCAYVSRYTMKKVY 180
+G + S+S +R W GY ++ VNW TCAYV+RY +KK
Sbjct 65 GDVGYQYFYSESLQRAWSVVEQKGEYDTPCIRKPIGYVLVGQVNWETCAYVARYVLKKAC 124
Query 181 KSENSHAYASGQLPPFCTMSRRPGIGLLHADDLLK------------------KGDKTFI 222
E P + MSRRPGIG DD + + K F
Sbjct 125 GPEADVYQTFNIQPEYVDMSRRPGIGRQWYDDHPECMEYDTISISTPDGGRKIRPPKYFD 184
Query 223 RDIDLNGKECTREVYLGRAFIRSAAREHMKPVFAAADLVESVQTCINELEAESCTEHEDV 282
+ DL E E+ A R+H A L +S T LE + H +
Sbjct 185 KLFDLEQPELMAEI--------KAKRKHFAEEGKKAKLAQSTMTYEEILETQERVLHNRI 236
>gi|575094418|emb|CDL65793.1| unnamed protein product [uncultured bacterium]
Length=367
Score = 89.4 bits (220), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 103/211 (49%), Gaps = 20/211 (9%)
Query 1 MGCRLDRSRVWADRMLLELKDNDYK-ALFVTLTYNDRSLPSAWHVGSNYFDGFYedvvpl 59
+ CR+ + WA R EL+ N +K ++F+TLTY++ +P + G
Sbjct 50 LACRIQYAANWAAR--CELETNYHKQSIFLTLTYDEEHVPVLNKETGEIYRGV------- 100
Query 60 vldddeeWIaaaagapaTLSIRDTQLFMKrlrktfrdrrlrffl----AGEYGPKTHRPH 115
+ E++A T+ D Q F+KRLRK L + +GEYG KT RPH
Sbjct 101 --RNPAEYVAGVTLERMTVYKPDVQKFIKRLRKAAEKEGLTDHIMYYLSGEYGDKTGRPH 158
Query 116 YHAIIYGLTLSDFKDCRIKDFNKLGQPRYISKSFERIWGNGYCVLAPVNWNTCAYVSRYT 175
YH I+YGL + D + ++ G R+ S+ + IWG G + V + +C YV+RY
Sbjct 159 YHLIVYGLEVPDAEHI----GSRRGYDRFTSEWLKGIWGMGLIEIGSVTYESCQYVARYV 214
Query 176 MKKVYKSENSHAYASGQLPPFCTMSRRPGIG 206
+KK E +G +P F MS +P IG
Sbjct 215 IKKRKGKEAKEYKDAGIMPEFVQMSLKPAIG 245
>gi|313766924|gb|ADR80651.1| putative replication initiation protein [Uncultured Microviridae]
Length=285
Score = 84.7 bits (208), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/217 (29%), Positives = 93/217 (43%), Gaps = 36/217 (17%)
Query 1 MGCRLDRSRVWADRMLLE--LKDNDYKALFVTLTYNDRSLPSAWHVGSNYFDGFYedvvp 58
+GCRLD + +WA R+ E L D+ F+TLTY++ LP W + ++F F + +
Sbjct 9 IGCRLDHAGMWASRIEHESSLYDDSNGNCFITLTYDEEHLPQDWSLDKSHFQKFMKRLRK 68
Query 59 lvldddeeWIaaaagapaTLSIRDTQLFMKrlrktfrdrrlrfflAGEYGPKTHRPHYHA 118
+ G I T G RPHYHA
Sbjct 69 RYPQKIRYYHCGEYGENCRHGIHTTLCP---------------------GCNVGRPHYHA 107
Query 119 IIYGLTLSDFKDCRIKDFNKLGQPRYISKSFERIWGNGYCVLAPVNWNTCAYVSRYTMKK 178
I++ + DF D R+ G P + S + IWG+G+ + + + YV+RY +KK
Sbjct 108 ILFNI---DFHD-RVLVGQSKGIPHFTSDTLTEIWGHGFTQVGDLTAQSAGYVARYALKK 163
Query 179 VYKSENSHAYASGQL---------PPFCTMSRRPGIG 206
V ++ Y S L P + TMSR+PGIG
Sbjct 164 VTGTQAEDHYRSIDLTTGEVTYVRPEYATMSRKPGIG 200
>gi|575094569|emb|CDL65925.1| unnamed protein product [uncultured bacterium]
Length=354
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 45/108 (42%), Positives = 60/108 (56%), Gaps = 3/108 (3%)
Query 105 GEYGPKTHRPHYHAIIYGLTLSDFKDCRIKDFNKLGQPRYISKSFERIWGNGYCVLAPVN 164
GEYG T RPHYHAI++G +D + K+F Y+SKS IW NG ++ V
Sbjct 160 GEYGDTTFRPHYHAILFGWRPTDLIQFK-KNFQ--NDTLYLSKSLASIWQNGNVMVGDVT 216
Query 165 WNTCAYVSRYTMKKVYKSENSHAYASGQLPPFCTMSRRPGIGLLHADD 212
+C YV+RY +KK ++ G LP F TMSR+PGI + DD
Sbjct 217 PESCRYVARYCLKKATGFDSEIYERLGVLPEFVTMSRKPGIARKYFDD 264
Lambda K H a alpha
0.324 0.139 0.438 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 2277859962564