bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-15_CDS_annotation_glimmer3.pl_2_1
Length=291
Score E
Sequences producing significant alignments: (Bits) Value
gi|547312922|ref|WP_022044634.1| putative replication initiation... 108 7e-24
gi|609718275|emb|CDN73649.1| conserved hypothetical protein 86.3 2e-16
gi|547920048|ref|WP_022322419.1| putative replication protein 77.8 1e-13
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 77.8 2e-13
gi|492501778|ref|WP_005867316.1| hypothetical protein 77.0 3e-13
gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 65.1 4e-09
gi|575094374|emb|CDL65755.1| unnamed protein product 63.9 3e-08
gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 62.4 8e-08
gi|313766930|gb|ADR80656.1| putative replication initiation protein 55.5 2e-05
gi|530695361|gb|AGT39916.1| replication initiator 53.1 7e-05
>gi|547312922|ref|WP_022044634.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
gi|524208442|emb|CCZ76638.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
Length=320
Score = 108 bits (270), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 72/221 (33%), Positives = 119/221 (54%), Gaps = 30/221 (14%)
Query 2 ECRQQKQRQWLVRMSEELRQTP--NAYFMTLTISDENYEILKNICKSEDPNTIATKAIRL 59
C++ Q+ +R+ ELR+ P F+TLT +D++ E S+D N KA+RL
Sbjct 49 SCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFNDDSLEKF-----SKDTN----KAVRL 99
Query 60 MLERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGI-------------GIDKLIEEK 106
L+R RK GK I+HWF+ E G R H HGI++ + G L+
Sbjct 100 FLDRFRKVYGKQIRHWFVCEFG-TLHGRPHYHGILFNVPQALIDGYDSDMPGHHPLLASC 158
Query 107 WNYGITFTGYFVNEQTINYITKYMTKEDLDHPGFIGKVLCSKGIGMGYAKREDAKNHVYK 166
W YG F GY V+++T +YITKY+TK ++ +V+ S GIG Y E++ H K
Sbjct 159 WKYGFVFVGY-VSDETCSYITKYVTKS-INGDKVRPRVISSFGIGSNYLNTEESSLH--K 214
Query 167 KG-KTIEYYRLRNGSKINLPIYYRNQLFTEEEREMLFLDKI 206
G + + + + NG + +P YY N++F++ +++ + +D++
Sbjct 215 LGNQRYQPFMVLNGFQQAMPRYYYNKIFSDVDKQNMVVDRL 255
>gi|609718275|emb|CDN73649.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=265
Score = 86.3 bits (212), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 101/206 (49%), Gaps = 19/206 (9%)
Query 1 MECRQQKQRQWLVRMSEELRQTPNAYFMTLTISDENYEILKNICKSEDPNTIATKAIRLM 60
+ECR+ + W R++EEL+ + +A+F+TLT SD N S D + +L
Sbjct 29 LECRKARTNSWFARLTEELKVSKSAHFVTLTYSDVYLPYSDNGLISLD-----YRDFQLF 83
Query 61 LERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGI-GIDKLIEEKWNYGITFTGYFVN 119
++R RK IK++ + E G + T R H H IV+G+ ID + E W G G V
Sbjct 84 MKRARKLQKSKIKYFLVGEYGAQ-TYRPHYHAIVFGVENIDAFLGE-WRMGNVHAGT-VT 140
Query 120 EQTINYITKYMTKEDLDHPGFIG-------KVLCSKGIGMGYAKREDAKNHVYKKGKTIE 172
++I Y KY TK + P K L SKG+G+ + K Y K
Sbjct 141 AKSIYYTLKYCTKSITEGPDKDPDDDRKPEKALMSKGLGLSHLTESMIK---YYKDDVSR 197
Query 173 YYRLRNGSKINLPIYYRNQLFTEEER 198
+ L G+ I LP YYR+++F++ E+
Sbjct 198 SFSLLGGTTIALPRYYRDKVFSDIEK 223
>gi|547920048|ref|WP_022322419.1| putative replication protein [Parabacteroides merdae CAG:48]
gi|524592960|emb|CDD13572.1| putative replication protein [Parabacteroides merdae CAG:48]
Length=278
Score = 77.8 bits (190), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 102/212 (48%), Gaps = 15/212 (7%)
Query 3 CRQQKQRQWLVRMSEELRQTPNAYFMTLTISDENYEI--LKNICKSEDPNTIATKAIRLM 60
CRQ K++ W+ R+ E ++ P + F+TLT DE+ I + + + ++ + ++L
Sbjct 18 CRQNKRQSWVYRLQAEAKEYPLSLFVTLTYDDEHLPIERIGSDLFQTNVAVVSKRDVQLF 77
Query 61 LERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGIGID-----KLIEEKWNYGITFTG 115
++R+RKK +F+T K R H H I++G L+ E W G
Sbjct 78 MKRLRKKYEDYKMRYFVTSEYGAKNGRPHYHMILFGFPFTGKMAGDLLAECWQNGFV-QA 136
Query 116 YFVNEQTINYITKYMTKEDL------DHPGFIGKVLCSKGIGMGYAKREDAKNHVYKKGK 169
+ + + I Y+ KYM ++ + D + +LCS+ G+G+ + Y++
Sbjct 137 HPLTIKEIAYVCKYMYEKSMCPEILRDEKKYKPFMLCSRNPGIGFGFMKADIIEFYRRHP 196
Query 170 TIEYYRLRNGSKINLPIYYRNQLFTEEEREML 201
+Y R G K+ +P YY ++L+ ++ + L
Sbjct 197 R-DYVRAWAGHKMAMPRYYADKLYDDDMKAFL 227
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/216 (27%), Positives = 108/216 (50%), Gaps = 19/216 (9%)
Query 1 MECRQQKQRQWLVRMSEELRQTPNAYFMTLTISDENYE---ILKNICKSEDPNTIATKAI 57
+ CR+ K++ W+ R+ E + P + F+TLT DE+ I +++ K+ ++ + I
Sbjct 21 VNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTT-VGVVSKRDI 79
Query 58 RLMLERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGIGID-----KLIEEKWNYGIT 112
+L ++R+RKK + +F+T + R H H I++G L+ E W G
Sbjct 80 QLFMKRLRKKYAQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAECWKNGFV 139
Query 113 FTGYFVNEQTINYITKYMTKEDLDHPGFIGKV-------LCSKGIGMGYAKREDAKNHVY 165
+ + + I+Y+TKYM ++ + P + V LCSK G+GY + Y
Sbjct 140 -QAHPLTTKEISYVTKYMYEKSMI-PDILKGVKEYQPFMLCSKMPGIGYHFLREQILDFY 197
Query 166 KKGKTIEYYRLRNGSKINLPIYYRNQLFTEEEREML 201
+ +Y R NG ++ +P YY ++L+ ++ +E L
Sbjct 198 RLHPR-DYVRAFNGMRMAMPRYYADKLYDDDMKEYL 232
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 77.0 bits (188), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 58/216 (27%), Positives = 107/216 (50%), Gaps = 19/216 (9%)
Query 1 MECRQQKQRQWLVRMSEELRQTPNAYFMTLTISDENYE---ILKNICKSEDPNTIATKAI 57
+ CR+ K++ W+ R+ E + P + F+TLT DE+ I +++ KS ++ + I
Sbjct 21 VNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHMPTAMIGEDLFKST-VGVVSKRDI 79
Query 58 RLMLERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGIGID-----KLIEEKWNYGIT 112
+L ++R+RKK + +F+T + R H H I++G L+ E W G
Sbjct 80 QLFMKRLRKKYDQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAECWKNGFV 139
Query 113 FTGYFVNEQTINYITKYMTKEDLDHPGFIGKV-------LCSKGIGMGYAKREDAKNHVY 165
+ + + I Y+TKYM ++ + P + V LCS+ G+GY + Y
Sbjct 140 -QAHPLTTKEIAYVTKYMYEKSMV-PDILKDVKEYQPFMLCSRIPGIGYHFLREQILDFY 197
Query 166 KKGKTIEYYRLRNGSKINLPIYYRNQLFTEEEREML 201
+ +Y R NG ++ +P YY ++L+ ++ +E L
Sbjct 198 RLHPR-DYVRAFNGMRMAMPRYYADKLYDDDMKEYL 232
>gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 [Parabacteroides distasonis str.
3999B T(B) 6]
Length=250
Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 54/202 (27%), Positives = 98/202 (49%), Gaps = 19/202 (9%)
Query 15 MSEELRQTPNAYFMTLTISDENYE---ILKNICKSEDPNTIATKAIRLMLERIRKKTGKS 71
M E + P + F+TLT DE+ I +++ K+ ++ + I+L ++R+RKK +
Sbjct 1 MQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTT-VGVVSKRDIQLFMKRLRKKYAQY 59
Query 72 IKHWFITELGHEKTERLHLHGIVWGIGID-----KLIEEKWNYGITFTGYFVNEQTINYI 126
+F+T + R H H I++G L+ E W G + + + I+Y+
Sbjct 60 RLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAECWKNGFV-QAHPLTTKEISYV 118
Query 127 TKYMTKEDLDHPGFIGKV-------LCSKGIGMGYAKREDAKNHVYKKGKTIEYYRLRNG 179
TKYM ++ + P + V LCSK G+GY + Y+ +Y R NG
Sbjct 119 TKYMYEKSMI-PDILKGVKEYQPFMLCSKMPGIGYHFLREQILDFYRLHPR-DYVRAFNG 176
Query 180 SKINLPIYYRNQLFTEEEREML 201
++ +P YY ++L+ ++ +E L
Sbjct 177 MRMAMPRYYADKLYDDDMKEYL 198
>gi|575094374|emb|CDL65755.1| unnamed protein product [uncultured bacterium]
Length=487
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/146 (29%), Positives = 64/146 (44%), Gaps = 14/146 (10%)
Query 2 ECRQQKQRQWLVRMSEELRQTPNAYFMTLTISDENYEILKNICKSEDPNTIATKAIRLML 61
+C+ K W VR SEEL +YF TLT+ + + + I+L L
Sbjct 33 DCKSAKTTDWQVRCSEELNNNSQSYFYTLTLDPRFIDTYGTLPDGSPRYVFNKRHIQLFL 92
Query 62 ERIRKKTGK---SIKHWFITELGHEKTERLHLHGIVW------GIGIDKLIEEKWNYGIT 112
+R+RK K S+K+ + ELG E T R H H I + ++ W+ G
Sbjct 93 KRLRKALSKYNISLKYVIVGELG-ETTHRPHYHAIFYLSSSVNPFKFRIMVRNSWSLGFI 151
Query 113 FT----GYFVNEQTINYITKYMTKED 134
+ G +N ++Y+ KYM K D
Sbjct 152 KSGDNNGIILNNDAVSYVIKYMHKTD 177
>gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 [Necator americanus]
Length=345
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 54/208 (26%), Positives = 95/208 (46%), Gaps = 24/208 (12%)
Query 3 CRQQKQRQWLVRMSEELRQTPNAYFMTLTISDENYEILKNICKSEDPNTIATKAIRLMLE 62
C++++ W+ R+ +E Q NA F+TLT I KN + D ++
Sbjct 28 CKRRRVDSWVFRLLQEELQHENASFVTLTYDTRFVPISKNGFMTLDRGEFPR-----YMK 82
Query 63 RIRK-KTGKSIKHWFITELGHEKTERLHLHGIVWGIGIDKLIEEKWNYG----ITFTGYF 117
R+RK G+ +K++ E G ++ R H H I++G+ D L + W
Sbjct 83 RLRKLVPGRKLKYYMCGEYGSQRF-RPHYHAIIFGVPQDSLFADAWTLNGDSLGGVVVGT 141
Query 118 VNEQTINYITKYMTKE--------DLDHPGFIGKVLCSKGIGMGYAKREDAKNHVYKKGK 169
V ++I Y KY+ K D P F L SKG+G+ Y + + H K+
Sbjct 142 VTGKSIAYTMKYIDKSTWKQKHGRDDRVPEF---SLMSKGMGVSYLTPQMVEYH--KEDI 196
Query 170 TIEYYRLRNGSKINLPIYYRNQLFTEEE 197
+ + GS+I +P YYR +++++++
Sbjct 197 SRLFCTREGGSRIAMPRYYRQKIYSDDD 224
>gi|313766930|gb|ADR80656.1| putative replication initiation protein [Uncultured Microviridae]
Length=402
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 66/146 (45%), Gaps = 24/146 (16%)
Query 3 CRQQKQRQWLVRMSEELRQTPNAYFMTLTISDENYEILKNICKSEDPNTIATKAIRLMLE 62
CR Q R+W +R E + + F+TLTI+ E E + P ++ K + +
Sbjct 136 CRLQHSREWAIRCMHEAQMHDHNCFITLTINPETLE------RRPRPWSLEKKEFQEFVH 189
Query 63 RIRKKTGKSIKHWFITELGHEKTERLHLHGIVWG----------------IGIDKLIEEK 106
R+R+K GK IK++ E G E +R H H I++G + I +E
Sbjct 190 RLRRKIGKKIKYFHCGEYGDE-NKRPHYHAIIFGYDFPDKQLWERKLGNELYISPELENL 248
Query 107 WNYGITFTGYFVNEQTINYITKYMTK 132
W +G G E + +Y+ +Y+ K
Sbjct 249 WPHGYHRIGACTYE-SAHYVARYVMK 273
>gi|530695361|gb|AGT39916.1| replication initiator [Marine gokushovirus]
Length=289
Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 66/147 (45%), Gaps = 24/147 (16%)
Query 3 CRQQKQRQWLVRMSEELRQTPNAYFMTLTISDENYEILKNICKSEDPNTIATKAIRLMLE 62
CR RQW +R E + + F+TLT +E +I K ++P ++ + ++
Sbjct 38 CRLDYSRQWAIRCVHEAQTHEDNCFITLTFDNE------HIAKRKNPESLDNTEFQRFMK 91
Query 63 RIRKKTGKSIKHWFITELGHEKTERLHLHGIVWG----------------IGIDKLIEEK 106
R+RKK I+ + E G ++ +R H H +++G + + + + E
Sbjct 92 RLRKKYPHKIRFFHCGEYG-DQNKRPHYHALLFGHDFKDKKLWSNKGDFKLFVSQELAEL 150
Query 107 WNYGITFTGYFVNEQTINYITKYMTKE 133
W YG G V+ T Y +Y+ K+
Sbjct 151 WPYGFHTIGA-VSFDTAAYCARYVMKK 176
Lambda K H a alpha
0.319 0.137 0.414 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1509111294645