bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-17_CDS_annotation_glimmer3.pl_2_3
Length=332
Score E
Sequences producing significant alignments: (Bits) Value
gi|547312922|ref|WP_022044634.1| putative replication initiation... 122 2e-28
gi|609718275|emb|CDN73649.1| conserved hypothetical protein 106 4e-23
gi|492501778|ref|WP_005867316.1| hypothetical protein 91.7 4e-18
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 91.7 5e-18
gi|547920048|ref|WP_022322419.1| putative replication protein 85.5 6e-16
gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 85.9 8e-16
gi|575094374|emb|CDL65755.1| unnamed protein product 80.5 2e-13
gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 65.9 3e-09
gi|410493159|ref|YP_006908225.1| replication-associated protein 63.5 3e-08
gi|575096096|emb|CDL66976.1| unnamed protein product 61.6 1e-07
>gi|547312922|ref|WP_022044634.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
gi|524208442|emb|CCZ76638.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
Length=320
Score = 122 bits (306), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 86/273 (32%), Positives = 141/273 (52%), Gaps = 47/273 (17%)
Query 2 CLYPKLIRNKKYLP---TKKNNYN--------PPKMVDPRTAYITAACGKCLECRKQKQR 50
C PK+I N++Y T+ NY PP + + CG C C+K
Sbjct 3 CEQPKVIVNRRYANMTNTEIVNYAKVYYGCFWPPDYI------LEVPCGYCHSCQKSYNN 56
Query 51 EWLVRMSEELRTEP--NAYFMTLTISDENYEILKNTCKSEDKNTIATKAIRLTLERIRKK 108
++ +R+ ELR P F+TLT +D++ E S+D N KA+RL L+R RK
Sbjct 57 QYRIRLLYELRKYPPGTCLFVTLTFNDDSLEKF-----SKDTN----KAVRLFLDRFRKV 107
Query 109 TGKSIKHWFITELGHEKTERLHLHGIVWGI-------------GTDQLIKEKWNYGITYT 155
GK I+HWF+ E G R H HGI++ + G L+ W YG +
Sbjct 108 YGKQIRHWFVCEFG-TLHGRPHYHGILFNVPQALIDGYDSDMPGHHPLLASCWKYGFVFV 166
Query 156 GNFVNEKTINYITKYMTK-IDEDHPEFVGKVLCSKGIGAGYIKRADASKHKYEKGKTIET 214
G +V+++T +YITKY+TK I+ D + +V+ S GIG+ Y+ ++S HK + +
Sbjct 167 G-YVSDETCSYITKYVTKSINGD--KVRPRVISSFGIGSNYLNTEESSLHKL-GNQRYQP 222
Query 215 YRLRNGAKINLPIYYRNKLFTEEERELLFIDKI 247
+ + NG + +P YY NK+F++ +++ + +D++
Sbjct 223 FMVLNGFQQAMPRYYYNKIFSDVDKQNMVVDRL 255
>gi|609718275|emb|CDN73649.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=265
Score = 106 bits (264), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/209 (35%), Positives = 104/209 (50%), Gaps = 17/209 (8%)
Query 38 CGKCLECRKQKQREWLVRMSEELRTEPNAYFMTLTISDENYEILKNTCKSEDKNTIATKA 97
CGKCLECRK + W R++EEL+ +A+F+TLT SD N S D +
Sbjct 25 CGKCLECRKARTNSWFARLTEELKVSKSAHFVTLTYSDVYLPYSDNGLISLDY-----RD 79
Query 98 IRLTLERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGIGTDQLIKEKWNYGITYTGN 157
+L ++R RK IK++ + E G +T R H H IV+G+ +W G + G
Sbjct 80 FQLFMKRARKLQKSKIKYFLVGEYG-AQTYRPHYHAIVFGVENIDAFLGEWRMGNVHAGT 138
Query 158 FVNEKTINYITKYMTK-IDEDHPEFVG------KVLCSKGIGAGYIKRADASKHKYEKGK 210
V K+I Y KY TK I E + K L SKG+G ++ S KY K
Sbjct 139 -VTAKSIYYTLKYCTKSITEGPDKDPDDDRKPEKALMSKGLGLSHLTE---SMIKYYKDD 194
Query 211 TIETYRLRNGAKINLPIYYRNKLFTEEER 239
++ L G I LP YYR+K+F++ E+
Sbjct 195 VSRSFSLLGGTTIALPRYYRDKVFSDIEK 223
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 91.7 bits (226), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 122/257 (47%), Gaps = 50/257 (19%)
Query 35 TAACGKCLECRKQKQREWLVRMSEELRTEPNAYFMTLTISDENY-------EILKNTCKS 87
CG+C+ CRK K++ W+ R+ E P + F+TLT DE+ ++ K+T
Sbjct 14 AVPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHMPTAMIGEDLFKSTV-- 71
Query 88 EDKNTIATKAIRLTLERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGI------GTD 141
++ + I+L ++R+RKK + +F+T + R H H I++G G D
Sbjct 72 ---GVVSKRDIQLFMKRLRKKYDQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGD 128
Query 142 QLIKEKWNYGITYTGNFVNEKTINYITKYM------TKIDEDHPEFVGKVLCSKGIGAGY 195
L+ E W G + + K I Y+TKYM I +D E+ +LCS+ G GY
Sbjct 129 -LLAECWKNGFV-QAHPLTTKEIAYVTKYMYEKSMVPDILKDVKEYQPFMLCSRIPGIGY 186
Query 196 IKRADASKHKYEKGKTIETYRLR--------NGAKINLPIYYRNKLFTE-------EERE 240
+ + + ++ YRL NG ++ +P YY +KL+ + E RE
Sbjct 187 ---------HFLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELRE 237
Query 241 LLFIDKIEKGFIYVLGT 257
FI+++++ + + + T
Sbjct 238 AFFINQMQQEWHHYINT 254
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 91.7 bits (226), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 121/257 (47%), Gaps = 50/257 (19%)
Query 35 TAACGKCLECRKQKQREWLVRMSEELRTEPNAYFMTLTISDENY-------EILKNTCKS 87
CG+C+ CRK K++ W+ R+ E P + F+TLT DE+ ++ K T
Sbjct 14 AVPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTV-- 71
Query 88 EDKNTIATKAIRLTLERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGI------GTD 141
++ + I+L ++R+RKK + +F+T + R H H I++G G D
Sbjct 72 ---GVVSKRDIQLFMKRLRKKYAQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGD 128
Query 142 QLIKEKWNYGITYTGNFVNEKTINYITKYM------TKIDEDHPEFVGKVLCSKGIGAGY 195
L+ E W G + + K I+Y+TKYM I + E+ +LCSK G GY
Sbjct 129 -LLAECWKNGFV-QAHPLTTKEISYVTKYMYEKSMIPDILKGVKEYQPFMLCSKMPGIGY 186
Query 196 IKRADASKHKYEKGKTIETYRLR--------NGAKINLPIYYRNKLFTE-------EERE 240
+ + + ++ YRL NG ++ +P YY +KL+ + E RE
Sbjct 187 ---------HFLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELRE 237
Query 241 LLFIDKIEKGFIYVLGT 257
FI+++++ + + + T
Sbjct 238 AFFINQMQQEWYHYINT 254
>gi|547920048|ref|WP_022322419.1| putative replication protein [Parabacteroides merdae CAG:48]
gi|524592960|emb|CDD13572.1| putative replication protein [Parabacteroides merdae CAG:48]
Length=278
Score = 85.5 bits (210), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 107/222 (48%), Gaps = 19/222 (9%)
Query 36 AACGKCLECRKQKQREWLVRMSEELRTEPNAYFMTLTISDENYEI--LKNTCKSEDKNTI 93
CG C+ CR+ K++ W+ R+ E + P + F+TLT DE+ I + + + +
Sbjct 10 VPCGWCVNCRQNKRQSWVYRLQAEAKEYPLSLFVTLTYDDEHLPIERIGSDLFQTNVAVV 69
Query 94 ATKAIRLTLERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGIG-----TDQLIKEKW 148
+ + ++L ++R+RKK +F+T K R H H I++G L+ E W
Sbjct 70 SKRDVQLFMKRLRKKYEDYKMRYFVTSEYGAKNGRPHYHMILFGFPFTGKMAGDLLAECW 129
Query 149 NYGITYTGNFVNEKTINYITKYM------TKIDEDHPEFVGKVLCSK--GIGAGYIKRAD 200
G + + K I Y+ KYM +I D ++ +LCS+ GIG G++K
Sbjct 130 QNGFV-QAHPLTIKEIAYVCKYMYEKSMCPEILRDEKKYKPFMLCSRNPGIGFGFMK--- 185
Query 201 ASKHKYEKGKTIETYRLRNGAKINLPIYYRNKLFTEEERELL 242
A ++ + + R G K+ +P YY +KL+ ++ + L
Sbjct 186 ADIIEFYRRHPRDYVRAWAGHKMAMPRYYADKLYDDDMKAFL 227
>gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 [Necator americanus]
Length=345
Score = 85.9 bits (211), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 109/236 (46%), Gaps = 34/236 (14%)
Query 22 NPPKMVDPRTAY--ITAACGKCLECRKQKQREWLVRMSEELRTEPNAYFMTLTISDENYE 79
+ P V P+ A + CG+C C++++ W+ R+ +E NA F+TLT
Sbjct 4 DSPFWVLPKAALEKVPVPCGRCPPCKRRRVDSWVFRLLQEELQHENASFVTLTYDTRFVP 63
Query 80 ILKNTCKSEDKNTIATKAIRLTLERIRKKT-GKSIKHWFITELGHEKTERLHLHGIVWGI 138
I KN + D+ ++R+RK G+ +K++ E G ++ R H H I++G+
Sbjct 64 ISKNGFMTLDRGEFPR-----YMKRLRKLVPGRKLKYYMCGEYGSQRF-RPHYHAIIFGV 117
Query 139 GTDQLIKEKWNYGITYTGNFV--------NEKTINYITKYMTKI--------DEDHPEFV 182
D L + W T G+ + K+I Y KY+ K D+ PEF
Sbjct 118 PQDSLFADAW----TLNGDSLGGVVVGTVTGKSIAYTMKYIDKSTWKQKHGRDDRVPEF- 172
Query 183 GKVLCSKGIGAGYIKRADASKHKYEKGKTIETYRLRNGAKINLPIYYRNKLFTEEE 238
L SKG+G Y+ HK + + T G++I +P YYR K++++++
Sbjct 173 --SLMSKGMGVSYLTPQMVEYHKEDISRLFCTR--EGGSRIAMPRYYRQKIYSDDD 224
>gi|575094374|emb|CDL65755.1| unnamed protein product [uncultured bacterium]
Length=487
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 85/190 (45%), Gaps = 26/190 (14%)
Query 1 MCLYPKLIRN-KKYLPTKKNNYNPPKMVDPRTAYITAACGKCLECRKQKQREWLVRMSEE 59
MC P +IRN Y+ T +Y V P CG C +C+ K +W VR SEE
Sbjct 1 MCFSPIIIRNNSSYIHT---HYTYADYVVP--------CGHCYDCKSAKTTDWQVRCSEE 49
Query 60 LRTEPNAYFMTLTISDENYEILKNTCKSEDKNTIATKAIRLTLERIRKKTGK---SIKHW 116
L +YF TLT+ + + + I+L L+R+RK K S+K+
Sbjct 50 LNNNSQSYFYTLTLDPRFIDTYGTLPDGSPRYVFNKRHIQLFLKRLRKALSKYNISLKYV 109
Query 117 FITELGHEKTERLHLHGIVWGIGTDQ------LIKEKWNYGITYTGN----FVNEKTINY 166
+ ELG E T R H H I + + +++ W+ G +G+ +N ++Y
Sbjct 110 IVGELG-ETTHRPHYHAIFYLSSSVNPFKFRIMVRNSWSLGFIKSGDNNGIILNNDAVSY 168
Query 167 ITKYMTKIDE 176
+ KYM K D
Sbjct 169 VIKYMHKTDS 178
>gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 [Parabacteroides distasonis str.
3999B T(B) 6]
Length=250
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 107/236 (45%), Gaps = 50/236 (21%)
Query 56 MSEELRTEPNAYFMTLTISDENY-------EILKNTCKSEDKNTIATKAIRLTLERIRKK 108
M E P + F+TLT DE+ ++ K T ++ + I+L ++R+RKK
Sbjct 1 MQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTV-----GVVSKRDIQLFMKRLRKK 55
Query 109 TGKSIKHWFITELGHEKTERLHLHGIVWGI------GTDQLIKEKWNYGITYTGNFVNEK 162
+ +F+T + R H H I++G G D L+ E W G + + K
Sbjct 56 YAQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGD-LLAECWKNGFV-QAHPLTTK 113
Query 163 TINYITKYM------TKIDEDHPEFVGKVLCSKGIGAGYIKRADASKHKYEKGKTIETYR 216
I+Y+TKYM I + E+ +LCSK G GY + + + ++ YR
Sbjct 114 EISYVTKYMYEKSMIPDILKGVKEYQPFMLCSKMPGIGY---------HFLREQILDFYR 164
Query 217 LR--------NGAKINLPIYYRNKLFTE-------EERELLFIDKIEKGFIYVLGT 257
L NG ++ +P YY +KL+ + E RE FI+++++ + + + T
Sbjct 165 LHPRDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFINQMQQEWYHYINT 220
>gi|410493159|ref|YP_006908225.1| replication-associated protein [Dragonfly-associated microphage
1]
gi|406870779|gb|AFS65317.1| replication-associated protein [Dragonfly-associated microphage
1]
Length=270
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 101/236 (43%), Gaps = 38/236 (16%)
Query 30 RTAYITAACGKCLECRKQKQREWLVRMSEELRTEPNAYFMTLTISDENYEILKNTCKSED 89
RT CG+C CR +R+W R+ E + F+TLT D+ E++ ED
Sbjct 9 RTGSGEFGCGQCTPCRVNVRRQWTGRILLEANCFADNVFVTLTYRDDPGELV-----PED 63
Query 90 KNTIATKAIRLTLERIRKKTGKSIKHWFITELGHEKTERLHLHGIVWGIGT--DQLIKEK 147
L+R R + ++++ + E G + R H H ++G+ + +I +
Sbjct 64 MTNF--------LKRFRYYLQRKVRYFGVGEYGDLRG-RPHFHMALFGVSPLEEGIIAKA 114
Query 148 WNYGITYTGNFVNEKT---INYITKYMTK-----IDEDHPEFVGKVLCSKGIGAG---YI 196
W+ G + G+ + Y+TK MTK +D +PEF ++ GIGA +
Sbjct 115 WSIGFVHVGDLTKDSAQYLCGYVTKKMTKAEDERLDGRYPEFA-RMSKDPGIGASALPVL 173
Query 197 KRADASKHK----YEKGKTIETYRLRNGAKINLPIYYRNKLFTEEERELLFIDKIE 248
+ A A + G T R R G ++ L Y R KL RE L D ++
Sbjct 174 REALAPDGDVTLMHRNGDVPSTMRTR-GKELPLGRYLRGKL-----RESLGWDALQ 223
>gi|575096096|emb|CDL66976.1| unnamed protein product [uncultured bacterium]
Length=296
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/233 (27%), Positives = 94/233 (40%), Gaps = 55/233 (24%)
Query 33 YITAACGKCLECRKQKQREWLVRMSEELRTEPNAYFMTLTISDENYEILKNTCKSEDKNT 92
Y+ CG+CLECR + EW +R EL++ F+TLT +D+N T
Sbjct 33 YVLVPCGQCLECRLHRASEWALRCCHELKSHDKGIFLTLTYNDDNL---------PPNGT 83
Query 93 IATKAIRLTLERIRKKTG-----KSIKHWFITELGHEKTERLHLHGIVWG---------- 137
+ K ++ ++R+R+ I++ E G + + R H H +V+G
Sbjct 84 LVKKHVQDFIKRLRRHIDYYGDCTKIRYLCAGEYG-DLSLRPHYHLLVFGYYPSDPRLLH 142
Query 138 ----IGTDQLIKEK-----WNYGITYTGNFVNEK---TINYITKYMTKIDEDH------- 178
IG + L W G G E T Y K T + H
Sbjct 143 GLQKIGKNSLFTSPTLTKLWGKGHISFGAITFESARYTCQYALKKQTG-EHSHYYVDRGV 201
Query 179 -PEFVGKVLCSKGIGAGYIKRADASKHKYEKGKTIETYRLRNGAKINLPIYYR 230
PEF ++CS G GY A + + +E+G Y NG KI +P YY+
Sbjct 202 IPEF---MICSNRNGLGY-DFAVSHDNMFERG-----YLTMNGKKIGIPRYYQ 245
Lambda K H a alpha
0.318 0.137 0.415 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1896974068200