bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-23_CDS_annotation_glimmer3.pl_2_5
Length=334
Score E
Sequences producing significant alignments: (Bits) Value
gi|547312922|ref|WP_022044634.1| putative replication initiation... 71.6 9e-11
gi|609718275|emb|CDN73649.1| conserved hypothetical protein 59.7 5e-07
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 58.5 1e-06
gi|557745630|ref|YP_008798246.1| replication initiator 58.9 1e-06
gi|492501778|ref|WP_005867316.1| hypothetical protein 57.8 3e-06
gi|313766930|gb|ADR80656.1| putative replication initiation protein 53.9 7e-05
gi|575094560|emb|CDL65924.1| unnamed protein product 53.1 1e-04
gi|9634955|ref|NP_054653.1| nonstructural protein 51.6 3e-04
gi|77020121|ref|YP_338244.1| putative replication protein 50.8 5e-04
gi|495507506|ref|WP_008232152.1| hypothetical protein 49.3 0.002
>gi|547312922|ref|WP_022044634.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
gi|524208442|emb|CCZ76638.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
Length=320
Score = 71.6 bits (174), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 55/205 (27%), Positives = 93/205 (45%), Gaps = 25/205 (12%)
Query 4 PWDYFTQRIMVPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYYDSA 63
P DY + VPCG C C + N + +RL E + K +FVT+T + +
Sbjct 35 PPDYILE---VPCGYCHSCQKSYNNQYRIRLLYELR--KYPPGTCLFVTLTFNDDSLEKF 89
Query 64 LLNPSSFIRMWFERIRRCFGHSIKHAVFQEFG-MHPEQGNEPRLHFHGVLWDVSYS---- 118
+ + +R++ +R R+ +G I+H EFG +H R H+HG+L++V +
Sbjct 90 SKDTNKAVRLFLDRFRKVYGKQIRHWFVCEFGTLH------GRPHYHGILFNVPQALIDG 143
Query 119 -------YNAIREAVKDLGFVWISSITDKRLRYVVKYVGKSVYMDDSSAVFAKSLPITVG 171
++ + + GFV++ ++D+ Y+ KYV KS+ D S I
Sbjct 144 YDSDMPGHHPLLASCWKYGFVFVGYVSDETCSYITKYVTKSINGDKVRPRVISSFGIGSN 203
Query 172 KLNTNLYDF--LQNRKYRRKFVSPG 194
LNT L N++Y+ V G
Sbjct 204 YLNTEESSLHKLGNQRYQPFMVLNG 228
>gi|609718275|emb|CDN73649.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=265
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 70/143 (49%), Gaps = 15/143 (10%)
Query 15 PCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAP---EYYDSALLNPS-SF 70
PCG+C EC + + N W+ RL E K KS H FVT+T + Y D+ L++
Sbjct 24 PCGKCLECRKARTNSWFARLTEELKVSKSAH----FVTLTYSDVYLPYSDNGLISLDYRD 79
Query 71 IRMWFERIRRCFGHSIKHAVFQEFGMHPEQGNEPRLHFHGVLWDVSYSYNAIREAVKDLG 130
+++ +R R+ IK+ + E+G R H+H +++ V + E +G
Sbjct 80 FQLFMKRARKLQKSKIKYFLVGEYG-----AQTYRPHYHAIVFGVENIDAFLGEW--RMG 132
Query 131 FVWISSITDKRLRYVVKYVGKSV 153
V ++T K + Y +KY KS+
Sbjct 133 NVHAGTVTAKSIYYTLKYCTKSI 155
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 81/167 (49%), Gaps = 26/167 (16%)
Query 7 YFTQRIMVPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYYDSALLN 66
+ R VPCGRC C + +R W RL+ E + S+FVT+T E+ +A++
Sbjct 8 HLPDRGAVPCGRCVNCRKNKRQSWVYRLQAEA----DEYPFSLFVTLTYDDEHIPTAMIG 63
Query 67 PSSF-----------IRMWFERIRRCFG-HSIKHAVFQEFGMHPEQGNEPRLHFHGVLWD 114
F I+++ +R+R+ + + +++ + E+G QG P H+H +L+
Sbjct 64 EDLFKTTVGVVSKRDIQLFMKRLRKKYAQYRLRYFLTSEYG---SQGGRP--HYHMILFG 118
Query 115 VSYS----YNAIREAVKDLGFVWISSITDKRLRYVVKYVGKSVYMDD 157
++ + + E K+ GFV +T K + YV KY+ + + D
Sbjct 119 FPFTGKHGGDLLAECWKN-GFVQAHPLTTKEISYVTKYMYEKSMIPD 164
>gi|557745630|ref|YP_008798246.1| replication initiator [Marine gokushovirus]
gi|530695343|gb|AGT39900.1| replication initiator [Marine gokushovirus]
Length=312
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 60/226 (27%), Positives = 98/226 (43%), Gaps = 42/226 (19%)
Query 5 WDYFTQRIMVPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYYDSAL 64
W + PCG C+ C +++ +W VR E + ++ R+S F+T+T YD+
Sbjct 24 WPEMYPPMTRPCGYCKGCRFKKQQEWTVRCLNEQQILETKGRSSSFITLT-----YDNKH 78
Query 65 LNPSSFI--RMWFERIR----RCFGHSIKHAVFQEFGMHPEQGNEPRLHFHGVLW----- 113
L P++ + W + IR R G SI++ E+G N R HFH +L+
Sbjct 79 LPPNNSLDYTHWQKFIRSLKKRNNGKSIRYFGVGEYGE-----NFGRPHFHAILFGHTFN 133
Query 114 -------DVSYSYNAIREAVKDL-----GFVWISSITDKRLRYVVKYVGKSVYMDDSSAV 161
++S S + + L GFV + +T + + YV YV K ++ +A
Sbjct 134 DLIPMHSNISKSQQLLSAWCEPLTQEPRGFVSVGDVTPESISYVCGYVQKKIFGQGQAAH 193
Query 162 F----AKSLPITVGK-LNTNLYDFLQNRKYRRKFVSPGVGDYLGDF 202
+ S IT K N +Y L++ RR PG+G D
Sbjct 194 YKYIDKDSGEITTKKPTNGKIYRLLKSFMSRR----PGIGSEFYDL 235
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 78/159 (49%), Gaps = 26/159 (16%)
Query 7 YFTQRIMVPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYYDSALLN 66
+ R VPCGRC C + +R W RL+ E + S+FVT+T E+ +A++
Sbjct 8 HLPDRGAVPCGRCVNCRKNKRQSWVYRLQAEA----DEYPFSLFVTLTYDDEHMPTAMIG 63
Query 67 PSSF-----------IRMWFERIRRCFG-HSIKHAVFQEFGMHPEQGNEPRLHFHGVLWD 114
F I+++ +R+R+ + + +++ + E+G QG P H+H +L+
Sbjct 64 EDLFKSTVGVVSKRDIQLFMKRLRKKYDQYRLRYFLTSEYG---SQGGRP--HYHMILFG 118
Query 115 VSYS----YNAIREAVKDLGFVWISSITDKRLRYVVKYV 149
++ + + E K+ GFV +T K + YV KY+
Sbjct 119 FPFTGKHGGDLLAECWKN-GFVQAHPLTTKEIAYVTKYM 156
>gi|313766930|gb|ADR80656.1| putative replication initiation protein [Uncultured Microviridae]
Length=402
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 110/265 (42%), Gaps = 36/265 (14%)
Query 2 NRPWDYFTQRIMVPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYYD 61
N+P+ Y + +PCG+C C Q +W +R E + +H ++ F+T+TI PE +
Sbjct 117 NKPFAY-AKGFNLPCGQCWGCRLQHSREWAIRCMHEAQ----MHDHNCFITLTINPETLE 171
Query 62 SALLNPSSFIRMWFE----RIRRCFGHSIKHAVFQEFGMHPEQGNEPRLHFHGVLWDVSY 117
P S + F+ R+RR G IK+ E+G R H+H +++ +
Sbjct 172 RR-PRPWSLEKKEFQEFVHRLRRKIGKKIKYFHCGEYG-----DENKRPHYHAIIFGYDF 225
Query 118 SYNAIRE-----------AVKDL---GFVWISSITDKRLRYVVKYVGKSVYMDDSSAVFA 163
+ E +++L G+ I + T + YV +YV K + +
Sbjct 226 PDKQLWERKLGNELYISPELENLWPHGYHRIGACTYESAHYVARYVMKRAKGEGPPEQYI 285
Query 164 KSLPITVGKLNTNLYDFLQNRKYRRKFVSPGVGDYLGDFKAPGVTSGLWSYTDYRTGVVY 223
V N Y + + +K G+G+ + G T DY
Sbjct 286 NPETGEVEYDLDNQYATMS--RGNKKQPQNGIGNQW--YWKYGWTDA--HCHDYIVHDGI 339
Query 224 RYRIPRYYDKYLSQ-DALFFRKIST 247
+ ++PRYYDK L + D +F+++
Sbjct 340 KMKVPRYYDKELEKYDPEYFQELKA 364
>gi|575094560|emb|CDL65924.1| unnamed protein product [uncultured bacterium]
Length=320
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/294 (24%), Positives = 128/294 (44%), Gaps = 63/294 (21%)
Query 14 VPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYYDS-ALLNPSSFIR 72
+PCG+C C + D VR E+ L+ + F+T+T +PE+ L P
Sbjct 45 IPCGQCIGCRLDRSLDSAVRAHHES----LLYDRNYFLTLTYSPEHLPPFGSLIPRDLTL 100
Query 73 MWFERIRRCFGHSIKHAVFQEFGMHPEQGNEPRLHFHGVLWDV----------------S 116
W +R+R+ G S+++ E+G R H+H +++++ +
Sbjct 101 FW-KRLRK-RGVSLRYMACGEYG-----STYGRPHYHAIIFNLPPLELKQIGTTSTGFPT 153
Query 117 YSYNAIREAVKDLGFVWISSITDKRLRYVVKYVGKSVYMDDSSAVFAKSLPITVGKLNTN 176
+ + I E LGF ++ ++ + YV +YV K + + D V+ K P+T G+++
Sbjct 154 FISDVISECW-SLGFHTLNPVSFQTCAYVARYVTKKI-LGDGKQVYEKFDPVT-GEVDCR 210
Query 177 LYDFLQNRKYRRKFVSPGVG-DYLGDFKAPGVTSGLWSYTDYRTGVVY----RYRIPRYY 231
+ +F R PG+G DY + W Y+ +++IPRYY
Sbjct 211 VKEF------SRWSTKPGIGHDYFMKY---------WR-DFYKIDCCLINNKKFKIPRYY 254
Query 232 DKYLSQD------ALFFRKISTAWTYACVFGSSLALGFLREV-----AQRVLRP 274
D+ L +D + ++I +A Y + + +RE A+R+LRP
Sbjct 255 DRLLLRDHPDVFEIVKQKRILSAQDYRLTPDAQKSRLLVREEVKRLRAERLLRP 308
>gi|9634955|ref|NP_054653.1| nonstructural protein [Chlamydia phage 2]
gi|7406595|emb|CAB85595.1| nonstructural protein [Chlamydia phage 2]
Length=336
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 104/255 (41%), Gaps = 52/255 (20%)
Query 3 RPWDYFTQRIMVPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYYDS 62
+P +Y + +++PC RC+ C Q W R E SL+ + F+T+T Y D
Sbjct 63 QPEEYRVRWVLMPCRRCKFCRVQNAKIWSYRCMHEA----SLYSQNCFLTLT----YEDR 114
Query 63 ALLNPSSFI----RMWFERIRR-CFGHSIKHAVFQEFGMHPEQGNEPRLHFHGVLWDVSY 117
L S + R++ R+R + H I++ E+G + R H+H ++++ +
Sbjct 115 HLPENGSLVRDHPRLFLMRLREHIYPHKIRYFGCGEYGSKLQ-----RPHYHLLIYNYDF 169
Query 118 SYNA-----------IREAVKDL---GFVWISSITDKRLRYVVKYVGKSVYMDDSSAVFA 163
+ E + L GF + S+T + YV +Y K V D S +
Sbjct 170 PDKKLLSKKRGNPLFVSEKLMRLWPFGFSTVGSVTRQSAGYVARYSLKKVNGDISQDHYG 229
Query 164 KSLPITVGKLNTNLYDFLQNRKYRRKFVSPGVG-DYLGDFKAPGVTSGLWSYTDYRTGVV 222
+ LP +FL + PG+G D+ +K D G
Sbjct 230 QRLP-----------EFLMCS------LKPGIGADWYEKYKCDVYPQDYLVVQD--KGKS 270
Query 223 YRYRIPRYYDKYLSQ 237
++ R PRYYDK S+
Sbjct 271 FKTRPPRYYDKLHSR 285
>gi|77020121|ref|YP_338244.1| putative replication protein [Chlamydia phage 4]
gi|59940020|gb|AAX12549.1| putative replication protein [Chlamydia phage 4]
Length=315
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 63/257 (25%), Positives = 102/257 (40%), Gaps = 52/257 (20%)
Query 1 MNRPWDYFTQRIMVPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYY 60
+P +Y + I++PC RC+ C Q W R E SL+ + F+T+T Y
Sbjct 40 QTQPEEYRKRWILMPCRRCKFCRVQNAKIWSYRCMHEA----SLYSQNCFLTLT----YE 91
Query 61 DSALLNPSSFIR----MWFERIRRCFG-HSIKHAVFQEFGMHPEQGNEPRLHFHGVLWDV 115
D L S +R ++ R+R H I++ E+G + R H+H ++++
Sbjct 92 DQHLPENGSLVRNHPTLFLRRLREHISPHKIRYFGCGEYGSKLQ-----RPHYHLLIYNY 146
Query 116 SYSYNA-----------IREAVKDL---GFVWISSITDKRLRYVVKYVGKSVYMDDSSAV 161
+ + E + L GF + S+T + YV +Y K V D S
Sbjct 147 DFPDKKLLSKKRGNPLFVSEKLMQLWPYGFSTVGSVTRQSAGYVARYSLKKVSRDISQDH 206
Query 162 FAKSLPITVGKLNTNLYDFLQNRKYRRKFVSPGVG-DYLGDFKAPGVTSGLWSYTDYRTG 220
+ + LP +FL + PG+G D+ +K D G
Sbjct 207 YGQRLP-----------EFLMCS------LKPGIGADWYEKYKRDVYPQDYLVVQD--KG 247
Query 221 VVYRYRIPRYYDKYLSQ 237
+ R PRYYDK S+
Sbjct 248 KSFTTRPPRYYDKLHSR 264
>gi|495507506|ref|WP_008232152.1| hypothetical protein [Richelia intracellularis]
gi|471331139|emb|CCH66547.1| hypothetical protein RINTHH_3920 [Richelia intracellularis HH01]
Length=306
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 104/257 (40%), Gaps = 57/257 (22%)
Query 14 VPCGRCEECLRQQRNDWYVRLERETKYQKSLHRNSVFVTITIAPEYYDSALLNP--SSFI 71
+PCG CE CL ++ W VR E + L + FVT+T Y ++ N S
Sbjct 31 LPCGHCEGCLLERSRQWAVRCMHEAQ----LWERNCFVTLT----YEETPPWNSLRHSDF 82
Query 72 RMWFERIRRCF-GHS-------------IKHAVFQEFGMHPEQGNEPRLHFHGVLWDVSY 117
+ + +R+R+ F GH I++ + E+G H G P H+H L++ ++
Sbjct 83 QKFMKRLRKRFKGHKENIDVRTGKSSYPIRYYMAGEYGTH---GGRP--HYHACLFNFAF 137
Query 118 S---------------YNAIREAVKDLGFVWISSITDKRLRYVVKYVGKSVYMDDSSAVF 162
+A E++ GF + +T + YV +YV K M+ +
Sbjct 138 EDIEFLRRTNSGSNLYRSAQLESLWPHGFSSVGDVTFESAAYVARYVMKK--MNKEAIEK 195
Query 163 AKSLPITVGKLNTNLYDFLQNRKYRRKFVSPGVGDYLGDFKAPGVTSGLWSYTDYRTGVV 222
+ + G++ L + Y + + PG+G D V DY
Sbjct 196 GQEINWETGEVMPRLPE------YNKMSLKPGIGANFIDKYQSDVFP-----NDYVIVNG 244
Query 223 YRYRIPRYYDKYLSQDA 239
++ + PRYY K L Q A
Sbjct 245 HKAKPPRYYFKRLKQAA 261
Lambda K H a alpha
0.327 0.141 0.450 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1917593351550