bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-21_CDS_annotation_glimmer3.pl_2_2
Length=313
Score E
Sequences producing significant alignments: (Bits) Value
gi|547312922|ref|WP_022044634.1| putative replication initiation... 590 0.0
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 103 4e-22
gi|492501778|ref|WP_005867316.1| hypothetical protein 102 9e-22
gi|547920048|ref|WP_022322419.1| putative replication protein 96.3 9e-20
gi|530695361|gb|AGT39916.1| replication initiator 93.6 8e-19
gi|609718275|emb|CDN73649.1| conserved hypothetical protein 90.5 7e-18
gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 85.9 3e-16
gi|495507506|ref|WP_008232152.1| hypothetical protein 85.1 7e-16
gi|575094560|emb|CDL65924.1| unnamed protein product 84.3 2e-15
gi|575094569|emb|CDL65925.1| unnamed protein product 84.3 2e-15
>gi|547312922|ref|WP_022044634.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
gi|524208442|emb|CCZ76638.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
Length=320
Score = 590 bits (1521), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 277/313 (88%), Positives = 295/313 (94%), Gaps = 0/313 (0%)
Query 1 MIVNRRYKDMTFNEVVDYAETYYGCFWPPDYYLEVPCGYCHSCQKSYNNQYRIRLLYELR 60
+IVNRRY +MT E+V+YA+ YYGCFWPPDY LEVPCGYCHSCQKSYNNQYRIRLLYELR
Sbjct 8 VIVNRRYANMTNTEIVNYAKVYYGCFWPPDYILEVPCGYCHSCQKSYNNQYRIRLLYELR 67
Query 61 KYPPGTCLFVTLTFDDDNLKKFSKDTNKAVRLFLDRLRKDYGKQIRHWFVCEFGTLYGRP 120
KYPPGTCLFVTLTF+DD+L+KFSKDTNKAVRLFLDR RK YGKQIRHWFVCEFGTL+GRP
Sbjct 68 KYPPGTCLFVTLTFNDDSLEKFSKDTNKAVRLFLDRFRKVYGKQIRHWFVCEFGTLHGRP 127
Query 121 HYHGILFDVPQTLIDGYSPDVPGHHPLLASRWKYGFVFVGYVSDETCSYITKYVTKSING 180
HYHGILF+VPQ LIDGY D+PGHHPLLAS WKYGFVFVGYVSDETCSYITKYVTKSING
Sbjct 128 HYHGILFNVPQALIDGYDSDMPGHHPLLASCWKYGFVFVGYVSDETCSYITKYVTKSING 187
Query 181 DKVRPRIISSFGIGSNYFDTEESTLHKLGGQRYQPFMVLNGFQQAMPRYYYNKIFSDVDK 240
DKVRPR+ISSFGIGSNY +TEES+LHKLG QRYQPFMVLNGFQQAMPRYYYNKIFSDVDK
Sbjct 188 DKVRPRVISSFGIGSNYLNTEESSLHKLGNQRYQPFMVLNGFQQAMPRYYYNKIFSDVDK 247
Query 241 QNIVLDRFVNPPVEFSWQGQKFSSKLERDEMRRSTLNQNITSGLTPALPLPHTERVSSFD 300
QN+V+DR +NPPVEFSWQGQKFSSKLERDEMRRSTLNQNI SGLTP LPLPHTERVSSFD
Sbjct 248 QNMVVDRLINPPVEFSWQGQKFSSKLERDEMRRSTLNQNIASGLTPVLPLPHTERVSSFD 307
Query 301 RFKENMDKNKEFK 313
FK+ MDKNKEFK
Sbjct 308 IFKKYMDKNKEFK 320
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 103 bits (256), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 78/230 (34%), Positives = 121/230 (53%), Gaps = 37/230 (16%)
Query 35 VPCGYCHSCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFDDDNL--KKFSKD------- 85
VPCG C +C+K+ + RL E +YP LFVTLT+DD+++ +D
Sbjct 15 VPCGRCVNCRKNKRQSWVYRLQAEADEYP--FSLFVTLTYDDEHIPTAMIGEDLFKTTVG 72
Query 86 --TNKAVRLFLDRLRKDYGK-QIRHWFVCEFGTLYGRPHYHGILFDVPQTLIDGYSPDVP 142
+ + ++LF+ RLRK Y + ++R++ E+G+ GRPHYH ILF P T
Sbjct 73 VVSKRDIQLFMKRLRKKYAQYRLRYFLTSEYGSQGGRPHYHMILFGFPFT---------- 122
Query 143 GHH--PLLASRWKYGFVFVGYVSDETCSYITKYV-TKSINGDKVR------PRIISSF-- 191
G H LLA WK GFV ++ + SY+TKY+ KS+ D ++ P ++ S
Sbjct 123 GKHGGDLLAECWKNGFVQAHPLTTKEISYVTKYMYEKSMIPDILKGVKEYQPFMLCSKMP 182
Query 192 GIGSNYFDTEESTLHKLGGQRYQPFMVLNGFQQAMPRYYYNKIFSDVDKQ 241
GIG ++ + ++L + Y NG + AMPRYY +K++ D K+
Sbjct 183 GIGYHFLREQILDFYRLHPRDY--VRAFNGMRMAMPRYYADKLYDDDMKE 230
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 102 bits (254), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 78/230 (34%), Positives = 121/230 (53%), Gaps = 37/230 (16%)
Query 35 VPCGYCHSCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFDDDNL--KKFSKDTNKA--- 89
VPCG C +C+K+ + RL E +YP LFVTLT+DD+++ +D K+
Sbjct 15 VPCGRCVNCRKNKRQSWVYRLQAEADEYP--FSLFVTLTYDDEHMPTAMIGEDLFKSTVG 72
Query 90 ------VRLFLDRLRKDYGK-QIRHWFVCEFGTLYGRPHYHGILFDVPQTLIDGYSPDVP 142
++LF+ RLRK Y + ++R++ E+G+ GRPHYH ILF P T
Sbjct 73 VVSKRDIQLFMKRLRKKYDQYRLRYFLTSEYGSQGGRPHYHMILFGFPFT---------- 122
Query 143 GHH--PLLASRWKYGFVFVGYVSDETCSYITKYV-TKSINGDKVR------PRIISSF-- 191
G H LLA WK GFV ++ + +Y+TKY+ KS+ D ++ P ++ S
Sbjct 123 GKHGGDLLAECWKNGFVQAHPLTTKEIAYVTKYMYEKSMVPDILKDVKEYQPFMLCSRIP 182
Query 192 GIGSNYFDTEESTLHKLGGQRYQPFMVLNGFQQAMPRYYYNKIFSDVDKQ 241
GIG ++ + ++L + Y NG + AMPRYY +K++ D K+
Sbjct 183 GIGYHFLREQILDFYRLHPRDY--VRAFNGMRMAMPRYYADKLYDDDMKE 230
>gi|547920048|ref|WP_022322419.1| putative replication protein [Parabacteroides merdae CAG:48]
gi|524592960|emb|CDD13572.1| putative replication protein [Parabacteroides merdae CAG:48]
Length=278
Score = 96.3 bits (238), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 74/225 (33%), Positives = 119/225 (53%), Gaps = 33/225 (15%)
Query 34 EVPCGYCHSCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFDDDNL--KKFSKD---TNK 88
+VPCG+C +C+++ + RL E ++YP LFVTLT+DD++L ++ D TN
Sbjct 9 KVPCGWCVNCRQNKRQSWVYRLQAEAKEYP--LSLFVTLTYDDEHLPIERIGSDLFQTNV 66
Query 89 AV------RLFLDRLRKDYGK-QIRHWFVCEFGTLYGRPHYHGILFDVPQTLIDGYSPDV 141
AV +LF+ RLRK Y ++R++ E+G GRPHYH ILF P ++ +
Sbjct 67 AVVSKRDVQLFMKRLRKKYEDYKMRYFVTSEYGAKNGRPHYHMILFGFP------FTGKM 120
Query 142 PGHHPLLASRWKYGFVFVGYVSDETCSYITKYV-TKSI------NGDKVRPRIISSF--G 192
G LLA W+ GFV ++ + +Y+ KY+ KS+ + K +P ++ S G
Sbjct 121 AGD--LLAECWQNGFVQAHPLTIKEIAYVCKYMYEKSMCPEILRDEKKYKPFMLCSRNPG 178
Query 193 IGSNYFDTEESTLHKLGGQRYQPFMVLNGFQQAMPRYYYNKIFSD 237
IG + + ++ + Y G + AMPRYY +K++ D
Sbjct 179 IGFGFMKADIIEFYRRHPRDY--VRAWAGHKMAMPRYYADKLYDD 221
>gi|530695361|gb|AGT39916.1| replication initiator [Marine gokushovirus]
Length=289
Score = 93.6 bits (231), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 101/220 (46%), Gaps = 28/220 (13%)
Query 35 VPCGYCHSCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFDDDNLKKFSKDT---NKAVR 91
+PCG C C+ Y+ Q+ IR ++E + + F+TLTFD++++ K N +
Sbjct 30 LPCGQCIGCRLDYSRQWAIRCVHEAQTHEDNC--FITLTFDNEHIAKRKNPESLDNTEFQ 87
Query 92 LFLDRLRKDYGKQIRHWFVCEFGTLYGRPHYHGILFDVPQTLIDGYSPDVPGHHPL---- 147
F+ RLRK Y +IR + E+G RPHYH +LF D G L
Sbjct 88 RFMKRLRKKYPHKIRFFHCGEYGDQNKRPHYHALLFG--HDFKDKKLWSNKGDFKLFVSQ 145
Query 148 -LASRWKYGFVFVGYVSDETCSYITKYVTKSINGDKVRP---RIISSFGIGSNYFDTEES 203
LA W YGF +G VS +T +Y +YV K + GD + G N E
Sbjct 146 ELAELWPYGFHTIGAVSFDTAAYCARYVMKKVTGDAAASHYREVDLETGEVINEIKPEYC 205
Query 204 TLHKLGGQRYQ-------------PFMVLNGFQQAMPRYY 230
T+ ++ G Y+ ++V+NG++ PRYY
Sbjct 206 TMSRMPGIGYEWYQKYGYHDCHKHDYIVINGYKVRPPRYY 245
>gi|609718275|emb|CDN73649.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=265
Score = 90.5 bits (223), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 102/217 (47%), Gaps = 29/217 (13%)
Query 36 PCGYCHSCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFDDDNL----KKFSKDTNKAVR 91
PCG C C+K+ N + RL EL+ + FVTLT+ D L + +
Sbjct 24 PCGKCLECRKARTNSWFARLTEELK--VSKSAHFVTLTYSDVYLPYSDNGLISLDYRDFQ 81
Query 92 LFLDRLRKDYGKQIRHWFVCEFGTLYGRPHYHGILFDVPQTLIDGYSPDVPGHHPLLASR 151
LF+ R RK +I+++ V E+G RPHYH I+F V ID +
Sbjct 82 LFMKRARKLQKSKIKYFLVGEYGAQTYRPHYHAIVFGVEN--IDAF-----------LGE 128
Query 152 WKYGFVFVGYVSDETCSYITKYVTKSIN--------GDKVRPRIISSFGIGSNYFDTEES 203
W+ G V G V+ ++ Y KY TKSI D+ + + S G+G ++ ES
Sbjct 129 WRMGNVHAGTVTAKSIYYTLKYCTKSITEGPDKDPDDDRKPEKALMSKGLGLSHL--TES 186
Query 204 TLHKLGGQRYQPFMVLNGFQQAMPRYYYNKIFSDVDK 240
+ + F +L G A+PRYY +K+FSD++K
Sbjct 187 MIKYYKDDVSRSFSLLGGTTIALPRYYRDKVFSDIEK 223
>gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 [Parabacteroides distasonis str.
3999B T(B) 6]
Length=250
Score = 85.9 bits (211), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 105/200 (53%), Gaps = 35/200 (18%)
Query 67 CLFVTLTFDDDNL--KKFSKD---------TNKAVRLFLDRLRKDYGK-QIRHWFVCEFG 114
LFVTLT+DD+++ +D + + ++LF+ RLRK Y + ++R++ E+G
Sbjct 11 SLFVTLTYDDEHIPTAMIGEDLFKTTVGVVSKRDIQLFMKRLRKKYAQYRLRYFLTSEYG 70
Query 115 TLYGRPHYHGILFDVPQTLIDGYSPDVPGHH--PLLASRWKYGFVFVGYVSDETCSYITK 172
+ GRPHYH ILF P T G H LLA WK GFV ++ + SY+TK
Sbjct 71 SQGGRPHYHMILFGFPFT----------GKHGGDLLAECWKNGFVQAHPLTTKEISYVTK 120
Query 173 YV-TKSINGDKVR------PRIISSF--GIGSNYFDTEESTLHKLGGQRYQPFMVLNGFQ 223
Y+ KS+ D ++ P ++ S GIG ++ + ++L + Y NG +
Sbjct 121 YMYEKSMIPDILKGVKEYQPFMLCSKMPGIGYHFLREQILDFYRLHPRDY--VRAFNGMR 178
Query 224 QAMPRYYYNKIFSDVDKQNI 243
AMPRYY +K++ D K+ +
Sbjct 179 MAMPRYYADKLYDDDMKEYL 198
>gi|495507506|ref|WP_008232152.1| hypothetical protein [Richelia intracellularis]
gi|471331139|emb|CCH66547.1| hypothetical protein RINTHH_3920 [Richelia intracellularis HH01]
Length=306
Score = 85.1 bits (209), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/236 (27%), Positives = 109/236 (46%), Gaps = 43/236 (18%)
Query 34 EVPCGYCHSCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFDD---------DNLKKFSK 84
++PCG+C C + Q+ +R ++E + + FVTLT+++ + +KF K
Sbjct 30 KLPCGHCEGCLLERSRQWAVRCMHEAQLWERNC--FVTLTYEETPPWNSLRHSDFQKFMK 87
Query 85 DTNKAVRLFLDRLRKDYGKQ---IRHWFVCEFGTLYGRPHYHGILFDVPQTLIDGYSPDV 141
K + + + GK IR++ E+GT GRPHYH LF+ I+
Sbjct 88 RLRKRFKGHKENIDVRTGKSSYPIRYYMAGEYGTHGGRPHYHACLFNFAFEDIEFLRRTN 147
Query 142 PGHH----PLLASRWKYGFVFVGYVSDETCSYITKYVTKSINGD-------------KVR 184
G + L S W +GF VG V+ E+ +Y+ +YV K +N + +V
Sbjct 148 SGSNLYRSAQLESLWPHGFSSVGDVTFESAAYVARYVMKKMNKEAIEKGQEINWETGEVM 207
Query 185 PRIIS------SFGIGSNYFDTEESTLHKLGGQRYQPFMVLNGFQQAMPRYYYNKI 234
PR+ GIG+N+ D +S + ++++NG + PRYY+ ++
Sbjct 208 PRLPEYNKMSLKPGIGANFIDKYQSDVFP------NDYVIVNGHKAKPPRYYFKRL 257
>gi|575094560|emb|CDL65924.1| unnamed protein product [uncultured bacterium]
Length=320
Score = 84.3 bits (207), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/232 (29%), Positives = 106/232 (46%), Gaps = 42/232 (18%)
Query 34 EVPCGYCHSCQ--KSYNNQYRIR---LLYELRKYPPGTCLFVTLTFDDDNLKKFSKDTNK 88
++PCG C C+ +S ++ R LLY+ R Y F+TLT+ ++L F +
Sbjct 44 QIPCGQCIGCRLDRSLDSAVRAHHESLLYD-RNY------FLTLTYSPEHLPPFGSLIPR 96
Query 89 AVRLFLDRLRKDYGKQIRHWFVCEFGTLYGRPHYHGILFDVPQTLIDGYSPDVPGH---- 144
+ LF RLRK G +R+ E+G+ YGRPHYH I+F++P + G
Sbjct 97 DLTLFWKRLRKR-GVSLRYMACGEYGSTYGRPHYHAIIFNLPPLELKQIGTTSTGFPTFI 155
Query 145 HPLLASRWKYGFVFVGYVSDETCSYITKYVTKSINGD-------------------KVRP 185
+++ W GF + VS +TC+Y+ +YVTK I GD K
Sbjct 156 SDVISECWSLGFHTLNPVSFQTCAYVARYVTKKILGDGKQVYEKFDPVTGEVDCRVKEFS 215
Query 186 RIISSFGIGSNYFDTEESTLHKLGGQRYQPFMVLNGFQQAMPRYYYNKIFSD 237
R + GIG +YF +K+ ++N + +PRYY + D
Sbjct 216 RWSTKPGIGHDYFMKYWRDFYKIDC------CLINNKKFKIPRYYDRLLLRD 261
>gi|575094569|emb|CDL65925.1| unnamed protein product [uncultured bacterium]
Length=354
Score = 84.3 bits (207), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/241 (30%), Positives = 107/241 (44%), Gaps = 36/241 (15%)
Query 32 YLEVPCGYCHSCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFDDDNL--------KKFS 83
++E+PCG C SC++ Y + RL+ EL+ + F+TLT+DDD++ + S
Sbjct 67 FIEIPCGKCISCRRRYAALWTDRLMLELQDHKESC--FITLTYDDDHICCVDSPIEENVS 124
Query 84 KDTNKAVRL--FLDRLRK------DYGKQIRHWFVCEFGTLYGRPHYHGILFD-VPQTLI 134
T V L F RLR+ + K+IR++ E+G RPHYH ILF P LI
Sbjct 125 MYTLNKVHLQCFWKRLRQYLVRHVEPEKRIRYFACGEYGDTTFRPHYHAILFGWRPTDLI 184
Query 135 D---GYSPDVPGHHPLLASRWKYGFVFVGYVSDETCSYITKYVTKSING--------DKV 183
+ D LAS W+ G V VG V+ E+C Y+ +Y K G V
Sbjct 185 QFKKNFQNDTLYLSKSLASIWQNGNVMVGDVTPESCRYVARYCLKKATGFDSEIYERLGV 244
Query 184 RPRIISSF---GIGSNYFDTEESTLHKLGGQRYQPFMVLNGFQQAMPRYYYNKIFSDVDK 240
P ++ GI YFD + K G +P Y+ ++ D+D
Sbjct 245 LPEFVTMSRKPGIARKYFDDHYDEIIKYKTINLSTLK--GGMSMQIPPYFI-RLIEDIDS 301
Query 241 Q 241
+
Sbjct 302 E 302
Lambda K H a alpha
0.323 0.141 0.449 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1719536379408