bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-5_CDS_annotation_glimmer3.pl_2_4
Length=450
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094327|emb|CDL65713.1| unnamed protein product 164 2e-41
gi|492501778|ref|WP_005867316.1| hypothetical protein 125 4e-29
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 120 1e-27
gi|547920048|ref|WP_022322419.1| putative replication protein 111 2e-24
gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 96.3 3e-19
gi|504330771|ref|WP_014517873.1| hypothetical protein 90.1 4e-18
gi|47566147|ref|YP_022485.1| nonstructural protein 93.2 8e-18
gi|77020121|ref|YP_338244.1| putative replication protein 92.0 2e-17
gi|9634955|ref|NP_054653.1| nonstructural protein 90.9 5e-17
gi|9791179|ref|NP_063900.1| conserved hypothetical protein 90.5 6e-17
>gi|575094327|emb|CDL65713.1| unnamed protein product [uncultured bacterium]
Length=515
Score = 164 bits (415), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 136/507 (27%), Positives = 218/507 (43%), Gaps = 103/507 (20%)
Query 3 NLYCEHPVIIRHPRLK----DLLPVFGSYRTPNGLYDLTPSQSHTWKYRFPEWLFSARKC 58
++ C HPV++ +P + D ++ R + +L + W +S +K
Sbjct 2 DIKCLHPVVLINPEARKKALDFDRIYIRERCQCWVMEL---------FVIEPWRYSPKKW 52
Query 59 GVTLDNIDDFQIINLRTGEISPMFMAVPCGKCILCRDKKKREWSFRATCENVFSTSIPIF 118
+ L +I+ ++N TG++ P+++AVPCG CI+CR +K + RA E + S P+F
Sbjct 53 SIGLSDIEHCYLLNSETGDMIPLYIAVPCGSCIICRKRKANALATRAIMETEITGSSPLF 112
Query 119 VTLTYNNKHLPK--YGVF---KEEIQLFMKRLRIRLDRLGYKHQIRYFFCSEYGSKSGRP 173
+TLTYN +HLPK YG K ++QLF KRLR LD H +RY C EYGS + RP
Sbjct 113 ITLTYNPEHLPKNQYGYLTLRKLDLQLFFKRLRSLLDNQSIPHSLRYLACGEYGSNTKRP 172
Query 174 HYHAIFWNFPRDGALSNIWNVVKMIEKAWSY-----NGE--------------------- 207
HYH + W FP +I I+KAWSY NG+
Sbjct 173 HYHLLLWGFPL-SYFKDILKAQAFIQKAWSYFQVDENGKRIPYYSKCRTCPFNQYKDRRS 231
Query 208 -------------------------PLGFAYAVPCDKGAIGYVMKYMSKIPRIP-DNMNN 241
P+G +P + GA Y+ KYM K P +
Sbjct 232 CSDVAHFCTGVRLRYPSGAFIYRRYPIGSIKVLPANSGAPAYITKYMVKGSNAPHSSCEP 291
Query 242 VFFLSSRKDGGIGAAYARRLMPFYRANSQCLDITVCDPYSGMSTTVTLP--EYFKRLYYP 299
F +S + GGIG+AY R N + V D +G +P + K P
Sbjct 292 PFRTASNRGGGIGSAYIRAHKDEILRNPSLEALPVVDRVTGSGKLFYMPIDSWVKSTLIP 351
Query 300 ANSSVLSKEVRDAFKKLC-------DCISDRYTIHCAGNYTD-KLR-FSDI------EKK 344
+ SS L + + + C C Y ++ + D +L+ ++D+ +K
Sbjct 352 SPSSFLKTKEYETVRDFCYTFSLFEQCAQQLYRLYPMDRFIDGELQYYTDVFCDPLKREK 411
Query 345 VLRKYS----FLSPKICKYPIG---KKMDYYTQIPWQALDDRYVANECEIASLCRFLMLE 397
Y+ F+ P + IG K DY+ + + L +R + L R ++
Sbjct 412 WAEAYAHVREFIPPTVPNINIGSWIKDRDYFNEYMY-TLTER-------LDILARKVLEI 463
Query 398 SIDETWFKIRDEILQKRQRALNAKFSN 424
+ID+++F+ R++ L+ R K+ N
Sbjct 464 NIDDSYFRSREKYLKTRVEIFKKKYGN 490
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 125 bits (313), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 86/248 (35%), Positives = 125/248 (50%), Gaps = 40/248 (16%)
Query 84 AVPCGKCILCRDKKKREWSFRATCENVFSTSIPIFVTLTYNNKHLPK-----------YG 132
AVPCG+C+ CR K++ W +R E +FVTLTY+++H+P G
Sbjct 14 AVPCGRCVNCRKNKRQSWVYRLQAE-ADEYPFSLFVTLTYDDEHMPTAMIGEDLFKSTVG 72
Query 133 VF-KEEIQLFMKRLRIRLDRLGYKHQIRYFFCSEYGSKSGRPHYHAIFWNFPRDGALSNI 191
V K +IQLFMKRLR + D ++++RYF SEYGS+ GRPHYH I + FP G
Sbjct 73 VVSKRDIQLFMKRLRKKYD----QYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGG- 127
Query 192 WNVVKMIEKAWSYNGEPLGFAYAVPCDKGAIGYVMKYMSKIPRIPDNMNNV-----FFLS 246
++ + W GF A P I YV KYM + +PD + +V F L
Sbjct 128 ----DLLAECWKN-----GFVQAHPLTTKEIAYVTKYMYEKSMVPDILKDVKEYQPFMLC 178
Query 247 SRKDGGIGAAYAR-RLMPFYRANSQCLDITVCDPYSGMSTTVTLPEYFKRLYYPANSSVL 305
SR GIG + R +++ FYR + + ++GM + +P Y+ Y +
Sbjct 179 SRIP-GIGYHFLREQILDFYRLHPR----DYVRAFNGMR--MAMPRYYADKLYDDDMKEY 231
Query 306 SKEVRDAF 313
KE+R+AF
Sbjct 232 LKELREAF 239
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 120 bits (302), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/248 (35%), Positives = 123/248 (50%), Gaps = 40/248 (16%)
Query 84 AVPCGKCILCRDKKKREWSFRATCENVFSTSIPIFVTLTYNNKHLPK-----------YG 132
AVPCG+C+ CR K++ W +R E +FVTLTY+++H+P G
Sbjct 14 AVPCGRCVNCRKNKRQSWVYRLQAE-ADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTVG 72
Query 133 VF-KEEIQLFMKRLRIRLDRLGYKHQIRYFFCSEYGSKSGRPHYHAIFWNFPRDGALSNI 191
V K +IQLFMKRLR + ++++RYF SEYGS+ GRPHYH I + FP G
Sbjct 73 VVSKRDIQLFMKRLRKKYA----QYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGG- 127
Query 192 WNVVKMIEKAWSYNGEPLGFAYAVPCDKGAIGYVMKYMSKIPRIPDNMNNV-----FFLS 246
++ + W GF A P I YV KYM + IPD + V F L
Sbjct 128 ----DLLAECWKN-----GFVQAHPLTTKEISYVTKYMYEKSMIPDILKGVKEYQPFMLC 178
Query 247 SRKDGGIGAAYAR-RLMPFYRANSQCLDITVCDPYSGMSTTVTLPEYFKRLYYPANSSVL 305
S K GIG + R +++ FYR + + ++GM + +P Y+ Y +
Sbjct 179 S-KMPGIGYHFLREQILDFYRLHPR----DYVRAFNGMR--MAMPRYYADKLYDDDMKEY 231
Query 306 SKEVRDAF 313
KE+R+AF
Sbjct 232 LKELREAF 239
>gi|547920048|ref|WP_022322419.1| putative replication protein [Parabacteroides merdae CAG:48]
gi|524592960|emb|CDD13572.1| putative replication protein [Parabacteroides merdae CAG:48]
Length=278
Score = 111 bits (278), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/247 (32%), Positives = 123/247 (50%), Gaps = 40/247 (16%)
Query 85 VPCGKCILCRDKKKREWSFRATCENVFSTSIPIFVTLTYNNKHLP--KYG---------- 132
VPCG C+ CR K++ W +R E + +FVTLTY+++HLP + G
Sbjct 10 VPCGWCVNCRQNKRQSWVYRLQAE-AKEYPLSLFVTLTYDDEHLPIERIGSDLFQTNVAV 68
Query 133 VFKEEIQLFMKRLRIRLDRLGYKHQIRYFFCSEYGSKSGRPHYHAIFWNFPRDGALSNIW 192
V K ++QLFMKRLR + + +++RYF SEYG+K+GRPHYH I + FP G ++
Sbjct 69 VSKRDVQLFMKRLRKKYE----DYKMRYFVTSEYGAKNGRPHYHMILFGFPFTGKMAG-- 122
Query 193 NVVKMIEKAWSYNGEPLGFAYAVPCDKGAIGYVMKYMSKIPRIPDNMNNV-----FFLSS 247
++ + W GF A P I YV KYM + P+ + + F L S
Sbjct 123 ---DLLAECWQN-----GFVQAHPLTIKEIAYVCKYMYEKSMCPEILRDEKKYKPFMLCS 174
Query 248 RKDGGIGAAYARR-LMPFYRANSQCLDITVCDPYSGMSTTVTLPEYFKRLYYPANSSVLS 306
R + GIG + + ++ FYR + + ++G + +P Y+ Y +
Sbjct 175 R-NPGIGFGFMKADIIEFYRRHPR----DYVRAWAG--HKMAMPRYYADKLYDDDMKAFL 227
Query 307 KEVRDAF 313
KE+R+ F
Sbjct 228 KEMREEF 234
>gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 [Parabacteroides distasonis str.
3999B T(B) 6]
Length=250
Score = 96.3 bits (238), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 74/215 (34%), Positives = 106/215 (49%), Gaps = 39/215 (18%)
Query 117 IFVTLTYNNKHLPK-----------YGVF-KEEIQLFMKRLRIRLDRLGYKHQIRYFFCS 164
+FVTLTY+++H+P GV K +IQLFMKRLR + ++++RYF S
Sbjct 12 LFVTLTYDDEHIPTAMIGEDLFKTTVGVVSKRDIQLFMKRLRKKYA----QYRLRYFLTS 67
Query 165 EYGSKSGRPHYHAIFWNFPRDGALSNIWNVVKMIEKAWSYNGEPLGFAYAVPCDKGAIGY 224
EYGS+ GRPHYH I + FP G ++ + W GF A P I Y
Sbjct 68 EYGSQGGRPHYHMILFGFPFTGKHGG-----DLLAECWKN-----GFVQAHPLTTKEISY 117
Query 225 VMKYMSKIPRIPDNMNNV-----FFLSSRKDGGIGAAYAR-RLMPFYRANSQCLDITVCD 278
V KYM + IPD + V F L S K GIG + R +++ FYR + +
Sbjct 118 VTKYMYEKSMIPDILKGVKEYQPFMLCS-KMPGIGYHFLREQILDFYRLHPR----DYVR 172
Query 279 PYSGMSTTVTLPEYFKRLYYPANSSVLSKEVRDAF 313
++GM + +P Y+ Y + KE+R+AF
Sbjct 173 AFNGMR--MAMPRYYADKLYDDDMKEYLKELREAF 205
>gi|504330771|ref|WP_014517873.1| hypothetical protein [Chlamydia pneumoniae]
gi|384449518|ref|YP_005662120.1| hypothetical protein CPK_ORF00729 [Chlamydophila pneumoniae LPCoLN]
gi|269303098|gb|ACZ33198.1| conserved hypothetical protein [Chlamydophila pneumoniae LPCoLN]
gi|572023919|gb|ETR80121.1| nonstructural protein [Chlamydia pneumoniae B21]
Length=121
Score = 90.1 bits (222), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 65/102 (64%), Gaps = 6/102 (6%)
Query 82 FMAVPCGKCILCRDKKKREWSFRATCENVFSTSIPIFVTLTYNNKHLPKYG-VFKEEIQL 140
++ +PC KC CR + + WS+R E F+TLTY++KHLP+YG + K +QL
Sbjct 19 WVLMPCLKCRFCRTQHAKVWSYRCVHEASLYEK-NCFLTLTYDDKHLPQYGSLVKLHLQL 77
Query 141 FMKRLRIRLDRLGYKHQIRYFFCSEYGSKSGRPHYHAIFWNF 182
F+KRLR R+ H+IRYF C EYG+K RPHYH + +N+
Sbjct 78 FLKRLRDRIS----PHKIRYFGCGEYGTKLQRPHYHLLIFNY 115
>gi|47566147|ref|YP_022485.1| nonstructural protein [Chlamydia phage 3]
gi|47522482|emb|CAD79483.1| nonstructural protein [Chlamydia phage 3]
Length=315
Score = 93.2 bits (230), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 83/291 (29%), Positives = 134/291 (46%), Gaps = 51/291 (18%)
Query 82 FMAVPCGKCILCRDKKKREWSFRATCE-NVFSTSIPIFVTLTYNNKHLPKYG-VFKEEIQ 139
++ PC KC CR + + WS+R E +++S + F+TLTY ++HLP+ G + ++ +
Sbjct 50 WVVKPCLKCRFCRVQNAKIWSYRCMHEASLYSQNC--FLTLTYEDRHLPENGSLVRDHPR 107
Query 140 LFMKRLRIRLDRLGYKHQIRYFFCSEYGSKSGRPHYHAIFWN--FPRDGALS----NIWN 193
LF++RLR + Y H+IRYF C EYGSK RPHYH + +N FP LS N
Sbjct 108 LFLRRLREHI----YPHKIRYFGCGEYGSKLQRPHYHLLIYNYDFPDKKLLSKKRGNPLF 163
Query 194 VVKMIEKAWSYNGEPLGFAYAVPCDKGAIGYVMKY-MSKI----------PRIPDNMNNV 242
V + + + W P GF+ + + GYV +Y + K+ R+P+
Sbjct 164 VSEKLMQLW-----PFGFSTVGSVTRQSAGYVARYSLKKVNGDSSQDHYGQRLPE----- 213
Query 243 FFLSSRKDGGIGAAYARRLMPFYRANSQCLDITVCDPYSGMSTTVTLPEYFKRLYY---P 299
FL GIGA + + Y+ + D V G S P Y+ +L+ P
Sbjct 214 -FLMCSLKPGIGADWYEK----YKRDVYPQDYLVVQD-KGKSFKTRPPRYYDKLHSRFDP 267
Query 300 ANSSVLSKEVRDAFKKLCDCISDRYTIHCAGNYTDKLRFSDIEKKVLRKYS 350
+ + + F L + D+ + + F+D K++ R Y
Sbjct 268 EEMEEIKQRRVEKFMALPELTQDKAEV-------KQYIFNDRTKRLFRDYE 311
>gi|77020121|ref|YP_338244.1| putative replication protein [Chlamydia phage 4]
gi|59940020|gb|AAX12549.1| putative replication protein [Chlamydia phage 4]
Length=315
Score = 92.0 bits (227), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/261 (30%), Positives = 120/261 (46%), Gaps = 30/261 (11%)
Query 51 WLFSARKCGVTLDNIDDFQII-NLRTGEISPMFMAVPCGKCILCRDKKKREWSFRATCE- 108
W K D + + + E ++ +PC +C CR + + WS+R E
Sbjct 18 WFPKGEKSSKPWDKVRELNAFEQTQPEEYRKRWILMPCRRCKFCRVQNAKIWSYRCMHEA 77
Query 109 NVFSTSIPIFVTLTYNNKHLPKYG-VFKEEIQLFMKRLRIRLDRLGYKHQIRYFFCSEYG 167
+++S + F+TLTY ++HLP+ G + + LF++RLR + H+IRYF C EYG
Sbjct 78 SLYSQNC--FLTLTYEDQHLPENGSLVRNHPTLFLRRLREHIS----PHKIRYFGCGEYG 131
Query 168 SKSGRPHYHAIFWN--FPRDGALS----NIWNVVKMIEKAWSYNGEPLGFAYAVPCDKGA 221
SK RPHYH + +N FP LS N V + + + W P GF+ + +
Sbjct 132 SKLQRPHYHLLIYNYDFPDKKLLSKKRGNPLFVSEKLMQLW-----PYGFSTVGSVTRQS 186
Query 222 IGYVMKY-MSKIPR--IPDNMNNVF--FLSSRKDGGIGAAYARRLMPFYRANSQCLDITV 276
GYV +Y + K+ R D+ FL GIGA + + Y+ + D V
Sbjct 187 AGYVARYSLKKVSRDISQDHYGQRLPEFLMCSLKPGIGADWYEK----YKRDVYPQDYLV 242
Query 277 CDPYSGMSTTVTLPEYFKRLY 297
G S T P Y+ +L+
Sbjct 243 VQD-KGKSFTTRPPRYYDKLH 262
>gi|9634955|ref|NP_054653.1| nonstructural protein [Chlamydia phage 2]
gi|7406595|emb|CAB85595.1| nonstructural protein [Chlamydia phage 2]
Length=336
Score = 90.9 bits (224), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 73/235 (31%), Positives = 115/235 (49%), Gaps = 41/235 (17%)
Query 82 FMAVPCGKCILCRDKKKREWSFRATCE-NVFSTSIPIFVTLTYNNKHLPKYG-VFKEEIQ 139
++ +PC +C CR + + WS+R E +++S + F+TLTY ++HLP+ G + ++ +
Sbjct 71 WVLMPCRRCKFCRVQNAKIWSYRCMHEASLYSQNC--FLTLTYEDRHLPENGSLVRDHPR 128
Query 140 LFMKRLRIRLDRLGYKHQIRYFFCSEYGSKSGRPHYHAIFWN--FPRDGALS----NIWN 193
LF+ RLR + Y H+IRYF C EYGSK RPHYH + +N FP LS N
Sbjct 129 LFLMRLREHI----YPHKIRYFGCGEYGSKLQRPHYHLLIYNYDFPDKKLLSKKRGNPLF 184
Query 194 VVKMIEKAWSYNGEPLGFAYAVPCDKGAIGYVMKY-MSKI----------PRIPDNMNNV 242
V + + + W P GF+ + + GYV +Y + K+ R+P+
Sbjct 185 VSEKLMRLW-----PFGFSTVGSVTRQSAGYVARYSLKKVNGDISQDHYGQRLPE----- 234
Query 243 FFLSSRKDGGIGAAYARRLMPFYRANSQCLDITVCDPYSGMSTTVTLPEYFKRLY 297
FL GIGA + + Y+ + D V G S P Y+ +L+
Sbjct 235 -FLMCSLKPGIGADWYEK----YKCDVYPQDYLVVQD-KGKSFKTRPPRYYDKLH 283
>gi|9791179|ref|NP_063900.1| conserved hypothetical protein [Chlamydia pneumoniae phage CPAR39]
gi|7190960|gb|AAF39720.1| conserved hypothetical protein [Chlamydia pneumoniae phage CPAR39]
Length=327
Score = 90.5 bits (223), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 86/323 (27%), Positives = 142/323 (44%), Gaps = 52/323 (16%)
Query 51 WLFSARKCGVTLDNIDDFQII-NLRTGEISPMFMAVPCGKCILCRDKKKREWSFRATCE- 108
W +K D + + + E ++ +PC KC CR + + WS+R E
Sbjct 30 WFPKGKKSPKPWDKVRELNSFEQTQPEEYRKRWVVMPCLKCRFCRVRNAKIWSYRCMHEA 89
Query 109 NVFSTSIPIFVTLTYNNKHLPKYG-VFKEEIQLFMKRLRIRLDRLGYKHQIRYFFCSEYG 167
+++S + F+TLTY +++LP+ G + + +LF+ RLR + Y H+IRYF C EYG
Sbjct 90 SLYSQNC--FLTLTYEDRYLPENGSLVRNHPRLFLMRLRKEI----YPHKIRYFGCGEYG 143
Query 168 SKSGRPHYHAIFWN--FPRDGALS----NIWNVVKMIEKAWSYNGEPLGFAYAVPCDKGA 221
SK RPHYH + +N FP LS N V + + + W P GF+ + +
Sbjct 144 SKLQRPHYHLLIYNYDFPDKKLLSKKRGNPLFVSEKLMRLW-----PFGFSTVGSVTRQS 198
Query 222 IGYVMKY-MSKI----------PRIPDNMNNVFFLSSRKDGGIGAAYARRLMPFYRANSQ 270
GYV +Y + K+ R+P+ FL GIGA + + Y+ +
Sbjct 199 AGYVARYSLKKVNGDISQDHYGQRLPE------FLMCSLKPGIGADWYEK----YKRDVY 248
Query 271 CLDITVCDPYSGMSTTVTLPEYFKRLYY---PANSSVLSKEVRDAFKKLCDCISDRYTIH 327
D V G S P Y+ +L+ P + ++ + F L + D+ +
Sbjct 249 PQDYLVVQD-KGKSFKTRPPRYYDKLHSRFDPEEMEEIKQKRVEKFMALPELTQDKAEV- 306
Query 328 CAGNYTDKLRFSDIEKKVLRKYS 350
+ F+D K++ R Y
Sbjct 307 ------KQYIFNDRTKRLFRDYE 323
Lambda K H a alpha
0.325 0.140 0.448 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3038619802764