bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-35_CDS_annotation_glimmer3.pl_2_2
Length=605
Score E
Sequences producing significant alignments: (Bits) Value
gi|648626869|ref|WP_026318620.1| hypothetical protein 142 3e-35
gi|547920049|ref|WP_022322420.1| capsid protein VP1 105 1e-20
gi|492501782|ref|WP_005867318.1| hypothetical protein 103 7e-20
gi|649555287|gb|KDS61824.1| capsid family protein 101 2e-19
gi|609718276|emb|CDN73650.1| conserved hypothetical protein 74.7 8e-11
gi|444297926|dbj|GAC77876.1| major capsid protein 73.9 1e-10
gi|17402851|ref|NP_510872.1| hypothetical protein PhiCPG1p2 73.2 2e-10
gi|599087675|gb|AHN52763.1| major capsid protein 69.7 5e-10
gi|9791178|ref|NP_063895.1| hypothetical protein 71.6 7e-10
gi|9629155|ref|NP_044312.1| VP1 71.6 8e-10
>gi|648626869|ref|WP_026318620.1| hypothetical protein [Alistipes onderdonkii]
Length=231
Score = 142 bits (358), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/246 (35%), Positives = 126/246 (51%), Gaps = 31/246 (13%)
Query 375 SGGETFRQRSYHFGENGYFVVMASLVPDVIYSRGMDPFNREKTLGDVYVPALDNIAMEPL 434
SGG++F++R++HF E+GYF+ + S+VP V+Y ++P + LG Y PALDNI M+PL
Sbjct 2 SGGDSFKRRTFHFNESGYFMEITSVVPTVMYPNYLNPTLLQTNLGQRYAPALDNIQMQPL 61
Query 435 MVEQVDALPSLLS---------------LQRGTDSVYTLSFRPDKLIKNSALGYVPAWSK 479
V P+LL GT + T++ + A+GY PAW++
Sbjct 62 TV------PTLLGNAYFNTGSGSYSHVLNHMGTGELRTVAVDKLSAAEGIAVGYQPAWAE 115
Query 480 VMQSTSRAHGRLTTDLKYWLLNRDYGVDIDQIKNTGVIGVDVLKNLASELQQAYQAGYIS 539
+M S+ HGRL DL YW R YG + + V L+ L +E+ +
Sbjct 116 LMTGVSKPHGRLCNDLDYWAFQRRYGTVLYSSNDAQDASV-FLEELGNEVDT------LD 168
Query 540 FEQMDALVTLFSRIQALPEYTPYVLPNAYNDVFADTSNQAQNFVLTCTFSMSCNREKGKV 599
E +A + ++ PY+LP YN VFADT AQNFVL + +S REK KV
Sbjct 169 VETFNAWL---KNTYVSTDFVPYILPAMYNYVFADTDPNAQNFVLDNSAEISVYREKSKV 225
Query 600 NTPTTI 605
N P T+
Sbjct 226 NVPNTL 231
>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 126/532 (24%), Positives = 206/532 (39%), Gaps = 83/532 (16%)
Query 10 RNKKSNVNLSHVSPTTLAPGNLIPISFTRIFAGDDLRFKPSAFVQAMPMNAPLVNGFKLC 69
R +++ NLS+ S TL G L+PI + +GD R K + V+ P+ AP+++ +
Sbjct 12 RPRRNAFNLSYESKLTLNMGELVPIMCMPVVSGDKFRVKTESLVRLAPLVAPMMHRVNVF 71
Query 70 LEYFFVPDRLY--NWELLMDNTGVTDDPDKVKFPQISSPAEYSTGTIKFSLNSQSAARGD 127
YFFVP+RL WE + T D D FP+I +Q +
Sbjct 72 THYFFVPNRLVWNEWEDFI--TKGVDGEDMPMFPKIQI--------------NQDSHLVS 115
Query 128 AARLANSIVQPGSLADYCGFPVGLF---PTYDIVSDTDDRN--QFCALKLLGVLDIFYHY 182
+A L SL DY G P +YD+V+ + Q AL I+ Y
Sbjct 116 SASLIKEYFGDSSLWDYLGLPTLSACGNKSYDVVNGVKVPSGFQVSALPFRAYQLIYNEY 175
Query 183 YVNQQIERFPTASFTPTLNSGSE---DE-------RNVDYPVTMLRSFLDFVKRSPNPAS 232
Y +Q + T TL SG+ D+ R + S L +++R P
Sbjct 176 YRDQNL----TEPIDFTLGSGTTVGGDQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTV 231
Query 233 AI-GEWATSNPNGNPIFGTWSWFCSRASIFQ----------RCLPPYYLESWLATSGYED 281
+ G + + + W S F+ R P G +
Sbjct 232 PVQGAGGSMDVVYERQSDSQKWVDSSGREFENGHAYDITMARANDPNSALMVAVNGGTNN 291
Query 282 SEIKVD------LDADGKSISFRNIAAHSHIQRWLDLALAGGSRYSDYINSQFDV----S 331
++D ++ D I+ ++ + +QRW + GGSRY + I S F V +
Sbjct 292 RAPELDPNGTLKVNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDA 351
Query 332 RLKHTTSPLYLGSDRQYLGSNVIYQTTGAGDSSSPLGAFAGQASGGETFRQRSYHFGENG 391
RL+ P +LG R + + + QT+ D +SP AG ++F E+G
Sbjct 352 RLQR---PQFLGGGRMPISVSEVLQTSST-DETSPQANMAGHGISAGINNGFKHYFEEHG 407
Query 392 YFVVMASLVPDVIYSRGM-DPFNREKTLGDVYVPALDNIAMEPLMVEQVDALPSLLSLQR 450
Y + + S+ P Y +G+ F + + D Y P +++ + + ++
Sbjct 408 YIIGIMSITPRSGYQQGVPRDFTKFDNM-DFYFPEFAHLSEQEIKNQE------------ 454
Query 451 GTDSVYTLSFRPDKLIKNSALGYVPAWSKVMQSTSRAHGRLTTDLKYWLLNR 502
L D N GY P +++ S AHG +L +W LNR
Sbjct 455 -------LFVSEDAAYNNGTFGYTPRYAEYKYHPSEAHGDFRGNLSFWHLNR 499
>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis
CL09T03C24]
Length=538
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 129/522 (25%), Positives = 209/522 (40%), Gaps = 76/522 (15%)
Query 9 SRNKKSNVNLSHVSPTTLAPGNLIPISFTRIFAGDDLRFKPSAFVQAMPMNAPLVNGFKL 68
R +++ NLS+ + T G L+PI + GD R V+ P+ AP+++ +
Sbjct 11 KRPRRNVFNLSYENKLTANAGELVPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDV 70
Query 69 CLEYFFVPDRLY--NWELLMDNTGVTDDPDKVKFPQISSPAEYSTGTIKFSLNSQSAARG 126
YFFVP+RL WE + T D D FP+I+ ++ +N SAA
Sbjct 71 FTHYFFVPNRLLWNQWEDFI--TKGVDGTDTPVFPKIALRPDW--------VNPTSAA-- 118
Query 127 DAARLANSIVQPGSLADYCGFP-VGLFPTYDIVSDTDDR------NQFCALKLLGVLDIF 179
++ GSL DY G P +G F + + + Q AL I+
Sbjct 119 -------VLLDDGSLWDYLGLPTIGGFNNVAFPNRSPNSVMPPVGYQVSALPFRAYQLIY 171
Query 180 YHYYVNQQIERFPTASFTPTLNSG---SEDE-------RNVDYPVTMLRSFLDFVKRSPN 229
YY +Q + T +LNSG S DE R + S L +V+R P
Sbjct 172 NEYYRDQNL----TKPIEFSLNSGIVLSADEVTRLLTLRRRTWEKDYFTSALPWVQRGPE 227
Query 230 ---PASAIGEWATSNPNGNPIFGTWSW--FCSRASIFQRCLPPYYLESWLATSGYEDSEI 284
P G + T+ +R + + + + + E
Sbjct 228 VTVPIQGSGGNLDVTLKNDAHADTYRMPGTSNRPAGAMQLVGGALIAGGTDGAYLEPDNF 287
Query 285 KVDLDADGKSISFRNIAAHSHIQRWLDLALAGGSRYSDYINSQFDV----SRLKHTTSPL 340
+V++D G SI+ ++ + +QRW + GSRY + I S F V +RL+ P
Sbjct 288 QVNVDELGVSIN--DLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQR---PQ 342
Query 341 YLGSDRQYLGSNVIYQTTGAGDSSSPLGAFAGQASGGETFRQRSYHFGENGYFVVMASLV 400
+LG R + + + QT+ A DS+SP AG +F E+GY + + S+
Sbjct 343 FLGGGRTPISVSEVLQTS-ATDSTSPQANMAGHGISAGVNHGFKRYFEEHGYIIGIMSIR 401
Query 401 PDVIYSRGMDPFNREKTLGDVYVPALDNIAMEPLMVEQVDALPSLLSLQRGTDSVYTLSF 460
P Y +G+ R+ D Y P ++ + + E+V LQ+ S
Sbjct 402 PRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEV-------YLQQTPAS------ 448
Query 461 RPDKLIKNSALGYVPAWSKVMQSTSRAHGRLTTDLKYWLLNR 502
N GY P +++ S + HG ++ +W LNR
Sbjct 449 ------NNGTFGYTPRYAEYKYSMNEVHGDFRGNMAFWHLNR 484
>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=541
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 130/537 (24%), Positives = 202/537 (38%), Gaps = 103/537 (19%)
Query 9 SRNKKSNVNLSHVSPTTLAPGNLIPISFTRIFAGDDLRFKPSAFVQAMPMNAPLVNGFKL 68
R +++ NLS+ + T+ G LIPI + GD R V+ P+ AP+++ +
Sbjct 11 KRPRRNVFNLSYENKLTVNAGELIPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDV 70
Query 69 CLEYFFVPDRLY--NWELLMDNTGVTDDPDKVKFPQISSPAEYSTGTIKFSLNSQSAARG 126
YFFVP+RL WE + T D D FP S P+ T S G
Sbjct 71 FTHYFFVPNRLIWNKWEDFI--TKGVDGTDSPVFPTYSFPSTVDTANAHNSF-------G 121
Query 127 DAARLANSIVQPGSLADYCGFPVGLFPTYDIVSDTDDRN--------QFCALKLLGVLDI 178
D GSL DY G P + + V N + AL I
Sbjct 122 D-----------GSLWDYLGLP-SINQIGEAVFQVQSPNGVKAPAGFKVSALPFRAYHLI 169
Query 179 FYHYYVNQQIERFPTASFTPTLNSGSEDERNVDYPVT--------------MLRSFLDFV 224
+ YY +Q + T+ TL+SG N PV S L +V
Sbjct 170 YNEYYRDQNL----TSELEITLDSG-----NYQLPVNSSLWQLHRRAWEKDYFTSALPWV 220
Query 225 KRSPN---PASAIGEWA------------TSNPNGNPIFGTWSWFCSRASIFQRCLPPYY 269
+R P P + GE T+ P+ PI G+ + + + L
Sbjct 221 QRGPEVTVPINGGGEIPVEMKEGFAAQKITTFPDRKPISGSEVLYSAPS-----VLSYGQ 275
Query 270 LESWLATSGYEDSEIKVDLDADGKSISFRNIAAHSHIQRWLDLALAGGSRYSDYINSQFD 329
+ S + E V+ D G +I+ +I + +QRW + GSRY + I S F
Sbjct 276 IGSIKGQALIEPDNFVVNTDQMGVNIN--DIRTSNALQRWFERNARSGSRYIEQILSHFG 333
Query 330 V----SRLKHTTSPLYLGSDRQYLGSNVIYQTTGAGDSSSPLGAFAGQASGGETFRQRSY 385
V +RL+ P +LG R + + + QT+ DS+SP AG +
Sbjct 334 VRSSDARLQR---PQFLGGGRTPISVSEVLQTSST-DSTSPQANMAGHGISAGVNHGFTR 389
Query 386 HFGENGYFVVMASLVPDVIYSRGMDPFNREKTLGDVYVPALDNIAMEPLMVEQVDALPSL 445
+F E+GY + + S+ P Y +G+ R+ D Y P ++ + + E+
Sbjct 390 YFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEE------- 442
Query 446 LSLQRGTDSVYTLSFRPDKLIKNSALGYVPAWSKVMQSTSRAHGRLTTDLKYWLLNR 502
L GY P +++ S + HG ++ +W LNR
Sbjct 443 ------------LYLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNMAFWHLNR 487
>gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=537
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 61/228 (27%), Positives = 99/228 (43%), Gaps = 28/228 (12%)
Query 306 IQRWLDLALAGGSRYSDYINSQFDVS----RLKHTTSPLYLGSDRQYLGSNVIYQTTGAG 361
+Q WL+ GSRY++ I S F V RL+ P +LG ++ + + + Q + A
Sbjct 300 LQEWLEKNARAGSRYAESILSFFGVKTSDGRLQR---PEFLGGNKSPIMISEVLQQS-AT 355
Query 362 DSSSPLGAFAGQASGGETFRQRSYHFGENGYFVVMASLVPDVIYSRGMDPFNREKTLGDV 421
DS++P G AG G S F E+GY + + S++P YS+G+ + D
Sbjct 356 DSTTPQGNMAGHGIGIGKDGGFSRFFEEHGYVIGLMSVIPKTSYSQGIPRHFSKSDKFDY 415
Query 422 YVPALDNIAMEPLMVEQVDALPSLLSLQRGTDSVYTLSFRPDKLIKNSALGYVPAWSKVM 481
+ P ++I +P+ +++ A D + GY+P +S+
Sbjct 416 FWPQFEHIGEQPVYNKEIFAK------------------NIDAFDSEAVFGYLPRYSEYK 457
Query 482 QSTSRAHGRLTTDLKYWLLNRDYGVDIDQIKNTGVIGVDVLKNLASEL 529
S S HG DL +W L R + D + N I D KN S +
Sbjct 458 FSPSTVHGDFKDDLYFWHLGRIFDTDKPPVLNQSFIECD--KNALSRI 503
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 59/140 (42%), Gaps = 32/140 (23%)
Query 12 KKSNVNLSHVSPTTLAPGNLIPISFTRIFAGDDLRFKPSAFVQAMPMNAPLVNGFKLCLE 71
K S N+S+ ++ G+L+PI + GD + P + PM AP+++ + +
Sbjct 13 KSSTFNMSYDRKFSMNFGDLVPIHCQEVIPGDKISINPQHMTRLAPMIAPVMHEVNVFIH 72
Query 72 YFFVPDRLY--NWELLMDNTGVTDDPDKVKFPQISSPAEYSTGTIKFSLNSQSAARGDAA 129
YFFVP+R+ NWE + TG D+ P
Sbjct 73 YFFVPNRIIWSNWEQFI--TGGESGLDQHLMP---------------------------- 102
Query 130 RLANSIVQPGSLADYCGFPV 149
R+ N V GSLAD+ G P+
Sbjct 103 RVGNLPVSKGSLADHLGLPL 122
>gi|444297926|dbj|GAC77876.1| major capsid protein [uncultured marine virus]
Length=583
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 100/461 (22%), Positives = 184/461 (40%), Gaps = 67/461 (15%)
Query 13 KSNVNLSHVSPTTLAPGNLIPISFTRIFAGDDLRFKPSAFVQAMPMNAPLVNGFKLCLEY 72
+S+ +LSH TT L+PI I GD + + S F + P+++ L +
Sbjct 17 RSSFDLSHGLKTTFNASQLVPILSLEILPGDTINLRASLFGRMATPVKPVLDNLYLETFF 76
Query 73 FFVPDRLYNWELLMDNTGVTDDPDKVKFPQISSPAEYSTGTIKFSLNSQSAARGDAARLA 132
FF P R W + G ++P +I FS+ + +
Sbjct 77 FFTPWRQV-WPNFIKMMGEQEEPGD---------------SIDFSVPTMTG--------- 111
Query 133 NSIVQPGSLADYCGFPVGLFPTYDIVSDTDDRNQFCALKLLGVLDIFYHYYVNQQIER-- 190
+ + G+L+DY G P+G+ P + L IF ++ ++ + +
Sbjct 112 TTPIVSGTLSDYFGLPLGIIP---------NGTPHSTLPFRCYNAIFNFWFRDENLMQPR 162
Query 191 -FPTA------SFTPTLNSGSEDERNVDYPVTMLRSFLDFVKRSPNPASAIGE----WAT 239
PT S N S +R DY + S L F ++ P+ ++G+ A
Sbjct 163 PVPTGDGPDEPSSNQFTNLASRRKRR-DY----ITSGLPFPQKGPDVTVSLGDRAQVMAQ 217
Query 240 SNPNGNPIFGTWSWFCSRASIFQRCLPPYYLESWLATSGYEDSEIKVDLDADGKSISFRN 299
+ +P G S L + ++++ + + +L+ D IS +
Sbjct 218 TIGGTDPAVGAEIGVELGTSGIFHALDTDAARAEISSTTAALAPLYANLE-DATGISIND 276
Query 300 IAAHSHIQRWLDLALAGGSRYSDYINSQFDVSR---LKHTTSPLYLGSDRQYLGSNVIYQ 356
+ IQR L+ GG+RY + + S F VS L H PL+LG + N + Q
Sbjct 277 LRESIQIQRLLERDARGGTRYPEILRSHFQVSDPGLLVHQ-RPLFLGGGSSQININPVAQ 335
Query 357 TTGAGDSSSP-------LGAFAGQASGGETFRQRSYHFGENGYFVVMASLVPDVIYSRGM 409
T+ D S L A+ + F + F E+G+ + + ++ D+ Y +G+
Sbjct 336 TSAKDDQHSATHTPQANLAAYGTVSGNNHGF---TASFTEHGHILGLVNVRADLTYQQGL 392
Query 410 DPFNREKTLGDVYVPALDNIAMEPLMVEQVDALPSLLSLQR 450
+ + +T D Y PAL ++ + + +++ + LS+ R
Sbjct 393 ERYWSRQTRFDFYWPALSHLGEQAVKIKKSSYRMTRLSMTR 433
>gi|17402851|ref|NP_510872.1| hypothetical protein PhiCPG1p2 [Guinea pig Chlamydia phage]
Length=553
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 119/536 (22%), Positives = 204/536 (38%), Gaps = 98/536 (18%)
Query 9 SRNKKSNVNLSHVSPTTLAPGNLIPISFTRIFAGDDLRFKPSAFVQAMPMNAPLVNGFKL 68
+R ++S+ + S TT G LIPI + GD K + + PL++ +L
Sbjct 21 ARIQRSSFDRSCGLKTTFDAGYLIPIFCDEVLPGDTFSLKEAFLARMATPIFPLMDNLRL 80
Query 69 CLEYFFVPDRLYNWELLMDNTGVTDDP-DKVKF--PQISSPAEYSTGTIKFSLNSQSAAR 125
+YFFVP RL W G D+P D F P +++P S G I+
Sbjct 81 DTQYFFVPLRLL-WSNFQKFCGEQDNPGDSTDFLTPVLTAP---SGGFIE---------- 126
Query 126 GDAARLANSIVQPGSLADYCGFPVGLFPTYDIVSDTDDRNQFCALKLLGVLDIFYHYYVN 185
GS+ DY G P + + A I+ YY +
Sbjct 127 -------------GSIHDYLGLPTKVAGI-----------ECVAFWHRAYNLIWNQYYRD 162
Query 186 QQIERFPTASFTPT----LNSGSEDERNVDYPVTMLRSFLDFVKRSPNPASAIG------ 235
+ I+ T +N+ +R Y S L + ++ P +G
Sbjct 163 ENIQESVDVEMGDTTSNEVNNYKLLKRGKRY--DYFTSCLPWPQKGPAVTIGVGGIVPVQ 220
Query 236 ----EWATSNPNGNPIFGTWSWFCSRASIFQRCLPP--------------YYLESWLATS 277
+W S+ NPI + SW S F P YY++ +
Sbjct 221 GLGIQWGNSSAP-NPITAS-SWINSVNPTFINSTTPTPTGTNKILNYGQAYYIKKPGEAT 278
Query 278 GYEDSEIKVDLDADGKSISFRNIAAHSHIQRWLDLALAGGSRYSDYINSQFDV----SRL 333
VDL + ++ ++ +Q+ + GG+RY + I S F+V +RL
Sbjct 279 TDPTPRAYVDLGST-SPVTINSLREAFQLQKLYERDARGGTRYIEIIRSHFNVQSPDARL 337
Query 334 KHTTSPLYLGSDRQYLGSNVIYQTTGAGDSSSPLGAFAGQASGGETFRQRSYHFGENGYF 393
+ YLG + + I QT+ DS+SP G A + + R + F E+G
Sbjct 338 QRAE---YLGGSSTPVNISPIPQTSST-DSTSPQGNLAAYGTAIGSKRVFTKSFTEHGVI 393
Query 394 VVMASLVPDVIYSRGMDPFNREKTLGDVYVPALDNIAMEPLMVEQVDAL-PSLLSLQRGT 452
+ +AS+ D+ Y +G+D +T D Y PAL ++ + ++ +++ P++ Q G
Sbjct 394 LGLASVRADLNYQQGLDRMWSRRTRWDFYWPALSHLGEQAVLNKEIYCQGPAVKDAQNG- 452
Query 453 DSVYTLSFRPDKLIKNSALGYVPAWSKVMQSTSRAHGRL----TTDLKYWLLNRDY 504
+ ++ GY +++ TS+ G+ T L W L + +
Sbjct 453 ----------NVVVDEQVFGYQERFAEYRYKTSKITGKFRSNATGSLDAWHLAQQF 498
>gi|599087675|gb|AHN52763.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=228
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/167 (27%), Positives = 83/167 (50%), Gaps = 6/167 (4%)
Query 273 WLATSGYEDSEIKVDLDADGKSISFRNIAAHSHIQRWLDLALAGGSRYSDYINSQFDVSR 332
W+ T G++ +I DL ++ + + ++ IQR L+ GG+RY++ I S F V+
Sbjct 67 WIETDGFKKPKIYADL-SNATAATINDLRESFQIQRLLERDARGGTRYAEIIGSHFGVNF 125
Query 333 LKHTTSPLYLGSDRQYLGSNVIYQTTGAGDSSSPLGAFAGQASGGETFRQRSYHFGENGY 392
+ T P YLG + N I +T A ++ LGAF + G F + F E+ Y
Sbjct 126 MDVTYRPEYLGGGSTPININPIAET--AAGTAGELGAFGTASFNGHGFVKS---FTEHCY 180
Query 393 FVVMASLVPDVIYSRGMDPFNREKTLGDVYVPALDNIAMEPLMVEQV 439
+ + ++ D+ Y +G++ T D Y P+L I + ++ +++
Sbjct 181 IIGLVNVRADLTYQQGLNRLWSRSTRYDFYWPSLAQIGEQAVLNKEI 227
>gi|9791178|ref|NP_063895.1| hypothetical protein [Chlamydia pneumoniae phage CPAR39]
gi|7190965|gb|AAF39725.1| hypothetical protein [Chlamydia pneumoniae phage CPAR39]
Length=553
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 117/536 (22%), Positives = 202/536 (38%), Gaps = 98/536 (18%)
Query 9 SRNKKSNVNLSHVSPTTLAPGNLIPISFTRIFAGDDLRFKPSAFVQAMPMNAPLVNGFKL 68
+R ++S+ + S TT G LIPI + GD K + + PL++ +L
Sbjct 21 ARIQRSSFDRSCGLKTTFDAGYLIPIFCDEVLPGDTFSLKEAFLARMATPIFPLMDNLRL 80
Query 69 CLEYFFVPDRLYNWELLMDNTGVTDDP-DKVKF--PQISSPAEYSTGTIKFSLNSQSAAR 125
+YFFVP RL W G D+P D F P +++P+ T
Sbjct 81 DTQYFFVPLRLI-WSNFQKFCGEQDNPGDSTDFLTPVLTAPSGGFT-------------- 125
Query 126 GDAARLANSIVQPGSLADYCGFPVGLFPTYDIVSDTDDRNQFCALKLLGVLDIFYHYYVN 185
GS+ DY G P + + A I+ YY +
Sbjct 126 ------------EGSIHDYLGLPTKVAGI-----------ECVAFWHRAYNLIWNQYYRD 162
Query 186 QQIERFPTASFTPT----LNSGSEDERNVDYPVTMLRSFLDFVKRSPNPASAIG------ 235
+ I+ T +N+ +R Y S L + ++ P +G
Sbjct 163 ENIQESVDVEMGDTTSNEVNNYKLLKRGKRY--DYFTSCLPWPQKGPAVTIGVGGIVPVQ 220
Query 236 ----EWATSNPNGNPIFGTWSWFCSRASIFQRCLPP--------------YYLESWLATS 277
+W S+ NPI + SW S F P YY++ +
Sbjct 221 GLGIQWGNSSAP-NPITAS-SWINSVNPTFINSTTPTPTGTNQILNYGQAYYIKKPGEAT 278
Query 278 GYEDSEIKVDLDADGKSISFRNIAAHSHIQRWLDLALAGGSRYSDYINSQFDV----SRL 333
VDL + ++ ++ +Q+ + GG+RY + I S F+V +RL
Sbjct 279 TDPTPRAYVDLGST-SPVTINSLREAFQLQKLYERDARGGTRYIEIIRSHFNVQSPDARL 337
Query 334 KHTTSPLYLGSDRQYLGSNVIYQTTGAGDSSSPLGAFAGQASGGETFRQRSYHFGENGYF 393
+ YLG + + I QT+ DS+SP G A + + R + F E+G
Sbjct 338 QRAE---YLGGSSTPVNISPIPQTSST-DSTSPQGNLAAYGTAIGSKRVFTKSFTEHGVI 393
Query 394 VVMASLVPDVIYSRGMDPFNREKTLGDVYVPALDNIAMEPLMVEQVDAL-PSLLSLQRGT 452
+ +AS+ D+ Y +G+D +T D Y PAL ++ + ++ +++ P++ Q G
Sbjct 394 LGLASVRADLNYQQGLDRMWSRRTRWDFYWPALSHLGEQAVLNKEIYCQGPAVKDAQNG- 452
Query 453 DSVYTLSFRPDKLIKNSALGYVPAWSKVMQSTSRAHGRL----TTDLKYWLLNRDY 504
+ ++ GY +++ TS+ G+ T L W L + +
Sbjct 453 ----------NVVVDEQVFGYQERFAEYRYKTSKITGKFRSNATGSLDAWHLAQQF 498
>gi|9629155|ref|NP_044312.1| VP1 [Chlamydia phage 1]
gi|139180|sp|P19192.2|F_BPCHP RecName: Full=Capsid protein VP1; AltName: Full=Protein VP1;
Short=VP1 [Chlamydia phage 1]
gi|93817|pir||JU0345 major capsid protein VP1 - Chlamydophila psittaci phage Chp1
gi|217762|dbj|BAA00515.1| VP1 [Chlamydia phage 1]
Length=596
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 103/507 (20%), Positives = 188/507 (37%), Gaps = 124/507 (24%)
Query 12 KKSNVNLSHVSPTTLAPGNLIPISFTRIFAGDDLRFKPSAFVQAMPMNAPLVNGFKLCLE 71
++S+ + SH TT L+P + GD + + + P+++ +L +
Sbjct 24 RRSSFDRSHGYKTTFDMDYLVPFFVDEVLPGDTFSLSETHLCRLTTLVQPIMDNIQLTTQ 83
Query 72 YFFVPDRLY--NWELLMDNTG--------VTDDPDKVKFPQISSPAEYSTGTIKFSLNSQ 121
+FFVP+RL NWE + TG T+ ++ PQ++SP
Sbjct 84 FFFVPNRLLWDNWESFI--TGGDEPVAWTSTNPANEYFVPQVTSP--------------- 126
Query 122 SAARGDAARLANSIVQPGSLADYCGFPVGLFPTYDIVSDTDDRNQFCALKLLGVLDIFYH 181
D NSI DY G P + + R+Q L+ + IF
Sbjct 127 -----DGGYAENSIY------DYFGLPTKV---------ANYRHQVLPLRAYNL--IFNE 164
Query 182 YYVNQQI-ERFPT------ASFTPTLNSGSEDERNVDYPVTMLR---------SFLDFVK 225
YY ++ + E P PT S+++ V Y ++R S L ++
Sbjct 165 YYRDENLQESLPVWTGDADPKVDPTTGEESQEDDAVPYVYKLMRRNKRYDYFTSALPGLQ 224
Query 226 RSPNPASAIGEWATSNPNGN-PIFG------------------------TWSWFCSRASI 260
+ P ++G T +G P+ G + WF + +
Sbjct 225 KGP----SVGIGITGGDSGRLPVHGLAIRSYLDDSSDDQFSFGVSYVNASQKWFTADGRL 280
Query 261 ----------------FQRCLPPYYLESWLATSGYE-----------DSEIKVDLDADGK 293
+ P Y + +A +G D + VDL A
Sbjct 281 TSGMGSVPVGTTGNFPIDNVVYPSYFGTTVAQTGSPSSSSTPPFVKGDFPVYVDLAA-SS 339
Query 294 SISFRNIAAHSHIQRWLDLALAGGSRYSDYINSQFDVSRLKHTTS-PLYLGSDRQYLGSN 352
S++ ++ +Q+W + + GSRY + + F V + P+YLG + Y+ N
Sbjct 340 SVTINSLRNAITLQQWFEKSARYGSRYVESVQGHFGVHLGDYRAQRPIYLGGSKSYVSVN 399
Query 353 VIYQTTGAGDSSSPLGAFAGQASGGETFRQRSYHFGENGYFVVMASLVPDVIYSRGMDPF 412
+ Q + DS SP G + A +T + F E+G+ + + S D+ Y +G++
Sbjct 400 PVVQNSST-DSVSPQGNLSAYALSTDTKHLFTKSFVEHGFVIGLLSATADLTYQQGLERQ 458
Query 413 NREKTLGDVYVPALDNIAMEPLMVEQV 439
+ D Y P ++ +P+ +++
Sbjct 459 WSRFSRYDYYWPTFAHLGEQPVYNKEI 485
Lambda K H a alpha
0.319 0.135 0.406 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4517024053047