bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-41_CDS_annotation_glimmer3.pl_2_2
Length=582
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094544|emb|CDL65904.1| unnamed protein product 576 0.0
gi|575094572|emb|CDL65928.1| unnamed protein product 554 0.0
gi|575094492|emb|CDL65859.1| unnamed protein product 548 0.0
gi|575096056|emb|CDL66947.1| unnamed protein product 534 2e-180
gi|575094496|emb|CDL65862.1| unnamed protein product 523 3e-176
gi|575094415|emb|CDL65790.1| unnamed protein product 446 2e-146
gi|575094431|emb|CDL65804.1| unnamed protein product 407 2e-131
gi|575094564|emb|CDL65921.1| unnamed protein product 392 3e-125
gi|530695351|gb|AGT39907.1| major capsid protein 380 5e-121
gi|9629155|ref|NP_044312.1| VP1 377 5e-119
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 576 bits (1485), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/585 (51%), Positives = 381/585 (65%), Gaps = 41/585 (7%)
Query 1 VESHFAQLPAAEIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTM 60
VESHF++LP+ +I RS FDRS KT+F GD+IPF +DEVLPGD+FN+ +SKV+R Q++
Sbjct 5 VESHFSRLPSVDISRSQFDRSSSLKTTFNVGDLIPFYIDEVLPGDTFNVKSSKVIRMQSL 64
Query 61 LTPIMDNMFLDTYYFFVPNRLVWKHWREFCGENREGAWAPTVEYTVPSIAPPPGGFAKGT 120
+TPIMDN++LDTYYFFVPNRLVW HW++F GEN E AW PT EY VP + P G++ GT
Sbjct 65 VTPIMDNIYLDTYYFFVPNRLVWSHWQQFNGENTESAWLPTTEYQVPQVTAPANGWSIGT 124
Query 121 IADYMGLPIGVEWKATDPLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQGSNG 180
IADY G+P GV +ALPFR YALICNE+FRDENL+DPL I + DA GSNG
Sbjct 125 IADYFGIPTGVACSV------NALPFRAYALICNEWFRDENLSDPLNIPISDATVVGSNG 178
Query 181 DDYANDVANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIHVPGFAGGTFPVTTTDNWTV 240
D+Y D+ GG PFKA + HDYF+SCLP+ QKG V +P+ + PVTT+D
Sbjct 179 DNYITDIVKGGMPFKACKYHDYFTSCLPAPQKGPDVLLPL-----SSSPVPVTTSDTMVD 233
Query 241 PA--SAVPAVFGLFTTPLNGTIDGRTAYPASTGSSELEQGQKIFVSDGHSAPGPMCWMAG 298
P S P G+ + L+ T+ P E +G V H G + +
Sbjct 234 PLQYSKYPMA-GVDSWNLSPTLMRNIIRPF-----EGVEGANYQV---HQFTGDIPTI-- 282
Query 299 SNITYARNVWSPINLSTTIPAAGSGDDVDASFTVNELRLAFAYQRFLESMARNGSRYTEL 358
+ + P+NL + A + ++N+LRLAF QR E AR G+RY E+
Sbjct 283 -------DAFRPLNLVANLQNATAA-------SINQLRLAFQIQRLYERDARGGTRYIEI 328
Query 359 LLGLFGVRSPDARLQRPEYLGGNRIPISVSEVTNNAQTPEDY-LGDLGAKSSTGDVNHDF 417
L FGV SPDARLQRPEYLGGNRIPI++++V ++T G+ +S T D N DF
Sbjct 329 LKSHFGVTSPDARLQRPEYLGGNRIPININQVLQQSETTSTSPQGNPVGQSLTTDTNADF 388
Query 418 IKSFTEHGYLFGLFVVRYDHSYSQGIERFWTRKKFTDFYNPKFAHLGETGVYQAEIMATP 477
+KSF EHG++ GL V RYDH+Y QG+ERFW+RK D+Y P FAH+GE V EI +
Sbjct 389 VKSFVEHGFVIGLMVARYDHTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSG 448
Query 478 ENMADPTKVFGFQEIWADYRYKPSLVTGEMRPGVTNSLAHWHLADHYAKVPTLSDGWIRE 537
+ D +VFG+QE +ADYRYKPS VTGEMR SL WHLAD YA +P+LSD WIRE
Sbjct 449 TAVDD--EVFGYQEAYADYRYKPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRE 506
Query 538 DKANVDRVLAVQSSVADQFWCDLWISNMCTRPMPMYSIPGSLDHF 582
+ VDRVLAV S+V+ Q +CD++I N TRPMPMYS+PG +DHF
Sbjct 507 SASTVDRVLAVSSNVSAQLFCDIYIQNRSTRPMPMYSVPGLIDHF 551
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 554 bits (1427), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/589 (51%), Positives = 374/589 (63%), Gaps = 46/589 (8%)
Query 1 VESHFAQLPA-AEIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQT 59
VESHFA+ P +I RSTFDRS K +F G+IIPF ++EVLPGD+F + TSKV+R QT
Sbjct 5 VESHFAKNPTNIDISRSTFDRSSSVKLTFNTGEIIPFFIEEVLPGDTFKVKTSKVIRLQT 64
Query 60 MLTPIMDNMFLDTYYFFVPNRLVWKHWREFCGENREGAWAPTVEYTVPSIAPPPGGFAKG 119
+LTP+MDN++LDTYYFFVPNRLVW+HW+EF GEN + AW P VEY +P + P GG+ G
Sbjct 65 LLTPMMDNIYLDTYYFFVPNRLVWEHWKEFNGENTQSAWIPEVEYQIPQLTAPEGGWNIG 124
Query 120 TIADYMGLPIGVEWKATDPLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQGSN 179
T+ADY G+P GV ++ +ALPFR YAL+CNE+FRD+NL+DPL I + DA G N
Sbjct 125 TLADYFGIPTGVS-----GISVNALPFRAYALVCNEWFRDQNLSDPLNIPVGDATVTGVN 179
Query 180 GDDYANDVANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIHVPGFAGGTFPVTTTDNWT 239
+ DV GG P+ AA+ HDYF+SCLP+ QKG V T PVT+ N
Sbjct 180 TGTFITDVVKGGLPYTAAKYHDYFTSCLPAPQKGPDV------------TIPVTSGHN-- 225
Query 240 VPASAVPAVFGLFTTPLNGTIDGRTAYPASTG--SSELEQ---GQKIFVSDGHSAPGPMC 294
+P +F LN T D P G +SEL ++
Sbjct 226 -----LPVMF------LNETHDAGPYKPFGVGIQNSELRNFYGFGSGSSGATSTSDTSST 274
Query 295 WMAGSNIT-YARNVWSPINLSTTIPAAGSGDDVDASFTVNELRLAFAYQRFLESMARNGS 353
GS+ T +N W+P N+ A SGD A T+N+LRLAF Q+ E AR G+
Sbjct 275 VEVGSDGTGIGQNFWTPTNMW----AVESGDVGMA--TINQLRLAFQLQKLYEKDARGGT 328
Query 354 RYTELLLGLFGVRSPDARLQRPEYLGGNRIPISVSEVTNNAQTPEDY-LGDLGAKSSTGD 412
RYTE++ FGV SPD+RLQRPEYLGGNRIPI+V+++ +Q+ E LG L S T D
Sbjct 329 RYTEIIRSHFGVVSPDSRLQRPEYLGGNRIPINVNQIIQQSQSTEQSPLGALAGMSVTTD 388
Query 413 VNHDFIKSFTEHGYLFGLFVVRYDHSYSQGIERFWTRKKFTDFYNPKFAHLGETGVYQAE 472
N DFIKSF EHGY+ GL V RYDH+Y QG++R W+RK DFY P A++GE V E
Sbjct 389 KNSDFIKSFVEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKE 448
Query 473 IMATPENMADPTKVFGFQEIWADYRYKPSLVTGEMRPGVTNSLAHWHLADHYAKVPTLSD 532
I + D +VFG+QE WA+YRYKP+ V GEMR SL WHL D Y+ +P LSD
Sbjct 449 IYIDGSDTDD--EVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSD 506
Query 533 GWIREDKANVDRVLAVQSSVADQFWCDLWISNMCTRPMPMYSIPGSLDH 581
WIREDK NVDRVLAV SSV+DQ + D++I N TRPMPMYSIPG +DH
Sbjct 507 SWIREDKTNVDRVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGLIDH 555
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 548 bits (1412), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 288/561 (51%), Positives = 363/561 (65%), Gaps = 42/561 (7%)
Query 24 YKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTMLTPIMDNMFLDTYYFFVPNRLVW 83
YKT+F GD+IPF VDE+LPGD+F+I TSKVVR Q++LTP+MDN++LDTY+FFVPNRL W
Sbjct 29 YKTTFNVGDLIPFYVDEILPGDTFSIDTSKVVRMQSLLTPVMDNIYLDTYFFFVPNRLTW 88
Query 84 KHWREFCGENREGAWAPTVEYTVPSIAPPPGGFAKGTIADYMGLPIGVEWKATDPLAPSA 143
HWRE GEN + AW P VEY+VP I P GG+ GTIADYMG+P GV L+ +A
Sbjct 89 SHWRELMGENTQSAWTPQVEYSVPQITAPEGGWNVGTIADYMGIPTGVS-----GLSVNA 143
Query 144 LPFRGYALICNEFFRDENLTDPLLISLDDANQQGSNGDDYANDVANGGKPFKAARMHDYF 203
+PFR YALICNE+FRDENLTDPL I + DA G N Y DVA GG PFKAA+ HDYF
Sbjct 144 MPFRAYALICNEWFRDENLTDPLNIPVGDATVAGVNTGTYVTDVAKGGLPFKAAKYHDYF 203
Query 204 SSCLPSSQKGSSVGIPIHVPGFAGGTFPVTTTDNWTVPASAVPAVFGLFTTPLNGTIDG- 262
+SCLP+ QKG V I G PVT TDN LN G
Sbjct 204 TSCLPAPQKGPDVLIS----AVGSGIVPVTATDN--------------DNDSLNVNSPGM 245
Query 263 RTAYPASTGSSELE--QGQKIFVSDGHSAPGPMCWMAGSNITYARNVWSPINLSTTIPAA 320
R +ST + L G V+D P P + G ++ N+W+ ++ +T +P A
Sbjct 246 RFVGNSSTSVNYLAFGGGDGYVVTD---TPKPSTPIHGISMI-PTNLWADLSTATDLPVA 301
Query 321 GSGDDVDASFTVNELRLAFAYQRFLESMARNGSRYTELLLGLFGVRSPDARLQRPEYLGG 380
T+N+LR AF Q+ E AR G+RY E+L FGV SPDARLQRPEYLGG
Sbjct 302 ----------TINQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGG 351
Query 381 NRIPISVSEVTNNAQTPEDYLGDLGAKSSTGDVNHDFIKSFTEHGYLFGLFVVRYDHSYS 440
+R+PI++++V +++T G+ A S T D + +F KSF EHG++ GL V RYDHSY
Sbjct 352 SRVPININQVIQSSETGATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARYDHSYQ 411
Query 441 QGIERFWTRKKFTDFYNPKFAHLGETGVYQAEIMATPENMADPTKVFGFQEIWADYRYKP 500
QG++RFW+RK D+Y P FA+LGE V EI A ++ D +VFG+QE WADYRYKP
Sbjct 412 QGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDD--EVFGYQEAWADYRYKP 469
Query 501 SLVTGEMRPGVTNSLAHWHLADHYAKVPTLSDGWIREDKANVDRVLAVQSSVADQFWCDL 560
S+VTGEMR SL WHLAD Y +P+LSD WIRED + V+RVLAV SV+ Q +CD+
Sbjct 470 SVVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLFCDI 529
Query 561 WISNMCTRPMPMYSIPGSLDH 581
+I + TRPMP+YSIPG +DH
Sbjct 530 YIRCLATRPMPLYSIPGLIDH 550
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 534 bits (1375), Expect = 2e-180, Method: Compositional matrix adjust.
Identities = 292/599 (49%), Positives = 375/599 (63%), Gaps = 55/599 (9%)
Query 2 ESHFAQLPAAEIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTML 61
ESHF+ LP +I RS FDRS KT+F AGD++PF ++EVLPGD+F++ +SKVVR QT+L
Sbjct 7 ESHFSLLPHVDISRSRFDRSSSIKTTFNAGDVVPFFLEEVLPGDTFSVDSSKVVRMQTLL 66
Query 62 TPIMDNMFLDTYYFFVPNRLVWKHWREFCGENREGAWAPTVEYTVPSIAPPPGGFAKGTI 121
TP+MDN++LDTYYFFVPNRLVW+HW+EFCGEN E AW P EY +P + P GGF GTI
Sbjct 67 TPMMDNVYLDTYYFFVPNRLVWQHWKEFCGENNESAWIPQTEYAIPQLKSPVGGFEVGTI 126
Query 122 ADYMGLPIGVEWKATDPLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQGSNGD 181
ADY GLP GV L+ SALPFR YALI NE+FRDENL DPL++ DDA G N
Sbjct 127 ADYFGLPTGVA-----NLSVSALPFRAYALIMNEWFRDENLMDPLVVPTDDATVTGVNTG 181
Query 182 DYANDVANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIHVPGFAGGTFPVTTTDNWTVP 241
+ DVA GGKPF AA+ HDYF+S LP+ QKG V I PV + N+ V
Sbjct 182 IFVTDVAKGGKPFVAAKYHDYFTSALPAPQKGPDVVI------------PVASAGNYNVV 229
Query 242 ASAVPAVFGLFTTPLNGTIDGRTAYPASTG-SSELEQGQKIFVSDGHSAPGPMCWMAGSN 300
+ GL + DG G S QG ++F S G +
Sbjct 230 GNGK----GLALS------DGSKMSIICNGLSGSNGQGTELFAS-GILGSQVGSSGGFGS 278
Query 301 ITYARNVWSPINLSTTIPAAGSGDDVDAS------------FTVNELRLAFAYQRFLESM 348
+ R + + T AA G++++ S T+N+LR+AF Q+F E
Sbjct 279 GSSLRGDGIILGVPT---AAQLGNNLENSGLIAIASGNAAAATINQLRMAFQIQKFYEKQ 335
Query 349 ARNGSRYTELLLGLFGVRSPDARLQRPEYLGGNRIPISVSEVTNNA------QTPEDYLG 402
AR GSRYTE++ FGV SPDARLQR EYLGGNRIPI++++V + TP+ G
Sbjct 336 ARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQSGTGSASTTPQ---G 392
Query 403 DLGAKSSTGDVNHDFIKSFTEHGYLFGLFVVRYDHSYSQGIERFWTRKKFTDFYNPKFAH 462
+ S T D + DF KSFTEHG++ G+ RYDH+Y QGI+R W+RK D+Y P F++
Sbjct 393 TVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKFDYYWPVFSN 452
Query 463 LGETGVYQAEIMATPENMADPTKVFGFQEIWADYRYKPSLVTGEMRPGVTNSLAHWHLAD 522
+GE + EI A + A +VFG+QE WA+YRYKPS VTGEMR SL WHLAD
Sbjct 453 IGEQAIKNKEIYA--QGNATDDEVFGYQEAWAEYRYKPSRVTGEMRSSYAQSLDVWHLAD 510
Query 523 HYAKVPTLSDGWIREDKANVDRVLAVQSSVADQFWCDLWISNMCTRPMPMYSIPGSLDH 581
Y+K+P+LSD WIRED ++RVLAV ++QF+ D+++ N+CTRPMPMYSIPG +DH
Sbjct 511 DYSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRPMPMYSIPGLIDH 569
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 523 bits (1346), Expect = 3e-176, Method: Compositional matrix adjust.
Identities = 287/590 (49%), Positives = 363/590 (62%), Gaps = 39/590 (7%)
Query 3 SHFAQLPAA-EIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTML 61
S F++ P +IQRSTF+RS YKTS G++IPF DEVLPGD+F + T+KVVR Q ++
Sbjct 6 SRFSENPVTLDIQRSTFNRSSTYKTSANIGELIPFYYDEVLPGDTFQVKTNKVVRLQPLV 65
Query 62 TPIMDNMFLDTYYFFVPNRLVWKHWREFCGENREGAWAPTVEYTVPSIAPPPG-GFAKGT 120
+ MDN++ DTYYFFVPNRLVW+HW EF GEN++GAW P EYT+P I P GF GT
Sbjct 66 SAPMDNLYFDTYYFFVPNRLVWEHWEEFMGENKQGAWIPQTEYTIPQITSPASTGFEIGT 125
Query 121 IADYMGLPIGVEWKATDPLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQGSNG 180
IADY G+P GV L+ SALPFR YALI +E+FRD+NL PL I LDD QG N
Sbjct 126 IADYFGIPTGVP-----NLSVSALPFRAYALIVDEWFRDQNLQLPLNIPLDDTTLQGVNT 180
Query 181 DDYANDVANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIHVPGFAGGTFPVTTTDNWTV 240
DY D GGKPF AA+ HDYF+SCLPS QKG V I A G FPV T D
Sbjct 181 GDYVTDTVKGGKPFVAAKYHDYFTSCLPSPQKGPDVTIA------AVGDFPVYTGDPHNN 234
Query 241 PASAVPAVFGLFTTPLNGTIDGRTAYPASTGSSELEQGQKIF---VSDGHSAPGP-MCWM 296
S +G+ S+GS QG I ++ G + P +
Sbjct 235 NGSNKALHYGISNI--------------SSGSVSFSQGNYIIPSVLTTGSTQSVPAQGKL 280
Query 297 AGSNITYARNVWSPINLSTTIPAAGSGDDVDAS----FTVNELRLAFAYQRFLESMARNG 352
SNIT + SP + S + D++ AS T+N+LR+AF Q+ E AR G
Sbjct 281 NASNITMTTSPGSP-DSSFGSKLSVYPDNLYASSGTATTINQLRMAFQIQKLYEKDARAG 339
Query 353 SRYTELLLGLFGVRSPDARLQRPEYLGGNRIPISVSEVTNNAQTPE-DYLGDLGAKSSTG 411
SRY EL+ F V DAR+Q PEYLGGNRIPI++++V +QT + G++ +S T
Sbjct 340 SRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININQVVQTSQTSDVSPQGNVAGQSLTS 399
Query 412 DVNHDFIKSFTEHGYLFGLFVVRYDHSYSQGIERFWTRKKFTDFYNPKFAHLGETGVYQA 471
D + DFIKSFTEHG L G+ V RYDH+Y QG+ + W+RK D+Y P A++GE V
Sbjct 400 DSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSKLWSRKTRFDYYWPVLANIGEQAVLNK 459
Query 472 EIMATPENMADPTKVFGFQEIWADYRYKPSLVTGEMRPGVTNSLAHWHLADHYAKVPTLS 531
EI A + A +VFG+QE WA+YRYKPS+VTGEMR SL WH AD Y +P LS
Sbjct 460 EIYA--QGTAQDEEVFGYQEAWAEYRYKPSIVTGEMRSSARTSLDSWHFADDYNSLPKLS 517
Query 532 DGWIREDKANVDRVLAVQSSVADQFWCDLWISNMCTRPMPMYSIPGSLDH 581
WI+EDK N+DRVLAV SSV++Q++ D +I N TR +P YSIPG +DH
Sbjct 518 ADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETTRALPFYSIPGLIDH 567
>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569
Score = 446 bits (1148), Expect = 2e-146, Method: Compositional matrix adjust.
Identities = 246/588 (42%), Positives = 327/588 (56%), Gaps = 40/588 (7%)
Query 2 ESHFAQLPAAEIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTML 61
E+H++Q+P A IQR+ F R Y T+ GD++P VDEVLPGD+ I +VR T L
Sbjct 6 EAHYSQIPHANIQRAKFKRDFSYLTTINEGDLVPIYVDEVLPGDTIKIKQRSLVRMSTPL 65
Query 62 TPIMDNMFLDTYYFFVPNRLVWKHWREFCGENREGAWAPTVEYTVPSIAPPPGGFAKGTI 121
P+MDN +LD +YFFVP RLVW HW+ GEN + WAP V+YT P + P GG+ GTI
Sbjct 66 YPVMDNCYLDIWYFFVPCRLVWDHWQNLMGENTKSYWAPDVQYTTPLTSAPSGGWQVGTI 125
Query 122 ADYMGLPIGVEWKATDPLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQGSNGD 181
ADYMG+P GV + +++P R YA I NE+FRDENL P+ DDA GSN
Sbjct 126 ADYMGIPTGVS-----GIKVNSMPMRAYARIWNEWFRDENLQQPVTQHSDDATTTGSNTG 180
Query 182 DYANDVANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIH----VPGFAGGTFPVTTTDN 237
D +GG P K A+ DYF+SCLP+ QKG ++G + V G G FP+ T
Sbjct 181 TELTDAESGGLPLKVAKFKDYFTSCLPAPQKGEAIGFDFNQTPKVKGI-GLVFPLETNTG 239
Query 238 -------WTVPASAVPAVFGLFTTPLNGTIDGRTAYPASTGSSELEQGQKIFVSDGHSAP 290
W P + + V + T N + + T + + + F ++G
Sbjct 240 HTATDILWRQPDAQL--VGENYNTSYNN-------FNSITTQTTVNGKKAFFFNNGK--- 287
Query 291 GPMCWMAGSNITYARNVWSPINLSTTIPAAGSGDDVDASFTVNELRLAFAYQRFLESMAR 350
GPM A Y V + ++ ++N+LR A A Q LE+ AR
Sbjct 288 GPML-SARFEDDYNGGV-------EQVELTAVAENSTNFLSINDLRQAIALQHILEADAR 339
Query 351 NGSRYTELLLGLFGVRSPDARLQRPEYLGGNRIPISVSEVT-NNAQTPEDYLGDLGAKSS 409
G+RY E+L FGV SPDARLQR EY+GG RIPI+VS+V ++A G+ A S
Sbjct 340 GGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQVIQSSASDTTSPQGNAAAYSL 399
Query 410 TGDVNHDFIKSFTEHGYLFGLFVVRYDHSYSQGIERFWTRKKFTDFYNPKFAHLGETGVY 469
T N S EHGY+ GL +R DHSY QG+ R WTR +Y+P A+LGE V
Sbjct 400 TTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMWTRSDRFSYYHPMLANLGEQAVL 459
Query 470 QAEIMATPENMADPTKVFGFQEIWADYRYKPSLVTGEMRPGVTNSLAHWHLADHYAKVPT 529
EI A + T+VFG+QE WADYRY+ +++TGEMR SL WH D Y +P
Sbjct 460 NQEIYA--QGTTADTEVFGYQEAWADYRYRTNMITGEMRSTYAQSLDAWHYGDKYTDLPR 517
Query 530 LSDGWIREDKANVDRVLAVQSSVADQFWCDLWISNMCTRPMPMYSIPG 577
LS+ WI+E + N+DR LAVQS + QF C+L+ RPMP+YS+PG
Sbjct 518 LSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRPMPIYSVPG 565
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 407 bits (1047), Expect = 2e-131, Method: Compositional matrix adjust.
Identities = 229/595 (38%), Positives = 318/595 (53%), Gaps = 60/595 (10%)
Query 4 HFAQLPAAEIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTMLTP 63
+FA+ P + RS F+R+ +F G+I+P VDEVLPGD+F + + ++R T + P
Sbjct 8 NFARNPGVSLSRSRFNRTSDRLDTFDTGEIVPIYVDEVLPGDTFELDMTAIIRGSTPIFP 67
Query 64 IMDNMFLDTYYFFVPNRLVWKHWREFCGENREGAWAPTVEYTVPSIAPPPGGFAKGTIAD 123
+MDN FLD Y+FFVPNRL W+HWRE GENR AW V+Y+VP + P GG+ + ++AD
Sbjct 68 VMDNSFLDVYFFFVPNRLTWEHWRELMGENRTTAWTQPVDYSVPQVTAPAGGWEELSLAD 127
Query 124 YMGLPIGVEWKATDPLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQGSNGDDY 183
+MG+P V D ++ +ALPFR Y LI NEFFR++NLT+P + + DAN G N +D
Sbjct 128 HMGIPTKV-----DNISVNALPFRAYGLIYNEFFRNQNLTNPTQVEVTDANIAGKNPNDV 182
Query 184 AND---VANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIHVPGFAGGTFPVTTTDNWTV 240
N G K K+A+ DYF+ LP QKG V I +
Sbjct 183 KNSNDWAITGAKCLKSAKFFDYFTGALPQPQKGEPVEI--------------------NL 222
Query 241 PASAVPAVFGLFTTPLNGTIDGRT---AYPASTGSSELEQGQKIFVSDGHSAP------- 290
+S +P G + PL+ + T P+S G+++ + +G P
Sbjct 223 ASSWLPVGIGDYHGPLDKVSNSDTLTWESPSSEGNTKRTYALGMVQQEGEVNPNGLKNFE 282
Query 291 ---GPMCWMAGSNITYARNVWSPINLSTTIPAAGSGDDVDASFTVNELRLAFAYQRFLES 347
G +G+ Y N+W+ V A+ TVN+LR AF Q+ LE
Sbjct 283 TKAGGSFSESGAVAAYPTNLWA--------------SPVTAAATVNQLRQAFQVQKLLEK 328
Query 348 MARNGSRYTELLLGLFGVRSPDARLQRPEYLGGNRIPISVSEVTN-NAQTPEDYLGDLGA 406
AR G+RY E+L FGV + DAR+Q PEYLGG ++PI+VS+V +A T G+ A
Sbjct 329 DARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTSASTDASPQGNTAA 388
Query 407 KSSTGDVNHDFIKSFTEHGYLFGLFVVRYDHSYSQGIERFWTRKKFTDFYNPKFAHLGET 466
S T F KSF EHG++ G+ R SY QGIER W+RK D+Y P A++GE
Sbjct 389 ISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDYYFPVLANIGEQ 448
Query 467 GVYQAEIMATPENMADPTKVFGFQEIWADYRYKPSLVTGEMRPGVTNSLAHWHLADHYAK 526
+ EI A + A + FG+QE WADYRYKP+ + G R SL WH Y K
Sbjct 449 AILNKEIYA--QGNAKDDEAFGYQEAWADYRYKPNTICGRFRSNAQQSLDAWHYGQDYDK 506
Query 527 VPTLSDGWIREDKANVDRVLAVQSSVADQFWCDLWISNMCTRPMPMYSIPGSLDH 581
+PTLS W+ + + R LAVQ+ F + + R MP+YSIPG +DH
Sbjct 507 LPTLSTDWMEQSDIEMKRTLAVQTE--PDFIANFRFNCKTVRVMPLYSIPGLIDH 559
>gi|575094564|emb|CDL65921.1| unnamed protein product [uncultured bacterium]
Length=582
Score = 392 bits (1007), Expect = 3e-125, Method: Compositional matrix adjust.
Identities = 245/605 (40%), Positives = 339/605 (56%), Gaps = 58/605 (10%)
Query 2 ESHFAQLPAAEIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTML 61
+ F+Q+P + IQRS FDRSH YKT+ AG +IPF VDEVLPGD+F + + VR T++
Sbjct 12 NNRFSQIPNSPIQRSVFDRSHDYKTTLDAGYLIPFFVDEVLPGDTFKLRVNAFVRMNTLV 71
Query 62 TPIMDNMFLDTYYFFVPNRLVWKHWREFCGENREGAWAPTVEYTVPSIAPPPGGFAKGTI 121
P MDN+F+DT++FFVP+RLVW +W+ FCGE + + ++ +PS++ F G+I
Sbjct 72 APFMDNVFMDTFFFFVPSRLVWDNWQRFCGEQKNPG--DSTDFLIPSLSGT-NTFTNGSI 128
Query 122 ADYMGLPIGVEWKATD-PLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQGSNG 180
DYMGLP GV T+ P+ +ALPFR Y LI NE+FRDENL D + ++ D SN
Sbjct 129 FDYMGLPTGVPLNPTNTPI--NALPFRAYNLIYNEWFRDENLIDSIPVTTGDGPDPISNY 186
Query 181 DDYANDVANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIH----VPGFAGG-TFPVTTT 235
K A+ HDYF+S LP QKG SV + + V GF G T+ +
Sbjct 187 -----------TLRKRAKRHDYFTSALPWPQKGPSVDVGLTGNAPVVGFGDGQTWNFMSN 235
Query 236 DNWTVPASAVPAVFGLFTTPLNGT----------IDGRTAYPASTGSSELEQGQKIFVSD 285
++ S AV G T L+ T P +++ + I D
Sbjct 236 TSY----SGNQAVLGNPTDVLDNVGLQVFINREQFSTATLIPIIQETNQSGRWANIGNQD 291
Query 286 GHSAP--GPMCWMAGSNITYARNVWSPINLSTTIPAAG-SGDDVDASFTVNELRLAFAYQ 342
S P+ + G + + S N S P A SG ++ T+N+LR AF Q
Sbjct 292 QSSGTDVSPIRAIRGDGFYFPNGILS--NSSGQQPYADLSGV---SAITINDLRQAFQIQ 346
Query 343 RFLESMARNGSRYTELLLGLFGVRSPDARLQRPEYLGG-----NRIPISVSEVTNNAQTP 397
+F E AR GSRYTE L +F V SPDARLQRPEYLGG N +P + + T++ +P
Sbjct 347 KFYEKWARGGSRYTETLRVMFNVISPDARLQRPEYLGGTHSRVNVVPTAQTSSTDSV-SP 405
Query 398 EDYLGDLGAKSSTGDVNHDFIKSFTEHGYLFGLFVVRYDHSYSQGIERFWTRKKFTDFYN 457
+ L G GD H F KSF EHGY+ GL +R D +Y QG+ R W+R++ DFY
Sbjct 406 QSNLSAFGV---LGDSAHGFNKSFVEHGYVIGLVCLRADITYQQGLNRMWSRRQLFDFYW 462
Query 458 PKFAHLGETGVYQAEIMATPENMADPTKVFGFQEIWADYRYKPSLVTGEMRPGVTNSLAH 517
P AHLGE VY EI + D VFG+QE +A+YRYKPS++TG++R + +L
Sbjct 463 PTLAHLGEQVVYNREIYT--QGTDDDNGVFGYQERYAEYRYKPSMITGKLRSTDSQTLDV 520
Query 518 WHLADHYAKVPTLSDGWIREDKANVDRVLAVQSSVADQFWCDLWISNMCTRPMPMYSIPG 577
WHLA + +P L+ +I E+ ++RV+AVQ+ QF+ D W +RPMP+YS+PG
Sbjct 521 WHLAQKFDTLPKLNQDFIEENPP-INRVIAVQNE--PQFFADFWFDLKTSRPMPVYSVPG 577
Query 578 SLDHF 582
+DHF
Sbjct 578 LVDHF 582
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 380 bits (976), Expect = 5e-121, Method: Compositional matrix adjust.
Identities = 232/585 (40%), Positives = 314/585 (54%), Gaps = 63/585 (11%)
Query 4 HFAQLPAAEIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTMLTP 63
F+ +P AEI RS FD KT+F +G ++P +VDEVLPGDS N+ + R T L P
Sbjct 12 QFSMIPRAEIPRSKFDAQKTLKTAFDSGYLVPILVDEVLPGDSMNLRMTAFTRLATPLFP 71
Query 64 IMDNMFLDTYYFFVPNRLVWKHWREFCGENREGAWAPTVEYTVPSIAPPPGGFAKGTIAD 123
+MDNM+LDT++FFVPNRL+W +W+ F GE R+ +++YT+P++ P GG+A ++ D
Sbjct 72 VMDNMYLDTFFFFVPNRLLWSNWQRFMGE-RDPDPDSSIDYTIPTMTSPNGGYAVNSLQD 130
Query 124 YMGLPIGVEWKATDPLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQGSNGDDY 183
YMGLP + A ++ ++L R Y LI NE+FRDENL D +++ +G D Y
Sbjct 131 YMGLPTAGQVDAGSSISHNSLFTRAYNLIWNEWFRDENLQDSVVV------DKGDGPDTY 184
Query 184 ANDVANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIHVPGFAGGTFPVTTTDNWTVPAS 243
+ + + HDYF+S LP QKG +V +P+ GG+ V D
Sbjct 185 TDYTL-----LRRGKRHDYFTSALPWPQKGDAVTLPL------GGSANVVYNDTGDPAYI 233
Query 244 AVPAVFGLFTTPLNGTIDGRTAYPASTGSSELEQGQKIFVSDGHSAPGPMCWMAGS-NIT 302
+ ++TTP ++ A G M GS N
Sbjct 234 REVSTGNVWTTPSRESV-------------------------SKEANGNMSVPTGSVNAQ 268
Query 303 YARNVWSPINLSTTIPAAGSGDDVDASFTVNELRLAFAYQRFLESMARNGSRYTELLLGL 362
Y N +LST A T+N +R +F QR LE AR G+RYTE++
Sbjct 269 YDPNGSLVADLSTATAA-----------TINAIRQSFQIQRLLERDARGGTRYTEIVRSH 317
Query 363 FGVRSPDARLQRPEYLGGNRIPISVSEVTNN----AQTPEDYLGDLGAKSSTGDVNHDFI 418
FGV SPDAR+QRPEYLGG PI V+ V A + LG LGA + H F
Sbjct 318 FGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGTLGAVGTGLASGHGFA 377
Query 419 KSFTEHGYLFGLFVVRYDHSYSQGIERFWTRKKFTDFYNPKFAHLGETGVYQAEIMATPE 478
SFTEHG + GL VR D +Y QG+ R ++R DF+ P F+HLGE + E+ AT
Sbjct 378 SSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKELYATGT 437
Query 479 NMADPTKVFGFQEIWADYRYKPSLVTGEMRPGVTNSLAHWHLADHYAKVPTLSDGWIRED 538
+ D VFG+QE WA+YRYKPS VTG MR +L WHLA ++ +PTL+ +I ED
Sbjct 438 STDD--DVFGYQEAWAEYRYKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTFI-ED 494
Query 539 KANVDRVLAVQSSV-ADQFWCDLWISNMCTRPMPMYSIPGSLDHF 582
VDRV+AV S QF D + RPMPMYS+PG +DHF
Sbjct 495 TPPVDRVVAVGSEANGQQFIFDAFFDINMARPMPMYSVPGLVDHF 539
>gi|9629155|ref|NP_044312.1| VP1 [Chlamydia phage 1]
gi|139180|sp|P19192.2|F_BPCHP RecName: Full=Capsid protein VP1; AltName: Full=Protein VP1;
Short=VP1 [Chlamydia phage 1]
gi|93817|pir||JU0345 major capsid protein VP1 - Chlamydophila psittaci phage Chp1
gi|217762|dbj|BAA00515.1| VP1 [Chlamydia phage 1]
Length=596
Score = 377 bits (967), Expect = 5e-119, Method: Compositional matrix adjust.
Identities = 241/616 (39%), Positives = 332/616 (54%), Gaps = 73/616 (12%)
Query 1 VESHFAQLPAAEIQRSTFDRSHGYKTSFCAGDIIPFMVDEVLPGDSFNISTSKVVRSQTM 60
+++ F+++P A I+RS+FDRSHGYKT+F ++PF VDEVLPGD+F++S + + R T+
Sbjct 11 MKNRFSEVPTATIRRSSFDRSHGYKTTFDMDYLVPFFVDEVLPGDTFSLSETHLCRLTTL 70
Query 61 LTPIMDNMFLDTYYFFVPNRLVWKHWREFC-GENREGAWA---PTVEYTVPSIAPPPGGF 116
+ PIMDN+ L T +FFVPNRL+W +W F G + AW P EY VP + P GG+
Sbjct 71 VQPIMDNIQLTTQFFFVPNRLLWDNWESFITGGDEPVAWTSTNPANEYFVPQVTSPDGGY 130
Query 117 AKGTIADYMGLPIGVEWKATDPLAPSALPFRGYALICNEFFRDENLTDPLLISLDDANQQ 176
A+ +I DY GLP V LP R Y LI NE++RDENL + L + DA+ +
Sbjct 131 AENSIYDYFGLPTKVA-----NYRHQVLPLRAYNLIFNEYYRDENLQESLPVWTGDADPK 185
Query 177 --GSNGDDYAND--VANGGKPFKAARMHDYFSSCLPSSQKGSSVGIPIHVPGFAGGTFPV 232
+ G++ D V K + + +DYF+S LP QKG SVGI G GG
Sbjct 186 VDPTTGEESQEDDAVPYVYKLMRRNKRYDYFTSALPGLQKGPSVGI-----GITGG---- 236
Query 233 TTTDNWTVPASAVPAVFGLFTTPLNGTIDGRTAYPASTGSSELEQGQKIFVSDGHSAPGP 292
D+ +P V GL + +D + S G S + QK F +DG G
Sbjct 237 ---DSGRLP------VHGL---AIRSYLDDSSDDQFSFGVSYVNASQKWFTADGRLTSGM 284
Query 293 MCWMAGSNITY-ARNVWSPINLSTTIPAAG-----------SGD-------DVDASFTVN 333
G+ + NV P TT+ G GD +S T+N
Sbjct 285 GSVPVGTTGNFPIDNVVYPSYFGTTVAQTGSPSSSSTPPFVKGDFPVYVDLAASSSVTIN 344
Query 334 ELRLAFAYQRFLESMARNGSRYTELLLGLFGVRSPDARLQRPEYLGGNRIPISVSEVTNN 393
LR A Q++ E AR GSRY E + G FGV D R QRP YLGG++ +SV+ V N
Sbjct 345 SLRNAITLQQWFEKSARYGSRYVESVQGHFGVHLGDYRAQRPIYLGGSKSYVSVNPVVQN 404
Query 394 AQT----PEDYLGDLGAKSSTGDVNHDFIKSFTEHGYLFGLFVVRYDHSYSQGIERFWTR 449
+ T P+ G+L A + + D H F KSF EHG++ GL D +Y QG+ER W+R
Sbjct 405 SSTDSVSPQ---GNLSAYALSTDTKHLFTKSFVEHGFVIGLLSATADLTYQQGLERQWSR 461
Query 450 KKFTDFYNPKFAHLGETGVYQAEIMATPENMADPTKV------FGFQEIWADYRYKPSLV 503
D+Y P FAHLGE VY EI + + DP+ FG+QE +A+YRYKPS V
Sbjct 462 FSRYDYYWPTFAHLGEQPVYNKEIYCQSDTVMDPSGSAVNDVPFGYQERYAEYRYKPSKV 521
Query 504 TGEMRPGVTNSLAHWHLADHYAKVPTLSDGWIREDKANVDRVLAVQSSVADQ--FWCDLW 561
TG R T +L WHL+ ++A +PTL++ +I+ + +DR LA V DQ F CD +
Sbjct 522 TGLFRSNATGTLDSWHLSQNFANLPTLNETFIQSNTP-IDRALA----VPDQPDFICDFY 576
Query 562 ISNMCTRPMPMYSIPG 577
+ C RPMP+YS+PG
Sbjct 577 FNYRCIRPMPVYSVPG 592
Lambda K H a alpha
0.319 0.136 0.429 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4286665841916