bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-28_CDS_annotation_glimmer3.pl_2_1
Length=508
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094572|emb|CDL65928.1| unnamed protein product 747 0.0
gi|575094544|emb|CDL65904.1| unnamed protein product 717 0.0
gi|575094492|emb|CDL65859.1| unnamed protein product 712 0.0
gi|575096056|emb|CDL66947.1| unnamed protein product 706 0.0
gi|575094496|emb|CDL65862.1| unnamed protein product 674 0.0
gi|575094415|emb|CDL65790.1| unnamed protein product 520 3e-176
gi|575094431|emb|CDL65804.1| unnamed protein product 503 8e-170
gi|530695385|gb|AGT39938.1| major capsid protein 436 4e-144
gi|444297960|dbj|GAC77859.1| major capsid protein 429 1e-141
gi|313766927|gb|ADR80653.1| putative major coat protein 430 1e-141
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/516 (71%), Positives = 416/516 (81%), Gaps = 10/516 (2%)
Query 1 LDEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSA 60
++EVLPGDTFK+KTSKV+RLQTL+TPMMDN+YLDTY+FFVPNRLVW HWKEFNGENTQSA
Sbjct 43 IEEVLPGDTFKVKTSKVIRLQTLLTPMMDNIYLDTYYFFVPNRLVWEHWKEFNGENTQSA 102
Query 61 WLPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDEN 120
W+P EY+IPQ+TAP GGW+IGT+ADY GIPTGV +SVNALPFRAYALV NEWFRD+N
Sbjct 103 WIPEVEYQIPQLTAPE-GGWNIGTLADYFGIPTGVSGISVNALPFRAYALVCNEWFRDQN 161
Query 121 LTDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIPGG 180
L+DPL +P+ DATV GVNTG ++TDV KGGLP+ AAKYHDYFTSCLPAPQKGPDV IP
Sbjct 162 LSDPLNIPVGDATVTGVNTGTFITDVVKGGLPYTAAKYHDYFTSCLPAPQKGPDVTIPVT 221
Query 181 TGMSVPVI---PQADKVPSGLITMPYTATFLNETPVRSTTGIFFNDSGSQTNGVSAGSSE 237
+G ++PV+ D P + + L + + ++ V GS
Sbjct 222 SGHNLPVMFLNETHDAGPYKPFGVGIQNSELRNFYGFGSGSSGATSTSDTSSTVEVGSDG 281
Query 238 DALP----VIDNLWAVGDG-VATATINQLRLAFQIQKLYEKDARGGTRYTEILRSHFGVT 292
+ N+WAV G V ATINQLRLAFQ+QKLYEKDARGGTRYTEI+RSHFGV
Sbjct 282 TGIGQNFWTPTNMWAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEIIRSHFGVV 341
Query 293 SPDSRLQRPEYLGGNRIPIRINQIVQQSATQEGSTPQGNPVGLSLTSDNHGDFTKSFTEH 352
SPDSRLQRPEYLGGNRIPI +NQI+QQS + E S P G G+S+T+D + DF KSF EH
Sbjct 342 SPDSRLQRPEYLGGNRIPINVNQIIQQSQSTEQS-PLGALAGMSVTTDKNSDFIKSFVEH 400
Query 353 GFILGLMVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQAVLNKEIYAQGTNEDDEV 412
G+I+GL+VARYDHTYQQGLDRM+SRK RFD+YWPV ANIGEQAVLNKEIY G++ DDEV
Sbjct 401 GYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYIDGSDTDDEV 460
Query 413 FGYQEAWADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPSLSDEWIREDKTNVDRVL 472
FGYQEAWA+YRYKPNRVCGEMRS APQSLDVWHLGDDYS LP LSD WIREDKTNVDRVL
Sbjct 461 FGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVDRVL 520
Query 473 AVQSSVSNQLFADIYVQNRCTRPMPMYSIPGLIDHH 508
AV SSVS+QLFADIY+ N+ TRPMPMYSIPGLIDHH
Sbjct 521 AVTSSVSDQLFADIYICNKATRPMPMYSIPGLIDHH 556
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 717 bits (1850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/516 (68%), Positives = 406/516 (79%), Gaps = 16/516 (3%)
Query 1 LDEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSA 60
+DEVLPGDTF +K+SKV+R+Q+L+TP+MDN+YLDTY+FFVPNRLVWSHW++FNGENT+SA
Sbjct 42 IDEVLPGDTFNVKSSKVIRMQSLVTPIMDNIYLDTYYFFVPNRLVWSHWQQFNGENTESA 101
Query 61 WLPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDEN 120
WLPTTEY++PQ+TAPA GWSIGTIADY GIPTGV SVNALPFRAYAL+ NEWFRDEN
Sbjct 102 WLPTTEYQVPQVTAPA-NGWSIGTIADYFGIPTGVA-CSVNALPFRAYALICNEWFRDEN 159
Query 121 LTDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIPGG 180
L+DPL +P+ DATV G N Y+TD+ KGG+PF A KYHDYFTSCLPAPQKGPDV++P
Sbjct 160 LSDPLNIPISDATVVGSNGDNYITDIVKGGMPFKACKYHDYFTSCLPAPQKGPDVLLPLS 219
Query 181 TGMSVPVIPQADKV-PSGLITMPYTAT---FLNETPVRSTTGIFFNDSGSQTNGVSAGSS 236
+ VPV V P P L+ T +R+ F G +
Sbjct 220 SS-PVPVTTSDTMVDPLQYSKYPMAGVDSWNLSPTLMRNIIRPF---EGVEGANYQVHQF 275
Query 237 EDALPVID-----NLWAVGDGVATATINQLRLAFQIQKLYEKDARGGTRYTEILRSHFGV 291
+P ID NL A A+INQLRLAFQIQ+LYE+DARGGTRY EIL+SHFGV
Sbjct 276 TGDIPTIDAFRPLNLVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGV 335
Query 292 TSPDSRLQRPEYLGGNRIPIRINQIVQQSATQEGSTPQGNPVGLSLTSDNHGDFTKSFTE 351
TSPD+RLQRPEYLGGNRIPI INQ++QQS T ++PQGNPVG SLT+D + DF KSF E
Sbjct 336 TSPDARLQRPEYLGGNRIPININQVLQQSET-TSTSPQGNPVGQSLTTDTNADFVKSFVE 394
Query 352 HGFILGLMVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQAVLNKEIYAQGTNEDDE 411
HGF++GLMVARYDHTYQQGL+R +SRK RFDYYWPVFA+IGEQAVLNKEIY GT DDE
Sbjct 395 HGFVIGLMVARYDHTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDE 454
Query 412 VFGYQEAWADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPSLSDEWIREDKTNVDRV 471
VFGYQEA+ADYRYKP+RV GEMRS APQSLDVWHL DDY+ LPSLSD WIRE + VDRV
Sbjct 455 VFGYQEAYADYRYKPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRV 514
Query 472 LAVQSSVSNQLFADIYVQNRCTRPMPMYSIPGLIDH 507
LAV S+VS QLF DIY+QNR TRPMPMYS+PGLIDH
Sbjct 515 LAVSSNVSAQLFCDIYIQNRSTRPMPMYSVPGLIDH 550
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 712 bits (1838), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/518 (68%), Positives = 405/518 (78%), Gaps = 19/518 (4%)
Query 1 LDEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSA 60
+DE+LPGDTF I TSKVVR+Q+L+TP+MDN+YLDTYFFFVPNRL WSHW+E GENTQSA
Sbjct 43 VDEILPGDTFSIDTSKVVRMQSLLTPVMDNIYLDTYFFFVPNRLTWSHWRELMGENTQSA 102
Query 61 WLPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDEN 120
W P EY +PQITAP GGW++GTIADY+GIPTGV LSVNA+PFRAYAL+ NEWFRDEN
Sbjct 103 WTPQVEYSVPQITAPE-GGWNVGTIADYMGIPTGVSGLSVNAMPFRAYALICNEWFRDEN 161
Query 121 LTDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIPG- 179
LTDPL +P+ DATVAGVNTG YVTDVAKGGLPF AAKYHDYFTSCLPAPQKGPDV+I
Sbjct 162 LTDPLNIPVGDATVAGVNTGTYVTDVAKGGLPFKAAKYHDYFTSCLPAPQKGPDVLISAV 221
Query 180 GTGMSVPVIPQADKVPSGLITMPYTATFLNETPVRSTTGIFFNDSGSQTNGVSAGSSEDA 239
G+G+ VPV + S + P N S+T + + G V + + +
Sbjct 222 GSGI-VPVTATDNDNDSLNVNSPGMRFVGN-----SSTSVNYLAFGGGDGYVVTDTPKPS 275
Query 240 LPVI------DNLWA---VGDGVATATINQLRLAFQIQKLYEKDARGGTRYTEILRSHFG 290
P+ NLWA + ATINQLR AFQIQKLYE+DARGGTRY EIL+SHFG
Sbjct 276 TPIHGISMIPTNLWADLSTATDLPVATINQLRTAFQIQKLYERDARGGTRYIEILKSHFG 335
Query 291 VTSPDSRLQRPEYLGGNRIPIRINQIVQQSATQEGSTPQGNPVGLSLTSDNHGDFTKSFT 350
VTSPD+RLQRPEYLGG+R+PI INQ++Q S T G+TPQGN SLT+D+H +FTKSF
Sbjct 336 VTSPDARLQRPEYLGGSRVPININQVIQSSET--GATPQGNAAAYSLTTDSHSEFTKSFV 393
Query 351 EHGFILGLMVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQAVLNKEIYAQGTNEDD 410
EHGFI+GLMVARYDH+YQQGL R +SRK RFDYYWPVFAN+GE AV NKEI+AQGT+ DD
Sbjct 394 EHGFIIGLMVARYDHSYQQGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDD 453
Query 411 EVFGYQEAWADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPSLSDEWIREDKTNVDR 470
EVFGYQEAWADYRYKP+ V GEMRSQ QSLD+WHL DDY LPSLSD WIRED + V+R
Sbjct 454 EVFGYQEAWADYRYKPSVVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNR 513
Query 471 VLAVQSSVSNQLFADIYVQNRCTRPMPMYSIPGLIDHH 508
VLAV SVS QLF DIY++ TRPMP+YSIPGLIDHH
Sbjct 514 VLAVSDSVSAQLFCDIYIRCLATRPMPLYSIPGLIDHH 551
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/530 (67%), Positives = 407/530 (77%), Gaps = 24/530 (5%)
Query 1 LDEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSA 60
L+EVLPGDTF + +SKVVR+QTL+TPMMDN+YLDTY+FFVPNRLVW HWKEF GEN +SA
Sbjct 43 LEEVLPGDTFSVDSSKVVRMQTLLTPMMDNVYLDTYYFFVPNRLVWQHWKEFCGENNESA 102
Query 61 WLPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDEN 120
W+P TEY IPQ+ +P GG+ +GTIADY G+PTGV +LSV+ALPFRAYAL+MNEWFRDEN
Sbjct 103 WIPQTEYAIPQLKSPV-GGFEVGTIADYFGLPTGVANLSVSALPFRAYALIMNEWFRDEN 161
Query 121 LTDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIP-- 178
L DPLVVP DDATV GVNTG +VTDVAKGG PFVAAKYHDYFTS LPAPQKGPDVVIP
Sbjct 162 LMDPLVVPTDDATVTGVNTGIFVTDVAKGGKPFVAAKYHDYFTSALPAPQKGPDVVIPVA 221
Query 179 ---------GGTGMSVPVIPQADKVPSGLITMPYTATFLNETPVRSTTGIFFNDSGS--- 226
G G+++ + + +GL T L + + + GS
Sbjct 222 SAGNYNVVGNGKGLALSDGSKMSIICNGLSGSNGQGTELFASGILGSQVGSSGGFGSGSS 281
Query 227 -QTNGVSAGSSEDALPVIDNLWAVG------DGVATATINQLRLAFQIQKLYEKDARGGT 279
+ +G+ G A + +NL G A ATINQLR+AFQIQK YEK ARGG+
Sbjct 282 LRGDGIILGVPT-AAQLGNNLENSGLIAIASGNAAAATINQLRMAFQIQKFYEKQARGGS 340
Query 280 RYTEILRSHFGVTSPDSRLQRPEYLGGNRIPIRINQIVQQSATQEGST-PQGNPVGLSLT 338
RYTE++RS FGVTSPD+RLQR EYLGGNRIPI INQ++QQS T ST PQG VG+S T
Sbjct 341 RYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQSGTGSASTTPQGTVVGMSQT 400
Query 339 SDNHGDFTKSFTEHGFILGLMVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQAVLN 398
+D H DFTKSFTEHGFI+G+M ARYDHTYQQG+DRM+SRK +FDYYWPVF+NIGEQA+ N
Sbjct 401 TDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKFDYYWPVFSNIGEQAIKN 460
Query 399 KEIYAQGTNEDDEVFGYQEAWADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPSLSD 458
KEIYAQG DDEVFGYQEAWA+YRYKP+RV GEMRS QSLDVWHL DDYSKLPSLSD
Sbjct 461 KEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRSSYAQSLDVWHLADDYSKLPSLSD 520
Query 459 EWIREDKTNVDRVLAVQSSVSNQLFADIYVQNRCTRPMPMYSIPGLIDHH 508
EWIRED ++RVLAV SNQ FADIYV+N CTRPMPMYSIPGLIDHH
Sbjct 521 EWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRPMPMYSIPGLIDHH 570
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/533 (63%), Positives = 398/533 (75%), Gaps = 33/533 (6%)
Query 2 DEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSAW 61
DEVLPGDTF++KT+KVVRLQ L++ MDNLY DTY+FFVPNRLVW HW+EF GEN Q AW
Sbjct 43 DEVLPGDTFQVKTNKVVRLQPLVSAPMDNLYFDTYYFFVPNRLVWEHWEEFMGENKQGAW 102
Query 62 LPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDENL 121
+P TEY IPQIT+PA+ G+ IGTIADY GIPTGVP+LSV+ALPFRAYAL+++EWFRD+NL
Sbjct 103 IPQTEYTIPQITSPASTGFEIGTIADYFGIPTGVPNLSVSALPFRAYALIVDEWFRDQNL 162
Query 122 TDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIP--- 178
PL +PLDD T+ GVNTG YVTD KGG PFVAAKYHDYFTSCLP+PQKGPDV I
Sbjct 163 QLPLNIPLDDTTLQGVNTGDYVTDTVKGGKPFVAAKYHDYFTSCLPSPQKGPDVTIAAVG 222
Query 179 ---------------------GGTGMSVPVIP--QADKVPSGLITMPYTATFLNETPVRS 215
G + +S + Q + + ++T T + + + +
Sbjct 223 DFPVYTGDPHNNNGSNKALHYGISNISSGSVSFSQGNYIIPSVLTTGSTQSVPAQGKLNA 282
Query 216 TTGIFFNDSGSQTNGVSAGSSEDALPVIDNLWAVGDGVATATINQLRLAFQIQKLYEKDA 275
+ GS + S GS P DNL+A G AT TINQLR+AFQIQKLYEKDA
Sbjct 283 SNITMTTSPGSPDS--SFGSKLSVYP--DNLYA-SSGTAT-TINQLRMAFQIQKLYEKDA 336
Query 276 RGGTRYTEILRSHFGVTSPDSRLQRPEYLGGNRIPIRINQIVQQSATQEGSTPQGNPVGL 335
R G+RY E++RSHF VT D+R+Q PEYLGGNRIPI INQ+VQ S T + S PQGN G
Sbjct 337 RAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININQVVQTSQTSDVS-PQGNVAGQ 395
Query 336 SLTSDNHGDFTKSFTEHGFILGLMVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQA 395
SLTSD+HGDF KSFTEHG ++G+ VARYDHTYQQG+ +++SRK+RFDYYWPV ANIGEQA
Sbjct 396 SLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSKLWSRKTRFDYYWPVLANIGEQA 455
Query 396 VLNKEIYAQGTNEDDEVFGYQEAWADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPS 455
VLNKEIYAQGT +D+EVFGYQEAWA+YRYKP+ V GEMRS A SLD WH DDY+ LP
Sbjct 456 VLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVTGEMRSSARTSLDSWHFADDYNSLPK 515
Query 456 LSDEWIREDKTNVDRVLAVQSSVSNQLFADIYVQNRCTRPMPMYSIPGLIDHH 508
LS +WI+EDKTN+DRVLAV SSVSNQ FAD Y++N TR +P YSIPGLIDHH
Sbjct 516 LSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETTRALPFYSIPGLIDHH 568
>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569
Score = 520 bits (1339), Expect = 3e-176, Method: Compositional matrix adjust.
Identities = 264/529 (50%), Positives = 337/529 (64%), Gaps = 29/529 (5%)
Query 1 LDEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSA 60
+DEVLPGDT KIK +VR+ T + P+MDN YLD ++FFVP RLVW HW+ GENT+S
Sbjct 42 VDEVLPGDTIKIKQRSLVRMSTPLYPVMDNCYLDIWYFFVPCRLVWDHWQNLMGENTKSY 101
Query 61 WLPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDEN 120
W P +Y P +AP +GGW +GTIADY+GIPTGV + VN++P RAYA + NEWFRDEN
Sbjct 102 WAPDVQYTTPLTSAP-SGGWQVGTIADYMGIPTGVSGIKVNSMPMRAYARIWNEWFRDEN 160
Query 121 LTDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIP-- 178
L P+ DDAT G NTG +TD GGLP AK+ DYFTSCLPAPQKG +
Sbjct 161 LQQPVTQHSDDATTTGSNTGTELTDAESGGLPLKVAKFKDYFTSCLPAPQKGEAIGFDFN 220
Query 179 -----GGTGMSVPVIPQADKVPSG---------LITMPYTATFLN------ETPVRSTTG 218
G G+ P+ + L+ Y ++ N +T V
Sbjct 221 QTPKVKGIGLVFPLETNTGHTATDILWRQPDAQLVGENYNTSYNNFNSITTQTTVNGKKA 280
Query 219 IFFNDSGSQTNGVSAGSSEDALPVIDN--LWAVGDGVAT-ATINQLRLAFQIQKLYEKDA 275
FFN+ +SA +D ++ L AV + +IN LR A +Q + E DA
Sbjct 281 FFFNNGKGPM--LSARFEDDYNGGVEQVELTAVAENSTNFLSINDLRQAIALQHILEADA 338
Query 276 RGGTRYTEILRSHFGVTSPDSRLQRPEYLGGNRIPIRINQIVQQSATQEGSTPQGNPVGL 335
RGGTRY EIL++ FGV+SPD+RLQR EY+GG RIPI ++Q++Q SA+ + ++PQGN
Sbjct 339 RGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQVIQSSAS-DTTSPQGNAAAY 397
Query 336 SLTSDNHGDFTKSFTEHGFILGLMVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQA 395
SLT+ + S EHG+ILGL R DH+YQQGL RM++R RF YY P+ AN+GEQA
Sbjct 398 SLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMWTRSDRFSYYHPMLANLGEQA 457
Query 396 VLNKEIYAQGTNEDDEVFGYQEAWADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPS 455
VLN+EIYAQGT D EVFGYQEAWADYRY+ N + GEMRS QSLD WH GD Y+ LP
Sbjct 458 VLNQEIYAQGTTADTEVFGYQEAWADYRYRTNMITGEMRSTYAQSLDAWHYGDKYTDLPR 517
Query 456 LSDEWIREDKTNVDRVLAVQSSVSNQLFADIYVQNRCTRPMPMYSIPGL 504
LS++WI+E + N+DR LAVQS S+Q ++Y RPMP+YS+PGL
Sbjct 518 LSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRPMPIYSVPGL 566
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 503 bits (1296), Expect = 8e-170, Method: Compositional matrix adjust.
Identities = 265/528 (50%), Positives = 340/528 (64%), Gaps = 29/528 (5%)
Query 1 LDEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSA 60
+DEVLPGDTF++ + ++R T I P+MDN +LD YFFFVPNRL W HW+E GEN +A
Sbjct 42 VDEVLPGDTFELDMTAIIRGSTPIFPVMDNSFLDVYFFFVPNRLTWEHWRELMGENRTTA 101
Query 61 WLPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDEN 120
W +Y +PQ+TAPA GGW ++AD++GIPT V ++SVNALPFRAY L+ NE+FR++N
Sbjct 102 WTQPVDYSVPQVTAPA-GGWEELSLADHMGIPTKVDNISVNALPFRAYGLIYNEFFRNQN 160
Query 121 LTDPLVVPLDDATVAGVNTGAYVTD---VAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVI 177
LT+P V + DA +AG N G +AK+ DYFT LP PQKG V I
Sbjct 161 LTNPTQVEVTDANIAGKNPNDVKNSNDWAITGAKCLKSAKFFDYFTGALPQPQKGEPVEI 220
Query 178 -------PGGTG-MSVPVIPQADKVPSGLITMPYTATFLNETPVRSTTGIFFNDSGSQTN 229
P G G P+ DKV + + + T G+ + N
Sbjct 221 NLASSWLPVGIGDYHGPL----DKVSNSDTLTWESPSSEGNTKRTYALGMVQQEGEVNPN 276
Query 230 GV------SAGSSEDALPVI---DNLWAVGDGVATATINQLRLAFQIQKLYEKDARGGTR 280
G+ + GS ++ V NLWA A AT+NQLR AFQ+QKL EKDARGGTR
Sbjct 277 GLKNFETKAGGSFSESGAVAAYPTNLWA-SPVTAAATVNQLRQAFQVQKLLEKDARGGTR 335
Query 281 YTEILRSHFGVTSPDSRLQRPEYLGGNRIPIRINQIVQQSATQEGSTPQGNPVGLSLTSD 340
Y EIL++HFGVT+ D+R+Q PEYLGG ++PI ++Q+VQ SA+ + S PQGN +S+T
Sbjct 336 YREILKNHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTSASTDAS-PQGNTAAISVTPF 394
Query 341 NHGDFTKSFTEHGFILGLMVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQAVLNKE 400
+ FTKSF EHGFI+G+ AR +YQQG++RM+SRK R DYY+PV ANIGEQA+LNKE
Sbjct 395 SKSMFTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDYYFPVLANIGEQAILNKE 454
Query 401 IYAQGTNEDDEVFGYQEAWADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPSLSDEW 460
IYAQG +DDE FGYQEAWADYRYKPN +CG RS A QSLD WH G DY KLP+LS +W
Sbjct 455 IYAQGNAKDDEAFGYQEAWADYRYKPNTICGRFRSNAQQSLDAWHYGQDYDKLPTLSTDW 514
Query 461 IREDKTNVDRVLAVQSSVSNQLFADIYVQNRCTRPMPMYSIPGLIDHH 508
+ + + R LAVQ+ A+ + R MP+YSIPGLIDH+
Sbjct 515 MEQSDIEMKRTLAVQTEPD--FIANFRFNCKTVRVMPLYSIPGLIDHN 560
>gi|530695385|gb|AGT39938.1| major capsid protein [Marine gokushovirus]
Length=514
Score = 436 bits (1120), Expect = 4e-144, Method: Compositional matrix adjust.
Identities = 248/509 (49%), Positives = 315/509 (62%), Gaps = 48/509 (9%)
Query 2 DEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSAW 61
DE LPGDTF + + RL T I P MDNLY++T+FF VP RL+W++W++F GE
Sbjct 50 DEALPGDTFTMDANGFGRLATPIAPFMDNLYIETFFFAVPYRLIWTNWEKFCGEQDNPG- 108
Query 62 LPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDENL 121
+T+Y +PQ TG S T+ DY G+PT V +L+ N L RAY LV NEWFRD+NL
Sbjct 109 -DSTDYLVPQ----TTGTISNSTLYDYFGVPTDV-NLTFNNLCGRAYNLVYNEWFRDQNL 162
Query 122 TDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIPGGT 181
+ + V D G +T + T + +G K HDYFTS LP PQKG V +P GT
Sbjct 163 QNSVTVDKGD----GPDTASNYTLLKRG-------KRHDYFTSALPWPQKGEAVTLPLGT 211
Query 182 GMSVPVIPQADKVPSGLITMP--YTATFLNETPVRSTTGIF-FNDSGSQTNGVSAGSSED 238
+ P++ T P Y + N P + G + F +G G+
Sbjct 212 --TAPIMS------GDFTTTPTNYIPSNGNNIPPQDANGDYSFAGTGVGGYGI------- 256
Query 239 ALPVIDNLWAVGDGVATATINQLRLAFQIQKLYEKDARGGTRYTEILRSHFGVTSPDSRL 298
WA ATINQLR AFQIQ+LYEKDARGGTRYTE+++SHFGVTSPD+RL
Sbjct 257 --------WADLSDATAATINQLREAFQIQRLYEKDARGGTRYTEVIQSHFGVTSPDARL 308
Query 299 QRPEYLGGNRIPIRINQIVQQSATQEGSTPQGNPVGLSLTSDNHGDFTKSFTEHGFILGL 358
QRPEYLGG + I IN I Q S+T + +TPQGN G T F KSFTEH +LGL
Sbjct 309 QRPEYLGGGKDRININPIAQTSST-DATTPQGNLSGYGTTGFTGHRFNKSFTEHSVVLGL 367
Query 359 MVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQAVLNKEIYAQGTNEDDEVFGYQEA 418
D TYQQGL R FSR++R+D+YWP A++GEQAVLNKEIYAQGT +D+ VFGYQE
Sbjct 368 ACVFADLTYQQGLPRHFSRQTRWDFYWPALAHLGEQAVLNKEIYAQGTTDDNNVFGYQER 427
Query 419 WADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPSLSDEWIREDKTNVDRVLAVQSSV 478
+A+YRYKP+ + G+MRS QSLD+WHL D+ LP L+ +I E+ VDRV AVQ+
Sbjct 428 YAEYRYKPSSITGQMRSNFAQSLDIWHLAQDFGSLPVLNSSFIEENPP-VDRVTAVQNYP 486
Query 479 SNQLFADIYVQNRCTRPMPMYSIPGLIDH 507
+ L D+Y + +C RPMP Y +PGLIDH
Sbjct 487 N--LILDMYFKLKCARPMPTYGVPGLIDH 513
>gi|444297960|dbj|GAC77859.1| major capsid protein, partial [uncultured marine virus]
Length=494
Score = 429 bits (1103), Expect = 1e-141, Method: Compositional matrix adjust.
Identities = 230/510 (45%), Positives = 311/510 (61%), Gaps = 25/510 (5%)
Query 1 LDEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSA 60
+DE LPGDTF + ++ R+ T I P+MDNL +D++FF VP RL+W +W +GE
Sbjct 6 VDEALPGDTFSVSSTFFARMATPIFPIMDNLKMDSFFFAVPVRLLWDNWARMHGEQRNPG 65
Query 61 WLPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDEN 120
+T++ +P +T+P G+ G++ DYLG+PTG+PDL ++L RA+ L+ NEWFRDEN
Sbjct 66 --DSTDFVVPTMTSPPINGYDEGSLEDYLGLPTGIPDLEHSSLFHRAHNLIHNEWFRDEN 123
Query 121 LTDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIPGG 180
LTD ++ +DD + ++ Y +K HDYFTS LP PQKG + IP G
Sbjct 124 LTDSVINNVDDGPDSNLDYALYRR-----------SKRHDYFTSALPWPQKGESISIPLG 172
Query 181 TGMSVPVIPQADKVPSGLITMPYTATFLNETPVRSTTGIFFNDSGSQTNGVSAGSSEDAL 240
T V I + D+ G Y + + S T I G + G + ED
Sbjct 173 TRADVKGIGKEDQT-FGASVNAYESGGTGQVQYLSATRI-----GDGSAGETHSMEEDPN 226
Query 241 -PVIDNLWAVGDGVATATINQLRLAFQIQKLYEKDARGGTRYTEILRSHFGVTSPDSRLQ 299
P N++A ATINQLR +FQIQK+ E+DARGGTR TE++ +HFGV SPD+R+Q
Sbjct 227 NPGFPNIYADLTTATAATINQLRQSFQIQKMLERDARGGTRLTEVILAHFGVRSPDARMQ 286
Query 300 RPEYLGGNRIPIRINQIVQQSATQ--EGSTPQGNPVGLSLTSDNHGDFTKSFTEHGFILG 357
RPEYLGG PI + Q+ E +TPQGN + ++ FTKSFTEH ILG
Sbjct 287 RPEYLGGGSAPIALQQVASTVPNDFTENNTPQGNLAAYGIGVSSNNSFTKSFTEHCIILG 346
Query 358 LMVARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQAVLNKEIYAQGTNEDDEVFGYQE 417
+ R D TYQQGL+RMFSR +R+D+Y+P ++IGEQAVLNKEIYAQG D++VFGYQE
Sbjct 347 YVNVRADITYQQGLNRMFSRSTRYDFYYPALSHIGEQAVLNKEIYAQGLPADEDVFGYQE 406
Query 418 AWADYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPSLSDEWIREDKTNVDRVLAVQSS 477
A+YRYKP+++ G RS A LD WHL D++ LP L +I E+ +DRV+AV +
Sbjct 407 RHAEYRYKPSQISGAFRSSAAAPLDAWHLSQDFATLPVLDQTFIEENPP-IDRVIAVPTE 465
Query 478 VSNQLFADIYVQNRCTRPMPMYSIPGLIDH 507
D Y +C RPMP+Y +PGLIDH
Sbjct 466 P--HFLFDSYTSMKCARPMPVYGVPGLIDH 493
>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533
Score = 430 bits (1106), Expect = 1e-141, Method: Compositional matrix adjust.
Identities = 235/507 (46%), Positives = 319/507 (63%), Gaps = 39/507 (8%)
Query 1 LDEVLPGDTFKIKTSKVVRLQTLITPMMDNLYLDTYFFFVPNRLVWSHWKEFNGENTQSA 60
+DEVLPGDTF++ + RL T + P+MDN+Y++T+FF+VPNR++W +W++FNG Q
Sbjct 50 VDEVLPGDTFQMNATGFGRLATPLYPVMDNMYVETFFFYVPNRIIWDNWEKFNG--AQDD 107
Query 61 WLPTTEYEIPQITAPATGGWSIGTIADYLGIPTGVPDLSVNALPFRAYALVMNEWFRDEN 120
+T++ +PQI + + G++ DY+G+PT + + N L RAY L+ NEWFRDEN
Sbjct 108 PNDSTDFLVPQIQSATV---AEGSLFDYMGLPTQIAGIDFNNLHGRAYNLIWNEWFRDEN 164
Query 121 LTDPLVVPLDDATVAGVNTGAYVTDVAKGGLPFVAAKYHDYFTSCLPAPQKGPDVVIPGG 180
L D L VP DD G +T T +G K HDYFTS LP PQKG V +P G
Sbjct 165 LQDSLGVPKDD----GPDTYTGYTIQKRG-------KRHDYFTSALPWPQKGDAVSLPLG 213
Query 181 TGMSVPVIPQADKVPSGLITMPYTATFLNETPVRSTTGIFFNDSGSQTNGVSAGSSEDAL 240
T + A L PV +S G+
Sbjct 214 TSADIHTAAAAGTDIGIYSVGSSDFRLLTSDPVEV--------------ALSGGTP---- 255
Query 241 PVIDNLWAVGDGVATATINQLRLAFQIQKLYEKDARGGTRYTEILRSHFGVTSPDSRLQR 300
P + ++A ATINQLR AFQIQ+LYEKDARGGTRYTEIL+SHFGVTSPD+RLQR
Sbjct 256 PETNKMFADLSNATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQR 315
Query 301 PEYLGGNRIPIRINQIVQQSATQEGSTPQGNPVGLSLTSDNHGDFTKSFTEHGFILGLMV 360
PEYLGG + + + Q V Q+++ + ++PQGN L T+ + G F+KSF EHG ++GL
Sbjct 316 PEYLGGQKTEVMM-QTVPQTSSTDSTSPQGNLAALG-TATSRGGFSKSFVEHGVLIGLAC 373
Query 361 ARYDHTYQQGLDRMFSRKSRFDYYWPVFANIGEQAVLNKEIYAQGTNEDDEVFGYQEAWA 420
D TYQQG++RM+SR+ R+D+YWP A++GEQAVLN+EIY QGT+ D + FGYQE +A
Sbjct 374 VFADLTYQQGMNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFA 433
Query 421 DYRYKPNRVCGEMRSQAPQSLDVWHLGDDYSKLPSLSDEWIREDKTNVDRVLAVQSSVSN 480
+YRYKP+++ G+MRS A +LD WHL D++ LP+L+ +I E+ VDRV+AV S
Sbjct 434 EYRYKPSQITGKMRSNATGTLDAWHLAQDFTALPALNASFIEENPP-VDRVIAVPS--EP 490
Query 481 QLFADIYVQNRCTRPMPMYSIPGLIDH 507
+ D Y + TRPMP+YS+PGLIDH
Sbjct 491 EFIWDWYFDLKTTRPMPVYSVPGLIDH 517
Lambda K H a alpha
0.318 0.136 0.417 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3600440468988