bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-22_CDS_annotation_glimmer3.pl_2_6
Length=561
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094431|emb|CDL65804.1| unnamed protein product 480 7e-160
gi|575094544|emb|CDL65904.1| unnamed protein product 465 4e-154
gi|575096056|emb|CDL66947.1| unnamed protein product 457 4e-151
gi|575094572|emb|CDL65928.1| unnamed protein product 450 2e-148
gi|575094492|emb|CDL65859.1| unnamed protein product 437 1e-143
gi|575094496|emb|CDL65862.1| unnamed protein product 435 4e-142
gi|575094415|emb|CDL65790.1| unnamed protein product 423 1e-137
gi|557745632|ref|YP_008798242.1| major capsid protein 402 6e-130
gi|313766927|gb|ADR80653.1| putative major coat protein 382 4e-122
gi|530695351|gb|AGT39907.1| major capsid protein 382 5e-122
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 480 bits (1235), Expect = 7e-160, Method: Compositional matrix adjust.
Identities = 268/578 (46%), Positives = 360/578 (62%), Gaps = 42/578 (7%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+NRN+ +F + P + SR+RFNR L TFD+G+++P YVDEVLPGDTF +D +AIIR
Sbjct 1 MNRNSNFNFARNPGVSLSRSRFNRTSDRLDTFDTGEIVPIYVDEVLPGDTFELDMTAIIR 60
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVI--NGK 118
+TP +PVMD++F+D Y+F+ PNR+ W+++++ MGE T W +YSVP++ G
Sbjct 61 GSTPIFPVMDNSFLDVYFFFVPNRLTWEHWRELMGENRTTAWTQPVDYSVPQVTAPAGGW 120
Query 119 EKSPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDAS 178
E E S+ D+MGIPTKV + VNALP RAY I+NEFFR++N+ N ++ DA+
Sbjct 121 E------ELSLADHMGIPTKVDNI-SVNALPFRAYGLIYNEFFRNQNLTNPTQVEVTDAN 173
Query 179 IDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGN-APV 237
I ++ N+ +++ D AI G +CL KF DYFT LP PQ+G V + + + PV
Sbjct 174 IAGKNPNDVKNSND----WAITGAKCLKSAKFFDYFTGALPQPQKGEPVEINLASSWLPV 229
Query 238 GM----------YKNDSLT-EFGTINGNSE--IFLNQALNGSALAPKISNSFKEGARRAL 284
G+ +D+LT E + GN++ L + P +F+ A
Sbjct 230 GIGDYHGPLDKVSNSDTLTWESPSSEGNTKRTYALGMVQQEGEVNPNGLKNFETKA---- 285
Query 285 VTGSTNPTTQVSDAAYLAANLGE---TTATTINDLRKAVAVQQYYEALARGGSRYREQVQ 341
GS + + V AAY NL T A T+N LR+A VQ+ E ARGG+RYRE ++
Sbjct 286 -GGSFSESGAV--AAY-PTNLWASPVTAAATVNQLRQAFQVQKLLEKDARGGTRYREILK 341
Query 342 ALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTD-TPIGETGAMSVTPVNESS 400
+ V SD +QIPEYLGG + +N++Q+VQTS +STD +P G T A+SVTP ++S
Sbjct 342 NHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTS---ASTDASPQGNTAAISVTPFSKSM 398
Query 401 FTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLT 460
FTKSF+EHGFIIGV R SYQQG+ER+WSR DRLDYY P AN+GEQ + KEI
Sbjct 399 FTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDYYFPVLANIGEQAILNKEIYAQ 458
Query 461 GEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAER 520
G A D+E FGYQEAWADYR KPN + + RSNA +LD WHY +Y +PTLS WM +
Sbjct 459 GNAKDDEAFGYQEAWADYRYKPNTICGRFRSNAQQSLDAWHYGQDYDKLPTLSTDWMEQS 518
Query 521 KTEIARTLIVQDEPQFFGAIRVANKTTRRMPLYSVPGL 558
E+ RTL VQ EP F R KT R MPLYS+PGL
Sbjct 519 DIEMKRTLAVQTEPDFIANFRFNCKTVRVMPLYSIPGL 556
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 465 bits (1196), Expect = 4e-154, Method: Compositional matrix adjust.
Identities = 247/564 (44%), Positives = 347/564 (62%), Gaps = 23/564 (4%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+NRN E HF+++P + SR++F+R ++ TTF+ G LIPFY+DEVLPGDTF+V +S +IR
Sbjct 1 MNRNVESHFSRLPSVDISRSQFDRSSSLKTTFNVGDLIPFYIDEVLPGDTFNVKSSKVIR 60
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEK 120
M + P+MD+ ++D YYF+ PNR++W +++QF GE E+ W+P EY VP++
Sbjct 61 MQSLVTPIMDNIYLDTYYFFVPNRLVWSHWQQFNGENTESAWLPTTEYQVPQVTAPANGW 120
Query 121 SPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180
S +I DY GIPT V VNALP RAY I NE+FRDEN+ + I DA++
Sbjct 121 S----IGTIADYFGIPTGV--ACSVNALPFRAYALICNEWFRDENLSDPLNIPISDATVV 174
Query 181 YQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMY 240
+G D+ + + GG K+HDYFTSCLP PQ+GP+V LP+ ++PV +
Sbjct 175 GSNG-------DNYITDIVKGGMPFKACKYHDYFTSCLPAPQKGPDVLLPL-SSSPVPVT 226
Query 241 KNDSLTEFGTINGNSEIFLNQALNGSALAPKISNSFK--EGARRAL--VTGSTNPTTQVS 296
+D++ + + ++ L I F+ EGA + TG PT
Sbjct 227 TSDTMVDPLQYSKYPMAGVDSWNLSPTLMRNIIRPFEGVEGANYQVHQFTGDI-PTIDAF 285
Query 297 DAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTVQIP 356
L ANL TA +IN LR A +Q+ YE ARGG+RY E +++ + V D +Q P
Sbjct 286 RPLNLVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRP 345
Query 357 EYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCC 416
EYLGG R +N+NQ++Q S ++++ +P G S+T + F KSF EHGF+IG+
Sbjct 346 EYLGGNRIPININQVLQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMV 403
Query 417 VRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWA 476
R++H+YQQGLER WSR DR DYY P FA++GEQ V KEI +G A D+E FGYQEA+A
Sbjct 404 ARYDHTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQEAYA 463
Query 477 DYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD--EP 534
DYR KP+RV+ +MRS A +LD WH AD+Y S+P+LS W+ E + + R L V
Sbjct 464 DYRYKPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSA 523
Query 535 QFFGAIRVANKTTRRMPLYSVPGL 558
Q F I + N++TR MP+YSVPGL
Sbjct 524 QLFCDIYIQNRSTRPMPMYSVPGL 547
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 457 bits (1177), Expect = 4e-151, Method: Compositional matrix adjust.
Identities = 253/586 (43%), Positives = 355/586 (61%), Gaps = 49/586 (8%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+NRN E HF+ +P + SR+RF+R +I TTF++G ++PF+++EVLPGDTFSVD+S ++R
Sbjct 2 MNRNTESHFSLLPHVDISRSRFDRSSSIKTTFNAGDVVPFFLEEVLPGDTFSVDSSKVVR 61
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEK 120
M T P+MD+ ++D YYF+ PNR++W ++K+F GE E+ W+P+ EY++P++ K
Sbjct 62 MQTLLTPMMDNVYLDTYYFFVPNRLVWQHWKEFCGENNESAWIPQTEYAIPQL------K 115
Query 121 SP-EPYE-DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDAS 178
SP +E +I DY G+PT V + V+ALP RAY I NE+FRDEN+ + + TDDA+
Sbjct 116 SPVGGFEVGTIADYFGLPTGVANL-SVSALPFRAYALIMNEWFRDENLMDPLVVPTDDAT 174
Query 179 IDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQ--GNAP 236
+ + NT + GG+ K+HDYFTS LP PQ+GP+V +P+ GN
Sbjct 175 V-------TGVNTGIFVTDVAKGGKPFVAAKYHDYFTSALPAPQKGPDVVIPVASAGNYN 227
Query 237 V-----GMYKNDSLTEFGTING-------NSEIFLNQALNGSALAPKISNSFKEGARRAL 284
V G+ +D NG +E+F + L + S +
Sbjct 228 VVGNGKGLALSDGSKMSIICNGLSGSNGQGTELFASGILGSQVGSSGGFGSGSSLRGDGI 287
Query 285 VTGSTNPTTQVSDAAYLAANLGET----------TATTINDLRKAVAVQQYYEALARGGS 334
+ G V AA L NL + A TIN LR A +Q++YE ARGGS
Sbjct 288 ILG-------VPTAAQLGNNLENSGLIAIASGNAAAATINQLRMAFQIQKFYEKQARGGS 340
Query 335 RYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVT 394
RY E +++ + V D +Q EYLGG R +N+NQ++Q SG S++ TP G MS T
Sbjct 341 RYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQSGTGSASTTPQGTVVGMSQT 400
Query 395 PVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKK 454
S FTKSF EHGFIIGV C R++H+YQQG++R+WSR D+ DYY P F+N+GEQ +K
Sbjct 401 TDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKFDYYWPVFSNIGEQAIKN 460
Query 455 KEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQ 514
KEI G A+D+E FGYQEAWA+YR KP+RV+ +MRS+ +LD WH AD+Y +P+LS
Sbjct 461 KEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRSSYAQSLDVWHLADDYSKLPSLSD 520
Query 515 GWMAERKTEIARTLIVQDE--PQFFGAIRVANKTTRRMPLYSVPGL 558
W+ E + R L V D+ QFF I V N TR MP+YS+PGL
Sbjct 521 EWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRPMPMYSIPGL 566
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 450 bits (1158), Expect = 2e-148, Method: Compositional matrix adjust.
Identities = 246/575 (43%), Positives = 348/575 (61%), Gaps = 40/575 (7%)
Query 1 VNRNNERHFNQIP-EMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAII 59
+NRN E HF + P + SR+ F+R ++ TF++G++IPF+++EVLPGDTF V TS +I
Sbjct 1 MNRNVESHFAKNPTNIDISRSTFDRSSSVKLTFNTGEIIPFFIEEVLPGDTFKVKTSKVI 60
Query 60 RMTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKE 119
R+ T P+MD+ ++D YYF+ PNR++W+++K+F GE ++ W+P+ EY +P++
Sbjct 61 RLQTLLTPMMDNIYLDTYYFFVPNRLVWEHWKEFNGENTQSAWIPEVEYQIPQLT----- 115
Query 120 KSPEPYED--SILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDA 177
+PE + ++ DY GIPT V + VNALP RAY + NE+FRD+N+ + I DA
Sbjct 116 -APEGGWNIGTLADYFGIPTGVSGI-SVNALPFRAYALVCNEWFRDQNLSDPLNIPVGDA 173
Query 178 SIDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQG--NA 235
++ + NT + + GG K+HDYFTSCLP PQ+GP+VT+P+ N
Sbjct 174 TV-------TGVNTGTFITDVVKGGLPYTAAKYHDYFTSCLPAPQKGPDVTIPVTSGHNL 226
Query 236 PVGMYKNDS-----LTEFGTINGNSEI---FLNQALNGSALAPKISNSFKEGARRALVTG 287
PV M+ N++ FG NSE+ + + + A + ++S E G
Sbjct 227 PV-MFLNETHDAGPYKPFGVGIQNSELRNFYGFGSGSSGATSTSDTSSTVEVGSDGTGIG 285
Query 288 ST--NPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWD 345
PT A G+ TIN LR A +Q+ YE ARGG+RY E +++ +
Sbjct 286 QNFWTPTNM------WAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEIIRSHFG 339
Query 346 VVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSF 405
VV D +Q PEYLGG R +N+NQI+Q S QS+ +P+G MSVT S F KSF
Sbjct 340 VVSPDSRLQRPEYLGGNRIPINVNQIIQQS--QSTEQSPLGALAGMSVTTDKNSDFIKSF 397
Query 406 EEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASD 465
EHG+IIG+ R++H+YQQGL+R+WSR DR D+Y P AN+GEQ V KEI + G +D
Sbjct 398 VEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYIDGSDTD 457
Query 466 EETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIA 525
+E FGYQEAWA+YR KPNRV +MRS+A +LD WH D+Y S+P LS W+ E KT +
Sbjct 458 DEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVD 517
Query 526 RTLIVQD--EPQFFGAIRVANKTTRRMPLYSVPGL 558
R L V Q F I + NK TR MP+YS+PGL
Sbjct 518 RVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGL 552
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 437 bits (1125), Expect = 1e-143, Method: Compositional matrix adjust.
Identities = 241/547 (44%), Positives = 325/547 (59%), Gaps = 48/547 (9%)
Query 30 TTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRMTTPKYPVMDDAFIDFYYFYCPNRILWDN 89
TTF+ G LIPFYVDE+LPGDTFS+DTS ++RM + PVMD+ ++D Y+F+ PNR+ W +
Sbjct 31 TTFNVGDLIPFYVDEILPGDTFSIDTSKVVRMQSLLTPVMDNIYLDTYFFFVPNRLTWSH 90
Query 90 FKQFMGEVEETPWMPKQEYSVPKIVINGKEKSPEPYED--SILDYMGIPTKVKKVFKVNA 147
+++ MGE ++ W P+ EYSVP+I +PE + +I DYMGIPT V + VNA
Sbjct 91 WRELMGENTQSAWTPQVEYSVPQIT------APEGGWNVGTIADYMGIPTGVSGL-SVNA 143
Query 148 LPIRAYVKIWNEFFRDENVDNAATIKTDDASIDYQDGNESEDNTDDILKKAIGGGRCLPV 207
+P RAY I NE+FRDEN+ + I DA++ + NT + GG
Sbjct 144 MPFRAYALICNEWFRDENLTDPLNIPVGDATV-------AGVNTGTYVTDVAKGGLPFKA 196
Query 208 NKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMYKNDSLTEFGTIN-------GNSEIFLN 260
K+HDYFTSCLP PQ+GP+V + G+ V + D+ + +N GNS +N
Sbjct 197 AKYHDYFTSCLPAPQKGPDVLISAVGSGIVPVTATDNDNDSLNVNSPGMRFVGNSSTSVN 256
Query 261 QALNGSALAPKISNSFKEGARRALVTGSTNPTTQVSDAAYLAANLGE--TTAT-----TI 313
G G +VT + P+T + + + NL +TAT TI
Sbjct 257 YLAFG-------------GGDGYVVTDTPKPSTPIHGISMIPTNLWADLSTATDLPVATI 303
Query 314 NDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQ 373
N LR A +Q+ YE ARGG+RY E +++ + V D +Q PEYLGG R +N+NQ++Q
Sbjct 304 NQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGSRVPININQVIQ 363
Query 374 TSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSR 433
+S + TP G A S+T + S FTKSF EHGFIIG+ R++HSYQQGL+R WSR
Sbjct 364 SS---ETGATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARYDHSYQQGLQRFWSR 420
Query 434 TDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNA 493
DR DYY P FANLGE VK KEI G D+E FGYQEAWADYR KP+ V+ +MRS
Sbjct 421 KDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWADYRYKPSVVTGEMRSQY 480
Query 494 TGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD--EPQFFGAIRVANKTTRRMP 551
+LD WH AD+Y+++P+LS W+ E + + R L V D Q F I + TR MP
Sbjct 481 AQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLFCDIYIRCLATRPMP 540
Query 552 LYSVPGL 558
LYS+PGL
Sbjct 541 LYSIPGL 547
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 435 bits (1118), Expect = 4e-142, Method: Compositional matrix adjust.
Identities = 250/590 (42%), Positives = 343/590 (58%), Gaps = 61/590 (10%)
Query 3 RNNERHFNQIP-EMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRM 61
RN F++ P + R+ FNR T T+ + G+LIPFY DEVLPGDTF V T+ ++R+
Sbjct 2 RNENSRFSENPVTLDIQRSTFNRSSTYKTSANIGELIPFYYDEVLPGDTFQVKTNKVVRL 61
Query 62 TTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEKS 121
MD+ + D YYF+ PNR++W+++++FMGE ++ W+P+ EY++P+I +
Sbjct 62 QPLVSAPMDNLYFDTYYFFVPNRLVWEHWEEFMGENKQGAWIPQTEYTIPQIT----SPA 117
Query 122 PEPYE-DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180
+E +I DY GIPT V + V+ALP RAY I +E+FRD+N+ I DD ++
Sbjct 118 STGFEIGTIADYFGIPTGVPNL-SVSALPFRAYALIVDEWFRDQNLQLPLNIPLDDTTLQ 176
Query 181 YQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMY 240
NT D + + GG+ K+HDYFTSCLP PQ+GP+VT+ G+ PV Y
Sbjct 177 -------GVNTGDYVTDTVKGGKPFVAAKYHDYFTSCLPSPQKGPDVTIAAVGDFPV--Y 227
Query 241 KNDSLTEFGTINGNSEIFLNQALN-GSALAPKISNSFKEG---ARRALVTGST------- 289
D G+ N+AL+ G + S SF +G L TGST
Sbjct 228 TGDPHNNNGS---------NKALHYGISNISSGSVSFSQGNYIIPSVLTTGSTQSVPAQG 278
Query 290 -----NPTTQVS----DAAY----------LAANLGETTATTINDLRKAVAVQQYYEALA 330
N T S D+++ L A+ G TATTIN LR A +Q+ YE A
Sbjct 279 KLNASNITMTTSPGSPDSSFGSKLSVYPDNLYASSG--TATTINQLRMAFQIQKLYEKDA 336
Query 331 RGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGA 390
R GSRYRE +++ + V D +Q+PEYLGG R +N+NQ+VQTS Q+S +P G
Sbjct 337 RAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININQVVQTS--QTSDVSPQGNVAG 394
Query 391 MSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQ 450
S+T + F KSF EHG +IGV R++H+YQQG+ +LWSR R DYY P AN+GEQ
Sbjct 395 QSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSKLWSRKTRFDYYWPVLANIGEQ 454
Query 451 PVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVP 510
V KEI G A DEE FGYQEAWA+YR KP+ V+ +MRS+A +LD WH+AD+Y S+P
Sbjct 455 AVLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVTGEMRSSARTSLDSWHFADDYNSLP 514
Query 511 TLSQGWMAERKTEIARTLIVQDEP--QFFGAIRVANKTTRRMPLYSVPGL 558
LS W+ E KT I R L V Q+F + N+TTR +P YS+PGL
Sbjct 515 KLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETTRALPFYSIPGL 564
>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569
Score = 423 bits (1087), Expect = 1e-137, Method: Compositional matrix adjust.
Identities = 238/596 (40%), Positives = 326/596 (55%), Gaps = 68/596 (11%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+NRN E H++QIP R +F RD + LTT + G L+P YVDEVLPGDT + +++R
Sbjct 1 MNRNAEAHYSQIPHANIQRAKFKRDFSYLTTINEGDLVPIYVDEVLPGDTIKIKQRSLVR 60
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEK 120
M+TP YPVMD+ ++D +YF+ P R++WD+++ MGE ++ W P +Y+ P
Sbjct 61 MSTPLYPVMDNCYLDIWYFFVPCRLVWDHWQNLMGENTKSYWAPDVQYTTPLT----SAP 116
Query 121 SPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180
S +I DYMGIPT V + KVN++P+RAY +IWNE+FRDEN+ T +DDA+
Sbjct 117 SGGWQVGTIADYMGIPTGVSGI-KVNSMPMRAYARIWNEWFRDENLQQPVTQHSDDATT- 174
Query 181 YQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRG----------PEV--- 227
+ NT L A GG L V KF DYFTSCLP PQ+G P+V
Sbjct 175 ------TGSNTGTELTDAESGGLPLKVAKFKDYFTSCLPAPQKGEAIGFDFNQTPKVKGI 228
Query 228 --TLPMQGNAP---------------VGMYKNDSLTEFG------TINGNSEIFLNQALN 264
P++ N VG N S F T+NG F N
Sbjct 229 GLVFPLETNTGHTATDILWRQPDAQLVGENYNTSYNNFNSITTQTTVNGKKAFFFNNG-K 287
Query 265 GSALAPKISNSFKEGARRALVTGSTNPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQ 324
G L+ + + + G + +T +A N T +INDLR+A+A+Q
Sbjct 288 GPMLSARFEDDYNGGVEQVELTA-------------VAEN--STNFLSINDLRQAIALQH 332
Query 325 YYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTP 384
EA ARGG+RY E ++ + V D +Q EY+GG R +N++Q++Q+S S T +P
Sbjct 333 ILEADARGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQVIQSSA--SDTTSP 390
Query 385 IGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQF 444
G A S+T + S EHG+I+G+ +R +HSYQQGL R+W+R+DR YY P
Sbjct 391 QGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMWTRSDRFSYYHPML 450
Query 445 ANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYAD 504
ANLGEQ V +EI G +D E FGYQEAWADYR + N ++ +MRS +LD WHY D
Sbjct 451 ANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADYRYRTNMITGEMRSTYAQSLDAWHYGD 510
Query 505 NYKSVPTLSQGWMAERKTEIARTLIVQDE--PQFFGAIRVANKTTRRMPLYSVPGL 558
Y +P LS W+ E + I RTL VQ E QF + R MP+YSVPGL
Sbjct 511 KYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRPMPIYSVPGL 566
>gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus]
gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus]
Length=538
Score = 402 bits (1033), Expect = 6e-130, Method: Compositional matrix adjust.
Identities = 232/570 (41%), Positives = 323/570 (57%), Gaps = 61/570 (11%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+ + F+++P R+ F+R + TTF++G+L+P YVDE LPGDTFS + +A R
Sbjct 13 IGSAKQHQFSEVPHADIQRSTFDRSHGLKTTFNAGQLVPIYVDEALPGDTFSCNLTAFSR 72
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGE-----------VEETPWMPKQEYS 109
+ TP +P MD+AF+D ++F P R++WD+F++FMGE ++ TP
Sbjct 73 LATPIHPTMDNAFMDTHFFAVPVRLVWDDFEEFMGETKTYKAAGSDRLDGTPDFSVAAPV 132
Query 110 VPKIVINGKEKSPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNA 169
P I +G ++ E S+ DY GIPTKV + + +AL RAY +WN++FRDEN+
Sbjct 133 PPTITASGSGEA----EASLSDYFGIPTKVGGL-EFSALWHRAYTLVWNDWFRDENLQAP 187
Query 170 ATIKTDDASIDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTL 229
TI D GN++ T +L + K HDYFTS LP+PQ+G +VT+
Sbjct 188 KTI-------DTTSGNDT--TTYALLNRG----------KKHDYFTSALPWPQKGADVTI 228
Query 230 PMQGNAPV--GMYKNDSLTEFGTINGNSEIFLNQALNGSALAPKISNSFKEGARRALVTG 287
P+ +APV N +T F GN+ FLN A + + P N+ + ARR
Sbjct 229 PLGTSAPVTTANSSNQDVTIFTPNIGNTHRFLNSA--STNVYPGDENT--DEARR----- 279
Query 288 STNPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVV 347
L A+L E T+ TIN LR A A Q++ E ARGGSRY E ++ ++V
Sbjct 280 -------------LYADLSEATSATINQLRLAFATQKFLEIQARGGSRYIEVIKNHFNVT 326
Query 348 ISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEE 407
D +Q PEYLGGG VN++ + QTS ++T P G A+ T ++ SFTKSF E
Sbjct 327 SPDARLQRPEYLGGGSSPVNISPVAQTSSTDATT--PQGNLSAIGTTVLSGHSFTKSFTE 384
Query 408 HGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEE 467
H +IG+ VR + +YQQGL R++SR DYY P + +GEQ VK KEI G A+DE
Sbjct 385 HTIVIGMVSVRTDLTYQQGLNRMFSRETIYDYYWPTLSTIGEQAVKNKEIYAQGSAADET 444
Query 468 TFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIART 527
TFGYQE +A+YR KP+ V+ K RSNATGTL+ WHYA Y S+P L W+ T + RT
Sbjct 445 TFGYQERYAEYRYKPSSVTGKFRSNATGTLESWHYAQEYASLPLLGDSWIQVTDTNVQRT 504
Query 528 LIVQDEPQFFGAIRVANKTTRRMPLYSVPG 557
L V EPQF + TR MP+ S+PG
Sbjct 505 LAVASEPQFIFDSLFKLRCTRPMPVNSIPG 534
>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533
Score = 382 bits (980), Expect = 4e-122, Method: Compositional matrix adjust.
Identities = 219/552 (40%), Positives = 316/552 (57%), Gaps = 52/552 (9%)
Query 7 RHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRMTTPKY 66
F+++P+ R+ F+R + TTF+SG LIP YVDEVLPGDTF ++ + R+ TP Y
Sbjct 15 HEFSRVPQADIQRSTFSRVHGLKTTFNSGDLIPIYVDEVLPGDTFQMNATGFGRLATPLY 74
Query 67 PVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEKSPEPYE 126
PVMD+ +++ ++FY PNRI+WDN+++F G ++ ++ VP+I +S E
Sbjct 75 PVMDNMYVETFFFYVPNRIIWDNWEKFNGAQDDP--NDSTDFLVPQI------QSATVAE 126
Query 127 DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASIDYQDGNE 186
S+ DYMG+PT++ + N L RAY IWNE+FRDEN+ ++ + DD Y
Sbjct 127 GSLFDYMGLPTQIAGI-DFNNLHGRAYNLIWNEWFRDENLQDSLGVPKDDGPDTY----- 180
Query 187 SEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMYKNDSLT 246
T ++K K HDYFTS LP+PQ+G V+LP+ +A + T
Sbjct 181 ----TGYTIQKR---------GKRHDYFTSALPWPQKGDAVSLPLGTSADIHTAAAAG-T 226
Query 247 EFGTINGNSEIFLNQALNGSALAPKISNSFKEGARRALVTGSTNPTTQVSDAAYLAANLG 306
+ G + S F + L + +S G T P T + A+L
Sbjct 227 DIGIYSVGSSDF--RLLTSDPVEVALS-------------GGTPPETN-----KMFADLS 266
Query 307 ETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHV 366
TA TIN LR+A +Q+ YE ARGG+RY E +Q+ + V D +Q PEYLGG + V
Sbjct 267 NATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQKTEV 326
Query 367 NMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQG 426
M + QTS S+ +P G A+ T + F+KSF EHG +IG+ CV + +YQQG
Sbjct 327 MMQTVPQTSSTDST--SPQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFADLTYQQG 383
Query 427 LERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVS 486
+ R+WSR DR D+Y P A+LGEQ V +EI G ++D +TFGYQE +A+YR KP++++
Sbjct 384 MNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRYKPSQIT 443
Query 487 SKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQDEPQFFGAIRVANKT 546
KMRSNATGTLD WH A ++ ++P L+ ++ E + R + V EP+F KT
Sbjct 444 GKMRSNATGTLDAWHLAQDFTALPALNASFI-EENPPVDRVIAVPSEPEFIWDWYFDLKT 502
Query 547 TRRMPLYSVPGL 558
TR MP+YSVPGL
Sbjct 503 TRPMPVYSVPGL 514
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 382 bits (980), Expect = 5e-122, Method: Compositional matrix adjust.
Identities = 219/569 (38%), Positives = 323/569 (57%), Gaps = 50/569 (9%)
Query 2 NRNNERH-FNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
N++ H F+ IP + R++F+ +T+ T FDSG L+P VDEVLPGD+ ++ +A R
Sbjct 5 NKSASAHQFSMIPRAEIPRSKFDAQKTLKTAFDSGYLVPILVDEVLPGDSMNLRMTAFTR 64
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVI-NGKE 119
+ TP +PVMD+ ++D ++F+ PNR+LW N+++FMGE + P +Y++P + NG
Sbjct 65 LATPLFPVMDNMYLDTFFFFVPNRLLWSNWQRFMGERDPDP-DSSIDYTIPTMTSPNGGY 123
Query 120 KSPEPYEDSILDYMGIPTK----VKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTD 175
+S+ DYMG+PT N+L RAY IWNE+FRDEN+ ++ +
Sbjct 124 AV-----NSLQDYMGLPTAGQVDAGSSISHNSLFTRAYNLIWNEWFRDENLQDSVVVDKG 178
Query 176 DASIDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNA 235
D Y D +L++ K HDYFTS LP+PQ+G VTLP+ G+A
Sbjct 179 DGPDTYTDYT--------LLRRG----------KRHDYFTSALPWPQKGDAVTLPLGGSA 220
Query 236 PVGMYKNDSLTEFGTINGNSEIFLNQALNGSA-LAPKISNSFKEG-ARRALVTGSTNPTT 293
V ND+ ++ + G+ P + KE ++ TGS N
Sbjct 221 NV--VYNDT---------GDPAYIREVSTGNVWTTPSRESVSKEANGNMSVPTGSVN--A 267
Query 294 QVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTV 353
Q L A+L TA TIN +R++ +Q+ E ARGG+RY E V++ + V+ D +
Sbjct 268 QYDPNGSLVADLSTATAATINAIRQSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARM 327
Query 354 QIPEYLGGGRYHVNMNQIVQTSGQQSS-TDTPIGETGAMSVTPVNESSFTKSFEEHGFII 412
Q PEYLGGG + +N + Q S +S TDTP+G GA+ + F SF EHG ++
Sbjct 328 QRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGTLGAVGTGLASGHGFASSFTEHGVVV 387
Query 413 GVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQ 472
G+C VR + +YQQGL R++SR+ R D++ P F++LGEQP+ KE+ TG ++D++ FGYQ
Sbjct 388 GLCSVRADLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKELYATGTSTDDDVFGYQ 447
Query 473 EAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD 532
EAWA+YR KP++V+ MRS A GTLD WH A N+ S+PTL+ ++ E + R + V
Sbjct 448 EAWAEYRYKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTFI-EDTPPVDRVVAVGS 506
Query 533 EP---QFFGAIRVANKTTRRMPLYSVPGL 558
E QF R MP+YSVPGL
Sbjct 507 EANGQQFIFDAFFDINMARPMPMYSVPGL 535
Lambda K H a alpha
0.316 0.133 0.398 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4106350928880