bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-1_CDS_annotation_glimmer3.pl_2_6
Length=561
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094431|emb|CDL65804.1| unnamed protein product 479 2e-159
gi|575094544|emb|CDL65904.1| unnamed protein product 464 5e-154
gi|575096056|emb|CDL66947.1| unnamed protein product 457 9e-151
gi|575094572|emb|CDL65928.1| unnamed protein product 449 1e-147
gi|575094492|emb|CDL65859.1| unnamed protein product 438 9e-144
gi|575094496|emb|CDL65862.1| unnamed protein product 430 2e-140
gi|575094415|emb|CDL65790.1| unnamed protein product 420 2e-136
gi|557745632|ref|YP_008798242.1| major capsid protein 401 1e-129
gi|530695351|gb|AGT39907.1| major capsid protein 384 1e-122
gi|313766927|gb|ADR80653.1| putative major coat protein 383 1e-122
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 479 bits (1232), Expect = 2e-159, Method: Compositional matrix adjust.
Identities = 264/583 (45%), Positives = 355/583 (61%), Gaps = 52/583 (9%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+NRN+ +F + P + SR+RFNR L TFD+G+++P YVDEVLPGDTF +D +AIIR
Sbjct 1 MNRNSNFNFARNPGVSLSRSRFNRTSDRLDTFDTGEIVPIYVDEVLPGDTFELDMTAIIR 60
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVI--NGK 118
+TP +PVMD++F+D Y+F+ PNR+ W+++++ MGE T W +YSVP++ G
Sbjct 61 GSTPIFPVMDNSFLDVYFFFVPNRLTWEHWRELMGENRTTAWTQPVDYSVPQVTAPAGGW 120
Query 119 EKSPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDAS 178
E E S+ D+MGIPTKV + VNALP RAY I+NEFFR++N+ N ++ DA+
Sbjct 121 E------ELSLADHMGIPTKVDNI-SVNALPFRAYGLIYNEFFRNQNLTNPTQVEVTDAN 173
Query 179 IDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGN-APV 237
I ++ + ++ D AI G +CL KF DYFT LP PQ+G V + + + PV
Sbjct 174 IAGKNPNDVKNSND----WAITGAKCLKSAKFFDYFTGALPQPQKGEPVEINLASSWLPV 229
Query 238 GMYKNDSLTEFGTINGNSEIFLNQALNGSAL---APKISNSSKEGARRALVT--GSTNPT 292
G+ G+ L++ N L +P ++K +V G NP
Sbjct 230 GI-------------GDYHGPLDKVSNSDTLTWESPSSEGNTKRTYALGMVQQEGEVNPN 276
Query 293 T----------QVSDAAYLAA---NLGE---TTATTINDLRKAVAVQQYYEALARGGSRY 336
S++ +AA NL T A T+N LR+A VQ+ E ARGG+RY
Sbjct 277 GLKNFETKAGGSFSESGAVAAYPTNLWASPVTAAATVNQLRQAFQVQKLLEKDARGGTRY 336
Query 337 REQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTD-TPIGETGAMSVTP 395
RE ++ + V SD +QIPEYLGG + +N++Q+VQTS +STD +P G T A+SVTP
Sbjct 337 REILKNHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTS---ASTDASPQGNTAAISVTP 393
Query 396 VNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKK 455
++S FTKSF+EHGFIIGV R SYQQG+ER+WSR DRLDYY P AN+GEQ + K
Sbjct 394 FSKSMFTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDYYFPVLANIGEQAILNK 453
Query 456 EIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQG 515
EI G A D+E FGYQEAWADYR KPN + + RSNA +LD WHY +Y +PTLS
Sbjct 454 EIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRFRSNAQQSLDAWHYGQDYDKLPTLSTD 513
Query 516 WMAERKTEIARTLIVQDEPQFFGAIRVANKTTRRMPLYSVPGL 558
WM + E+ RTL VQ EP F R KT R MPLYS+PGL
Sbjct 514 WMEQSDIEMKRTLAVQTEPDFIANFRFNCKTVRVMPLYSIPGL 556
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 464 bits (1195), Expect = 5e-154, Method: Compositional matrix adjust.
Identities = 248/567 (44%), Positives = 348/567 (61%), Gaps = 29/567 (5%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+NRN E HF+++P + SR++F+R ++ TTF+ G LIPFY+DEVLPGDTF+V +S +IR
Sbjct 1 MNRNVESHFSRLPSVDISRSQFDRSSSLKTTFNVGDLIPFYIDEVLPGDTFNVKSSKVIR 60
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEK 120
M + P+MD+ ++D YYF+ PNR++W +++QF GE E+ W+P EY VP++
Sbjct 61 MQSLVTPIMDNIYLDTYYFFVPNRLVWSHWQQFNGENTESAWLPTTEYQVPQVTAPANGW 120
Query 121 SPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180
S +I DY GIPT V VNALP RAY I NE+FRDEN+ + I DA++
Sbjct 121 S----IGTIADYFGIPTGV--ACSVNALPFRAYALICNEWFRDENLSDPLNIPISDATVV 174
Query 181 YQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMY 240
+G D+ + + GG K+HDYFTSCLP PQ+GP+V LP+ ++PV +
Sbjct 175 GSNG-------DNYITDIVKGGMPFKACKYHDYFTSCLPAPQKGPDVLLPL-SSSPVPVT 226
Query 241 KNDSLTE-----FGTINGNSEIFLNQALNGSALAPKISNSSKEGARRAL--VTGSTNPTT 293
+D++ + + G L+ L + + P EGA + TG PT
Sbjct 227 TSDTMVDPLQYSKYPMAGVDSWNLSPTLMRNIIRPF---EGVEGANYQVHQFTGDI-PTI 282
Query 294 QVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTV 353
L ANL TA +IN LR A +Q+ YE ARGG+RY E +++ + V D +
Sbjct 283 DAFRPLNLVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARL 342
Query 354 QIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIG 413
Q PEYLGG R +N+NQ++Q S ++++ +P G S+T + F KSF EHGF+IG
Sbjct 343 QRPEYLGGNRIPININQVLQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIG 400
Query 414 VCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQE 473
+ R++H+YQQGLER WSR DR DYY P FA++GEQ V KEI +G A D+E FGYQE
Sbjct 401 LMVARYDHTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQE 460
Query 474 AWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD- 532
A+ADYR KP+RV+ +MRS A +LD WH AD+Y S+P+LS W+ E + + R L V
Sbjct 461 AYADYRYKPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSN 520
Query 533 -EPQFFGAIRVANKTTRRMPLYSVPGL 558
Q F I + N++TR MP+YSVPGL
Sbjct 521 VSAQLFCDIYIQNRSTRPMPMYSVPGL 547
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 457 bits (1175), Expect = 9e-151, Method: Compositional matrix adjust.
Identities = 254/586 (43%), Positives = 356/586 (61%), Gaps = 49/586 (8%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+NRN E HF+ +P + SR+RF+R +I TTF++G ++PF+++EVLPGDTFSVD+S ++R
Sbjct 2 MNRNTESHFSLLPHVDISRSRFDRSSSIKTTFNAGDVVPFFLEEVLPGDTFSVDSSKVVR 61
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEK 120
M T P+MD+ ++D YYF+ PNR++W ++K+F GE E+ W+P+ EY++P++ K
Sbjct 62 MQTLLTPMMDNVYLDTYYFFVPNRLVWQHWKEFCGENNESAWIPQTEYAIPQL------K 115
Query 121 SP-EPYE-DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDAS 178
SP +E +I DY G+PT V + V+ALP RAY I NE+FRDEN+ + + TDDA+
Sbjct 116 SPVGGFEVGTIADYFGLPTGVANL-SVSALPFRAYALIMNEWFRDENLMDPLVVPTDDAT 174
Query 179 IDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQ--GNAP 236
+ G + D+ K GG+ K+HDYFTS LP PQ+GP+V +P+ GN
Sbjct 175 VT---GVNTGIFVTDVAK----GGKPFVAAKYHDYFTSALPAPQKGPDVVIPVASAGNYN 227
Query 237 V-----GMYKNDSLTEFGTING-------NSEIFLNQALNGSALAPKISNSSKEGARRAL 284
V G+ +D NG +E+F + L + S +
Sbjct 228 VVGNGKGLALSDGSKMSIICNGLSGSNGQGTELFASGILGSQVGSSGGFGSGSSLRGDGI 287
Query 285 VTGSTNPTTQVSDAAYLAANLGET----------TATTINDLRKAVAVQQYYEALARGGS 334
+ G V AA L NL + A TIN LR A +Q++YE ARGGS
Sbjct 288 ILG-------VPTAAQLGNNLENSGLIAIASGNAAAATINQLRMAFQIQKFYEKQARGGS 340
Query 335 RYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVT 394
RY E +++ + V D +Q EYLGG R +N+NQ++Q SG S++ TP G MS T
Sbjct 341 RYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQSGTGSASTTPQGTVVGMSQT 400
Query 395 PVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKK 454
S FTKSF EHGFIIGV C R++H+YQQG++R+WSR D+ DYY P F+N+GEQ +K
Sbjct 401 TDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKFDYYWPVFSNIGEQAIKN 460
Query 455 KEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQ 514
KEI G A+D+E FGYQEAWA+YR KP+RV+ +MRS+ +LD WH AD+Y +P+LS
Sbjct 461 KEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRSSYAQSLDVWHLADDYSKLPSLSD 520
Query 515 GWMAERKTEIARTLIVQDE--PQFFGAIRVANKTTRRMPLYSVPGL 558
W+ E + R L V D+ QFF I V N TR MP+YS+PGL
Sbjct 521 EWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRPMPMYSIPGL 566
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 449 bits (1154), Expect = 1e-147, Method: Compositional matrix adjust.
Identities = 245/575 (43%), Positives = 348/575 (61%), Gaps = 40/575 (7%)
Query 1 VNRNNERHFNQIP-EMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAII 59
+NRN E HF + P + SR+ F+R ++ TF++G++IPF+++EVLPGDTF V TS +I
Sbjct 1 MNRNVESHFAKNPTNIDISRSTFDRSSSVKLTFNTGEIIPFFIEEVLPGDTFKVKTSKVI 60
Query 60 RMTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKE 119
R+ T P+MD+ ++D YYF+ PNR++W+++K+F GE ++ W+P+ EY +P++
Sbjct 61 RLQTLLTPMMDNIYLDTYYFFVPNRLVWEHWKEFNGENTQSAWIPEVEYQIPQLT----- 115
Query 120 KSPEPYED--SILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDA 177
+PE + ++ DY GIPT V + VNALP RAY + NE+FRD+N+ + I DA
Sbjct 116 -APEGGWNIGTLADYFGIPTGVSGI-SVNALPFRAYALVCNEWFRDQNLSDPLNIPVGDA 173
Query 178 SIDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQG--NA 235
++ + T + + GG K+HDYFTSCLP PQ+GP+VT+P+ N
Sbjct 174 TV-------TGVNTGTFITDVVKGGLPYTAAKYHDYFTSCLPAPQKGPDVTIPVTSGHNL 226
Query 236 PVGMYKNDS-----LTEFGTINGNSEI---FLNQALNGSALAPKISNSSKEGARRALVTG 287
PV M+ N++ FG NSE+ + + + A + ++S+ E G
Sbjct 227 PV-MFLNETHDAGPYKPFGVGIQNSELRNFYGFGSGSSGATSTSDTSSTVEVGSDGTGIG 285
Query 288 ST--NPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWD 345
PT A G+ TIN LR A +Q+ YE ARGG+RY E +++ +
Sbjct 286 QNFWTPTNM------WAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEIIRSHFG 339
Query 346 VVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSF 405
VV D +Q PEYLGG R +N+NQI+Q S QS+ +P+G MSVT S F KSF
Sbjct 340 VVSPDSRLQRPEYLGGNRIPINVNQIIQQS--QSTEQSPLGALAGMSVTTDKNSDFIKSF 397
Query 406 EEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASD 465
EHG+IIG+ R++H+YQQGL+R+WSR DR D+Y P AN+GEQ V KEI + G +D
Sbjct 398 VEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYIDGSDTD 457
Query 466 EETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIA 525
+E FGYQEAWA+YR KPNRV +MRS+A +LD WH D+Y S+P LS W+ E KT +
Sbjct 458 DEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVD 517
Query 526 RTLIVQD--EPQFFGAIRVANKTTRRMPLYSVPGL 558
R L V Q F I + NK TR MP+YS+PGL
Sbjct 518 RVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGL 552
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 438 bits (1127), Expect = 9e-144, Method: Compositional matrix adjust.
Identities = 238/540 (44%), Positives = 327/540 (61%), Gaps = 34/540 (6%)
Query 30 TTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRMTTPKYPVMDDAFIDFYYFYCPNRILWDN 89
TTF+ G LIPFYVDE+LPGDTFS+DTS ++RM + PVMD+ ++D Y+F+ PNR+ W +
Sbjct 31 TTFNVGDLIPFYVDEILPGDTFSIDTSKVVRMQSLLTPVMDNIYLDTYFFFVPNRLTWSH 90
Query 90 FKQFMGEVEETPWMPKKEYSVPKIVINGKEKSPEPYED--SILDYMGIPTKVKKVFKVNA 147
+++ MGE ++ W P+ EYSVP+I +PE + +I DYMGIPT V + VNA
Sbjct 91 WRELMGENTQSAWTPQVEYSVPQIT------APEGGWNVGTIADYMGIPTGVSGL-SVNA 143
Query 148 LPIRAYVKIWNEFFRDENVDNAATIKTDDASIDYQDGGESEDKTDDILKKAIGGGRCLPV 207
+P RAY I NE+FRDEN+ + I DA++ G + D+ K GG
Sbjct 144 MPFRAYALICNEWFRDENLTDPLNIPVGDATVA---GVNTGTYVTDVAK----GGLPFKA 196
Query 208 NKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMYKNDSLTEFGTINGNSEIFLNQALNGSA 267
K+HDYFTSCLP PQ+GP+V + G+ V + D+ + +N F+ +
Sbjct 197 AKYHDYFTSCLPAPQKGPDVLISAVGSGIVPVTATDNDNDSLNVNSPGMRFVGNS----- 251
Query 268 LAPKISNSSKEGARRALVTGSTNPTTQVSDAAYLAANLGE--TTAT-----TINDLRKAV 320
+ ++ + G +VT + P+T + + + NL +TAT TIN LR A
Sbjct 252 -STSVNYLAFGGGDGYVVTDTPKPSTPIHGISMIPTNLWADLSTATDLPVATINQLRTAF 310
Query 321 AVQQYYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSS 380
+Q+ YE ARGG+RY E +++ + V D +Q PEYLGG R +N+NQ++Q+S +
Sbjct 311 QIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGSRVPININQVIQSS---ET 367
Query 381 TDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYY 440
TP G A S+T + S FTKSF EHGFIIG+ R++HSYQQGL+R WSR DR DYY
Sbjct 368 GATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARYDHSYQQGLQRFWSRKDRFDYY 427
Query 441 VPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFW 500
P FANLGE VK KEI G D+E FGYQEAWADYR KP+ V+ +MRS +LD W
Sbjct 428 WPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWADYRYKPSVVTGEMRSQYAQSLDIW 487
Query 501 HYADNYKSVPTLSQGWMAERKTEIARTLIVQD--EPQFFGAIRVANKTTRRMPLYSVPGL 558
H AD+Y+++P+LS W+ E + + R L V D Q F I + TR MPLYS+PGL
Sbjct 488 HLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLFCDIYIRCLATRPMPLYSIPGL 547
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 430 bits (1106), Expect = 2e-140, Method: Compositional matrix adjust.
Identities = 237/579 (41%), Positives = 339/579 (59%), Gaps = 39/579 (7%)
Query 3 RNNERHFNQIP-EMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRM 61
RN F++ P + R+ FNR T T+ + G+LIPFY DEVLPGDTF V T+ ++R+
Sbjct 2 RNENSRFSENPVTLDIQRSTFNRSSTYKTSANIGELIPFYYDEVLPGDTFQVKTNKVVRL 61
Query 62 TTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEKS 121
MD+ + D YYF+ PNR++W+++++FMGE ++ W+P+ EY++P+I +
Sbjct 62 QPLVSAPMDNLYFDTYYFFVPNRLVWEHWEEFMGENKQGAWIPQTEYTIPQIT----SPA 117
Query 122 PEPYE-DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180
+E +I DY GIPT V + V+ALP RAY I +E+FRD+N+ I DD ++
Sbjct 118 STGFEIGTIADYFGIPTGVPNL-SVSALPFRAYALIVDEWFRDQNLQLPLNIPLDDTTLQ 176
Query 181 YQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPV--- 237
+ G D + + GG+ K+HDYFTSCLP PQ+GP+VT+ G+ PV
Sbjct 177 GVNTG-------DYVTDTVKGGKPFVAAKYHDYFTSCLPSPQKGPDVTIAAVGDFPVYTG 229
Query 238 ----------GMYKNDSLTEFGTINGNSEIFLNQALNGSALAPKISNSSKEGARRALVT- 286
++ S G+++ + ++ ++ + + K A +T
Sbjct 230 DPHNNNGSNKALHYGISNISSGSVSFSQGNYIIPSVLTTGSTQSVPAQGKLNASNITMTT 289
Query 287 --GSTNPTTQVSDAAY---LAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQ 341
GS + + + Y L A+ G TATTIN LR A +Q+ YE AR GSRYRE ++
Sbjct 290 SPGSPDSSFGSKLSVYPDNLYASSG--TATTINQLRMAFQIQKLYEKDARAGSRYRELIR 347
Query 342 ALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSF 401
+ + V D +Q+PEYLGG R +N+NQ+VQTS Q+S +P G S+T + F
Sbjct 348 SHFSVTPLDARMQVPEYLGGNRIPININQVVQTS--QTSDVSPQGNVAGQSLTSDSHGDF 405
Query 402 TKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTG 461
KSF EHG +IGV R++H+YQQG+ +LWSR R DYY P AN+GEQ V KEI G
Sbjct 406 IKSFTEHGMLIGVAVARYDHTYQQGVSKLWSRKTRFDYYWPVLANIGEQAVLNKEIYAQG 465
Query 462 EASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERK 521
A DEE FGYQEAWA+YR KP+ V+ +MRS+A +LD WH+AD+Y S+P LS W+ E K
Sbjct 466 TAQDEEVFGYQEAWAEYRYKPSIVTGEMRSSARTSLDSWHFADDYNSLPKLSADWIKEDK 525
Query 522 TEIARTLIVQDEP--QFFGAIRVANKTTRRMPLYSVPGL 558
T I R L V Q+F + N+TTR +P YS+PGL
Sbjct 526 TNIDRVLAVSSSVSNQYFADFYIENETTRALPFYSIPGL 564
>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569
Score = 420 bits (1079), Expect = 2e-136, Method: Compositional matrix adjust.
Identities = 237/596 (40%), Positives = 324/596 (54%), Gaps = 68/596 (11%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+NRN E H++QIP R +F RD + LTT + G L+P YVDEVLPGDT + +++R
Sbjct 1 MNRNAEAHYSQIPHANIQRAKFKRDFSYLTTINEGDLVPIYVDEVLPGDTIKIKQRSLVR 60
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEK 120
M+TP YPVMD+ ++D +YF+ P R++WD+++ MGE ++ W P +Y+ P
Sbjct 61 MSTPLYPVMDNCYLDIWYFFVPCRLVWDHWQNLMGENTKSYWAPDVQYTTPLT----SAP 116
Query 121 SPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180
S +I DYMGIPT V + KVN++P+RAY +IWNE+FRDEN+ T +DDA+
Sbjct 117 SGGWQVGTIADYMGIPTGVSGI-KVNSMPMRAYARIWNEWFRDENLQQPVTQHSDDATTT 175
Query 181 YQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRG----------PEV--- 227
+ G L A GG L V KF DYFTSCLP PQ+G P+V
Sbjct 176 GSNTGTE-------LTDAESGGLPLKVAKFKDYFTSCLPAPQKGEAIGFDFNQTPKVKGI 228
Query 228 --TLPMQGNAP---------------VGMYKNDSLTEFG------TINGNSEIFLNQALN 264
P++ N VG N S F T+NG F N
Sbjct 229 GLVFPLETNTGHTATDILWRQPDAQLVGENYNTSYNNFNSITTQTTVNGKKAFFFNNG-K 287
Query 265 GSALAPKISNSSKEGARRALVTGSTNPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQ 324
G L+ + + G + +T +A N T +INDLR+A+A+Q
Sbjct 288 GPMLSARFEDDYNGGVEQVELTA-------------VAEN--STNFLSINDLRQAIALQH 332
Query 325 YYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTP 384
EA ARGG+RY E ++ + V D +Q EY+GG R +N++Q++Q+S S T +P
Sbjct 333 ILEADARGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQVIQSSA--SDTTSP 390
Query 385 IGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQF 444
G A S+T + S EHG+I+G+ +R +HSYQQGL R+W+R+DR YY P
Sbjct 391 QGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMWTRSDRFSYYHPML 450
Query 445 ANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYAD 504
ANLGEQ V +EI G +D E FGYQEAWADYR + N ++ +MRS +LD WHY D
Sbjct 451 ANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADYRYRTNMITGEMRSTYAQSLDAWHYGD 510
Query 505 NYKSVPTLSQGWMAERKTEIARTLIVQDE--PQFFGAIRVANKTTRRMPLYSVPGL 558
Y +P LS W+ E + I RTL VQ E QF + R MP+YSVPGL
Sbjct 511 KYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRPMPIYSVPGL 566
>gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus]
gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus]
Length=538
Score = 401 bits (1031), Expect = 1e-129, Method: Compositional matrix adjust.
Identities = 231/570 (41%), Positives = 320/570 (56%), Gaps = 61/570 (11%)
Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
+ + F+++P R+ F+R + TTF++G+L+P YVDE LPGDTFS + +A R
Sbjct 13 IGSAKQHQFSEVPHADIQRSTFDRSHGLKTTFNAGQLVPIYVDEALPGDTFSCNLTAFSR 72
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGE-----------VEETPWMPKKEYS 109
+ TP +P MD+AF+D ++F P R++WD+F++FMGE ++ TP
Sbjct 73 LATPIHPTMDNAFMDTHFFAVPVRLVWDDFEEFMGETKTYKAAGSDRLDGTPDFSVAAPV 132
Query 110 VPKIVINGKEKSPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNA 169
P I +G ++ E S+ DY GIPTKV + + +AL RAY +WN++FRDEN+
Sbjct 133 PPTITASGSGEA----EASLSDYFGIPTKVGGL-EFSALWHRAYTLVWNDWFRDENLQAP 187
Query 170 ATIKTDDASIDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTL 229
TI T +D A+ L K HDYFTS LP+PQ+G +VT+
Sbjct 188 KTIDTTSG--------------NDTTTYAL-----LNRGKKHDYFTSALPWPQKGADVTI 228
Query 230 PMQGNAPV--GMYKNDSLTEFGTINGNSEIFLNQALNGSALAPKISNSSKEGARRALVTG 287
P+ +APV N +T F GN+ FLN A + + P N+ + ARR
Sbjct 229 PLGTSAPVTTANSSNQDVTIFTPNIGNTHRFLNSA--STNVYPGDENTDE--ARR----- 279
Query 288 STNPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVV 347
L A+L E T+ TIN LR A A Q++ E ARGGSRY E ++ ++V
Sbjct 280 -------------LYADLSEATSATINQLRLAFATQKFLEIQARGGSRYIEVIKNHFNVT 326
Query 348 ISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEE 407
D +Q PEYLGGG VN++ + QTS ++T P G A+ T ++ SFTKSF E
Sbjct 327 SPDARLQRPEYLGGGSSPVNISPVAQTSSTDATT--PQGNLSAIGTTVLSGHSFTKSFTE 384
Query 408 HGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEE 467
H +IG+ VR + +YQQGL R++SR DYY P + +GEQ VK KEI G A+DE
Sbjct 385 HTIVIGMVSVRTDLTYQQGLNRMFSRETIYDYYWPTLSTIGEQAVKNKEIYAQGSAADET 444
Query 468 TFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIART 527
TFGYQE +A+YR KP+ V+ K RSNATGTL+ WHYA Y S+P L W+ T + RT
Sbjct 445 TFGYQERYAEYRYKPSSVTGKFRSNATGTLESWHYAQEYASLPLLGDSWIQVTDTNVQRT 504
Query 528 LIVQDEPQFFGAIRVANKTTRRMPLYSVPG 557
L V EPQF + TR MP+ S+PG
Sbjct 505 LAVASEPQFIFDSLFKLRCTRPMPVNSIPG 534
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 384 bits (985), Expect = 1e-122, Method: Compositional matrix adjust.
Identities = 220/569 (39%), Positives = 324/569 (57%), Gaps = 50/569 (9%)
Query 2 NRNNERH-FNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60
N++ H F+ IP + R++F+ +T+ T FDSG L+P VDEVLPGD+ ++ +A R
Sbjct 5 NKSASAHQFSMIPRAEIPRSKFDAQKTLKTAFDSGYLVPILVDEVLPGDSMNLRMTAFTR 64
Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVI-NGKE 119
+ TP +PVMD+ ++D ++F+ PNR+LW N+++FMGE + P +Y++P + NG
Sbjct 65 LATPLFPVMDNMYLDTFFFFVPNRLLWSNWQRFMGERDPDP-DSSIDYTIPTMTSPNGGY 123
Query 120 KSPEPYEDSILDYMGIPTK----VKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTD 175
+S+ DYMG+PT N+L RAY IWNE+FRDEN+ ++ +
Sbjct 124 AV-----NSLQDYMGLPTAGQVDAGSSISHNSLFTRAYNLIWNEWFRDENLQDSVVVDKG 178
Query 176 DASIDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNA 235
D Y D +L++ K HDYFTS LP+PQ+G VTLP+ G+A
Sbjct 179 DGPDTYTD--------YTLLRRG----------KRHDYFTSALPWPQKGDAVTLPLGGSA 220
Query 236 PVGMYKNDSLTEFGTINGNSEIFLNQALNGSA-LAPKISNSSKEG-ARRALVTGSTNPTT 293
V ND+ ++ + G+ P + SKE ++ TGS N
Sbjct 221 NV--VYNDT---------GDPAYIREVSTGNVWTTPSRESVSKEANGNMSVPTGSVN--A 267
Query 294 QVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTV 353
Q L A+L TA TIN +R++ +Q+ E ARGG+RY E V++ + V+ D +
Sbjct 268 QYDPNGSLVADLSTATAATINAIRQSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARM 327
Query 354 QIPEYLGGGRYHVNMNQIVQTSGQQSS-TDTPIGETGAMSVTPVNESSFTKSFEEHGFII 412
Q PEYLGGG + +N + Q S +S TDTP+G GA+ + F SF EHG ++
Sbjct 328 QRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGTLGAVGTGLASGHGFASSFTEHGVVV 387
Query 413 GVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQ 472
G+C VR + +YQQGL R++SR+ R D++ P F++LGEQP+ KE+ TG ++D++ FGYQ
Sbjct 388 GLCSVRADLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKELYATGTSTDDDVFGYQ 447
Query 473 EAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD 532
EAWA+YR KP++V+ MRS A GTLD WH A N+ S+PTL+ ++ E + R + V
Sbjct 448 EAWAEYRYKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTFI-EDTPPVDRVVAVGS 506
Query 533 EP---QFFGAIRVANKTTRRMPLYSVPGL 558
E QF R MP+YSVPGL
Sbjct 507 EANGQQFIFDAFFDINMARPMPMYSVPGL 535
>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533
Score = 383 bits (983), Expect = 1e-122, Method: Compositional matrix adjust.
Identities = 220/552 (40%), Positives = 320/552 (58%), Gaps = 52/552 (9%)
Query 7 RHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRMTTPKY 66
F+++P+ R+ F+R + TTF+SG LIP YVDEVLPGDTF ++ + R+ TP Y
Sbjct 15 HEFSRVPQADIQRSTFSRVHGLKTTFNSGDLIPIYVDEVLPGDTFQMNATGFGRLATPLY 74
Query 67 PVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEKSPEPYE 126
PVMD+ +++ ++FY PNRI+WDN+++F G ++ ++ VP+I +S E
Sbjct 75 PVMDNMYVETFFFYVPNRIIWDNWEKFNGAQDDP--NDSTDFLVPQI------QSATVAE 126
Query 127 DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASIDYQDGGE 186
S+ DYMG+PT++ + N L RAY IWNE+FRDEN+ ++ + DD Y
Sbjct 127 GSLFDYMGLPTQIAGI-DFNNLHGRAYNLIWNEWFRDENLQDSLGVPKDDGPDTY----- 180
Query 187 SEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMYKNDSLT 246
T ++K K HDYFTS LP+PQ+G V+LP+ +A + +
Sbjct 181 ----TGYTIQKR---------GKRHDYFTSALPWPQKGDAVSLPLGTSADI-----HTAA 222
Query 247 EFGTINGNSEIFLNQALNGSALAPKISNSSKEGARRALVTGSTNPTTQVSDAAYLAANLG 306
GT G + GS+ +++ E A ++G T P T + A+L
Sbjct 223 AAGTDIGIYSV-------GSSDFRLLTSDPVEVA----LSGGTPPETNK-----MFADLS 266
Query 307 ETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHV 366
TA TIN LR+A +Q+ YE ARGG+RY E +Q+ + V D +Q PEYLGG + V
Sbjct 267 NATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQKTEV 326
Query 367 NMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQG 426
M + QTS S++ P G A+ T + F+KSF EHG +IG+ CV + +YQQG
Sbjct 327 MMQTVPQTSSTDSTS--PQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFADLTYQQG 383
Query 427 LERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVS 486
+ R+WSR DR D+Y P A+LGEQ V +EI G ++D +TFGYQE +A+YR KP++++
Sbjct 384 MNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRYKPSQIT 443
Query 487 SKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQDEPQFFGAIRVANKT 546
KMRSNATGTLD WH A ++ ++P L+ ++ E + R + V EP+F KT
Sbjct 444 GKMRSNATGTLDAWHLAQDFTALPALNASFI-EENPPVDRVIAVPSEPEFIWDWYFDLKT 502
Query 547 TRRMPLYSVPGL 558
TR MP+YSVPGL
Sbjct 503 TRPMPVYSVPGL 514
Lambda K H a alpha
0.316 0.133 0.397 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4106350928880