bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-23_CDS_annotation_glimmer3.pl_2_6
Length=538
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094431|emb|CDL65804.1| unnamed protein product 425 6e-139
gi|575094492|emb|CDL65859.1| unnamed protein product 415 5e-135
gi|575096056|emb|CDL66947.1| unnamed protein product 416 5e-135
gi|575094572|emb|CDL65928.1| unnamed protein product 409 1e-132
gi|575094544|emb|CDL65904.1| unnamed protein product 405 3e-131
gi|575094496|emb|CDL65862.1| unnamed protein product 392 5e-126
gi|575094415|emb|CDL65790.1| unnamed protein product 381 1e-121
gi|557745632|ref|YP_008798242.1| major capsid protein 354 8e-112
gi|530695351|gb|AGT39907.1| major capsid protein 354 1e-111
gi|313766927|gb|ADR80653.1| putative major coat protein 345 5e-108
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 425 bits (1093), Expect = 6e-139, Method: Compositional matrix adjust.
Identities = 238/547 (44%), Positives = 319/547 (58%), Gaps = 47/547 (9%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
VLPGDTF +D AIIR +TP +PVMD++++D Y+F+ PNR+ W++++ MGE W
Sbjct 45 VLPGDTFELDMTAIIRGSTPIFPVMDNSFLDVYFFFVPNRLTWEHWRELMGENRTTAWTQ 104
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKI 120
Y VP++ GG +E ++ D+MG+P K I +NALP RAY I
Sbjct 105 PVDYSVPQVTA--PAGGW-----EELSLADHMGIPTKV------DNISVNALPFRAYGLI 151
Query 121 WNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYF 180
+NEFFR+QN+ NP + D + A + N ++V ++A G CL +F DYF
Sbjct 152 YNEFFRNQNLTNPTQVEVTDANIAGK----NPNDVKNSN--DWAITGAKCLKSAKFFDYF 205
Query 181 SSCLPYPQRGPEVTIALTG----------NAPLRAYSEKDLNNRKIGTGFFNNE--YNTG 228
+ LP PQ+G V I L + PL S D + + N + Y G
Sbjct 206 TGALPQPQKGEPVEINLASSWLPVGIGDYHGPLDKVSNSDTLTWESPSSEGNTKRTYALG 265
Query 229 IVNHTNISFTKEGTKFSVNKNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAA 288
+V +EG VN N N G + ++ + A + W + AA
Sbjct 266 MVQ-------QEG---EVNPNGLKNFETKAGGSFSESGAV-AAYPTNLWASPVTA---AA 311
Query 289 TINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQI 348
T+NQLRQAF VQ E ARGG+RYRE ++ FGV+ SD +QIPEYLGG + +N++Q+
Sbjct 312 TVNQLRQAFQVQKLLEKDARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVSQV 371
Query 349 VQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFW 408
VQTS S +P G T A+SVTP ++S FTKSF+EHGF+IGV R SYQQG+ER W
Sbjct 372 VQTSA--STDASPQGNTAAISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIERMW 429
Query 409 SRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRS 468
SR DRLDYYFP AN+GEQ + KEI G + D+E FGYQEAWADYR KPN + G+ RS
Sbjct 430 SRKDRLDYYFPVLANIGEQAILNKEIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRFRS 489
Query 469 NAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVMNKTTRCMP 528
NA+ +LD WHY +Y +PTLS +WM++ E+ RTL V+ EP F R KT R MP
Sbjct 490 NAQQSLDAWHYGQDYDKLPTLSTDWMEQSDIEMKRTLAVQTEPDFIANFRFNCKTVRVMP 549
Query 529 LYSVPGL 535
LYS+PGL
Sbjct 550 LYSIPGL 556
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 415 bits (1066), Expect = 5e-135, Method: Compositional matrix adjust.
Identities = 235/542 (43%), Positives = 315/542 (58%), Gaps = 47/542 (9%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
+LPGDTFS+DT+ ++RM + PVMD+ Y+D Y+F+ PNR+ W +++ MGE + W P
Sbjct 46 ILPGDTFSIDTSKVVRMQSLLTPVMDNIYLDTYFFFVPNRLTWSHWRELMGENTQSAWTP 105
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKI 120
Y VP+I EGG + TI DYMG+P G + +NA+P RAY I
Sbjct 106 QVEYSVPQITA--PEGGW-----NVGTIADYMGIP------TGVSGLSVNAMPFRAYALI 152
Query 121 WNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYF 180
NE+FRD+N+ +P + GD A AG + V++ GG ++HDYF
Sbjct 153 CNEWFRDENLTDPLNIPVGD---ATVAGVNTGTYVTD------VAKGGLPFKAAKYHDYF 203
Query 181 SSCLPYPQRGPEVTIALTGN--APLRAYSEKD--LNNRKIGTGFFNNEYNTGIVNHTNIS 236
+SCLP PQ+GP+V I+ G+ P+ A + LN G F N + VN+ ++
Sbjct 204 TSCLPAPQKGPDVLISAVGSGIVPVTATDNDNDSLNVNSPGMRFVGNSSTS--VNY--LA 259
Query 237 FTKEGTKFSVNKNNNGNTAPLVNGQYIQTMSQDDANFF-DAWLGTDLSNIEAATINQLRQ 295
F G + V +T I +S N + D TDL ATINQLR
Sbjct 260 F-GGGDGYVVTDTPKPSTP-------IHGISMIPTNLWADLSTATDL---PVATINQLRT 308
Query 296 AFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQE 355
AF +Q YE ARGG+RY E +++ FGV+ D +Q PEYLGG R +N+NQ++Q+S
Sbjct 309 AFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGSRVPININQVIQSS--- 365
Query 356 SNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLD 415
TP G A S+T + S FTKSF EHGF+IG+M R+DHSYQQGL+RFWSR DR D
Sbjct 366 ETGATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARYDHSYQQGLQRFWSRKDRFD 425
Query 416 YYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLD 475
YY+P FANLGE VK KEI G D+E FGYQEAWADYR KP+ V+G+MRS +LD
Sbjct 426 YYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWADYRYKPSVVTGEMRSQYAQSLD 485
Query 476 FWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN--EPQFFGAIRVMNKTTRCMPLYSVP 533
WH AD+Y +P+LS W++E + + R L V + Q F I + TR MPLYS+P
Sbjct 486 IWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLFCDIYIRCLATRPMPLYSIP 545
Query 534 GL 535
GL
Sbjct 546 GL 547
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 416 bits (1068), Expect = 5e-135, Method: Compositional matrix adjust.
Identities = 230/550 (42%), Positives = 319/550 (58%), Gaps = 44/550 (8%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
VLPGDTFSVD++ ++RM T P+MD+ Y+D YYF+ PNR++W ++K F GE +++ W+P
Sbjct 46 VLPGDTFSVDSSKVVRMQTLLTPMMDNVYLDTYYFFVPNRLVWQHWKEFCGENNESAWIP 105
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKI 120
Y +P++ + GG + TI DY G+P G + ++ALP RAY I
Sbjct 106 QTEYAIPQL--KSPVGGF-----EVGTIADYFGLPT------GVANLSVSALPFRAYALI 152
Query 121 WNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYF 180
NE+FRD+N+ +P V+ T D+A G V++ GG ++HDYF
Sbjct 153 MNEWFRDENLMDPLVVPT---DDATVTGVNTGIFVTD------VAKGGKPFVAAKYHDYF 203
Query 181 SSCLPYPQRGPEVTIALTGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKE 240
+S LP PQ+GP+V I P+ + ++ G + + I N + S +
Sbjct 204 TSALPAPQKGPDVVI------PVASAGNYNVVGNGKGLALSDGSKMSIICNGLSGS-NGQ 256
Query 241 GTKFSVNKNNNGNTAPLVNGQYIQTMSQDDANF---FDAWLGTDLSN----------IEA 287
GT+ + ++ D A LG +L N A
Sbjct 257 GTELFASGILGSQVGSSGGFGSGSSLRGDGIILGVPTAAQLGNNLENSGLIAIASGNAAA 316
Query 288 ATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQ 347
ATINQLR AF +Q +YE ARGGSRY E +R+ FGV+ D +Q EYLGG R +N+NQ
Sbjct 317 ATINQLRMAFQIQKFYEKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQ 376
Query 348 IVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERF 407
++Q SG S TP G MS T S FTKSF EHGF+IGVMC R+DH+YQQG++R
Sbjct 377 VIQQSGTGSASTTPQGTVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRM 436
Query 408 WSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMR 467
WSR D+ DYY+P F+N+GEQ +K KEI G +TD+E FGYQEAWA+YR KP+RV+G+MR
Sbjct 437 WSRKDKFDYYWPVFSNIGEQAIKNKEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMR 496
Query 468 SNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIV--ENEPQFFGAIRVMNKTTR 525
S+ +LD WH AD+Y+ +P+LS EW++E + R L V +N QFF I V N TR
Sbjct 497 SSYAQSLDVWHLADDYSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTR 556
Query 526 CMPLYSVPGL 535
MP+YS+PGL
Sbjct 557 PMPMYSIPGL 566
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 409 bits (1051), Expect = 1e-132, Method: Compositional matrix adjust.
Identities = 231/550 (42%), Positives = 317/550 (58%), Gaps = 58/550 (11%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
VLPGDTF V T+ +IR+ T P+MD+ Y+D YYF+ PNR++W+++K F GE + W+P
Sbjct 46 VLPGDTFKVKTSKVIRLQTLLTPMMDNIYLDTYYFFVPNRLVWEHWKEFNGENTQSAWIP 105
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKI 120
Y++P++ EGG + T+ DY G+P G I +NALP RAY +
Sbjct 106 EVEYQIPQLTA--PEGGW-----NIGTLADYFGIP------TGVSGISVNALPFRAYALV 152
Query 121 WNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYF 180
NE+FRDQN+ +P + GD + V+ + GG ++HDYF
Sbjct 153 CNEWFRDQNLSDPLNIPVGD---------ATVTGVNTGTFITDVVKGGLPYTAAKYHDYF 203
Query 181 SSCLPYPQRGPEVTIALTG--NAPLRAYSEKDLNN--RKIGTGFFNNE----YNTGIVNH 232
+SCLP PQ+GP+VTI +T N P+ +E + G G N+E Y G +
Sbjct 204 TSCLPAPQKGPDVTIPVTSGHNLPVMFLNETHDAGPYKPFGVGIQNSELRNFYGFGSGSS 263
Query 233 TNISFTKEGTKFSVNKNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEA----- 287
S + + V + G GQ NF W T++ +E+
Sbjct 264 GATSTSDTSSTVEVGSDGTGI------GQ----------NF---WTPTNMWAVESGDVGM 304
Query 288 ATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQ 347
ATINQLR AF +Q YE ARGG+RY E +R+ FGV D +Q PEYLGG R +N+NQ
Sbjct 305 ATINQLRLAFQLQKLYEKDARGGTRYTEIIRSHFGVVSPDSRLQRPEYLGGNRIPINVNQ 364
Query 348 IVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERF 407
I+Q S +S +P+G MSVT S F KSF EHG++IG++ R+DH+YQQGL+R
Sbjct 365 IIQQS--QSTEQSPLGALAGMSVTTDKNSDFIKSFVEHGYIIGLVVARYDHTYQQGLDRM 422
Query 408 WSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMR 467
WSR DR D+Y+P AN+GEQ V KEI + G TD+E FGYQEAWA+YR KPNRV G+MR
Sbjct 423 WSRKDRFDFYWPVLANIGEQAVLNKEIYIDGSDTDDEVFGYQEAWAEYRYKPNRVCGEMR 482
Query 468 SNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN--EPQFFGAIRVMNKTTR 525
S+A +LD WH D+Y+++P LS W++E K + R L V + Q F I + NK TR
Sbjct 483 SSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVDRVLAVTSSVSDQLFADIYICNKATR 542
Query 526 CMPLYSVPGL 535
MP+YS+PGL
Sbjct 543 PMPMYSIPGL 552
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 405 bits (1041), Expect = 3e-131, Method: Compositional matrix adjust.
Identities = 227/549 (41%), Positives = 328/549 (60%), Gaps = 60/549 (11%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
VLPGDTF+V ++ +IRM + P+MD+ Y+D YYF+ PNR++W ++++F GE ++ W+P
Sbjct 45 VLPGDTFNVKSSKVIRMQSLVTPIMDNIYLDTYYFFVPNRLVWSHWQQFNGENTESAWLP 104
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTG-RIEINALPVRAYVK 119
T Y+VP++ G ++ TI DY G+P TG +NALP RAY
Sbjct 105 TTEYQVPQVTAP-ANGWSI------GTIADYFGIP--------TGVACSVNALPFRAYAL 149
Query 120 IWNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDY 179
I NE+FRD+N+ +P + D A GS ++ +++ I++ GG ++HDY
Sbjct 150 ICNEWFRDENLSDPLNIPISD---ATVVGSNGDNYITD--IVK----GGMPFKACKYHDY 200
Query 180 FSSCLPYPQRGPEVTIALTGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTK 239
F+SCLP PQ+GP+V + L+ + S+ ++ + ++Y V+ N+S T
Sbjct 201 FTSCLPAPQKGPDVLLPLSSSPVPVTTSDTMVDPLQY------SKYPMAGVDSWNLSPTL 254
Query 240 -----------EGTKFSVNKNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAA 288
EG + V+ Q+ + DA F L +L N AA
Sbjct 255 MRNIIRPFEGVEGANYQVH-------------QFTGDIPTIDA-FRPLNLVANLQNATAA 300
Query 289 TINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQI 348
+INQLR AF +Q YE ARGG+RY E +++ FGV+ D +Q PEYLGG R +N+NQ+
Sbjct 301 SINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPININQV 360
Query 349 VQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFW 408
+Q S E+ +P G S+T + F KSF EHGFVIG+M R+DH+YQQGLERFW
Sbjct 361 LQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMVARYDHTYQQGLERFW 418
Query 409 SRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRS 468
SR DR DYY+P FA++GEQ V KEI +G + D+E FGYQEA+ADYR KP+RV+G+MRS
Sbjct 419 SRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQEAYADYRYKPSRVTGEMRS 478
Query 469 NAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN--EPQFFGAIRVMNKTTRC 526
A +LD WH AD+YA++P+LS W++E + + R L V + Q F I + N++TR
Sbjct 479 AAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSAQLFCDIYIQNRSTRP 538
Query 527 MPLYSVPGL 535
MP+YSVPGL
Sbjct 539 MPMYSVPGL 547
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 392 bits (1007), Expect = 5e-126, Method: Compositional matrix adjust.
Identities = 223/550 (41%), Positives = 312/550 (57%), Gaps = 45/550 (8%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
VLPGDTF V T ++R+ MD+ Y D YYF+ PNR++W++++ FMGE W+P
Sbjct 45 VLPGDTFQVKTNKVVRLQPLVSAPMDNLYFDTYYFFVPNRLVWEHWEEFMGENKQGAWIP 104
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKI 120
Y +P+I G + TI DY G+P G + ++ALP RAY I
Sbjct 105 QTEYTIPQITSPASTGFEI------GTIADYFGIP------TGVPNLSVSALPFRAYALI 152
Query 121 WNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYF 180
+E+FRDQN+ P LN +D + + + K GG ++HDYF
Sbjct 153 VDEWFRDQNLQLP--LNIPLDDTTLQGVNTGDYVTDTVK-------GGKPFVAAKYHDYF 203
Query 181 SSCLPYPQRGPEVTIALTGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKE 240
+SCLP PQ+GP+VTIA G+ P+ Y+ NN Y ++ ++SF++
Sbjct 204 TSCLPSPQKGPDVTIAAVGDFPV--YTGDPHNNNGSNKAL---HYGISNISSGSVSFSQG 258
Query 241 GTKF-SVNKNNNGNTAPL---VNGQYIQTMSQDDANFFDAWLGTDLS----NI-----EA 287
SV + + P +N I TM+ + D+ G+ LS N+ A
Sbjct 259 NYIIPSVLTTGSTQSVPAQGKLNASNI-TMTTSPGSP-DSSFGSKLSVYPDNLYASSGTA 316
Query 288 ATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQ 347
TINQLR AF +Q YE AR GSRYRE +R+ F V+ D +Q+PEYLGG R +N+NQ
Sbjct 317 TTINQLRMAFQIQKLYEKDARAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININQ 376
Query 348 IVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERF 407
+VQTS +++ +P G S+T + F KSF EHG +IGV R+DH+YQQG+ +
Sbjct 377 VVQTS--QTSDVSPQGNVAGQSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSKL 434
Query 408 WSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMR 467
WSR R DYY+P AN+GEQ V KEI G + DEE FGYQEAWA+YR KP+ V+G+MR
Sbjct 435 WSRKTRFDYYWPVLANIGEQAVLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVTGEMR 494
Query 468 SNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEP--QFFGAIRVMNKTTR 525
S+A +LD WH+AD+Y ++P LS +W+KE K I R L V + Q+F + N+TTR
Sbjct 495 SSARTSLDSWHFADDYNSLPKLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETTR 554
Query 526 CMPLYSVPGL 535
+P YS+PGL
Sbjct 555 ALPFYSIPGL 564
>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569
Score = 381 bits (978), Expect = 1e-121, Method: Compositional matrix adjust.
Identities = 217/565 (38%), Positives = 308/565 (55%), Gaps = 67/565 (12%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
VLPGDT + +++RM+TP YPVMD+ Y+D +YF+ P R++WD+++ MGE + W P
Sbjct 45 VLPGDTIKIKQRSLVRMSTPLYPVMDNCYLDIWYFFVPCRLVWDHWQNLMGENTKSYWAP 104
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKI 120
Y P + GG TI DYMG+P G I++N++P+RAY +I
Sbjct 105 DVQYTTP--LTSAPSGGW-----QVGTIADYMGIPT------GVSGIKVNSMPMRAYARI 151
Query 121 WNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYF 180
WNE+FRD+N+ P T D+A GS +E+++ A GG L V +F DYF
Sbjct 152 WNEWFRDENLQQPV---TQHSDDATTTGSNTGTELTD------AESGGLPLKVAKFKDYF 202
Query 181 SSCLPYPQRGPEVTIALTGNAPLRA------------YSEKDLNNRKIGTGFFNNEYNTG 228
+SCLP PQ+G + ++ ++ D+ R+ YNT
Sbjct 203 TSCLPAPQKGEAIGFDFNQTPKVKGIGLVFPLETNTGHTATDILWRQPDAQLVGENYNTS 262
Query 229 IVNHTNISFTKEGTKFSVNKN-----NNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLS 283
N +I+ T+ +VN NNG P+++ ++ +DD N G +
Sbjct 263 YNNFNSIT-----TQTTVNGKKAFFFNNGK-GPMLSARF-----EDDYNG-----GVEQV 306
Query 284 NIEAA--------TINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEY 335
+ A +IN LRQA A+QH EA ARGG+RY E ++ FGVS D +Q EY
Sbjct 307 ELTAVAENSTNFLSINDLRQAIALQHILEADARGGTRYVEILKNEFGVSSPDARLQRSEY 366
Query 336 LGGGRYHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVR 395
+GG R +N++Q++Q+S ++ +P G A S+T + S EHG+++G+ +R
Sbjct 367 IGGERIPINVSQVIQSSASDTT--SPQGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIR 424
Query 396 HDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADY 455
DHSYQQGL R W+RSDR YY P ANLGEQ V +EI G + D E FGYQEAWADY
Sbjct 425 VDHSYQQGLSRMWTRSDRFSYYHPMLANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADY 484
Query 456 RMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIV--ENEPQF 513
R + N ++G+MRS +LD WHY D Y +P LS +W+KEG+ I RTL V EN QF
Sbjct 485 RYRTNMITGEMRSTYAQSLDAWHYGDKYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQF 544
Query 514 FGAIRVMNKTTRCMPLYSVPGLEKL 538
+ R MP+YSVPGL +
Sbjct 545 ICNLYFDQTWVRPMPIYSVPGLSMI 569
>gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus]
gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus]
Length=538
Score = 354 bits (909), Expect = 8e-112, Method: Compositional matrix adjust.
Identities = 217/552 (39%), Positives = 292/552 (53%), Gaps = 92/552 (17%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
LPGDTFS + A R+ TP +P MD+A++D ++F P R++WD+F+ FMGE
Sbjct 57 ALPGDTFSCNLTAFSRLATPIHPTMDNAFMDTHFFAVPVRLVWDDFEEFMGE-------- 108
Query 61 TKTYK------------------VPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVG 102
TKTYK VP I + G A E+++ DY G+P K VG
Sbjct 109 TKTYKAAGSDRLDGTPDFSVAAPVPPTITASGSGEA------EASLSDYFGIPTK---VG 159
Query 103 GTGRIEINALPVRAYVKIWNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILE 162
G +E +AL RAY +WN++FRD+N+ P ++T SGN++
Sbjct 160 G---LEFSALWHRAYTLVWNDWFRDENLQAPKTIDTT---------SGNDTTT------- 200
Query 163 YAHIGGYCLPVNRFHDYFSSCLPYPQRGPEVTIALTGNAPLRAYSEKDLNNRKIGTGFFN 222
YA L + HDYF+S LP+PQ+G +VTI L +AP+
Sbjct 201 YA-----LLNRGKKHDYFTSALPWPQKGADVTIPLGTSAPVTT----------------- 238
Query 223 NEYNTGIVNHTNISFTKEGTKFSVNKNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDL 282
N +N T + N GNT +N D+ L DL
Sbjct 239 -------ANSSNQDVT-------IFTPNIGNTHRFLNSASTNVYPGDENTDEARRLYADL 284
Query 283 SNIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYH 342
S +ATINQLR AFA Q + E ARGGSRY E ++ F V+ D +Q PEYLGGG
Sbjct 285 SEATSATINQLRLAFATQKFLEIQARGGSRYIEVIKNHFNVTSPDARLQRPEYLGGGSSP 344
Query 343 VNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQ 402
VN++ + QTS ++ TP G A+ T ++ SFTKSF EH VIG++ VR D +YQQ
Sbjct 345 VNISPVAQTSSTDAT--TPQGNLSAIGTTVLSGHSFTKSFTEHTIVIGMVSVRTDLTYQQ 402
Query 403 GLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRV 462
GL R +SR DYY+P + +GEQ VK KEI G + DE TFGYQE +A+YR KP+ V
Sbjct 403 GLNRMFSRETIYDYYWPTLSTIGEQAVKNKEIYAQGSAADETTFGYQERYAEYRYKPSSV 462
Query 463 SGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVMNK 522
+GK RSNA GTL+ WHYA YA++P L W++ + RTL V +EPQF +
Sbjct 463 TGKFRSNATGTLESWHYAQEYASLPLLGDSWIQVTDTNVQRTLAVASEPQFIFDSLFKLR 522
Query 523 TTRCMPLYSVPG 534
TR MP+ S+PG
Sbjct 523 CTRPMPVNSIPG 534
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 354 bits (908), Expect = 1e-111, Method: Compositional matrix adjust.
Identities = 224/543 (41%), Positives = 306/543 (56%), Gaps = 64/543 (12%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
VLPGD+ ++ A R+ TP +PVMD+ Y+D ++F+ PNR+LW N++RFMGE D P
Sbjct 49 VLPGDSMNLRMTAFTRLATPLFPVMDNMYLDTFFFFVPNRLLWSNWQRFMGERDPDP-DS 107
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKI 120
+ Y +P + N G AV +++ DYMG+P A V I N+L RAY I
Sbjct 108 SIDYTIPTMTSPNG-GYAV------NSLQDYMGLP-TAGQVDAGSSISHNSLFTRAYNLI 159
Query 121 WNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYF 180
WNE+FRD+N+ + V++ GD + Y +Y L + HDYF
Sbjct 160 WNEWFRDENLQDSVVVDKGDGPDTYT---------------DYT-----LLRRGKRHDYF 199
Query 181 SSCLPYPQRGPEVTIALTGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKE 240
+S LP+PQ+G VT+ L G+A ++ G + E +TG V T
Sbjct 200 TSALPWPQKGDAVTLPLGGSA--------NVVYNDTGDPAYIREVSTGNVWTTP------ 245
Query 241 GTKFSVNKNNNGN-TAPL--VNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAF 297
++ SV+K NGN + P VN QY D N L DLS AATIN +RQ+F
Sbjct 246 -SRESVSKEANGNMSVPTGSVNAQY-------DPN---GSLVADLSTATAATINAIRQSF 294
Query 298 AVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQ-ES 356
+Q E ARGG+RY E VR+ FGV D +Q PEYLGGG + +N + Q S S
Sbjct 295 QIQRLLERDARGGTRYTEIVRSHFGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGAS 354
Query 357 NYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDY 416
TP+G GA+ + F SF EHG V+G+ VR D +YQQGL R +SRS R D+
Sbjct 355 GTDTPLGTLGAVGTGLASGHGFASSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDF 414
Query 417 YFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDF 476
+FP F++LGEQP+ KE+ TG STD++ FGYQEAWA+YR KP++V+G MRS A GTLD
Sbjct 415 FFPVFSHLGEQPILNKELYATGTSTDDDVFGYQEAWAEYRYKPSQVTGLMRSTAAGTLDA 474
Query 477 WHYADNYATVPTLSQEWMKEGKNEIARTLIVENEP---QF-FGAIRVMNKTTRCMPLYSV 532
WH A N+ ++PTL+ ++ E + R + V +E QF F A +N R MP+YSV
Sbjct 475 WHLAQNFGSLPTLNSTFI-EDTPPVDRVVAVGSEANGQQFIFDAFFDINM-ARPMPMYSV 532
Query 533 PGL 535
PGL
Sbjct 533 PGL 535
>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533
Score = 345 bits (884), Expect = 5e-108, Method: Compositional matrix adjust.
Identities = 207/538 (38%), Positives = 292/538 (54%), Gaps = 79/538 (15%)
Query 1 VLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMP 60
VLPGDTF ++ R+ TP YPVMD+ Y++ ++FY PNRI+WDN+++F G DD
Sbjct 53 VLPGDTFQMNATGFGRLATPLYPVMDNMYVETFFFYVPNRIIWDNWEKFNGAQDDP--ND 110
Query 61 TKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKI 120
+ + VP+I A E ++ DYMG+P + + G I+ N L RAY I
Sbjct 111 STDFLVPQI---------QSATVAEGSLFDYMGLPTQ---IAG---IDFNNLHGRAYNLI 155
Query 121 WNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCL-PVNRFHDY 179
WNE+FRD+N+ + + D + Y GY + + HDY
Sbjct 156 WNEWFRDENLQDSLGVPKDDGPDTYT---------------------GYTIQKRGKRHDY 194
Query 180 FSSCLPYPQRGPEVTIALTGNAPLR--AYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISF 237
F+S LP+PQ+G V++ L +A + A + D+ +G+ F
Sbjct 195 FTSALPWPQKGDAVSLPLGTSADIHTAAAAGTDIGIYSVGSSDF---------------- 238
Query 238 TKEGTKFSVNKNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAF 297
+ T V +G T P N + DLSN AATINQLR+AF
Sbjct 239 -RLLTSDPVEVALSGGTPPETNKMF-----------------ADLSNATAATINQLREAF 280
Query 298 AVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESN 357
+Q YE ARGG+RY E +++ FGV+ D +Q PEYLGG + V M + QTS +S
Sbjct 281 QIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQKTEVMMQTVPQTSSTDST 340
Query 358 YGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYY 417
+P G A+ T + F+KSF EHG +IG+ CV D +YQQG+ R WSR DR D+Y
Sbjct 341 --SPQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFADLTYQQGMNRMWSRRDRWDFY 397
Query 418 FPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFW 477
+P A+LGEQ V +EI G S D +TFGYQE +A+YR KP++++GKMRSNA GTLD W
Sbjct 398 WPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRYKPSQITGKMRSNATGTLDAW 457
Query 478 HYADNYATVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVMNKTTRCMPLYSVPGL 535
H A ++ +P L+ +++E + R + V +EP+F KTTR MP+YSVPGL
Sbjct 458 HLAQDFTALPALNASFIEENP-PVDRVIAVPSEPEFIWDWYFDLKTTRPMPVYSVPGL 514
Lambda K H a alpha
0.317 0.135 0.411 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3874865459850