bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-19_CDS_annotation_glimmer3.pl_2_5
Length=582
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094431|emb|CDL65804.1| unnamed protein product 469 2e-155
gi|575094544|emb|CDL65904.1| unnamed protein product 458 3e-151
gi|575096056|emb|CDL66947.1| unnamed protein product 452 1e-148
gi|575094572|emb|CDL65928.1| unnamed protein product 449 1e-147
gi|575094492|emb|CDL65859.1| unnamed protein product 440 3e-144
gi|575094415|emb|CDL65790.1| unnamed protein product 434 1e-141
gi|575094496|emb|CDL65862.1| unnamed protein product 421 2e-136
gi|557745632|ref|YP_008798242.1| major capsid protein 394 2e-126
gi|530695351|gb|AGT39907.1| major capsid protein 389 2e-124
gi|313766927|gb|ADR80653.1| putative major coat protein 384 1e-122
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 469 bits (1208), Expect = 2e-155, Method: Compositional matrix adjust.
Identities = 259/591 (44%), Positives = 350/591 (59%), Gaps = 47/591 (8%)
Query 1 MNRNNERHFNQVPETHVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAAIIR 60
MNRN+ +F + P +SR+RF R + L TFD+G+++P YVDEVLPGDTF +D AIIR
Sbjct 1 MNRNSNFNFARNPGVSLSRSRFNRTSDRLDTFDTGEIVPIYVDEVLPGDTFELDMTAIIR 60
Query 61 MTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYKVPKIIIDNEEG 120
+TP +PVMD++++D Y+F+ PNR+ W++++ MGE W Y VP++ G
Sbjct 61 GSTPIFPVMDNSFLDVYFFFVPNRLTWEHWRELMGENRTTAWTQPVDYSVPQVTA--PAG 118
Query 121 GAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKIWNEFFRDQNVGNPAVL 180
G +E ++ D+MG+P K I +NALP RAY I+NEFFR+QN+ NP +
Sbjct 119 GW-----EELSLADHMGIPTKV------DNISVNALPFRAYGLIYNEFFRNQNLTNPTQV 167
Query 181 NTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYFSSCLPYPQRGPEVTIA 240
D + A + N ++V ++A G CL +F DYF+ LP PQ+G V I
Sbjct 168 EVTDANIAGK----NPNDVKNSN--DWAITGAKCLKSAKFFDYFTGALPQPQKGEPVEIN 221
Query 241 LTGN----------APLRAYSEKDLNNRKIGTGFFNNE--YNTGIVNHTNISFTKEGTKF 288
L + PL S D + + N + Y G+V +EG
Sbjct 222 LASSWLPVGIGDYHGPLDKVSNSDTLTWESPSSEGNTKRTYALGMVQ-------QEG--- 271
Query 289 SVNKNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQHYYE 348
VN N N G + ++ + A + W + AAT+NQLRQAF VQ E
Sbjct 272 EVNPNGLKNFETKAGGSFSESGAV-AAYPTNLWASPVTA---AATVNQLRQAFQVQKLLE 327
Query 349 ALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPIGE 408
ARGG+RYRE ++ FGV+ SD +QIPEYLGG + +N++Q+VQTS S +P G
Sbjct 328 KDARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTSA--STDASPQGN 385
Query 409 TGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFANL 468
T A+SVTP ++S FTKSF+EHGF+IGV R SYQQG+ER WSR DRLDYYFP AN+
Sbjct 386 TAAISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDYYFPVLANI 445
Query 469 GEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNYA 528
GEQ + KEI G + D+E FGYQEAWADYR KPN + G+ RSNA+ +LD WHY +Y
Sbjct 446 GEQAILNKEIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRFRSNAQQSLDAWHYGQDYD 505
Query 529 TVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVMNKTTRCMPLYSVPGL 579
+PTLS +WM++ E+ RTL V+ EP F R KT R MPLYS+PGL
Sbjct 506 KLPTLSTDWMEQSDIEMKRTLAVQTEPDFIANFRFNCKTVRVMPLYSIPGL 556
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 458 bits (1179), Expect = 3e-151, Method: Compositional matrix adjust.
Identities = 250/593 (42%), Positives = 361/593 (61%), Gaps = 60/593 (10%)
Query 1 MNRNNERHFNQVPETHVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAAIIR 60
MNRN E HF+++P +SR++F R ++ TTF+ G LIPFY+DEVLPGDTF+V ++ +IR
Sbjct 1 MNRNVESHFSRLPSVDISRSQFDRSSSLKTTFNVGDLIPFYIDEVLPGDTFNVKSSKVIR 60
Query 61 MTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYKVPKIIIDNEEG 120
M + P+MD+ Y+D YYF+ PNR++W ++++F GE ++ W+PT Y+VP++ G
Sbjct 61 MQSLVTPIMDNIYLDTYYFFVPNRLVWSHWQQFNGENTESAWLPTTEYQVPQVTAP-ANG 119
Query 121 GAVRAYPDESTILDYMGVPPKAIPVGGTG-RIEINALPVRAYVKIWNEFFRDQNVGNPAV 179
++ TI DY G+P TG +NALP RAY I NE+FRD+N+ +P
Sbjct 120 WSI------GTIADYFGIP--------TGVACSVNALPFRAYALICNEWFRDENLSDPLN 165
Query 180 LNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYFSSCLPYPQRGPEVTI 239
+ D A GS ++ +++ I++ GG ++HDYF+SCLP PQ+GP+V +
Sbjct 166 IPISD---ATVVGSNGDNYITD--IVK----GGMPFKACKYHDYFTSCLPAPQKGPDVLL 216
Query 240 ALTGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTK-----------EGTKF 288
L+ + S+ ++ + ++Y V+ N+S T EG +
Sbjct 217 PLSSSPVPVTTSDTMVDPLQY------SKYPMAGVDSWNLSPTLMRNIIRPFEGVEGANY 270
Query 289 SVNKNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQHYYE 348
V+ Q+ + DA F L +L N AA+INQLR AF +Q YE
Sbjct 271 QVH-------------QFTGDIPTIDA-FRPLNLVANLQNATAASINQLRLAFQIQRLYE 316
Query 349 ALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPIGE 408
ARGG+RY E +++ FGV+ D +Q PEYLGG R +N+NQ++Q S E+ +P G
Sbjct 317 RDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPININQVLQQS--ETTSTSPQGN 374
Query 409 TGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFANL 468
S+T + F KSF EHGFVIG+M R+DH+YQQGLERFWSR DR DYY+P FA++
Sbjct 375 PVGQSLTTDTNADFVKSFVEHGFVIGLMVARYDHTYQQGLERFWSRKDRFDYYWPVFAHI 434
Query 469 GEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNYA 528
GEQ V KEI +G + D+E FGYQEA+ADYR KP+RV+G+MRS A +LD WH AD+YA
Sbjct 435 GEQAVLNKEIYTSGTAVDDEVFGYQEAYADYRYKPSRVTGEMRSAAPQSLDVWHLADDYA 494
Query 529 TVPTLSQEWMKEGKNEIARTLIVEN--EPQFFGAIRVMNKTTRCMPLYSVPGL 579
++P+LS W++E + + R L V + Q F I + N++TR MP+YSVPGL
Sbjct 495 SLPSLSDSWIRESASTVDRVLAVSSNVSAQLFCDIYIQNRSTRPMPMYSVPGL 547
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 452 bits (1164), Expect = 1e-148, Method: Compositional matrix adjust.
Identities = 251/594 (42%), Positives = 352/594 (59%), Gaps = 44/594 (7%)
Query 1 MNRNNERHFNQVPETHVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAAIIR 60
MNRN E HF+ +P +SR+RF R +I TTF++G ++PF+++EVLPGDTFSVD++ ++R
Sbjct 2 MNRNTESHFSLLPHVDISRSRFDRSSSIKTTFNAGDVVPFFLEEVLPGDTFSVDSSKVVR 61
Query 61 MTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYKVPKIIIDNEEG 120
M T P+MD+ Y+D YYF+ PNR++W ++K F GE +++ W+P Y +P++ + G
Sbjct 62 MQTLLTPMMDNVYLDTYYFFVPNRLVWQHWKEFCGENNESAWIPQTEYAIPQL--KSPVG 119
Query 121 GAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKIWNEFFRDQNVGNPAVL 180
G + TI DY G+P G + ++ALP RAY I NE+FRD+N+ +P V+
Sbjct 120 GF-----EVGTIADYFGLPT------GVANLSVSALPFRAYALIMNEWFRDENLMDPLVV 168
Query 181 NTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYFSSCLPYPQRGPEVTIA 240
T D+A G V++ GG ++HDYF+S LP PQ+GP+V I
Sbjct 169 PT---DDATVTGVNTGIFVTD------VAKGGKPFVAAKYHDYFTSALPAPQKGPDVVI- 218
Query 241 LTGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKEGTKFSVNKNNNGNTAP 300
P+ + ++ G + + I N + S +GT+ +
Sbjct 219 -----PVASAGNYNVVGNGKGLALSDGSKMSIICNGLSGS-NGQGTELFASGILGSQVGS 272
Query 301 LVNGQYIQTMSQDDANF---FDAWLGTDLSN----------IEAATINQLRQAFAVQHYY 347
++ D A LG +L N AATINQLR AF +Q +Y
Sbjct 273 SGGFGSGSSLRGDGIILGVPTAAQLGNNLENSGLIAIASGNAAAATINQLRMAFQIQKFY 332
Query 348 EALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPIG 407
E ARGGSRY E +R+ FGV+ D +Q EYLGG R +N+NQ++Q SG S TP G
Sbjct 333 EKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQSGTGSASTTPQG 392
Query 408 ETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFAN 467
MS T S FTKSF EHGF+IGVMC R+DH+YQQG++R WSR D+ DYY+P F+N
Sbjct 393 TVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKFDYYWPVFSN 452
Query 468 LGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNY 527
+GEQ +K KEI G +TD+E FGYQEAWA+YR KP+RV+G+MRS+ +LD WH AD+Y
Sbjct 453 IGEQAIKNKEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRSSYAQSLDVWHLADDY 512
Query 528 ATVPTLSQEWMKEGKNEIARTLIV--ENEPQFFGAIRVMNKTTRCMPLYSVPGL 579
+ +P+LS EW++E + R L V +N QFF I V N TR MP+YS+PGL
Sbjct 513 SKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRPMPMYSIPGL 566
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 449 bits (1156), Expect = 1e-147, Method: Compositional matrix adjust.
Identities = 250/595 (42%), Positives = 348/595 (58%), Gaps = 59/595 (10%)
Query 1 MNRNNERHFNQVP-ETHVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAAII 59
MNRN E HF + P +SR+ F R ++ TF++G++IPF+++EVLPGDTF V T+ +I
Sbjct 1 MNRNVESHFAKNPTNIDISRSTFDRSSSVKLTFNTGEIIPFFIEEVLPGDTFKVKTSKVI 60
Query 60 RMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYKVPKIIIDNEE 119
R+ T P+MD+ Y+D YYF+ PNR++W+++K F GE + W+P Y++P++ E
Sbjct 61 RLQTLLTPMMDNIYLDTYYFFVPNRLVWEHWKEFNGENTQSAWIPEVEYQIPQLTA--PE 118
Query 120 GGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKIWNEFFRDQNVGNPAV 179
GG + T+ DY G+P G I +NALP RAY + NE+FRDQN+ +P
Sbjct 119 GGW-----NIGTLADYFGIPT------GVSGISVNALPFRAYALVCNEWFRDQNLSDPLN 167
Query 180 LNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYFSSCLPYPQRGPEVTI 239
+ GD + V+ + GG ++HDYF+SCLP PQ+GP+VTI
Sbjct 168 IPVGD---------ATVTGVNTGTFITDVVKGGLPYTAAKYHDYFTSCLPAPQKGPDVTI 218
Query 240 ALTG--NAPLRAYSEKDLNN--RKIGTGFFNNE----YNTGIVNHTNISFTKEGTKFSVN 291
+T N P+ +E + G G N+E Y G + S + + V
Sbjct 219 PVTSGHNLPVMFLNETHDAGPYKPFGVGIQNSELRNFYGFGSGSSGATSTSDTSSTVEVG 278
Query 292 KNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEA-----ATINQLRQAFAVQHY 346
+ G GQ NF W T++ +E+ ATINQLR AF +Q
Sbjct 279 SDGTGI------GQ----------NF---WTPTNMWAVESGDVGMATINQLRLAFQLQKL 319
Query 347 YEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPI 406
YE ARGG+RY E +R+ FGV D +Q PEYLGG R +N+NQI+Q S +S +P+
Sbjct 320 YEKDARGGTRYTEIIRSHFGVVSPDSRLQRPEYLGGNRIPINVNQIIQQS--QSTEQSPL 377
Query 407 GETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFA 466
G MSVT S F KSF EHG++IG++ R+DH+YQQGL+R WSR DR D+Y+P A
Sbjct 378 GALAGMSVTTDKNSDFIKSFVEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLA 437
Query 467 NLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADN 526
N+GEQ V KEI + G TD+E FGYQEAWA+YR KPNRV G+MRS+A +LD WH D+
Sbjct 438 NIGEQAVLNKEIYIDGSDTDDEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDD 497
Query 527 YATVPTLSQEWMKEGKNEIARTLIVEN--EPQFFGAIRVMNKTTRCMPLYSVPGL 579
Y+++P LS W++E K + R L V + Q F I + NK TR MP+YS+PGL
Sbjct 498 YSSLPYLSDSWIREDKTNVDRVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGL 552
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 440 bits (1131), Expect = 3e-144, Method: Compositional matrix adjust.
Identities = 247/557 (44%), Positives = 328/557 (59%), Gaps = 47/557 (8%)
Query 30 TTFDSGKLIPFYVDEVLPGDTFSVDTAAIIRMTTPKYPVMDDAYIDFYYFYCPNRILWDN 89
TTF+ G LIPFYVDE+LPGDTFS+DT+ ++RM + PVMD+ Y+D Y+F+ PNR+ W +
Sbjct 31 TTFNVGDLIPFYVDEILPGDTFSIDTSKVVRMQSLLTPVMDNIYLDTYFFFVPNRLTWSH 90
Query 90 FKRFMGEADDAPWMPTKTYKVPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTG 149
++ MGE + W P Y VP+I EGG + TI DYMG+P G
Sbjct 91 WRELMGENTQSAWTPQVEYSVPQITA--PEGGW-----NVGTIADYMGIP------TGVS 137
Query 150 RIEINALPVRAYVKIWNEFFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAH 209
+ +NA+P RAY I NE+FRD+N+ +P + GD A AG + V++
Sbjct 138 GLSVNAMPFRAYALICNEWFRDENLTDPLNIPVGD---ATVAGVNTGTYVTD------VA 188
Query 210 IGGYCLPVNRFHDYFSSCLPYPQRGPEVTIALTGN--APLRAYSEKD--LNNRKIGTGFF 265
GG ++HDYF+SCLP PQ+GP+V I+ G+ P+ A + LN G F
Sbjct 189 KGGLPFKAAKYHDYFTSCLPAPQKGPDVLISAVGSGIVPVTATDNDNDSLNVNSPGMRFV 248
Query 266 NNEYNTGIVNHTNISFTKEGTKFSVNKNNNGNTAPLVNGQYIQTMSQDDANFF-DAWLGT 324
N + VN+ ++F G + V +T I +S N + D T
Sbjct 249 GNSSTS--VNY--LAF-GGGDGYVVTDTPKPSTP-------IHGISMIPTNLWADLSTAT 296
Query 325 DLSNIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGR 384
DL ATINQLR AF +Q YE ARGG+RY E +++ FGV+ D +Q PEYLGG R
Sbjct 297 DL---PVATINQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGSR 353
Query 385 YHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSY 444
+N+NQ++Q+S TP G A S+T + S FTKSF EHGF+IG+M R+DHSY
Sbjct 354 VPININQVIQSS---ETGATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARYDHSY 410
Query 445 QQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPN 504
QQGL+RFWSR DR DYY+P FANLGE VK KEI G D+E FGYQEAWADYR KP+
Sbjct 411 QQGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWADYRYKPS 470
Query 505 RVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN--EPQFFGAIR 562
V+G+MRS +LD WH AD+Y +P+LS W++E + + R L V + Q F I
Sbjct 471 VVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLFCDIY 530
Query 563 VMNKTTRCMPLYSVPGL 579
+ TR MPLYS+PGL
Sbjct 531 IRCLATRPMPLYSIPGL 547
>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569
Score = 434 bits (1116), Expect = 1e-141, Method: Compositional matrix adjust.
Identities = 240/609 (39%), Positives = 340/609 (56%), Gaps = 67/609 (11%)
Query 1 MNRNNERHFNQVPETHVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAAIIR 60
MNRN E H++Q+P ++ R +FKRD + LTT + G L+P YVDEVLPGDT + +++R
Sbjct 1 MNRNAEAHYSQIPHANIQRAKFKRDFSYLTTINEGDLVPIYVDEVLPGDTIKIKQRSLVR 60
Query 61 MTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYKVPKIIIDNEEG 120
M+TP YPVMD+ Y+D +YF+ P R++WD+++ MGE + W P Y P + G
Sbjct 61 MSTPLYPVMDNCYLDIWYFFVPCRLVWDHWQNLMGENTKSYWAPDVQYTTP--LTSAPSG 118
Query 121 GAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKIWNEFFRDQNVGNPAVL 180
G TI DYMG+P G I++N++P+RAY +IWNE+FRD+N+ P
Sbjct 119 GW-----QVGTIADYMGIPT------GVSGIKVNSMPMRAYARIWNEWFRDENLQQPV-- 165
Query 181 NTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYFSSCLPYPQRGPEVTIA 240
T D+A GS +E+++ A GG L V +F DYF+SCLP PQ+G +
Sbjct 166 -TQHSDDATTTGSNTGTELTD------AESGGLPLKVAKFKDYFTSCLPAPQKGEAIGFD 218
Query 241 LTGNAPLRA------------YSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKEGTKF 288
++ ++ D+ R+ YNT N +I+ T+
Sbjct 219 FNQTPKVKGIGLVFPLETNTGHTATDILWRQPDAQLVGENYNTSYNNFNSIT-----TQT 273
Query 289 SVNKN-----NNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAA--------TIN 335
+VN NNG P+++ ++ +DD N G + + A +IN
Sbjct 274 TVNGKKAFFFNNGK-GPMLSARF-----EDDYNG-----GVEQVELTAVAENSTNFLSIN 322
Query 336 QLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQT 395
LRQA A+QH EA ARGG+RY E ++ FGVS D +Q EY+GG R +N++Q++Q+
Sbjct 323 DLRQAIALQHILEADARGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQVIQS 382
Query 396 SGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRS 455
S ++ +P G A S+T + S EHG+++G+ +R DHSYQQGL R W+RS
Sbjct 383 SASDTT--SPQGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMWTRS 440
Query 456 DRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAE 515
DR YY P ANLGEQ V +EI G + D E FGYQEAWADYR + N ++G+MRS
Sbjct 441 DRFSYYHPMLANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADYRYRTNMITGEMRSTYA 500
Query 516 GTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIV--ENEPQFFGAIRVMNKTTRCMPL 573
+LD WHY D Y +P LS +W+KEG+ I RTL V EN QF + R MP+
Sbjct 501 QSLDAWHYGDKYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRPMPI 560
Query 574 YSVPGLEKL 582
YSVPGL +
Sbjct 561 YSVPGLSMI 569
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 421 bits (1081), Expect = 2e-136, Method: Compositional matrix adjust.
Identities = 239/593 (40%), Positives = 335/593 (56%), Gaps = 46/593 (8%)
Query 3 RNNERHFNQVPET-HVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAAIIRM 61
RN F++ P T + R+ F R T+ + G+LIPFY DEVLPGDTF V T ++R+
Sbjct 2 RNENSRFSENPVTLDIQRSTFNRSSTYKTSANIGELIPFYYDEVLPGDTFQVKTNKVVRL 61
Query 62 TTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYKVPKIIIDNEEGG 121
MD+ Y D YYF+ PNR++W++++ FMGE W+P Y +P+I G
Sbjct 62 QPLVSAPMDNLYFDTYYFFVPNRLVWEHWEEFMGENKQGAWIPQTEYTIPQITSPASTGF 121
Query 122 AVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKIWNEFFRDQNVGNPAVLN 181
+ TI DY G+P G + ++ALP RAY I +E+FRDQN+ P LN
Sbjct 122 EI------GTIADYFGIP------TGVPNLSVSALPFRAYALIVDEWFRDQNLQLP--LN 167
Query 182 TGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYFSSCLPYPQRGPEVTIAL 241
+D + + + K GG ++HDYF+SCLP PQ+GP+VTIA
Sbjct 168 IPLDDTTLQGVNTGDYVTDTVK-------GGKPFVAAKYHDYFTSCLPSPQKGPDVTIAA 220
Query 242 TGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKEGTKF-SVNKNNNGNTAP 300
G+ P+ Y+ NN Y ++ ++SF++ SV + + P
Sbjct 221 VGDFPV--YTGDPHNNNGSNKAL---HYGISNISSGSVSFSQGNYIIPSVLTTGSTQSVP 275
Query 301 L---VNGQYIQTMSQDDANFFDAWLGTDLS---------NIEAATINQLRQAFAVQHYYE 348
+N I TM+ + D+ G+ LS + A TINQLR AF +Q YE
Sbjct 276 AQGKLNASNI-TMTTSPGSP-DSSFGSKLSVYPDNLYASSGTATTINQLRMAFQIQKLYE 333
Query 349 ALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPIGE 408
AR GSRYRE +R+ F V+ D +Q+PEYLGG R +N+NQ+VQTS +++ +P G
Sbjct 334 KDARAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININQVVQTS--QTSDVSPQGN 391
Query 409 TGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFANL 468
S+T + F KSF EHG +IGV R+DH+YQQG+ + WSR R DYY+P AN+
Sbjct 392 VAGQSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSKLWSRKTRFDYYWPVLANI 451
Query 469 GEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNYA 528
GEQ V KEI G + DEE FGYQEAWA+YR KP+ V+G+MRS+A +LD WH+AD+Y
Sbjct 452 GEQAVLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVTGEMRSSARTSLDSWHFADDYN 511
Query 529 TVPTLSQEWMKEGKNEIARTLIVENEP--QFFGAIRVMNKTTRCMPLYSVPGL 579
++P LS +W+KE K I R L V + Q+F + N+TTR +P YS+PGL
Sbjct 512 SLPKLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETTRALPFYSIPGL 564
>gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus]
gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus]
Length=538
Score = 394 bits (1012), Expect = 2e-126, Method: Compositional matrix adjust.
Identities = 233/591 (39%), Positives = 318/591 (54%), Gaps = 92/591 (16%)
Query 6 ERHFNQVPETHVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAAIIRMTTPK 65
+ F++VP + R+ F R + TTF++G+L+P YVDE LPGDTFS + A R+ TP
Sbjct 18 QHQFSEVPHADIQRSTFDRSHGLKTTFNAGQLVPIYVDEALPGDTFSCNLTAFSRLATPI 77
Query 66 YPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYK---------------- 109
+P MD+A++D ++F P R++WD+F+ FMGE TKTYK
Sbjct 78 HPTMDNAFMDTHFFAVPVRLVWDDFEEFMGE--------TKTYKAAGSDRLDGTPDFSVA 129
Query 110 --VPKIIIDNEEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKIWNE 167
VP I + G A E+++ DY G+P K VGG +E +AL RAY +WN+
Sbjct 130 APVPPTITASGSGEA------EASLSDYFGIPTK---VGG---LEFSALWHRAYTLVWND 177
Query 168 FFRDQNVGNPAVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYFSSC 227
+FRD+N+ P ++T SGN++ YA L + HDYF+S
Sbjct 178 WFRDENLQAPKTIDTT---------SGNDTTT-------YA-----LLNRGKKHDYFTSA 216
Query 228 LPYPQRGPEVTIALTGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKEGTK 287
LP+PQ+G +VTI L +AP+ N +N T
Sbjct 217 LPWPQKGADVTIPLGTSAPVTT------------------------ANSSNQDVT----- 247
Query 288 FSVNKNNNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQHYY 347
+ N GNT +N D+ L DLS +ATINQLR AFA Q +
Sbjct 248 --IFTPNIGNTHRFLNSASTNVYPGDENTDEARRLYADLSEATSATINQLRLAFATQKFL 305
Query 348 EALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPIG 407
E ARGGSRY E ++ F V+ D +Q PEYLGGG VN++ + QTS ++ TP G
Sbjct 306 EIQARGGSRYIEVIKNHFNVTSPDARLQRPEYLGGGSSPVNISPVAQTSSTDAT--TPQG 363
Query 408 ETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFAN 467
A+ T ++ SFTKSF EH VIG++ VR D +YQQGL R +SR DYY+P +
Sbjct 364 NLSAIGTTVLSGHSFTKSFTEHTIVIGMVSVRTDLTYQQGLNRMFSRETIYDYYWPTLST 423
Query 468 LGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNY 527
+GEQ VK KEI G + DE TFGYQE +A+YR KP+ V+GK RSNA GTL+ WHYA Y
Sbjct 424 IGEQAVKNKEIYAQGSAADETTFGYQERYAEYRYKPSSVTGKFRSNATGTLESWHYAQEY 483
Query 528 ATVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVMNKTTRCMPLYSVPG 578
A++P L W++ + RTL V +EPQF + TR MP+ S+PG
Sbjct 484 ASLPLLGDSWIQVTDTNVQRTLAVASEPQFIFDSLFKLRCTRPMPVNSIPG 534
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 389 bits (999), Expect = 2e-124, Method: Compositional matrix adjust.
Identities = 241/590 (41%), Positives = 333/590 (56%), Gaps = 67/590 (11%)
Query 1 MNRN---NERHFNQVPETHVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAA 57
M+RN + F+ +P + R++F + + T FDSG L+P VDEVLPGD+ ++ A
Sbjct 2 MHRNKSASAHQFSMIPRAEIPRSKFDAQKTLKTAFDSGYLVPILVDEVLPGDSMNLRMTA 61
Query 58 IIRMTTPKYPVMDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYKVPKIIIDN 117
R+ TP +PVMD+ Y+D ++F+ PNR+LW N++RFMGE D P + Y +P + N
Sbjct 62 FTRLATPLFPVMDNMYLDTFFFFVPNRLLWSNWQRFMGERDPDP-DSSIDYTIPTMTSPN 120
Query 118 EEGGAVRAYPDESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKIWNEFFRDQNVGNP 177
G AV +++ DYMG+P A V I N+L RAY IWNE+FRD+N+ +
Sbjct 121 G-GYAV------NSLQDYMGLP-TAGQVDAGSSISHNSLFTRAYNLIWNEWFRDENLQDS 172
Query 178 AVLNTGDEDEAYRAGSGNESEVSEEKILEYAHIGGYCLPVNRFHDYFSSCLPYPQRGPEV 237
V++ GD + Y +Y L + HDYF+S LP+PQ+G V
Sbjct 173 VVVDKGDGPDTYT---------------DYT-----LLRRGKRHDYFTSALPWPQKGDAV 212
Query 238 TIALTGNAPLRAYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKEGTKFSVNKNNNGN 297
T+ L G+A ++ G + E +TG V T ++ SV+K NGN
Sbjct 213 TLPLGGSA--------NVVYNDTGDPAYIREVSTGNVWTTP-------SRESVSKEANGN 257
Query 298 -TAPL--VNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQHYYEALARGG 354
+ P VN QY D N L DLS AATIN +RQ+F +Q E ARGG
Sbjct 258 MSVPTGSVNAQY-------DPN---GSLVADLSTATAATINAIRQSFQIQRLLERDARGG 307
Query 355 SRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQ-ESNYGTPIGETGAMS 413
+RY E VR+ FGV D +Q PEYLGGG + +N + Q S S TP+G GA+
Sbjct 308 TRYTEIVRSHFGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGTLGAVG 367
Query 414 VTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPV 473
+ F SF EHG V+G+ VR D +YQQGL R +SRS R D++FP F++LGEQP+
Sbjct 368 TGLASGHGFASSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPI 427
Query 474 KKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTL 533
KE+ TG STD++ FGYQEAWA+YR KP++V+G MRS A GTLD WH A N+ ++PTL
Sbjct 428 LNKELYATGTSTDDDVFGYQEAWAEYRYKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTL 487
Query 534 SQEWMKEGKNEIARTLIVENEP---QF-FGAIRVMNKTTRCMPLYSVPGL 579
+ ++ E + R + V +E QF F A +N R MP+YSVPGL
Sbjct 488 NSTFI-EDTPPVDRVVAVGSEANGQQFIFDAFFDINM-ARPMPMYSVPGL 535
>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533
Score = 384 bits (985), Expect = 1e-122, Method: Compositional matrix adjust.
Identities = 225/574 (39%), Positives = 317/574 (55%), Gaps = 79/574 (14%)
Query 9 FNQVPETHVSRTRFKRDQNILTTFDSGKLIPFYVDEVLPGDTFSVDTAAIIRMTTPKYPV 68
F++VP+ + R+ F R + TTF+SG LIP YVDEVLPGDTF ++ R+ TP YPV
Sbjct 17 FSRVPQADIQRSTFSRVHGLKTTFNSGDLIPIYVDEVLPGDTFQMNATGFGRLATPLYPV 76
Query 69 MDDAYIDFYYFYCPNRILWDNFKRFMGEADDAPWMPTKTYKVPKIIIDNEEGGAVRAYPD 128
MD+ Y++ ++FY PNRI+WDN+++F G DD + + VP+I A
Sbjct 77 MDNMYVETFFFYVPNRIIWDNWEKFNGAQDDP--NDSTDFLVPQI---------QSATVA 125
Query 129 ESTILDYMGVPPKAIPVGGTGRIEINALPVRAYVKIWNEFFRDQNVGNPAVLNTGDEDEA 188
E ++ DYMG+P + + G I+ N L RAY IWNE+FRD+N+ + + D +
Sbjct 126 EGSLFDYMGLPTQ---IAG---IDFNNLHGRAYNLIWNEWFRDENLQDSLGVPKDDGPDT 179
Query 189 YRAGSGNESEVSEEKILEYAHIGGYCL-PVNRFHDYFSSCLPYPQRGPEVTIALTGNAPL 247
Y GY + + HDYF+S LP+PQ+G V++ L +A +
Sbjct 180 YT---------------------GYTIQKRGKRHDYFTSALPWPQKGDAVSLPLGTSADI 218
Query 248 R--AYSEKDLNNRKIGTGFFNNEYNTGIVNHTNISFTKEGTKFSVNKNNNGNTAPLVNGQ 305
A + D+ +G+ F + T V +G T P N
Sbjct 219 HTAAAAGTDIGIYSVGSSDF-----------------RLLTSDPVEVALSGGTPPETNKM 261
Query 306 YIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALF 365
+ DLSN AATINQLR+AF +Q YE ARGG+RY E +++ F
Sbjct 262 F-----------------ADLSNATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHF 304
Query 366 GVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKS 425
GV+ D +Q PEYLGG + V M + QTS +S +P G A+ T + F+KS
Sbjct 305 GVTSPDARLQRPEYLGGQKTEVMMQTVPQTSSTDST--SPQGNLAALG-TATSRGGFSKS 361
Query 426 FEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKST 485
F EHG +IG+ CV D +YQQG+ R WSR DR D+Y+P A+LGEQ V +EI G S
Sbjct 362 FVEHGVLIGLACVFADLTYQQGMNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSA 421
Query 486 DEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEI 545
D +TFGYQE +A+YR KP++++GKMRSNA GTLD WH A ++ +P L+ +++E +
Sbjct 422 DTQTFGYQERFAEYRYKPSQITGKMRSNATGTLDAWHLAQDFTALPALNASFIEENP-PV 480
Query 546 ARTLIVENEPQFFGAIRVMNKTTRCMPLYSVPGL 579
R + V +EP+F KTTR MP+YSVPGL
Sbjct 481 DRVIAVPSEPEFIWDWYFDLKTTRPMPVYSVPGL 514
Lambda K H a alpha
0.317 0.135 0.410 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4286665841916