bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters



Query= Contig-43_CDS_annotation_glimmer3.pl_2_1

Length=306
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|575094431|emb|CDL65804.1|  unnamed protein product                   293   4e-91
gi|575096056|emb|CDL66947.1|  unnamed protein product                   286   4e-88
gi|575094572|emb|CDL65928.1|  unnamed protein product                   272   4e-83
gi|575094544|emb|CDL65904.1|  unnamed protein product                   270   2e-82
gi|575094492|emb|CDL65859.1|  unnamed protein product                   265   2e-80
gi|575094496|emb|CDL65862.1|  unnamed protein product                   261   6e-79
gi|557745632|ref|YP_008798242.1|  major capsid protein                  240   5e-71
gi|444298010|dbj|GAC77834.1|  major capsid protein                      235   8e-70
gi|530695351|gb|AGT39907.1|  major capsid protein                       237   9e-70
gi|313766927|gb|ADR80653.1|  putative major coat protein                232   5e-68


>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560

 Score =   293 bits (750),  Expect = 4e-91, Method: Compositional matrix adjust.
 Identities = 138/249 (55%), Positives = 176/249 (71%), Gaps = 2/249 (1%)

Query  55   SATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLGGGRYHVNIN  114
            +AT+NQLRQA  VQ+  E  ARGG+RYRE ++  + V  SD  +Q+PEYLGG +  +N++
Sbjct  310  AATVNQLRQAFQVQKLLEKDARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVS  369

Query  115  QIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHNRSYQQGLER  174
            Q+VQTS   +++ +P G T A+SVTP ++S FTKSF+EHGF+IGV   R  +SYQQG+ER
Sbjct  370  QVVQTSA--STDASPQGNTAAISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIER  427

Query  175  FWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRMKPNRVSGLM  234
             WSRKDRLDYY P  AN+GEQ +  KEI   G+  DDE FGYQEAWADYR KPN + G  
Sbjct  428  MWSRKDRLDYYFPVLANIGEQAILNKEIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRF  487

Query  235  RSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEPQFFGAIRIANKTTRR  294
            RSNA  +L+ WHY  +Y K+PTLS +WM +   E+ RTL V+ EP F    R   KT R 
Sbjct  488  RSNAQQSLDAWHYGQDYDKLPTLSTDWMEQSDIEMKRTLAVQTEPDFIANFRFNCKTVRV  547

Query  295  MPLYSVPGL  303
            MPLYS+PGL
Sbjct  548  MPLYSIPGL  556


>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570

 Score =   286 bits (731),  Expect = 4e-88, Method: Compositional matrix adjust.
 Identities = 134/249 (54%), Positives = 174/249 (70%), Gaps = 2/249 (1%)

Query  57   TINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLGGGRYHVNINQI  116
            TINQLR A  +Q++YE  ARGGSRY E IR+ + V   D  +Q  EYLGG R  +NINQ+
Sbjct  318  TINQLRMAFQIQKFYEKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQV  377

Query  117  VQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHNRSYQQGLERFW  176
            +Q SG  +++ TP G    MS T    S FTKSF EHGF+IGV C R++ +YQQG++R W
Sbjct  378  IQQSGTGSASTTPQGTVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMW  437

Query  177  SRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRMKPNRVSGLMRS  236
            SRKD+ DYY P F+N+GEQ +K KEI   G+ TDDE FGYQEAWA+YR KP+RV+G MRS
Sbjct  438  SRKDKFDYYWPVFSNIGEQAIKNKEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRS  497

Query  237  NATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDE--PQFFGAIRIANKTTRR  294
            +   +L+ WH AD+Y+K+P+LS EW+ E  + + R L V D+   QFF  I + N  TR 
Sbjct  498  SYAQSLDVWHLADDYSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRP  557

Query  295  MPLYSVPGL  303
            MP+YS+PGL
Sbjct  558  MPMYSIPGL  566


>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556

 Score =   272 bits (696),  Expect = 4e-83, Method: Compositional matrix adjust.
 Identities = 138/280 (49%), Positives = 177/280 (63%), Gaps = 9/280 (3%)

Query  31   VLLGKDAGGV--STWVPME---ARLDNATSATINQLRQAISVQQYYEALARGGSRYREQI  85
            V +G D  G+  + W P         +   ATINQLR A  +Q+ YE  ARGG+RY E I
Sbjct  275  VEVGSDGTGIGQNFWTPTNMWAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEII  334

Query  86   RAIWDVIISDKTVQVPEYLGGGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESS  145
            R+ + V+  D  +Q PEYLGG R  +N+NQI+Q S  Q++  +P+G    MSVT    S 
Sbjct  335  RSHFGVVSPDSRLQRPEYLGGNRIPINVNQIIQQS--QSTEQSPLGALAGMSVTTDKNSD  392

Query  146  FTKSFEEHGFVIGVCCVRHNRSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLT  205
            F KSF EHG++IG+   R++ +YQQGL+R WSRKDR D+Y P  AN+GEQ V  KEI + 
Sbjct  393  FIKSFVEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYID  452

Query  206  GDTTDDETFGYQEAWADYRMKPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEG  265
            G  TDDE FGYQEAWA+YR KPNRV G MRS+A  +L+ WH  D+Y+ +P LS  W+ E 
Sbjct  453  GSDTDDEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIRED  512

Query  266  KEEIARTLIVED--EPQFFGAIRIANKTTRRMPLYSVPGL  303
            K  + R L V      Q F  I I NK TR MP+YS+PGL
Sbjct  513  KTNVDRVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGL  552


>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551

 Score =   270 bits (690),  Expect = 2e-82, Method: Compositional matrix adjust.
 Identities = 135/260 (52%), Positives = 175/260 (67%), Gaps = 4/260 (2%)

Query  46   MEARLDNATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLG  105
            + A L NAT+A+INQLR A  +Q+ YE  ARGG+RY E +++ + V   D  +Q PEYLG
Sbjct  290  LVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLG  349

Query  106  GGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHN  165
            G R  +NINQ++Q S  +T++ +P G     S+T    + F KSF EHGFVIG+   R++
Sbjct  350  GNRIPININQVLQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMVARYD  407

Query  166  RSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRM  225
             +YQQGLERFWSRKDR DYY P FA++GEQ V  KEI  +G   DDE FGYQEA+ADYR 
Sbjct  408  HTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQEAYADYRY  467

Query  226  KPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVED--EPQFFG  283
            KP+RV+G MRS A  +L+ WH AD+YA +P+LS  W+ E    + R L V      Q F 
Sbjct  468  KPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSAQLFC  527

Query  284  AIRIANKTTRRMPLYSVPGL  303
             I I N++TR MP+YSVPGL
Sbjct  528  DIYIQNRSTRPMPMYSVPGL  547


>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551

 Score =   265 bits (678),  Expect = 2e-80, Method: Compositional matrix adjust.
 Identities = 137/261 (52%), Positives = 167/261 (64%), Gaps = 8/261 (3%)

Query  48   ARLDNATS---ATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYL  104
            A L  AT    ATINQLR A  +Q+ YE  ARGG+RY E +++ + V   D  +Q PEYL
Sbjct  290  ADLSTATDLPVATINQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYL  349

Query  105  GGGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRH  164
            GG R  +NINQ++Q+S    +  TP G   A S+T  + S FTKSF EHGF+IG+   R+
Sbjct  350  GGSRVPININQVIQSS---ETGATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARY  406

Query  165  NRSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYR  224
            + SYQQGL+RFWSRKDR DYY P FANLGE  VK KEI   G   DDE FGYQEAWADYR
Sbjct  407  DHSYQQGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWADYR  466

Query  225  MKPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVED--EPQFF  282
             KP+ V+G MRS    +L+ WH AD+Y  +P+LS  W+ E    + R L V D    Q F
Sbjct  467  YKPSVVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLF  526

Query  283  GAIRIANKTTRRMPLYSVPGL  303
              I I    TR MPLYS+PGL
Sbjct  527  CDIYIRCLATRPMPLYSIPGL  547


>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568

 Score =   261 bits (668),  Expect = 6e-79, Method: Compositional matrix adjust.
 Identities = 130/254 (51%), Positives = 166/254 (65%), Gaps = 4/254 (2%)

Query  52   NATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLGGGRYHV  111
            + T+ TINQLR A  +Q+ YE  AR GSRYRE IR+ + V   D  +QVPEYLGG R  +
Sbjct  313  SGTATTINQLRMAFQIQKLYEKDARAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPI  372

Query  112  NINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHNRSYQQG  171
            NINQ+VQTS  QTS+ +P G     S+T  +   F KSF EHG +IGV   R++ +YQQG
Sbjct  373  NINQVVQTS--QTSDVSPQGNVAGQSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQG  430

Query  172  LERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRMKPNRVS  231
            + + WSRK R DYY P  AN+GEQ V  KEI   G   D+E FGYQEAWA+YR KP+ V+
Sbjct  431  VSKLWSRKTRFDYYWPVLANIGEQAVLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVT  490

Query  232  GLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVED--EPQFFGAIRIAN  289
            G MRS+A  +L+ WH+AD+Y  +P LS +W+ E K  I R L V      Q+F    I N
Sbjct  491  GEMRSSARTSLDSWHFADDYNSLPKLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIEN  550

Query  290  KTTRRMPLYSVPGL  303
            +TTR +P YS+PGL
Sbjct  551  ETTRALPFYSIPGL  564


>gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus]
 gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus]
Length=538

 Score =   240 bits (612),  Expect = 5e-71, Method: Compositional matrix adjust.
 Identities = 127/257 (49%), Positives = 158/257 (61%), Gaps = 2/257 (1%)

Query  46   MEARLDNATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLG  105
            + A L  ATSATINQLR A + Q++ E  ARGGSRY E I+  ++V   D  +Q PEYLG
Sbjct  280  LYADLSEATSATINQLRLAFATQKFLEIQARGGSRYIEVIKNHFNVTSPDARLQRPEYLG  339

Query  106  GGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHN  165
            GG   VNI+ + QTS   T   TP G   A+  T ++  SFTKSF EH  VIG+  VR +
Sbjct  340  GGSSPVNISPVAQTS--STDATTPQGNLSAIGTTVLSGHSFTKSFTEHTIVIGMVSVRTD  397

Query  166  RSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRM  225
             +YQQGL R +SR+   DYY P  + +GEQ VK KEI   G   D+ TFGYQE +A+YR 
Sbjct  398  LTYQQGLNRMFSRETIYDYYWPTLSTIGEQAVKNKEIYAQGSAADETTFGYQERYAEYRY  457

Query  226  KPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEPQFFGAI  285
            KP+ V+G  RSNATGTLE WHYA  YA +P L   W+      + RTL V  EPQF    
Sbjct  458  KPSSVTGKFRSNATGTLESWHYAQEYASLPLLGDSWIQVTDTNVQRTLAVASEPQFIFDS  517

Query  286  RIANKTTRRMPLYSVPG  302
                + TR MP+ S+PG
Sbjct  518  LFKLRCTRPMPVNSIPG  534


>gi|444298010|dbj|GAC77834.1| major capsid protein [uncultured marine virus]
Length=480

 Score =   235 bits (600),  Expect = 8e-70, Method: Compositional matrix adjust.
 Identities = 124/304 (41%), Positives = 175/304 (58%), Gaps = 18/304 (6%)

Query  2    TSNGANHLVPASGNTLGAPKTDEDNGKPRVLLGKDAGGVSTWVPMEARLDNATSATINQL  61
            T + ANH+  A+  +    +  ++ G P +                A L NAT+ATINQL
Sbjct  189  TVSYANHIESATAASFAFEEDPDNAGFPNI---------------RADLTNATAATINQL  233

Query  62   RQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLGGGRYHVNINQIVQT--  119
            RQA  +Q+  E  ARGG+RY E IRA + V+  D  +Q PEYLGGG  ++NI  I QT  
Sbjct  234  RQAFQIQKLLERDARGGTRYTEIIRAHFSVLSPDSRLQRPEYLGGGSSNINITPIAQTQR  293

Query  120  SGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHNRSYQQGLERFWSRK  179
            S   T ++TP G   A+  +  +   FTKSF EHG+++G+C VR + +YQQG++R WSR 
Sbjct  294  SDTTTPDETPQGNLAAIGTSAFSGHGFTKSFTEHGYILGLCEVRADLTYQQGIDRLWSRD  353

Query  180  DRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRMKPNRVSGLMRSNAT  239
             R D+Y P  +++GEQ V  KEI       D++ FGYQE +A+YR KP+R+S L RSNA 
Sbjct  354  TRYDFYWPALSHIGEQAVLSKEIFADATAGDEDVFGYQERFAEYRYKPSRISSLFRSNAA  413

Query  240  GTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEPQFFGAIRIANKTTRRMPLYS  299
             +L+ WH + ++A  P L+  ++ E    I R + V DEP          +  R +PLY 
Sbjct  414  ASLDVWHLSQDFAARPVLNSTFI-EDTPPIDRVIAVTDEPHILLDAYFKLRCARPLPLYG  472

Query  300  VPGL  303
            VPGL
Sbjct  473  VPGL  476


>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539

 Score =   237 bits (604),  Expect = 9e-70, Method: Compositional matrix adjust.
 Identities = 120/262 (46%), Positives = 163/262 (62%), Gaps = 5/262 (2%)

Query  46   MEARLDNATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLG  105
            + A L  AT+ATIN +RQ+  +Q+  E  ARGG+RY E +R+ + VI  D  +Q PEYLG
Sbjct  275  LVADLSTATAATINAIRQSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARMQRPEYLG  334

Query  106  GGRYHVNINQIVQTSGQQTS-NDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRH  164
            GG   + +N + Q S    S  DTP+G  GA+     +   F  SF EHG V+G+C VR 
Sbjct  335  GGSAPIIVNPVAQQSASGASGTDTPLGTLGAVGTGLASGHGFASSFTEHGVVVGLCSVRA  394

Query  165  NRSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYR  224
            + +YQQGL R +SR  R D++ P F++LGEQP+  KE+  TG +TDD+ FGYQEAWA+YR
Sbjct  395  DLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKELYATGTSTDDDVFGYQEAWAEYR  454

Query  225  MKPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEP---QF  281
             KP++V+GLMRS A GTL+ WH A N+  +PTL+  ++ E    + R + V  E    QF
Sbjct  455  YKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTFI-EDTPPVDRVVAVGSEANGQQF  513

Query  282  FGAIRIANKTTRRMPLYSVPGL  303
                       R MP+YSVPGL
Sbjct  514  IFDAFFDINMARPMPMYSVPGL  535


>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533

 Score =   232 bits (591),  Expect = 5e-68, Method: Compositional matrix adjust.
 Identities = 120/258 (47%), Positives = 166/258 (64%), Gaps = 4/258 (2%)

Query  46   MEARLDNATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLG  105
            M A L NAT+ATINQLR+A  +Q+ YE  ARGG+RY E +++ + V   D  +Q PEYLG
Sbjct  261  MFADLSNATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLG  320

Query  106  GGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHN  165
            G +  V +  + QTS   T + +P G   A+  T  +   F+KSF EHG +IG+ CV  +
Sbjct  321  GQKTEVMMQTVPQTS--STDSTSPQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFAD  377

Query  166  RSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRM  225
             +YQQG+ R WSR+DR D+Y P  A+LGEQ V  +EI   G + D +TFGYQE +A+YR 
Sbjct  378  LTYQQGMNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRY  437

Query  226  KPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEPQFFGAI  285
            KP++++G MRSNATGTL+ WH A ++  +P L+  ++ E    + R + V  EP+F    
Sbjct  438  KPSQITGKMRSNATGTLDAWHLAQDFTALPALNASFIEE-NPPVDRVIAVPSEPEFIWDW  496

Query  286  RIANKTTRRMPLYSVPGL  303
                KTTR MP+YSVPGL
Sbjct  497  YFDLKTTRPMPVYSVPGL  514



Lambda      K        H        a         alpha
   0.315    0.132    0.395    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 1647025809192