bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-6_CDS_annotation_glimmer3.pl_2_8

Length=632
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547312923|ref|WP_022044635.1|  putative uncharacterized protein    68.2    4e-09
gi|530695267|gb|AGT39863.1|  major capsid protein                     67.0    5e-09
gi|492501782|ref|WP_005867318.1|  hypothetical protein                68.2    8e-09
gi|599088027|gb|AHN52939.1|  major capsid protein                     65.5    1e-08
gi|547920049|ref|WP_022322420.1|  capsid protein VP1                  67.4    2e-08
gi|599088021|gb|AHN52936.1|  major capsid protein                     64.7    2e-08
gi|649557305|gb|KDS63784.1|  capsid family protein                    64.7    2e-08
gi|599087961|gb|AHN52906.1|  major capsid protein                     63.9    3e-08
gi|599087475|gb|AHN52663.1|  major capsid protein                     63.5    4e-08
gi|649569140|gb|KDS75238.1|  capsid family protein                    65.1    6e-08


>gi|547312923|ref|WP_022044635.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
 gi|524208404|emb|CCZ76639.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
Length=338

 Score = 68.2 bits (165),  Expect = 4e-09, Method: Compositional matrix adjust.
 Identities = 86/334 (26%), Positives = 138/334 (41%), Gaps = 53/334 (16%)

Query  344  GLAVKTYQSDIFNNWIQTEWLEGENG---INAITAVDTSIG---SFTMDTLNLAKKVYNM  397
            GL    Y  D+F N I+    +G +    I  + A+D +I    S  +  L L  K+ N 
Sbjct  11   GLLSVPYSPDLFGNIIK----QGSSPAVEIEVMNALDLNISTGFSVAVPELRLRTKIQNW  66

Query  398  LNRIAVSDG----TYKSWMETVFSAKYVERTETPIYYGGMSQEII----FEEVVSTSATG  449
            ++R+ VS G     +++   T  SA YV + +    + G+ Q  I       + + SA+G
Sbjct  67   MDRLFVSGGRVGDVFRTLWGTKSSAIYVNKPD----FLGVWQASINPSNVRAMANGSASG  122

Query  450  EEP-LGTLAGK-GRLAPNKKGGKVVIKIDEPSYIIGICSITPRLDYSQGNNFTMNWETMN  507
            E+  LG LA    R         +     EP   + I  + P   YSQG +  +   +  
Sbjct  123  EDANLGQLAACVDRYCDFSGHSGIDYYAKEPGTFMLITMLVPEPAYSQGLHPDLASISFG  182

Query  508  DLHKPALDMIGYQ-----------------DLTMEKAAWWTEEHTSD-TEFAQKSIGKTV  549
            D   P L+ IG+Q                  L  E + W+    T    +    S+G+ V
Sbjct  183  DDFNPELNGIGFQLVPRHRFSMMPRGFNFTGLDQEASPWFGHTGTGVLVDPNMVSVGEEV  242

Query  550  AWVDYMTNYNKNYGNFASGENENFMTLDRNYNVENPD----------FT-TYIDPAKYNG  598
            AW    T+Y++ +G+FA   N  +  L R +    PD          +T TYI+P  +  
Sbjct  243  AWSWLRTDYSRLHGDFAQNGNYQYWVLTRRFTTYFPDDGTGFYQDGEYTGTYINPLDWQY  302

Query  599  IFADQSRSSMNFWVQIGVNWKVRRKISAKSIPNL  632
            +F DQ+  + NF      +  V   +SA  +P L
Sbjct  303  VFVDQTLMAGNFAYYGTFDLNVTSSLSANYMPYL  336


>gi|530695267|gb|AGT39863.1| major capsid protein, partial [uncultured marine Microviridae]
Length=263

 Score = 67.0 bits (162),  Expect = 5e-09, Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 71/160 (44%), Gaps = 4/160 (3%)

Query  368  NGINAITAVDTSIGSFTMDTLNLAKKVYNMLNRIAVSDGTYKSWMETVFSAKYVE-RTET  426
            NG  AI AV +S G   ++TL  A  V  +L R A     Y   +++ F     + R + 
Sbjct  106  NGYPAIYAVLSSTGGIPINTLRQAWMVQALLERDARGGTRYIEIIKSHFGVTSPDFRLQR  165

Query  427  PIYYGGMSQEIIFEEVVSTSATGEEPLGTLAGKGRLAPNKKGGKVVIKIDEPSYIIGICS  486
            P Y GG S ++    +  T   G  PLG + G G  A + +         E  YIIGI S
Sbjct  166  PEYIGGGSTDLNITPIAQTVPGGGNPLGQIGGAGTAAGSHRASYAAT---EHGYIIGIIS  222

Query  487  ITPRLDYSQGNNFTMNWETMNDLHKPALDMIGYQDLTMEK  526
            +   L Y QG N   +  T  D + PA   +G Q +T  +
Sbjct  223  VKSELSYQQGINKMWDRHTRYDFYFPATAQLGEQAITQRE  262


>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
 gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis 
CL09T03C24]
Length=538

 Score = 68.2 bits (165),  Expect = 8e-09, Method: Compositional matrix adjust.
 Identities = 67/252 (27%), Positives = 109/252 (43%), Gaps = 13/252 (5%)

Query  384  TMDTLNLAKKVYNMLNRIAVSDGTYKSWMETVFSAKYVE-RTETPIYYGGMSQEIIFEEV  442
            +++ L  +  +     R A S   Y   + + F  +  + R + P + GG    I   EV
Sbjct  297  SINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEV  356

Query  443  VSTSAT-GEEPLGTLAGKGRLAPNKKGGKVVIKIDEPSYIIGICSITPRLDYSQGNNFTM  501
            + TSAT    P   +AG G  A    G K     +E  YIIGI SI PR  Y QG     
Sbjct  357  LQTSATDSTSPQANMAGHGISAGVNHGFKRYF--EEHGYIIGIMSIRPRTGYQQGVPKDF  414

Query  502  NWETMNDLHKPALDMIGYQDLTMEKAAWWTEEHTSDTEFAQKSIGKTVAWVDYMTNYNKN  561
                  D + P    +G Q++  E+        +++  F     G T  + +Y  + N+ 
Sbjct  415  RKFDNMDFYFPEFAHLGEQEIKNEEVYLQQTPASNNGTF-----GYTPRYAEYKYSMNEV  469

Query  562  YGNFASGENENFMTLDRNYNVENPDF-TTYIDPAKYNGIFADQSRSSMNFWVQIGVNWKV  620
            +G+F    N  F  L+R ++ E+P+  TT+++    N +FA    S   +W+Q+  + K 
Sbjct  470  HGDFRG--NMAFWHLNRIFS-ESPNLNTTFVECNPSNRVFATAETSDDKYWIQLYQDVKA  526

Query  621  RRKISAKSIPNL  632
             R +     P L
Sbjct  527  LRLMPKYGTPML  538


>gi|599088027|gb|AHN52939.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=219

 Score = 65.5 bits (158),  Expect = 1e-08, Method: Compositional matrix adjust.
 Identities = 45/137 (33%), Positives = 62/137 (45%), Gaps = 2/137 (1%)

Query  384  TMDTLNLAKKVYNMLNRIAVSDGTYKSWMETVFSAKYVERTETPIYYGGMSQEIIFEEVV  443
            T++ L  A ++  +L R A S   Y   ++  F   +++ T  P + GG S  I    V 
Sbjct  77   TINQLRQAFQIQKLLERDARSGTRYAEIVKAHFGVNFMDVTYRPEFLGGTSTPINVTSVP  136

Query  444  STSATGEEPLGTLAGKGRLAPNKKGGKVVIKIDEPSYIIGICSITPRLDYSQGNNFTMNW  503
             TS +G  P GTLA  G    N  GG       E   ++GI S+   L Y QG N   + 
Sbjct  137  QTSESGTTPQGTLAAFGTATVN--GGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFSR  194

Query  504  ETMNDLHKPALDMIGYQ  520
             T  D + PAL  IG Q
Sbjct  195  STRYDFYFPALAHIGEQ  211


>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
 gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553

 Score = 67.4 bits (163),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 62/214 (29%), Positives = 99/214 (46%), Gaps = 16/214 (7%)

Query  423  RTETPIYYGGMSQEIIFEEVVSTSATGE-EPLGTLAGKGRLAPNKKGGKVVIKIDEPSYI  481
            R + P + GG    I   EV+ TS+T E  P   +AG G  A    G K     +E  YI
Sbjct  352  RLQRPQFLGGGRMPISVSEVLQTSSTDETSPQANMAGHGISAGINNGFKHYF--EEHGYI  409

Query  482  IGICSITPRLDYSQG--NNFTMNWETMNDLHKPALDMIGYQDLTMEKAAWWTEEHTSDTE  539
            IGI SITPR  Y QG   +FT  ++ M D + P    +  Q++  ++        + D  
Sbjct  410  IGIMSITPRSGYQQGVPRDFT-KFDNM-DFYFPEFAHLSEQEIKNQELFV-----SEDAA  462

Query  540  FAQKSIGKTVAWVDYMTNYNKNYGNFASGENENFMTLDRNYNVENPDF-TTYIDPAKYNG  598
            +   + G T  + +Y  + ++ +G+F    N +F  L+R +  + P+  TT+++    N 
Sbjct  463  YNNGTFGYTPRYAEYKYHPSEAHGDFRG--NLSFWHLNRIFE-DKPNLNTTFVECKPSNR  519

Query  599  IFADQSRSSMNFWVQIGVNWKVRRKISAKSIPNL  632
            +FA        FWVQ+  + K  R +     P L
Sbjct  520  VFATSETEDDKFWVQMYQDVKALRLMPKYGTPML  553


>gi|599088021|gb|AHN52936.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=220

 Score = 64.7 bits (156),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 48/155 (31%), Positives = 70/155 (45%), Gaps = 2/155 (1%)

Query  368  NGINAITAVDTSIGSFTMDTLNLAKKVYNMLNRIAVSDGTYKSWMETVFSAKYVERTETP  427
            N   A+ A  +S  + T++ L  A ++  +L R A S   Y   ++  F   +++ T  P
Sbjct  62   NANRALYADLSSATAATINQLRQAFQIQKLLERDARSGTRYSEIVKAHFGVNFMDVTYRP  121

Query  428  IYYGGMSQEIIFEEVVSTSATGEEPLGTLAGKGRLAPNKKGGKVVIKIDEPSYIIGICSI  487
             + GG S  +    V  TS +G  P GTLA  G    N  GG       E   ++GI S+
Sbjct  122  EFLGGSSTPVNVTSVPQTSESGTTPQGTLAAFGTATIN--GGGFTKSFTEHCIVMGIASV  179

Query  488  TPRLDYSQGNNFTMNWETMNDLHKPALDMIGYQDL  522
               L Y QG N   +  T  D + PAL  IG Q +
Sbjct  180  RADLTYQQGLNRMFSRSTRYDFYFPALAHIGEQSV  214


>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=245

 Score = 64.7 bits (156),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 68/241 (28%), Positives = 109/241 (45%), Gaps = 19/241 (8%)

Query  398  LNRIAVSDGTYKSWMETVFSAKYVE-RTETPIYYGGMSQEIIFEEVVSTSAT-GEEPLGT  455
              R A S   Y   + + F  +  + R + P + GG    I   EV+ TS+T    P   
Sbjct  18   FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN  77

Query  456  LAGKGRLAPNKKGGKVVIKIDEPSYIIGICSITPRLDYSQG--NNFTMNWETMNDLHKPA  513
            +AG G  A    G       +E  YI+GI SI PR  Y QG   +F   ++ M D + P 
Sbjct  78   MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFR-KFDNM-DFYFPE  133

Query  514  LDMIGYQDLTMEKAAWWTEEHTSDTEFA-QKSIGKTVAWVDYMTNYNKNYGNFASGENEN  572
               +G Q++  E      E + ++++ A + + G T  + +Y  + N+ +G+F    N  
Sbjct  134  FAHLGEQEIKNE------ELYLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRG--NMA  185

Query  573  FMTLDRNYNVENPDF-TTYIDPAKYNGIFADQSRSSMNFWVQIGVNWKVRRKISAKSIPN  631
            F  L+R +  E P+  TT+++    N +FA    S   +WVQI  + K  R +     P 
Sbjct  186  FWHLNRIFK-EKPNLNTTFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKYGTPM  244

Query  632  L  632
            L
Sbjct  245  L  245


>gi|599087961|gb|AHN52906.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210

 Score = 63.9 bits (154),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 45/139 (32%), Positives = 63/139 (45%), Gaps = 2/139 (1%)

Query  384  TMDTLNLAKKVYNMLNRIAVSDGTYKSWMETVFSAKYVERTETPIYYGGMSQEIIFEEVV  443
            T++ L  A ++  +L R A S   Y   ++  F   +++ T  P + GG S  I    V 
Sbjct  68   TINQLRQAFQIQKLLERDARSGTRYSEIVKAHFGVNFMDVTYRPEFLGGTSTPINVTSVP  127

Query  444  STSATGEEPLGTLAGKGRLAPNKKGGKVVIKIDEPSYIIGICSITPRLDYSQGNNFTMNW  503
             TS +G  P GTLA  G    N  GG       E   ++GI S+   L Y QG N   + 
Sbjct  128  QTSESGTTPQGTLAAFGTATIN--GGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFSR  185

Query  504  ETMNDLHKPALDMIGYQDL  522
             T  D + PAL  IG Q +
Sbjct  186  STRYDFYFPALAHIGEQSV  204


>gi|599087475|gb|AHN52663.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210

 Score = 63.5 bits (153),  Expect = 4e-08, Method: Composition-based stats.
 Identities = 45/139 (32%), Positives = 63/139 (45%), Gaps = 2/139 (1%)

Query  384  TMDTLNLAKKVYNMLNRIAVSDGTYKSWMETVFSAKYVERTETPIYYGGMSQEIIFEEVV  443
            T++ L  A ++  +L R A S   Y   ++  F   +++ T  P + GG S  I    V 
Sbjct  68   TINQLRQAFQIQKLLERDARSGTRYSEIVKAHFGVNFMDVTYRPEFLGGTSTPINVTSVP  127

Query  444  STSATGEEPLGTLAGKGRLAPNKKGGKVVIKIDEPSYIIGICSITPRLDYSQGNNFTMNW  503
             TS +G  P GTLA  G    N  GG       E   ++GI S+   L Y QG N   + 
Sbjct  128  QTSESGTTPQGTLAAFGTATIN--GGGFTKSFTEHCILMGIASVRADLTYQQGLNRMFSR  185

Query  504  ETMNDLHKPALDMIGYQDL  522
             T  D + PAL  IG Q +
Sbjct  186  STRYDFYFPALAHIGEQSV  204


>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str. 
3999B T(B) 6]
Length=390

 Score = 65.1 bits (157),  Expect = 6e-08, Method: Compositional matrix adjust.
 Identities = 61/213 (29%), Positives = 95/213 (45%), Gaps = 14/213 (7%)

Query  423  RTETPIYYGGMSQEIIFEEVVSTSAT-GEEPLGTLAGKGRLAPNKKGGKVVIKIDEPSYI  481
            R + P + GG    I   EV+ TS+T    P   +AG G  A    G       +E  YI
Sbjct  189  RLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGVNHG--FTRYFEEHGYI  246

Query  482  IGICSITPRLDYSQGNNFTMNWETMNDLHKPALDMIGYQDLTMEKAAWWTEEHTSDTEFA  541
            +GI SI PR  Y QG           D + P    +G Q++  E      E + ++++ A
Sbjct  247  MGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNE------ELYLNESDAA  300

Query  542  -QKSIGKTVAWVDYMTNYNKNYGNFASGENENFMTLDRNYNVENPDF-TTYIDPAKYNGI  599
             + + G T  + +Y  + N+ +G+F    N  F  L+R +  E P+  TT+++    N +
Sbjct  301  NEGTFGYTPRYAEYKYSQNEVHGDFRG--NMAFWHLNRIFK-EKPNLNTTFVECNPSNRV  357

Query  600  FADQSRSSMNFWVQIGVNWKVRRKISAKSIPNL  632
            FA    S   +WVQI  + K  R +     P L
Sbjct  358  FATAETSDDKYWVQIYQDIKALRLMPKYGTPML  390



Lambda      K        H        a         alpha
   0.315    0.133    0.399    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 4787444561766