bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters



Query= Contig-32_CDS_annotation_glimmer3.pl_2_1

Length=411
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|648626869|ref|WP_026318620.1|  hypothetical protein                  234   6e-71
gi|547312923|ref|WP_022044635.1|  putative uncharacterized protein    63.5    8e-08
gi|517172762|ref|WP_018361580.1|  hypothetical protein                60.8    7e-07
gi|547920049|ref|WP_022322420.1|  capsid protein VP1                  56.6    2e-05
gi|492501782|ref|WP_005867318.1|  hypothetical protein                55.8    4e-05
gi|649557305|gb|KDS63784.1|  capsid family protein                    50.1    0.001
gi|649569140|gb|KDS75238.1|  capsid family protein                    50.8    0.001
gi|649555287|gb|KDS61824.1|  capsid family protein                    50.8    0.001
gi|494308783|ref|WP_007173938.1|  hypothetical protein                50.1    0.002
gi|609718276|emb|CDN73650.1|  conserved hypothetical protein          48.5    0.006


>gi|648626869|ref|WP_026318620.1| hypothetical protein [Alistipes onderdonkii]
Length=231

 Score =   234 bits (597),  Expect = 6e-71, Method: Compositional matrix adjust.
 Identities = 120/231 (52%), Positives = 149/231 (65%), Gaps = 9/231 (4%)

Query  190  LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYYPSYINPTSRQISLGQQYAPALDNIAMQG  249
            +SGG  F+RR +HFN+ GYFMEITS+VP V YP+Y+NPT  Q +LGQ+YAPALDNI MQ 
Sbjct  1    MSGGDSFKRRTFHFNESGYFMEITSVVPTVMYPNYLNPTLLQTNLGQRYAPALDNIQMQP  60

Query  250  LKASTVFGEVQ-NLGANTVTYANS--------TLSIPGFKLQESNYVGYEPAWSELMTAV  300
            L   T+ G    N G+ + ++  +        T+++      E   VGY+PAW+ELMT V
Sbjct  61   LTVPTLLGNAYFNTGSGSYSHVLNHMGTGELRTVAVDKLSAAEGIAVGYQPAWAELMTGV  120

Query  301  SKPHGRLCNDLDYWVLSRDYGRNLASVMDTPAYSDFIKAAGTYVDELSLQRLTAFLKRIY  360
            SKPHGRLCNDLDYW   R YG  L S  D    S F++  G  VD L ++   A+LK  Y
Sbjct  121  SKPHGRLCNDLDYWAFQRRYGTVLYSSNDAQDASVFLEELGNEVDTLDVETFNAWLKNTY  180

Query  361  VSPSSCPYILCGDFNYVFYDQRPTAENFVLDNVADIVVFREKSKVNVATTL  411
            VS    PYIL   +NYVF D  P A+NFVLDN A+I V+REKSKVNV  TL
Sbjct  181  VSTDFVPYILPAMYNYVFADTDPNAQNFVLDNSAEISVYREKSKVNVPNTL  231


>gi|547312923|ref|WP_022044635.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
 gi|524208404|emb|CCZ76639.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
Length=338

 Score = 63.5 bits (153),  Expect = 8e-08, Method: Compositional matrix adjust.
 Identities = 82/326 (25%), Positives = 123/326 (38%), Gaps = 70/326 (21%)

Query  101  AVDISVS-GQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDNT-CPAFL  158
            A+D+++S G SV++  +   +++Q +MD  F  GGR  D + + +  K S      P FL
Sbjct  41   ALDLNISTGFSVAVPELRLRTKIQNWMDRLFVSGGRVGDVFRTLWGTKSSAIYVNKPDFL  100

Query  159  GSDSFDMNVNTLY-----QTTGFEDNSSPLGAFSGQLSGGTRFRRRNYHFNDDGYFMEIT  213
            G     +N + +        +G + N   L A   +    +     +Y+  + G FM IT
Sbjct  101  GVWQASINPSNVRAMANGSASGEDANLGQLAACVDRYCDFSGHSGIDYYAKEPGTFMLIT  160

Query  214  SIVPRVYYPSYINPTSRQISLGQQYAPALDNIAMQ-----------------GL--KAST  254
             +VP   Y   ++P    IS G  + P L+ I  Q                 GL  +AS 
Sbjct  161  MLVPEPAYSQGLHPDLASISFGDDFNPELNGIGFQLVPRHRFSMMPRGFNFTGLDQEASP  220

Query  255  VFGEVQNLGANTVTYANSTLSIPGFKLQESNYVGYEPAWSELMTAVSKPHGRLC--NDLD  312
             FG     G   +   N               VG E AWS L T  S+ HG      +  
Sbjct  221  WFGHT---GTGVLVDPNMV------------SVGEEVAWSWLRTDYSRLHGDFAQNGNYQ  265

Query  313  YWVLSRDYGRNLASVMDTPAYSDFIKAAGTYVDELSLQRLTAFLKRIYVSPSSCPYILCG  372
            YWVL+R +        D   +    +  GTY++ L                         
Sbjct  266  YWVLTRRFTTYFPD--DGTGFYQDGEYTGTYINPL-------------------------  298

Query  373  DFNYVFYDQRPTAENFVLDNVADIVV  398
            D+ YVF DQ   A NF      D+ V
Sbjct  299  DWQYVFVDQTLMAGNFAYYGTFDLNV  324


>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568

 Score = 60.8 bits (146),  Expect = 7e-07, Method: Compositional matrix adjust.
 Identities = 64/261 (25%), Positives = 112/261 (43%), Gaps = 40/261 (15%)

Query  111  VSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQ--DNTCPAFLGSDSFDMNVN  168
            +S+ +I  A  +++   +    G    +  E+ F + + +  D  C    G DS ++ V 
Sbjct  304  ISVADIRNAFALEKLASVTMRAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDS-NIQVG  362

Query  169  TLYQT-----TGFEDNS------SPLGAFSGQLSGGTRFRRRNYHFNDDGYFMEITSIVP  217
             + Q+     TG +D S         G  +G  SG  RF  + +     G  M I S+VP
Sbjct  363  DVTQSSGTTVTGTKDTSFGGYLGRTTGKATGSGSGHIRFDAKEH-----GILMCIYSLVP  417

Query  218  RVYYPSY-INPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSI  276
             V Y S  ++P  ++I  G  + P  +N+ MQ L A  +  +  N  AN+          
Sbjct  418  DVQYDSKRVDPFVQKIERGDFFVPEFENLGMQPLFAKNISYKYNNNTANS----------  467

Query  277  PGFKLQESNYVGYEPAWSELMTAVSKPHGRLCND--LDYWVLSRDYGR-----NLASVMD  329
               +++     G++P +SE  TA+   HG+  +   L YW ++R  G      N+++   
Sbjct  468  ---RIKNLGAFGWQPRYSEYKTALDINHGQFVHQEPLSYWTVARARGESMSNFNISTFKI  524

Query  330  TPAYSDFIKAAGTYVDELSLQ  350
             P + D + A      EL+ Q
Sbjct  525  NPKWLDDVFAVNYNGTELTDQ  545


>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
 gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553

 Score = 56.6 bits (135),  Expect = 2e-05, Method: Compositional matrix adjust.
 Identities = 52/222 (23%), Positives = 94/222 (42%), Gaps = 23/222 (10%)

Query  99   DAAVDISVSGQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAF  157
            +  + ++V    +++ ++  ++ +QR+ +    GG R  +   S F V+ S      P F
Sbjct  299  NGTLKVNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQF  358

Query  158  LGSDSFDMNVNTLYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIV  216
            LG     ++V+ + QT+   D +SP    +G  +S G     ++Y F + GY + I SI 
Sbjct  359  LGGGRMPISVSEVLQTSS-TDETSPQANMAGHGISAGINNGFKHY-FEEHGYIIGIMSIT  416

Query  217  PRVYYPSYINPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSI  276
            PR  Y   +     +      Y P   +++ Q +K   +F       +    Y N T   
Sbjct  417  PRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFV------SEDAAYNNGTF--  468

Query  277  PGFKLQESNYVGYEPAWSELMTAVSKPHGRLCNDLDYWVLSR  318
                       GY P ++E     S+ HG    +L +W L+R
Sbjct  469  -----------GYTPRYAEYKYHPSEAHGDFRGNLSFWHLNR  499


>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
 gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis 
CL09T03C24]
Length=538

 Score = 55.8 bits (133),  Expect = 4e-05, Method: Compositional matrix adjust.
 Identities = 52/217 (24%), Positives = 95/217 (44%), Gaps = 23/217 (11%)

Query  104  ISVSGQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAFLGSDS  162
            ++V    VS+ ++  ++ +QR+ +     G R  +   S F V+ S      P FLG   
Sbjct  289  VNVDELGVSINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR  348

Query  163  FDMNVNTLYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYY  221
              ++V+ + QT+   D++SP    +G  +S G     + Y F + GY + I SI PR  Y
Sbjct  349  TPISVSEVLQTSA-TDSTSPQANMAGHGISAGVNHGFKRY-FEEHGYIIGIMSIRPRTGY  406

Query  222  PSYINPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSIPGFKL  281
               +    R+      Y P   ++  Q +K   V+ + Q   +N  T+            
Sbjct  407  QQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEVYLQ-QTPASNNGTF------------  453

Query  282  QESNYVGYEPAWSELMTAVSKPHGRLCNDLDYWVLSR  318
                  GY P ++E   ++++ HG    ++ +W L+R
Sbjct  454  ------GYTPRYAEYKYSMNEVHGDFRGNMAFWHLNR  484


>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=245

 Score = 50.1 bits (118),  Expect = 0.001, Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 90/210 (43%), Gaps = 23/210 (11%)

Query  111  VSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAFLGSDSFDMNVNT  169
            V++ +I  ++ +QR+ +     G R  +   S F V+ S      P FLG     ++V+ 
Sbjct  3    VNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSE  62

Query  170  LYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYYPSYINPT  228
            + QT+   D++SP    +G  +S G       Y F + GY M I SI PR  Y   +   
Sbjct  63   VLQTSS-TDSTSPQANMAGHGISAGVNHGFTRY-FEEHGYIMGIMSIRPRTGYQQGVPKD  120

Query  229  SRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSIPGFKLQESNYVG  288
             R+      Y P   ++  Q +K   ++   ++  AN  T+                  G
Sbjct  121  FRKFDNMDFYFPEFAHLGEQEIKNEELYLN-ESDAANEGTF------------------G  161

Query  289  YEPAWSELMTAVSKPHGRLCNDLDYWVLSR  318
            Y P ++E   + ++ HG    ++ +W L+R
Sbjct  162  YTPRYAEYKYSQNEVHGDFRGNMAFWHLNR  191


>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str. 
3999B T(B) 6]
Length=390

 Score = 50.8 bits (120),  Expect = 0.001, Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 90/210 (43%), Gaps = 23/210 (11%)

Query  111  VSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAFLGSDSFDMNVNT  169
            V++ +I  ++ +QR+ +     G R  +   S F V+ S      P FLG     ++V+ 
Sbjct  148  VNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSE  207

Query  170  LYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYYPSYINPT  228
            + QT+   D++SP    +G  +S G       Y F + GY M I SI PR  Y   +   
Sbjct  208  VLQTSS-TDSTSPQANMAGHGISAGVNHGFTRY-FEEHGYIMGIMSIRPRTGYQQGVPKD  265

Query  229  SRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSIPGFKLQESNYVG  288
             R+      Y P   ++  Q +K   ++   ++  AN  T+                  G
Sbjct  266  FRKFDNMDFYFPEFAHLGEQEIKNEELYLN-ESDAANEGTF------------------G  306

Query  289  YEPAWSELMTAVSKPHGRLCNDLDYWVLSR  318
            Y P ++E   + ++ HG    ++ +W L+R
Sbjct  307  YTPRYAEYKYSQNEVHGDFRGNMAFWHLNR  336


>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=541

 Score = 50.8 bits (120),  Expect = 0.001, Method: Compositional matrix adjust.
 Identities = 51/217 (24%), Positives = 92/217 (42%), Gaps = 23/217 (11%)

Query  104  ISVSGQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAFLGSDS  162
            ++     V++ +I  ++ +QR+ +     G R  +   S F V+ S      P FLG   
Sbjct  292  VNTDQMGVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR  351

Query  163  FDMNVNTLYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYY  221
              ++V+ + QT+   D++SP    +G  +S G       Y F + GY M I SI PR  Y
Sbjct  352  TPISVSEVLQTSS-TDSTSPQANMAGHGISAGVNHGFTRY-FEEHGYIMGIMSIRPRTGY  409

Query  222  PSYINPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSIPGFKL  281
               +    R+      Y P   ++  Q +K   ++   ++  AN  T+            
Sbjct  410  QQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEELYLN-ESDAANEGTF------------  456

Query  282  QESNYVGYEPAWSELMTAVSKPHGRLCNDLDYWVLSR  318
                  GY P ++E   + ++ HG    ++ +W L+R
Sbjct  457  ------GYTPRYAEYKYSQNEVHGDFRGNMAFWHLNR  487


>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
 gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM 
17361]
Length=553

 Score = 50.1 bits (118),  Expect = 0.002, Method: Compositional matrix adjust.
 Identities = 54/228 (24%), Positives = 96/228 (42%), Gaps = 31/228 (14%)

Query  101  AVDISVSGQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKL--SQDNTCPAFL  158
             VD   S    S+ ++  A  + + + +    G    D   + + V++  S+D       
Sbjct  286  GVDTDSSEGDFSVSSLRAAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDSRDGRVNYLG  345

Query  159  GSDSFDMNVNTLYQTTG-----FEDNSSPLGAFSGQLSGGTRFRRRNYHFNDDGYFMEIT  213
            G DS DM V+ + QT+G     ++  +  LG  +G+ +G  R  R  +   + G  M I 
Sbjct  346  GFDS-DMQVSDVTQTSGTTATEYKPEAGYLGRVAGKGTGSGR-GRIVFDAKEHGVLMCIY  403

Query  214  SIVPRVYYP-SYINPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANS  272
            S+VP++ Y  + ++P   ++     + P  +N+ MQ L +S +         N V     
Sbjct  404  SLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQPLNSSYISSFCTTDPKNPV-----  458

Query  273  TLSIPGFKLQESNYVGYEPAWSELMTAVSKPHGRLCND--LDYWVLSR  318
                          +GY+P +SE  TA+   HG+      L  W +SR
Sbjct  459  --------------LGYQPRYSEYKTALDVNHGQFAQSDALSSWSVSR  492


>gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=537

 Score = 48.5 bits (114),  Expect = 0.006, Method: Compositional matrix adjust.
 Identities = 51/233 (22%), Positives = 100/233 (43%), Gaps = 27/233 (12%)

Query  102  VDISVSGQSVSMRN-ITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDNTC-PAFLG  159
            + ++++ ++VS  N +  A ++Q +++     G R ++   S F VK S      P FLG
Sbjct  279  LKLNMASENVSTVNDLRRAFKLQEWLEKNARAGSRYAESILSFFGVKTSDGRLQRPEFLG  338

Query  160  SDSFDMNVNTLYQTTGFEDNSSPLGAFSGQLSGGTRFRRRNYHFNDDGYFMEITSIVPRV  219
             +   + ++ + Q +   D+++P G  +G   G  +    +  F + GY + + S++P+ 
Sbjct  339  GNKSPIMISEVLQQSA-TDSTTPQGNMAGHGIGIGKDGGFSRFFEEHGYVIGLMSVIPKT  397

Query  220  YYPSYINPTSRQISLGQQYA---PALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSI  276
               SY     R  S   ++    P  ++I  Q +    +F   +N+ A            
Sbjct  398  ---SYSQGIPRHFSKSDKFDYFWPQFEHIGEQPVYNKEIFA--KNIDA------------  440

Query  277  PGFKLQESNYVGYEPAWSELMTAVSKPHGRLCNDLDYWVLSRDYGRNLASVMD  329
                       GY P +SE   + S  HG   +DL +W L R +  +   V++
Sbjct  441  ----FDSEAVFGYLPRYSEYKFSPSTVHGDFKDDLYFWHLGRIFDTDKPPVLN  489



Lambda      K        H        a         alpha
   0.318    0.133    0.390    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 2665232623989