bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-40_CDS_annotation_glimmer3.pl_2_2

Length=278
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|575096057|emb|CDL66940.1|  unnamed protein product                   186   1e-53
gi|575094486|emb|CDL65860.1|  unnamed protein product                   139   5e-35
gi|575094568|emb|CDL65929.1|  unnamed protein product                   130   5e-32
gi|575094545|emb|CDL65905.1|  unnamed protein product                   128   5e-31
gi|393707865|ref|YP_004732987.1|  structural protein VP2              62.0    3e-08
gi|575094495|emb|CDL65861.1|  unnamed protein product                 56.6    4e-06
gi|12085140|ref|NP_073542.1|  minor capsid protein                    55.1    6e-06
gi|547839281|ref|WP_022246923.1|  putative minor capsid protein       54.7    1e-05
gi|575094430|emb|CDL65810.1|  unnamed protein product                 53.9    3e-05
gi|568290031|gb|ETN78178.1|  hypothetical protein NECAME_18237        50.4    8e-05


>gi|575096057|emb|CDL66940.1| unnamed protein product [uncultured bacterium]
Length=275

 Score =   186 bits (473),  Expect = 1e-53, Method: Compositional matrix adjust.
 Identities = 114/154 (74%), Positives = 129/154 (84%), Gaps = 0/154 (0%)

Query  30   ASENTAKSAQMASEQRDWQERQNALAMQFNAQEAAKSRSWQEYMSNTAHQREIRDLKAAG  89
            A  N+A +A+ A  QRDWQE QNA AMQFN+ EAAK+R WQE MSNTAHQRE++DL AAG
Sbjct  32   AQANSAWNAEQAEIQRDWQEAQNAKAMQFNSMEAAKNRKWQEMMSNTAHQREVKDLMAAG  91

Query  90   LNPVLSAMggngaavtsgatasgvtsagaKGEVDTSANAALVQMLGSVLSAQTQLQTANV  149
            LNPVLSAM GNGAAV SGATASGVTSAGAKGE DTS + A+  +LGS+LSA T +Q ANV
Sbjct  92   LNPVLSAMNGNGAAVGSGATASGVTSAGAKGEADTSTSGAIANLLGSILSASTAIQAANV  151

Query  150  NARTQEAVADKYTAMEEIVANISRDATLGSAGIH  183
            NARTQEAVADKYTAM +IVA I++ ATLGSAGIH
Sbjct  152  NARTQEAVADKYTAMSQIVAEINKAATLGSAGIH  185


>gi|575094486|emb|CDL65860.1| unnamed protein product [uncultured bacterium]
Length=344

 Score =   139 bits (350),  Expect = 5e-35, Method: Compositional matrix adjust.
 Identities = 93/165 (56%), Positives = 116/165 (70%), Gaps = 1/165 (1%)

Query  19   LDSALSRITRTASENTAKSAQMASEQRDWQERQNALAMQFNAQEAAKSRSWQEYMSNTAH  78
            LD   SR+    + N A SA  A +Q D+Q  Q AL  QFN  EA  SR WQE MSNTAH
Sbjct  61   LDDYFSRLQSITASNNAWSAAQAQKQMDFQASQGALVRQFNHDEAELSRLWQERMSNTAH  120

Query  79   QREIRDLKAAGLNPVLSAMggngaavtsgatasg-vtsagaKGEVDTSANAALVQMLGSV  137
            QREI+DL+AAGLNPVLSAMGG+GA VTSG+TASG    +G+KG+ DTS   ALV +LGS 
Sbjct  121  QREIKDLQAAGLNPVLSAMGGSGAPVTSGSTASGYSPPSGSKGDTDTSLAGALVSLLGSS  180

Query  138  LSAQTQLQTANVNARTQEAVADKYTAMEEIVANISRDATLGSAGI  182
            + AQ  +    ++ARTQE+VADKYTAM ++VA I ++ TL ++ I
Sbjct  181  MMAQASMANTAMSARTQESVADKYTAMSKLVAEIQQETTLSASTI  225


>gi|575094568|emb|CDL65929.1| unnamed protein product [uncultured bacterium]
Length=310

 Score =   130 bits (327),  Expect = 5e-32, Method: Compositional matrix adjust.
 Identities = 93/166 (56%), Positives = 123/166 (74%), Gaps = 7/166 (4%)

Query  19   LDSALSRITRT-------ASENTAKSAQMASEQRDWQERQNALAMQFNAQEAAKSRSWQE  71
            L + L++IT++       + +NTA+S Q A   R WQE QN +AMQFNA EA K+R+WQE
Sbjct  32   LANQLTKITQSIDDVIGLSEKNTARSVQEAESLRTWQEEQNRIAMQFNAAEAEKNRNWQE  91

Query  72   YMSNTAHQREIRDLKAAGLNPVLSAMggngaavtsgatasgvtsagaKGEVDTSANAALV  131
             MSNTAHQRE+ DL AAGLNPVLSA GGNGAAVTSGATASGVTS+GAKG+VDTSA++A+V
Sbjct  92   IMSNTAHQREVNDLMAAGLNPVLSAGGGNGAAVTSGATASGVTSSGAKGDVDTSASSAVV  151

Query  132  QMLGSVLSAQTQLQTANVNARTQEAVADKYTAMEEIVANISRDATL  177
             +LGS+LS+ T +  AN +A T  A  +K   + +++A+ + +  L
Sbjct  152  GILGSMLSSLTNIANANTSAITSMANTEKLGQINQLIAHANNENAL  197


>gi|575094545|emb|CDL65905.1| unnamed protein product [uncultured bacterium]
Length=325

 Score =   128 bits (321),  Expect = 5e-31, Method: Compositional matrix adjust.
 Identities = 116/235 (49%), Positives = 161/235 (69%), Gaps = 0/235 (0%)

Query  2    TTGKDAAQVQSVPAVGNLDSALSRITRTASENTAKSAQMASEQRDWQERQNALAMQFNAQ  61
            T+ +  A + S P +G   + L +ITR AS+N+A +A  A+  R+WQ++QN +AMQF++ 
Sbjct  3    TSAQSFANIPSAPQLGQYSNYLGQITRMASDNSAFNASQAAANRNWQQQQNNIAMQFSSA  62

Query  62   EAAKSRSWQEYMSNTAHQREIRDLKAAGLNPVLSAMggngaavtsgatasgvtsagaKGE  121
            EAAK+R WQ YMSNTAHQRE+ DLKAAGLNPVLSAMGGNGAAVTSGATA G TS+G +  
Sbjct  63   EAAKNRDWQSYMSNTAHQREVADLKAAGLNPVLSAMGGNGAAVTSGATAQGYTSSGGQAS  122

Query  122  VDTSANAALVQMLGSVLSAQTQLQTANVNARTQEAVADKYTAMEEIVANISRDATLGSAG  181
             DTSA AALV +LGS+L+AQT +     NA    +VADKYT+     A++    T  SA 
Sbjct  123  ADTSATAALVGLLGSLLNAQTSIANTATNAVANLSVADKYTSATRYAADVGYAGTSYSAN  182

Query  182  IHagatryaadtsaaasryfaDKNYEGTKYSSDKHYQGTMYSANKSYEGTKYSSD  236
            + A A+R+A++ + AAS+Y +D +   +KY+SD+ Y  + +++       KY+ D
Sbjct  183  VAAYASRFASNNALAASKYASDNSRAASKYASDQSYLASKFASILQSNTAKYNID  237


>gi|393707865|ref|YP_004732987.1| structural protein VP2 [Microviridae phi-CA82]
 gi|311336637|gb|ADP89808.1| structural protein VP2 [Microviridae phi-CA82]
Length=234

 Score = 62.0 bits (149),  Expect = 3e-08, Method: Compositional matrix adjust.
 Identities = 64/235 (27%), Positives = 104/235 (44%), Gaps = 39/235 (17%)

Query  41   ASEQRDWQERQNALAMQFNAQEAAKSRSWQEYMSNTAHQREIRDLKAAGLNPVLSAMggn  100
            A +Q  W   Q   + QFNAQEA K+R WQE MSNTA QR+++D + AGLNP+ +     
Sbjct  12   ADKQNKWNAEQTEKSNQFNAQEAQKNRDWQEQMSNTALQRKMQDAEKAGLNPIFA-----  66

Query  101  gaavtsgatasgvtsagaKGEVDTS---ANAALVQMLGSVLSAQTQLQTANVNARTQEAV  157
                             A+G   TS   A+A   +    +++A T    A    R Q  +
Sbjct  67   ---------------LNAQGASTTSGATASADSSKPATDIINAMTSYTNAQEQNRMQREL  111

Query  158  ADKYTAMEEIVANISRDATLGSAGIHagatryaadtsaaasryfaDKNYEGTKYSSDKHY  217
            A +   ++  +A +   A L  A +   A    A  SA             ++Y +D+ Y
Sbjct  112  ALQENRVKLQIAQMQMQAELQRARLATNAQMATAYASA-----------SASRYGADRSY  160

Query  218  QGTMYSANKSYEGTKYSSDNSVR----NPSSAVGYAREIGKVAVNIFSDLFG-WD  267
              + YS++ SY+  +  ++N++     NP+  +   +E  +   N  +  F  WD
Sbjct  161  NASRYSSDTSYKNVQSQNENNILTKGVNPTGVLYNNKEFQRRTNNAINKGFNLWD  215


>gi|575094495|emb|CDL65861.1| unnamed protein product [uncultured bacterium]
Length=266

 Score = 56.6 bits (135),  Expect = 4e-06, Method: Compositional matrix adjust.
 Identities = 25/41 (61%), Positives = 33/41 (80%), Gaps = 0/41 (0%)

Query  56   MQFNAQEAAKSRSWQEYMSNTAHQREIRDLKAAGLNPVLSA  96
            + +N Q A +  ++QE MS+TAHQRE++DL AAGLNPVLSA
Sbjct  64   LNYNTQSAREQMAFQERMSSTAHQREVKDLIAAGLNPVLSA  104


>gi|12085140|ref|NP_073542.1| minor capsid protein [Bdellovibrio phage phiMH2K]
 gi|75089169|sp|Q9G055.1|H_BPPHM RecName: Full=Minor spike protein H; AltName: Full=H protein; 
AltName: Full=Pilot protein; AltName: Full=Protein VP2; Short=VP2 
[Bdellovibrio phage phiMH2K]
 gi|12017988|gb|AAG45344.1|AF306496_5 Vp2 [Bdellovibrio phage phiMH2K]
Length=199

 Score = 55.1 bits (131),  Expect = 6e-06, Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 34/50 (68%), Gaps = 7/50 (14%)

Query  47  WQERQNALAMQFNAQEAAKSRSWQEYMSNTAHQREIRDLKAAGLNPVLSA  96
           WQ R+N         EAA++R WQE MSN+AHQRE  DL+ AGLN +L+A
Sbjct  37  WQTRENQA-------EAARNRKWQEQMSNSAHQREANDLQTAGLNRLLTA  79


>gi|547839281|ref|WP_022246923.1| putative minor capsid protein [Clostridium sp. CAG:306]
 gi|524476581|emb|CDC18646.1| putative minor capsid protein [Clostridium sp. CAG:306]
Length=236

 Score = 54.7 bits (130),  Expect = 1e-05, Method: Compositional matrix adjust.
 Identities = 22/29 (76%), Positives = 28/29 (97%), Gaps = 0/29 (0%)

Query  69  WQEYMSNTAHQREIRDLKAAGLNPVLSAM  97
           +QE MS+TAHQRE++DL+AAGLNP+LSAM
Sbjct  41  FQERMSSTAHQREVKDLRAAGLNPILSAM  69


>gi|575094430|emb|CDL65810.1| unnamed protein product [uncultured bacterium]
Length=274

 Score = 53.9 bits (128),  Expect = 3e-05, Method: Compositional matrix adjust.
 Identities = 20/41 (49%), Positives = 33/41 (80%), Gaps = 0/41 (0%)

Query  55   AMQFNAQEAAKSRSWQEYMSNTAHQREIRDLKAAGLNPVLS  95
            AMQ+N+ +A +   W+E+MSNT++QR + D++ AGLNP+L+
Sbjct  120  AMQYNSAQAMRQMKWEEHMSNTSYQRAMEDMRKAGLNPILA  160


>gi|568290031|gb|ETN78178.1| hypothetical protein NECAME_18237 [Necator americanus]
Length=112

 Score = 50.4 bits (119),  Expect = 8e-05, Method: Compositional matrix adjust.
 Identities = 21/26 (81%), Positives = 23/26 (88%), Gaps = 0/26 (0%)

Query  68  SWQEYMSNTAHQREIRDLKAAGLNPV  93
           +WQE MSNTAHQRE  DLKAAGLNP+
Sbjct  43  AWQERMSNTAHQREQADLKAAGLNPI  68



Lambda      K        H        a         alpha
   0.309    0.121    0.329    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 1373811661332