bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-35_CDS_annotation_glimmer3.pl_2_4

Length=343
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547312922|ref|WP_022044634.1|  putative replication initiation...    125   2e-29
gi|609718275|emb|CDN73649.1|  conserved hypothetical protein          95.1    2e-19
gi|649555288|gb|KDS61825.1|  hypothetical protein M095_3808           80.9    3e-14
gi|547920048|ref|WP_022322419.1|  putative replication protein        79.7    6e-14
gi|492501778|ref|WP_005867316.1|  hypothetical protein                79.3    1e-13
gi|568293148|gb|ETN80369.1|  hypothetical protein NECAME_18023        75.5    4e-12
gi|575094374|emb|CDL65755.1|  unnamed protein product                 71.6    2e-10
gi|649562725|gb|KDS68909.1|  hypothetical protein M096_3339           58.5    1e-06
gi|313766930|gb|ADR80656.1|  putative replication initiation protein  59.3    1e-06
gi|47566147|ref|YP_022485.1|  nonstructural protein                   56.6    9e-06


>gi|547312922|ref|WP_022044634.1| putative replication initiation protein [Alistipes finegoldii 
CAG:68]
 gi|524208442|emb|CCZ76638.1| putative replication initiation protein [Alistipes finegoldii 
CAG:68]
Length=320

 Score =   125 bits (314),  Expect = 2e-29, Method: Compositional matrix adjust.
 Identities = 88/271 (32%), Positives = 137/271 (51%), Gaps = 43/271 (16%)

Query  2    CLYPKLIPNKRYLPTKKN----------GGIPPVCPDERLRYVTAACGDCYECRKQKQRQ  51
            C  PK+I N+RY                G   P  PD  L      CG C+ C+K    Q
Sbjct  3    CEQPKVIVNRRYANMTNTEIVNYAKVYYGCFWP--PDYILE---VPCGYCHSCQKSYNNQ  57

Query  52   WMVRMSEENRQTP--NAYFLTLTIDDKSYKQIKQKYNLKDNNDIATKAIRLCLERVRKLT  109
            + +R+  E R+ P     F+TLT +D S ++       KD N    KA+RL L+R RK+ 
Sbjct  58   YRIRLLYELRKYPPGTCLFVTLTFNDDSLEKFS-----KDTN----KAVRLFLDRFRKVY  108

Query  110  GKSVKHWFITELGHEKTERLHLHGIVWGL-------------GNGEKVTNNWKYGITFTG  156
            GK ++HWF+ E G     R H HGI++ +             G+   + + WKYG  F G
Sbjct  109  GKQIRHWFVCEFG-TLHGRPHYHGILFNVPQALIDGYDSDMPGHHPLLASCWKYGFVFVG  167

Query  157  YFVNEKTIKYITKYMLKVDEKHPKFRGKVLCSAGIGAGYLKREDAKRHVYIPGKTNESYR  216
            Y V+++T  YITKY+ K      K R +V+ S GIG+ YL  E++  H  +  +  + + 
Sbjct  168  Y-VSDETCSYITKYVTKSINGD-KVRPRVISSFGIGSNYLNTEESSLHK-LGNQRYQPFM  224

Query  217  MRNGEKLNLPIYYRNKIFTEEEREKLFLDKI  247
            + NG +  +P YY NKIF++ +++ + +D++
Sbjct  225  VLNGFQQAMPRYYYNKIFSDVDKQNMVVDRL  255


>gi|609718275|emb|CDN73649.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=265

 Score = 95.1 bits (235),  Expect = 2e-19, Method: Compositional matrix adjust.
 Identities = 66/211 (31%), Positives = 103/211 (49%), Gaps = 21/211 (10%)

Query  38   CGDCYECRKQKQRQWMVRMSEENRQTPNAYFLTLTIDDKSYKQIKQKYNLKDNNDIAT--  95
            CG C ECRK +   W  R++EE + + +A+F+TLT     Y  +   Y+  DN  I+   
Sbjct  25   CGKCLECRKARTNSWFARLTEELKVSKSAHFVTLT-----YSDVYLPYS--DNGLISLDY  77

Query  96   KAIRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWGLGNGEKVTNNWKYGITFT  155
            +  +L ++R RKL    +K++ + E G  +T R H H IV+G+ N +     W+ G    
Sbjct  78   RDFQLFMKRARKLQKSKIKYFLVGEYG-AQTYRPHYHAIVFGVENIDAFLGEWRMGNVHA  136

Query  156  GYFVNEKTIKYITKYMLK-------VDEKHPKFRGKVLCSAGIGAGYLKREDAKRHVYIP  208
            G  V  K+I Y  KY  K        D    +   K L S G+G  +L     K   Y  
Sbjct  137  GT-VTAKSIYYTLKYCTKSITEGPDKDPDDDRKPEKALMSKGLGLSHLTESMIK---YYK  192

Query  209  GKTNESYRMRNGEKLNLPIYYRNKIFTEEER  239
               + S+ +  G  + LP YYR+K+F++ E+
Sbjct  193  DDVSRSFSLLGGTTIALPRYYRDKVFSDIEK  223


>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str. 
3999B T(B) 4]
 gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str. 
3999B T(B) 4]
 gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str. 
3999B T(B) 4]
Length=284

 Score = 80.9 bits (198),  Expect = 3e-14, Method: Compositional matrix adjust.
 Identities = 67/272 (25%), Positives = 126/272 (46%), Gaps = 28/272 (10%)

Query  35   TAACGDCYECRKQKQRQWMVRMSEENRQTPNAYFLTLTIDDKSY--KQIKQKYNLKDNND  92
               CG C  CRK K++ W+ R+  E  + P + F+TLT DD+      I +         
Sbjct  14   AVPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTVGV  73

Query  93   IATKAIRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWGLG-----NGEKVTNN  147
            ++ + I+L ++R+RK   +    +F+T     +  R H H I++G        G+ +   
Sbjct  74   VSKRDIQLFMKRLRKKYAQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAEC  133

Query  148  WKYGITFTGYFVNEKTIKYITKYMLK------VDEKHPKFRGKVLCS--AGIGAGYLKRE  199
            WK G     + +  K I Y+TKYM +      + +   +++  +LCS   GIG  +L+ +
Sbjct  134  WKNGFV-QAHPLTTKEISYVTKYMYEKSMIPDILKGVKEYQPFMLCSKMPGIGYHFLREQ  192

Query  200  DAKRHVYIPGKTNESYRMRNGEKLNLPIYYRNKIFTE-------EEREKLFLDKIEKGII  252
                +   P    +  R  NG ++ +P YY +K++ +       E RE  F++++++   
Sbjct  193  ILDFYRLHP---RDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFINQMQQEWY  249

Query  253  YILGIKIDLK--TEELRYNGVLASERERCERL  282
            + +     L+   ++L     LA ER   ++L
Sbjct  250  HYINTSPRLRYIADQLETESKLAYERRAEDKL  281


>gi|547920048|ref|WP_022322419.1| putative replication protein [Parabacteroides merdae CAG:48]
 gi|524592960|emb|CDD13572.1| putative replication protein [Parabacteroides merdae CAG:48]
Length=278

 Score = 79.7 bits (195),  Expect = 6e-14, Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 112/237 (47%), Gaps = 36/237 (15%)

Query  38   CGDCYECRKQKQRQWMVRMSEENRQTPNAYFLTLTIDDKSYKQIKQKYNLKDNN--DIAT  95
            CG C  CR+ K++ W+ R+  E ++ P + F+TLT DD+     +   +L   N   ++ 
Sbjct  12   CGWCVNCRQNKRQSWVYRLQAEAKEYPLSLFVTLTYDDEHLPIERIGSDLFQTNVAVVSK  71

Query  96   KAIRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWGLG-----NGEKVTNNWKY  150
            + ++L ++R+RK        +F+T     K  R H H I++G        G+ +   W+ 
Sbjct  72   RDVQLFMKRLRKKYEDYKMRYFVTSEYGAKNGRPHYHMILFGFPFTGKMAGDLLAECWQN  131

Query  151  GITFTGYFVNEKTIKYITKYM--------LKVDEKHPKFRGKVLCS--AGIGAGYLKR--  198
            G     + +  K I Y+ KYM        +  DEK  K++  +LCS   GIG G++K   
Sbjct  132  GFV-QAHPLTIKEIAYVCKYMYEKSMCPEILRDEK--KYKPFMLCSRNPGIGFGFMKADI  188

Query  199  -EDAKRHVYIPGKTNESYRMRNGEKLNLPIYYRNKI-------FTEEEREKLFLDKI  247
             E  +RH        +  R   G K+ +P YY +K+       F +E RE+ F  K+
Sbjct  189  IEFYRRH------PRDYVRAWAGHKMAMPRYYADKLYDDDMKAFLKEMREEFFRHKM  239


>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
 gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis 
CL09T03C24]
Length=284

 Score = 79.3 bits (194),  Expect = 1e-13, Method: Compositional matrix adjust.
 Identities = 60/237 (25%), Positives = 112/237 (47%), Gaps = 26/237 (11%)

Query  35   TAACGDCYECRKQKQRQWMVRMSEENRQTPNAYFLTLTIDDKSY--KQIKQKYNLKDNND  92
               CG C  CRK K++ W+ R+  E  + P + F+TLT DD+      I +         
Sbjct  14   AVPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHMPTAMIGEDLFKSTVGV  73

Query  93   IATKAIRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWGLG-----NGEKVTNN  147
            ++ + I+L ++R+RK   +    +F+T     +  R H H I++G        G+ +   
Sbjct  74   VSKRDIQLFMKRLRKKYDQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAEC  133

Query  148  WKYGITFTGYFVNEKTIKYITKYMLK------VDEKHPKFRGKVLCS--AGIGAGYLKRE  199
            WK G     + +  K I Y+TKYM +      + +   +++  +LCS   GIG  +L+ +
Sbjct  134  WKNGFV-QAHPLTTKEIAYVTKYMYEKSMVPDILKDVKEYQPFMLCSRIPGIGYHFLREQ  192

Query  200  DAKRHVYIPGKTNESYRMRNGEKLNLPIYYRNKIFTE-------EEREKLFLDKIEK  249
                +   P    +  R  NG ++ +P YY +K++ +       E RE  F++++++
Sbjct  193  ILDFYRLHP---RDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFINQMQQ  246


>gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 [Necator americanus]
Length=345

 Score = 75.5 bits (184),  Expect = 4e-12, Method: Compositional matrix adjust.
 Identities = 65/249 (26%), Positives = 111/249 (45%), Gaps = 43/249 (17%)

Query  25   VCPDERLRYVTAACGDCYECRKQKQRQWMVRMSEENRQTPNAYFLTLTIDDKSYKQIKQK  84
            V P   L  V   CG C  C++++   W+ R+ +E  Q  NA F+TLT D +     K  
Sbjct  9    VLPKAALEKVPVPCGRCPPCKRRRVDSWVFRLLQEELQHENASFVTLTYDTRFVPISKNG  68

Query  85   YNLKDNNDIATKAIRLCLERVRKLT-GKSVKHWFITELGHEKTERLHLHGIVWGLGNGEK  143
            +   D  +         ++R+RKL  G+ +K++   E G ++  R H H I++G+     
Sbjct  69   FMTLDRGEFPR-----YMKRLRKLVPGRKLKYYMCGEYGSQRF-RPHYHAIIFGVPQDSL  122

Query  144  VTNNWKYGITFTG--------YFVNEKTIKYITKYMLKV--------DEKHPKFRGKVLC  187
              + W    T  G          V  K+I Y  KY+ K         D++ P+F    L 
Sbjct  123  FADAW----TLNGDSLGGVVVGTVTGKSIAYTMKYIDKSTWKQKHGRDDRVPEFS---LM  175

Query  188  SAGIGAGYLKREDAKRHVYIPGKTNESYRM----RNGEKLNLPIYYRNKIFTEEEREK--  241
            S G+G  YL  +  + H        +  R+      G ++ +P YYR KI+++++ +K  
Sbjct  176  SKGMGVSYLTPQMVEYH------KEDISRLFCTREGGSRIAMPRYYRQKIYSDDDLKKQV  229

Query  242  -LFLDKIEK  249
             L  + +E+
Sbjct  230  VLIAESVER  238


>gi|575094374|emb|CDL65755.1| unnamed protein product [uncultured bacterium]
Length=487

 Score = 71.6 bits (174),  Expect = 2e-10, Method: Compositional matrix adjust.
 Identities = 51/158 (32%), Positives = 71/158 (45%), Gaps = 20/158 (13%)

Query  35   TAACGDCYECRKQKQRQWMVRMSEENRQTPNAYFLTLTIDDKSYKQIKQKYNLKDNND--  92
               CG CY+C+  K   W VR SEE      +YF TLT+D +    I     L D +   
Sbjct  25   VVPCGHCYDCKSAKTTDWQVRCSEELNNNSQSYFYTLTLDPRF---IDTYGTLPDGSPRY  81

Query  93   -IATKAIRLCLERVRKLTGK---SVKHWFITELGHEKTERLHLHGIVWGLGNGEK-----  143
                + I+L L+R+RK   K   S+K+  + ELG E T R H H I +   +        
Sbjct  82   VFNKRHIQLFLKRLRKALSKYNISLKYVIVGELG-ETTHRPHYHAIFYLSSSVNPFKFRI  140

Query  144  -VTNNWKYGITFT----GYFVNEKTIKYITKYMLKVDE  176
             V N+W  G   +    G  +N   + Y+ KYM K D 
Sbjct  141  MVRNSWSLGFIKSGDNNGIILNNDAVSYVIKYMHKTDS  178


>gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 [Parabacteroides distasonis str. 
3999B T(B) 6]
Length=250

 Score = 58.5 bits (140),  Expect = 1e-06, Method: Compositional matrix adjust.
 Identities = 62/255 (24%), Positives = 112/255 (44%), Gaps = 36/255 (14%)

Query  56   MSEENRQTPNAYFLTLTIDDKSY--KQIKQKYNLKDNNDIATKAIRLCLERVRKLTGKSV  113
            M  E  + P + F+TLT DD+      I +         ++ + I+L ++R+RK   +  
Sbjct  1    MQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTVGVVSKRDIQLFMKRLRKKYAQYR  60

Query  114  KHWFITELGHEKTERLHLHGIVWGLG-----NGEKVTNNWKYGITFTGYFVNEKTIKYIT  168
              +F+T     +  R H H I++G        G+ +   WK G     + +  K I Y+T
Sbjct  61   LRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAECWKNGFV-QAHPLTTKEISYVT  119

Query  169  KYMLK----------VDEKHPKFRGKVLCS--AGIGAGYLKREDAKRHVYIPGKTNESYR  216
            KYM +          V E  P     +LCS   GIG  +L+ +    +   P    +  R
Sbjct  120  KYMYEKSMIPDILKGVKEYQP----FMLCSKMPGIGYHFLREQILDFYRLHP---RDYVR  172

Query  217  MRNGEKLNLPIYYRNKIFTE-------EEREKLFLDKIEKGIIYILGIKIDLK--TEELR  267
              NG ++ +P YY +K++ +       E RE  F++++++   + +     L+   ++L 
Sbjct  173  AFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFINQMQQEWYHYINTSPRLRYIADQLE  232

Query  268  YNGVLASERERCERL  282
                LA ER   ++L
Sbjct  233  TESKLAYERRAEDKL  247


>gi|313766930|gb|ADR80656.1| putative replication initiation protein [Uncultured Microviridae]
Length=402

 Score = 59.3 bits (142),  Expect = 1e-06, Method: Compositional matrix adjust.
 Identities = 41/152 (27%), Positives = 74/152 (49%), Gaps = 24/152 (16%)

Query  38   CGDCYECRKQKQRQWMVRMSEENRQTPNAYFLTLTIDDKSYKQIKQKYNLKDNNDIATKA  97
            CG C+ CR Q  R+W +R   E +   +  F+TLTI+ ++ ++  + ++L+       K 
Sbjct  130  CGQCWGCRLQHSREWAIRCMHEAQMHDHNCFITLTINPETLERRPRPWSLE------KKE  183

Query  98   IRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWG------------LGN----G  141
             +  + R+R+  GK +K++   E G E  +R H H I++G            LGN     
Sbjct  184  FQEFVHRLRRKIGKKIKYFHCGEYGDE-NKRPHYHAIIFGYDFPDKQLWERKLGNELYIS  242

Query  142  EKVTNNWKYGITFTGYFVNEKTIKYITKYMLK  173
             ++ N W +G    G    E +  Y+ +Y++K
Sbjct  243  PELENLWPHGYHRIGACTYE-SAHYVARYVMK  273


>gi|47566147|ref|YP_022485.1| nonstructural protein [Chlamydia phage 3]
 gi|47522482|emb|CAD79483.1| nonstructural protein [Chlamydia phage 3]
Length=315

 Score = 56.6 bits (135),  Expect = 9e-06, Method: Compositional matrix adjust.
 Identities = 68/261 (26%), Positives = 111/261 (43%), Gaps = 61/261 (23%)

Query  27   PDE-RLRYVTAACGDCYECRKQKQRQWMVRMSEENRQTPNAYFLTLTIDDKSYKQIKQKY  85
            P+E R+R+V   C  C  CR Q  + W  R   E        FLTLT +D+         
Sbjct  43   PEEYRVRWVVKPCLKCRFCRVQNAKIWSYRCMHEASLYSQNCFLTLTYEDR---------  93

Query  86   NLKDNNDIATKAIRLCLERVRK-LTGKSVKHWFITELGHEKTERLHLHGIVWGL------  138
            +L +N  +     RL L R+R+ +    ++++   E G  K +R H H +++        
Sbjct  94   HLPENGSLVRDHPRLFLRRLREHIYPHKIRYFGCGEYG-SKLQRPHYHLLIYNYDFPDKK  152

Query  139  ------GN----GEKVTNNWKYGITFTGYFVNEKTIKYITKYMLK----------VDEKH  178
                  GN     EK+   W +G +  G  V  ++  Y+ +Y LK            ++ 
Sbjct  153  LLSKKRGNPLFVSEKLMQLWPFGFSTVGS-VTRQSAGYVARYSLKKVNGDSSQDHYGQRL  211

Query  179  PKFRGKVLCS--AGIGAGYLKREDAKRHVY-----IPGKTNESYRMRNGEKLNLPIYYRN  231
            P+F   ++CS   GIGA +   E  KR VY     +     +S++ R       P  Y +
Sbjct  212  PEF---LMCSLKPGIGADWY--EKYKRDVYPQDYLVVQDKGKSFKTR-------PPRYYD  259

Query  232  KI---FTEEEREKLFLDKIEK  249
            K+   F  EE E++   ++EK
Sbjct  260  KLHSRFDPEEMEEIKQRRVEK  280



Lambda      K        H        a         alpha
   0.320    0.138    0.423    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 2010380126625