bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-9_CDS_annotation_glimmer3.pl_2_6
Length=384
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 268 2e-80
gi|496050829|ref|WP_008775336.1| hypothetical protein 256 2e-75
gi|490418709|ref|WP_004291032.1| hypothetical protein 255 2e-75
gi|575094354|emb|CDL65742.1| unnamed protein product 245 3e-71
gi|494822885|ref|WP_007558293.1| hypothetical protein 216 3e-60
gi|575094321|emb|CDL65708.1| unnamed protein product 152 2e-37
gi|565841287|ref|WP_023924568.1| hypothetical protein 134 4e-31
gi|517172762|ref|WP_018361580.1| hypothetical protein 131 2e-30
gi|494306153|ref|WP_007173049.1| hypothetical protein 122 2e-27
gi|647452987|ref|WP_025792807.1| hypothetical protein 122 5e-27
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 268 bits (686), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 168/393 (43%), Positives = 229/393 (58%), Gaps = 39/393 (10%)
Query 1 VDYFTGvspslissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPD 60
+DY G S + S + D +K+ TMFDL YCN+ KD G+LP +Q+GDV+V P
Sbjct 211 LDYLYGKSSGFHIPMSSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVAS-PI 269
Query 61 SGDSNVVLGTDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQF 120
GD ++ G +S++T +AP N I + + S+ +
Sbjct 270 FGDLDI-----------GDSSSLTFASAP-------QQGANTIQSGVLVVNNNSNTTAGL 311
Query 121 TVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEV 180
+VLALRQAE LQ+W+EI+QSG DY+ Q++KHF V LS C Y+GG + NLDISEV
Sbjct 312 SVLALRQAECLQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEV 371
Query 181 VNNNLATEGDTAVIAGKGVGAGNGS-FEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLV 239
VN NL T + A I GKG G NG+ ++ ++EH ++MCIYH +PLLD+++ Q
Sbjct 372 VNTNL-TGDNQADIQGKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSINRIARQNFK 430
Query 240 TDAESLPIPEFDNIGMEVL-PMTQVFN-----SPKASIVNLFNAGYNPRYFNWKTKLDVI 293
T IPEFD++GM+ L P +F S +SI N GY PRY + KT +D I
Sbjct 431 TTFTDYAIPEFDSVGMQQLYPSEMIFGLEDLPSDPSSI----NMGYVPRYADLKTSIDEI 486
Query 294 NGAFTTTLKSWVSPVTESLLSGW--FCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFG 351
+G+F TL SWVSP+T+S +S + C KD D + M Y FFKVNP ++D IFG
Sbjct 487 HGSFIDTLVSWVSPLTDSYISAYRQAC----KDAGFSD--ITMTYNFFKVNPHIVDNIFG 540
Query 352 VNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 384
V ADST +TDQLL+NSY VRN +G+PY
Sbjct 541 VKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 256 bits (653), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 151/361 (42%), Positives = 209/361 (58%), Gaps = 31/361 (9%)
Query 28 MFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGIASAITSKT 87
MFDL+YCNW KD+ GVLP Q+GD A +++ SNV+ +
Sbjct 247 MFDLRYCNWQKDLFHGVLPRQQYGDTAAVNV---NLSNVL-------------------S 284
Query 88 APFPLFALDASPENPIPINsklrldlsslks-QFTVLALRQAEALQRWKEISQSGDSDYR 146
A + + D P P +S + S FTVLALRQAE LQ+WKEI+QSG+ DY+
Sbjct 285 AQYMVQTPDGDPVGGSPFSSTGVNLQTVNGSGTFTVLALRQAEFLQKWKEITQSGNKDYK 344
Query 147 EQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEGDTAVIAGKGVGAGNGSF 206
+QI KH+ V + +A S M Y+GG + +LDI+EVVNNN+ T + A IAGKGV GNG
Sbjct 345 DQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNI-TGSNAADIAGKGVVVGNGRI 403
Query 207 EYTTTE-HCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFN 265
+ E + ++MCIYH++PLLDYT + ++ IPEFD +GME +P+ + N
Sbjct 404 SFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMN 463
Query 266 SPKASIVNLFNA--GYNPRYFNWKTKLDVINGAFTTTLKSWVSPVTESLLSGWFCFGYNK 323
P S N+ ++ GY PRY ++KT +D GAF TTLKSWV + + +
Sbjct 464 -PLQSSYNVGSSILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNY---Q 519
Query 324 DDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVP 383
DD ++NY FKVNP+ +DP+F V A ++ DTDQ L +S+ VVRNL DG+P
Sbjct 520 DDPNNSPGTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLP 579
Query 384 Y 384
Y
Sbjct 580 Y 580
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 255 bits (652), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 147/371 (40%), Positives = 202/371 (54%), Gaps = 31/371 (8%)
Query 21 DYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGIA 80
+++++ FDL+YCNW KD+ GVLP+ Q+G+ AV I + L S+ S+VG +
Sbjct 232 EFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTL---SNFSTVGTS 288
Query 81 SAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQS 140
S TA L A D + ++L LRQAE LQ+WKEI+QS
Sbjct 289 PTTASGTATKNLPAFDTVGD-------------------LSILVLRQAEFLQKWKEITQS 329
Query 141 GDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEGDTAVIAGKGVG 200
G+ DY++Q+ KH+GV + S +CTY+GGVS ++DI+EV+N N+ T A IAGKGVG
Sbjct 330 GNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNI-TGSAAADIAGKGVG 388
Query 201 AGNGSFEYTTT-EHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLP 259
NG + + + ++MCIYH +PLLDYT D L ++ IPEFD +GM+ +P
Sbjct 389 VANGEINFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMP 448
Query 260 MTQVFNSPKASIVNL--FNAGYNPRYFNWKTKLDVINGAFTTTLKSWVSPVTESLLSGWF 317
+ Q+ N P S N GY PRY ++KT +D G F TL SWV +
Sbjct 449 LVQLMN-PLRSFANASGLVLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQV 507
Query 318 CFGYNKDDAAPDTKV----IMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYV 373
+ P V MN+ FFKVNP LDPIF V A +TDQ L +S+
Sbjct 508 TLPNDAPPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKA 567
Query 374 VRNLSRDGVPY 384
VRNL DG+PY
Sbjct 568 VRNLDTDGLPY 578
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 245 bits (626), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 151/395 (38%), Positives = 218/395 (55%), Gaps = 48/395 (12%)
Query 28 MFDLKYCNWNKDMLMGVLPNSQFGDVAV------LDIPDSGDSNVV-------------- 67
FD++YCN+ KDM GVLP +Q+G +V L++ +GDS +
Sbjct 231 FFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTPDPGTPGTS 290
Query 68 -------LGTDSHKSSVGIASAITSKTAP-----FPLFALDASP--ENPIPINsklrldl 113
+G D+ V ++ K+A FP A S ENP I
Sbjct 291 YVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPNLI------IE 344
Query 114 sslksQFTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSR 173
++ +LALRQAE LQ+WKE+S SG+ DY+ QI KH+G+K+ LS+ Y+GG +
Sbjct 345 NNQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCAT 404
Query 174 NLDISEVVNNNLATEGDTAVIAGKGVGAGNGSFEYTTT-EHCVVMCIYHAVPLLDYTLTG 232
+LDI+EV+NNN+ T + A IAGKG GNGS + + E+ ++MCIYH +P++DY +G
Sbjct 405 SLDINEVINNNI-TGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSG 463
Query 233 QDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFNSPKASIVNLFNA--GYNPRYFNWKTKL 290
D + DA S PIPE D IGME +P+ + N K S + GY PRY +WKT +
Sbjct 464 VDHSCTLVDATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFLGYAPRYIDWKTSV 523
Query 291 DVINGAFTTTLKSWVSPVTESLLSGWFCFGYNKD-DAAPDTKVIMNYKFFKVNPSVLDPI 349
D G F +L++W PV + L+ + + + PD+ + FFKVNPS++DP+
Sbjct 524 DRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDS---IAAGFFKVNPSIVDPL 580
Query 350 FGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 384
F V ADST TD+ L +S+ VVRNL +G+PY
Sbjct 581 FAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 216 bits (549), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 139/378 (37%), Positives = 211/378 (56%), Gaps = 26/378 (7%)
Query 25 SGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVG------ 78
S +FD++Y NW +D+L G +P +Q+G+ + +P SG VV G + G
Sbjct 244 SFNLFDMRYSNWQRDLLHGTIPQAQYGEASA--VPVSGSMQVVEGPTPPAFTTGQDGVAF 301
Query 79 IASAITSKTAPFPLFALDASPENPI-PINsklrldlsslksQF--TVLALRQAEALQRWK 135
+ +T + + L A + E+ I N+ + S F ++LALR+AEA Q+WK
Sbjct 302 LNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRAEAAQKWK 361
Query 136 EISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEGDTAVIA 195
E++ + + DY QI H+G + +A S+MC ++G ++ +L I+EVVNNN+ E + A IA
Sbjct 362 EVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGE-NAADIA 420
Query 196 GKGVGAGNGSFEYTT-TEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIG 254
GKG +GNGS + ++ +VMC++H +P LDY + +T+ PIPEFD IG
Sbjct 421 GKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIG 480
Query 255 MEVLPMTQVFNSPKAS------IVNLFNAGYNPRYFNWKTKLDVINGAFTTTLKSWVSPV 308
ME +P+ + N K NL+ GY P+Y+NWKT LD G F +LK+W+ P
Sbjct 481 MEQVPVIRGLNPVKPKDGDFKVSPNLY-FGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPF 539
Query 309 -TESLLSG-WFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVN 366
E+LL+ F N + A K FFKV+PSVLD +F V A+S +TDQ L +
Sbjct 540 DDEALLAADSVDFPDNPNVEADSVKA----GFFKVSPSVLDNLFAVKANSDLNTDQFLCS 595
Query 367 SYIGCYVVRNLSRDGVPY 384
+ VVR+L +G+PY
Sbjct 596 TLFDVNVVRSLDPNGLPY 613
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 152 bits (384), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 186/394 (47%), Gaps = 61/394 (15%)
Query 28 MFDLKYCNWNKDMLMGVLPNSQFGDVAV--LDIPDSGDSNVVLGTDSH-----KSSVG-- 78
+ D+++ N D GVLP SQFG +V L++ ++ S V+ GT S +++ G
Sbjct 271 LLDMRFSNLPLDYFTGVLPTSQFGSESVVNLNLGNASGSAVLNGTTSKDSGRWRTTTGEW 330
Query 79 -----IASAITSK----TAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAE 129
+AS+ + + D + + IN+ L +++ALR A
Sbjct 331 EMEQRVASSANGNLKLDNSNGTFISHDHTFSGNVAINTSLSG-------NLSIIALRNAL 383
Query 130 ALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEG 189
A Q++KEI + D D++ Q+ HFG+K P + +IGG S ++I+E +N NL+ G
Sbjct 384 AAQKYKEIQLANDVDFQSQVEAHFGIK-PDEKNENSLFIGGSSSMININEQINQNLS--G 440
Query 190 DTAVIAGKG-VGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIP 248
D G G G+ S ++T + VV+ IY P+LD+ G D L TDA IP
Sbjct 441 DNKATYGAAPQGNGSASIKFTAKTYGVVIGIYRCTPVLDFAHLGIDRTLFKTDASDFVIP 500
Query 249 EFDNIGMEVLPMTQVFNSPKASIVNLFNA---------------GYNPRYFNWKTKLDVI 293
E D+IGM+ +V + A + F A GY PRY +KT D
Sbjct 501 EMDSIGMQQTFRCEV--AAPAPYNDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRY 558
Query 294 NGAFTTTLKSWVSPVTESLLSG--WFCF-GYNKDDAAPDTKVIMNYKFFKVNPSVLDPIF 350
NGAF +LKSWV+ + + W + G N AP+ F P ++ +F
Sbjct 559 NGAFCHSLKSWVTGINFDAIQNNVWNTWAGIN----APN--------MFACRPDIVKNLF 606
Query 351 GVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 384
V++ + D DQL V CY RNLSR G+PY
Sbjct 607 LVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY 640
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 134 bits (337), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 166/366 (45%), Gaps = 54/366 (15%)
Query 28 MFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGIASAITSKT 87
M L+Y +W+KD + P + + D + ++PD + N T K V + ++
Sbjct 317 MCQLRYRHWSKDWVTSAYPTASY-DKGIFELPDYINGNTGFATTEVKRDV-----VNNRG 370
Query 88 APFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQSGDS-DYR 146
+ + ++DA I+ D +R AL++ E +++ + DY
Sbjct 371 SQLEIKSMDAGSLGSNNISYISPND------------IRAMFALEKMLERTRAANGLDYS 418
Query 147 EQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEGDTAV-------IAGKGV 199
QI HFG K+P++ N ++IGG + ISEVV + + TA + GKG+
Sbjct 419 NQIAAHFGFKVPESRKNCASFIGGFDNQISISEVVTTSNGSVDGTASTGSVVGQVFGKGI 478
Query 200 GAGN-GSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVL 258
GA N G Y EH ++MCIY P +DY D E PEF+N+GM+
Sbjct 479 GAMNSGHISYDVKEHGLIMCIYSIAPQVDYDARELDPFNRKFSREDYFQPEFENLGMQ-- 536
Query 259 PMTQ-----VFNSPKASIVNLFN--AGYNPRYFNWKTKLDVINGAFTT--TLKSWVSPVT 309
P+ Q NS K+ + N GY+ RY +KT D+I G F + +L +W +P
Sbjct 537 PVIQSDLCLCINSAKSDSSDQHNNVLGYSARYLEYKTARDIIFGEFMSGGSLSAWATPKN 596
Query 310 ESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYI 369
+ F + K + PD V+P VL+PIF V + + TDQ LVNSY
Sbjct 597 N------YTFEFGK-LSLPD---------LLVDPKVLEPIFAVKYNGSMSTDQFLVNSYF 640
Query 370 GCYVVR 375
+R
Sbjct 641 DVKAIR 646
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 131 bits (329), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 158/378 (42%), Gaps = 63/378 (17%)
Query 29 FDLKYCNWNKDMLMGVLP---------NSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGI 79
F L+Y N KD+L V P N QF DI NV GT ++ SV I
Sbjct 229 FTLRYRNAQKDLLTNVRPTPLFSIDDFNPQFF-TGGSDIVMEKGPNVTGGTHEYRDSVVI 287
Query 80 ASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQ 139
EN + S ++ +V +R A AL++ ++
Sbjct 288 VGKNLK--------------ENGV----------DSKRTMISVADIRNAFALEKLASVTM 323
Query 140 SGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLAT---------EGD 190
Y+EQ+ HFG+ + + CTYIGG N+ + +V ++ T G
Sbjct 324 RAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGY 383
Query 191 TAVIAGKGVGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEF 250
GK G+G+G + EH ++MCIY VP + Y D + + +PEF
Sbjct 384 LGRTTGKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEF 443
Query 251 DNIGMEVLPMTQVF-----NSPKASIVNLFNAGYNPRYFNWKTKLDVINGAFTTTLKSWV 305
+N+GM+ L + N+ + I NL G+ PRY +KT LD+ +G F
Sbjct 444 ENLGMQPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQF-------- 495
Query 306 SPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLV 365
V + LS W A ++ N FK+NP LD +F VN + T TDQ+
Sbjct 496 --VHQEPLSYWTV-----ARARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQVFG 548
Query 366 NSYIGCYVVRNLSRDGVP 383
Y V ++S DG+P
Sbjct 549 GCYFNIVKVSDMSIDGMP 566
>gi|494306153|ref|WP_007173049.1| hypothetical protein [Prevotella bergensis]
gi|270333881|gb|EFA44667.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=519
Score = 122 bits (306), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 131/277 (47%), Gaps = 34/277 (12%)
Query 120 FTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISE 179
F+V +LR A A+ + ++ +++Q+R H+GV++P + Y+GG +L +S+
Sbjct 262 FSVSSLRSAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDSRDGRVNYLGGFDSDLQVSD 321
Query 180 VVNNN--LATE-----GDTAVIAGKGVGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTG 232
V + ATE G IAGKG G+G G + EH V+MCIY VP + Y T
Sbjct 322 VTQTSGTTATEYKPEAGYLGRIAGKGTGSGRGRIVFDAKEHGVLMCIYSLVPQIQYDCTR 381
Query 233 QDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFN----SPKASIVNLFNAGYNPRYFNWKT 288
D + D PEF+N+GM+ L + + + PK ++ GY PRY +KT
Sbjct 382 LDPMVDKLDRFDFFTPEFENLGMQPLNSSYISSFCTPDPKNPVL-----GYQPRYSEYKT 436
Query 289 KLDVINGAFTT--TLKSWVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVL 346
LD+ +G F L SW + S W F P ++ FK++P L
Sbjct 437 ALDINHGQFAQNDALSSW----SVSRFRRWTTF--------PQLEIAD----FKIDPGCL 480
Query 347 DPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVP 383
+ +F V + T TD + V ++S DG+P
Sbjct 481 NSVFPVEFNGTESTDCVFGGCNFNIVKVSDMSVDGMP 517
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 122 bits (305), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 106/384 (28%), Positives = 178/384 (46%), Gaps = 74/384 (19%)
Query 31 LKYCNWNKDMLMGVLPNSQFGDVAVLDIPD--SGDSNVVLGTDSHKSSVGIASAITSKTA 88
++Y + KD L + P + D + ++P+ G+ NV+L T++ SV + S S ++
Sbjct 240 MRYRPYAKDWLTSMKPTPNYSD-GIFNLPEYVRGNGNVIL-TNNKSGSVSLDSGTVSPSS 297
Query 89 PFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQSGDS-DYRE 147
F+V LR A AL + E ++ + DY
Sbjct 298 -------------------------------FSVNDLRAAFALDKMLEATRRANGLDYAS 326
Query 148 QIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVV--NNNLATEGDTAVI---AGKGVGA- 201
QI HFG K+P++ +N ++GG ++ +SEVV N N A++G A I GKG+G+
Sbjct 327 QIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGDLGGKGIGSM 386
Query 202 GNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMT 261
+G+ E+ +TEH ++MCIY P +Y + D E PEF ++G + L +
Sbjct 387 SSGTIEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQFYQPEFADLGYQALIGS 446
Query 262 QV------FNSPKA--SIVNLFN--AGYNPRYFNWKTKLDVINGAFTT--TLKSWVSPVT 309
+ N +A S + L N GY RY +KT D++ G F + +L W +P
Sbjct 447 DLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFESGKSLSYWCTPRF 506
Query 310 ESLLSGWFCFGYNKDDAAPDTKVIMNYKF-----------FKVNPSVLDPIFGVNADSTW 358
+ F +G + AP+ K +Y+ F +NP++++PIF +A
Sbjct 507 D------FGYGDTEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFLTSA---V 557
Query 359 DTDQLLVNSYIGCYVVRNLSRDGV 382
D +VNS++ VR +S G+
Sbjct 558 QADHFIVNSFLDVKAVRPMSVTGL 581
Lambda K H a alpha
0.318 0.136 0.416 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 2389518904266