bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-7_CDS_annotation_glimmer3.pl_2_4
Length=568
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 338 2e-104
gi|575094354|emb|CDL65742.1| unnamed protein product 325 4e-99
gi|490418709|ref|WP_004291032.1| hypothetical protein 318 4e-97
gi|496050829|ref|WP_008775336.1| hypothetical protein 297 7e-89
gi|494822885|ref|WP_007558293.1| hypothetical protein 277 5e-81
gi|575094321|emb|CDL65708.1| unnamed protein product 214 2e-57
gi|575094339|emb|CDL65730.1| unnamed protein product 181 6e-46
gi|517172762|ref|WP_018361580.1| hypothetical protein 153 8e-37
gi|647452987|ref|WP_025792807.1| hypothetical protein 150 6e-36
gi|494308783|ref|WP_007173938.1| hypothetical protein 150 1e-35
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 338 bits (866), Expect = 2e-104, Method: Compositional matrix adjust.
Identities = 205/567 (36%), Positives = 305/567 (54%), Gaps = 76/567 (13%)
Query 1 MAGLFSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQP 60
M+ + S +KN +R+GFDL KNAFTAKVGELLP+ K PGDKF I + FTRTQP
Sbjct 1 MSSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGTQSITFNRKLPWVDLL 120
V+++A++R+REYY+++FVP L++ + +M + P++AA S+ +++ PW
Sbjct 61 VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPD-PHHAADLVSSVNLSQRHPWFTFF 119
Query 121 TLNNAVENVKA-----STYHDNMFGFSRALGFAKLYNYLGVG---QFDPSKTLA---NLR 169
+ + N+ + Y N FGFSR KL NYL G ++ K + ++
Sbjct 120 DIMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYGFGKDYESVKVPSDSDDIV 179
Query 170 ISVFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDT---TPVASLKDLFDTNPNDS 226
+S FP AYQKI DY+R+ QW+ P+ YN D+ G+ + P++S + D N +
Sbjct 180 LSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPMSSFTN--DAFKNPT 237
Query 227 VFELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTGAGGV 286
+F+L Y N+ KD + G +P AQ+GDV+ +PI+
Sbjct 238 MFDLNYCNFQKDYFTGMLPRAQYGDVSV--------------------ASPIFG------ 271
Query 287 QPDAQIGLRGAVT--GAPDNGQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQVP 344
D IG ++T AP G G V + N+ T
Sbjct 272 --DLDIGDSSSLTFASAPQQGANTIQSG------------------VLVVNNNSNT---- 307
Query 345 YEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDG 404
++ VL LR AECLQKW+EIAQ+ +Y +Q++ HF VSP+ S C+ + G+
Sbjct 308 ---TAGLSVLALRQAECLQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTS 364
Query 405 SIDISAVENTNLSSD-EAIIRGKGIGGYRVNKPETFKTTEHGVLMCIYHAVPLLDYAPTG 463
++DIS V NTNL+ D +A I+GKG G NK + F+++EHG++MCIYH +PLLD++
Sbjct 365 NLDISEVVNTNLTGDNQADIQGKGTGTLNGNKVD-FESSEHGIIMCIYHCLPLLDWSINR 423
Query 464 PDLQFMTTVDGDSWPVPELDSVGFEEL-PSYSLLNTSDVQPIKEPRPFGYVPRYISWKTS 522
Q T D + +PE DSVG ++L PS + D+ GYVPRY KTS
Sbjct 424 IARQNFKTTFTD-YAIPEFDSVGMQQLYPSEMIFGLEDLPSDPSSINMGYVPRYADLKTS 482
Query 523 VDVVRGAFIDTLKSWTAPIGEDYMKIY 549
+D + G+FIDTL SW +P+ + Y+ Y
Sbjct 483 IDEIHGSFIDTLVSWVSPLTDSYISAY 509
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 325 bits (832), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 210/575 (37%), Positives = 310/575 (54%), Gaps = 68/575 (12%)
Query 5 FSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTS 64
S DIKN+P R+GFDL K FTAK GELLPV K LPGD F I+ FTRTQP++TS
Sbjct 1 MSMADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTS 60
Query 65 AFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGT--QSITFNRKLPWVDLLTL 122
AF R+REYY+++FVP M+ + I M +A+ T + + ++P+ +
Sbjct 61 AFARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQI 120
Query 123 NNAVENVKASTYHDNMFGFSRALGFAKLYNYLGVGQFDP--SKT--------LANLRISV 172
+ + N +A+ N FGF+R+ KL YLG G ++ S+T L NL +S
Sbjct 121 ADYL-NDQATAARKNPFGFNRSTLTCKLLQYLGYGDYNSFDSETNTWSAKPLLYNLELSP 179
Query 173 FPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTTPVASLKDLFDTNPND--SVFEL 230
FP AYQKIY+D+YR +QWE P T+N D+ G + L+ P+D + F++
Sbjct 180 FPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKG-----TSDLQMDLTGLPSDDNNFFDI 234
Query 231 RYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNT--------G 282
RY N+ KD++ G +P AQ+G + VP++ G+L V I G PI+ T G
Sbjct 235 RYCNYQKDMFHGVLPVAQYGSASVVPIN--GQLNV----ISNGDSGPIFKTSTPDPGTPG 288
Query 283 AGGVQPDAQIGLRG---AVTGAPDN-GQTV--TAYGADKTDAARPYFYAVPDGSVAHLKT 336
V IG+ V+G+ N G++ + YG + R + P+ +
Sbjct 289 TSYVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPN----LIIE 344
Query 337 NAKTIQVPYEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRC 396
N + VP +L LR AE LQKWKE++ + ++Y SQ++ H+G+ + SH+
Sbjct 345 NNQGFYVP--------ILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQA 396
Query 397 QRVCGFDGSIDISAVENTNLSSDEAI-IRGKGIGGYRVNKPETFKTT-EHGVLMCIYHAV 454
+ + G S+DI+ V N N++ D A I GKG + N F++ E+G++MCIYH +
Sbjct 397 RYLGGCATSLDINEVINNNITGDNAADIAGKGT--FTGNGSIRFESKGEYGIIMCIYHVL 454
Query 455 PLLDYAPTGPDLQFMTTVDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRP----- 509
P++DY +G D T VD S+P+PELD +G E +P +N P+KE
Sbjct 455 PIVDYVGSGVD-HSCTLVDATSFPIPELDQIGMESVPLVRAMN-----PVKESDTPSADT 508
Query 510 -FGYVPRYISWKTSVDVVRGAFIDTLKSWTAPIGE 543
GY PRYI WKTSVD G F D+L++W P+G+
Sbjct 509 FLGYAPRYIDWKTSVDRSVGDFADSLRTWCLPVGD 543
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 318 bits (816), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 209/560 (37%), Positives = 285/560 (51%), Gaps = 79/560 (14%)
Query 1 MAGLFSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQP 60
MA + S I+NKP R+GFDL K FTAK GELLPV K LPGD FKI+ + FTRTQP
Sbjct 1 MANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAAS--GTQSITFNRKLPWVD 118
V+T+AF RIREYY++FFVP L++ +N + M + P +A S T++ + ++P++
Sbjct 61 VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMT 120
Query 119 ---LLTLNNAVENVKA-STYHDNMFGFSRALGFAKLYNYLGVGQFDPSKT--------LA 166
+ + NA+ A + Y N FG++R+ KL YLG G ++ T +A
Sbjct 121 SEAIASYINALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGNYESFLTDDWNTAPLMA 180
Query 167 NLRISVFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTTPVASLKDLFDTNPNDS 226
NL ++F AYQKIY+D+YR+SQWE P T+N D+ +G + F N N
Sbjct 181 NLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSSMNLDNAYSTEFYQNYN-- 238
Query 227 VFELRYANWNKDLYMGAMPNAQFGDVAFVPV--DSSGKLPVSLPSIEVGGVAPIYNTGAG 284
F+LRY NW KDL+ G +P+ Q+G+ A + D +GKL +S N
Sbjct 239 FFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTLS-------------NFSTV 285
Query 285 GVQPDAQIGLRGAVTGAPDNGQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQVP 344
G P TA G + P F V D S
Sbjct 286 GTSP-------------------TTASGTATKNL--PAFDTVGDLS-------------- 310
Query 345 YEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDG 404
+L LR AE LQKWKEI Q+ ++Y Q++ H+GVS S C + G
Sbjct 311 --------ILVLRQAEFLQKWKEITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSS 362
Query 405 SIDISAVENTNLS-SDEAIIRGKGIGGYRVNKPETFKTT-EHGVLMCIYHAVPLLDYAPT 462
SIDI+ V NTN++ S A I GKG+G N F + +G++MCIYH +PLLDY
Sbjct 363 SIDINEVINTNITGSAAADIAGKGVG--VANGEINFNSNGRYGLIMCIYHCLPLLDYTTD 420
Query 463 GPDLQFMTTVDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRPFGYVPRYISWKTS 522
D F+ V+ + +PE D VG + +P L+N GYVPRYI +KTS
Sbjct 421 MLDPAFL-KVNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFANASGLVLGYVPRYIDYKTS 479
Query 523 VDVVRGAFIDTLKSWTAPIG 542
VD G F TL SW G
Sbjct 480 VDQSVGGFKRTLNSWVISYG 499
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 297 bits (760), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 205/568 (36%), Positives = 284/568 (50%), Gaps = 96/568 (17%)
Query 1 MAGLFSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQP 60
MA + S ++NK R+GFDL +K FTAK GELLPV LPGDK+ I + FTRTQP
Sbjct 1 MANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGTQSITFNRKLPWV--- 117
++T+AF R+REYY+++FVP +L++ +N + M + P +A S S N+ L V
Sbjct 61 LNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSA--NQALAGVMPN 118
Query 118 -------DLLTLNNAVENVKASTYHDNMFGFSRALGFAKLYNYLGVGQF----------- 159
D L L A + ++Y N FG+SR+LG AKL YLG G F
Sbjct 119 VTCKGIADYLNL-VAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYTYATSKNNTW 177
Query 160 DPSKTLANLRISVFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNG--EDTTPVASLKD 217
S +NL+++++ AYQKIY D+ R+SQWE P +N D+ +G + + S+
Sbjct 178 TKSPLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVDSAMTIDSMIT 237
Query 218 LFDTNPNDSVFELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLP----VSLPSIEVG 273
P ++F+LRY NW KDL+ G +P Q+GD A V V+ S L V P +
Sbjct 238 GQGFAPFYNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNVNLSNVLSAQYMVQTPDGDPV 297
Query 274 GVAPIYNTGAGGVQPDAQIGLRGAVTGAPDNGQTVTAYGADKTDAARPYFYAVPDGSVAH 333
G +P +TG N QTV G
Sbjct 298 GGSPFSSTGV--------------------NLQTVNGSGT-------------------- 317
Query 334 LKTNAKTIQVPYEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTS 393
F VL LR AE LQKWKEI Q+ ++Y Q++ H+ VS S
Sbjct 318 -----------------FTVLALRQAEFLQKWKEITQSGNKDYKDQIEKHWNVSVGEAYS 360
Query 394 HRCQRVCGFDGSIDISAVENTNLS-SDEAIIRGKG--IGGYRVNKPETFKTTE-HGVLMC 449
+ G S+DI+ V N N++ S+ A I GKG +G R+ +F E +G++MC
Sbjct 361 EMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVVVGNGRI----SFDAGERYGLIMC 416
Query 450 IYHAVPLLDYAPTGPDLQFMTTVDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRP 509
IYH++PLLDY + F T ++ + +PE D VG E +P SL+N
Sbjct 417 IYHSLPLLDYTTDLVNPAF-TKINSTDFAIPEFDRVGMESVPLVSLMNPLQSSYNVGSSI 475
Query 510 FGYVPRYISWKTSVDVVRGAFIDTLKSW 537
GY PRYIS+KT VD GAF TLKSW
Sbjct 476 LGYAPRYISYKTDVDSSVGAFKTTLKSW 503
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 277 bits (709), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 193/581 (33%), Positives = 289/581 (50%), Gaps = 79/581 (14%)
Query 1 MAGLFSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQP 60
MA + S ++NKP R+G+DL K FTAK G L+PV+W LP D + + F RTQP
Sbjct 8 MANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQP 67
Query 61 VDTSAFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGT--QSITFNRKLPWVD 118
++T+AF R+R Y++++FVP M+ AI M +A+ ++ + +LP+
Sbjct 68 LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADNVPLSDELPY-- 125
Query 119 LLTLNNAVENVKASTYHDNMFGFSRALGFAKLYNYLGVGQFDP---------------SK 163
T + + + N FG+ RA + YLG G F P
Sbjct 126 -FTAEQVADYIVSLADSKNQFGYYRAWLVCIILEYLGYGDFYPYIVEAAGGEGATWATRP 184
Query 164 TLANLRISVFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTT-----PVASLKDL 218
L NL+ S FP +AYQKIY D+ R +QWE + P T+N D+ +G + V KD
Sbjct 185 MLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGSADSLQLDFTVEGFKDS 244
Query 219 FDTNPNDSVFELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPI 278
F+ +F++RY+NW +DL G +P AQ+G+ + VPV SG + V +E G P
Sbjct 245 FN------LFDMRYSNWQRDLLHGTIPQAQYGEASAVPV--SGSMQV----VE-GPTPPA 291
Query 279 YNTGAGGVQPDAQIGLRGAVTGAPDNG--QTVTAYGADKTDAARPYFYAVPDGSVAHLKT 336
+ TG GV L G VT +G Q T+ G + L+
Sbjct 292 FTTGQDGVA-----FLNGNVTIQGSSGYLQAQTSVGESRI-----------------LRF 329
Query 337 NAKTIQVPYEFSSKFDV--LQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSH 394
N + E S F V L LR AE QKWKE+A A+ ++Y SQ++AH+G S N S
Sbjct 330 NNTNSGLIVEGDSSFGVSILALRRAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSD 389
Query 395 RCQRVCGFDGSIDISAVENTNLSSDEAI-IRGKGIGGYRVNKPETFKT-TEHGVLMCIYH 452
CQ + + + I+ V N N++ + A I GKG N F ++G++MC++H
Sbjct 390 MCQWLGSINIDLSINEVVNNNITGENAADIAGKGT--MSGNGSINFNVGGQYGIVMCVFH 447
Query 453 AVPLLDYAPTGPDLQFMTTVDGD-SWPVPELDSVGFEELPSYSLLNTSDVQP------IK 505
+P LDY + P F TT+ +P+PE D +G E++P LN V+P +
Sbjct 448 VLPQLDYITSAP--HFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNP--VKPKDGDFKVS 503
Query 506 EPRPFGYVPRYISWKTSVDVVRGAFIDTLKSWTAPIGEDYM 546
FGY P+Y +WKT++D G F +LK+W P ++ +
Sbjct 504 PNLYFGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEAL 544
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 214 bits (545), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 173/601 (29%), Positives = 267/601 (44%), Gaps = 97/601 (16%)
Query 10 IKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRI 69
+KNKP R+ FDL ++N FTAKVGELLP + + PGD K+S +FTRT P+ ++AFTR+
Sbjct 13 LKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQSNAFTRL 72
Query 70 REYYEWFFVPLHLMYRNSNEAIMSMENQPN------YAAS--GTQSITFNRKLPWVDLLT 121
RE ++FFVP +++ + +++M N A+S G Q +T ++P V+ T
Sbjct 73 RENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVT--TQMPCVNYKT 130
Query 122 L--------NNAVENVKASTYHDNMFGFSRALGFAKLYNYLGVGQFDPSKTLANLRI--- 170
L N + S + G R AKL LG G F + AN ++
Sbjct 131 LHAYLLKFINRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNF--PEQFANFKVNND 188
Query 171 --------------------SVFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTT 210
S+F AY KI ND+Y QW+ N D+ +++
Sbjct 189 KHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYNASLCNVDYLT-PNSS 247
Query 211 PVASLKDLFDTNPNDSV-------FELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKL 263
+ S+ D + P+DS+ ++R++N D + G +P +QFG + V ++
Sbjct 248 SLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSESVVNLNLG--- 304
Query 264 PVSLPSIEVGGVAPIYNTGAGGVQPDAQIGLRGAVTGAPDNGQTV--TAYGADKTDAARP 321
G A + G D+ G TG + Q V +A G K D +
Sbjct 305 -------NASGSAVL----NGTTSKDS--GRWRTTTGEWEMEQRVASSANGNLKLDNSNG 351
Query 322 YFYAVPDGSVAHLKTNAKTIQVPYEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVK 381
F ++H T + + + S ++ LR A QK+KEI AN ++ SQV+
Sbjct 352 TF-------ISHDHTFSGNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVE 404
Query 382 AHFGVSPNPMTSHRCQRVCGFDGSIDISAVENTNLSSDEAIIRG---KGIGGYRVNKPET 438
AHFG+ P+ + + G I+I+ N NLS D G +G G +
Sbjct 405 AHFGIKPDEKNENSL-FIGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASI----K 459
Query 439 FKTTEHGVLMCIYHAVPLLDYAPTGPDLQFMTTVDGDSWPVPELDSVGFEEL-------- 490
F +GV++ IY P+LD+A G D T D + +PE+DS+G ++
Sbjct 460 FTAKTYGVVIGIYRCTPVLDFAHLGIDRTLFKT-DASDFVIPEMDSIGMQQTFRCEVAAP 518
Query 491 ----PSYSLLNTSDVQPIKEPRPFGYVPRYISWKTSVDVVRGAFIDTLKSWTAPIGEDYM 546
+ D +GY PRY +KTS D GAF +LKSW I D +
Sbjct 519 APYNDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAI 578
Query 547 K 547
+
Sbjct 579 Q 579
>gi|575094339|emb|CDL65730.1| unnamed protein product [uncultured bacterium]
Length=588
Score = 181 bits (458), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 153/548 (28%), Positives = 238/548 (43%), Gaps = 87/548 (16%)
Query 16 RSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRIREYYEW 75
++GFD+ ++ FT+ VG+LLPV++ + PGDK +IS FTRTQP+ ++A R+ E+ E+
Sbjct 16 KNGFDMSQRHPFTSSVGQLLPVFYDYLNPGDKIRISANLFTRTQPMKSTAMARLTEHIEY 75
Query 76 FFVPLHLMYRNSNEAIMSMENQPNYAASGTQSITFNRKLPWVDLLTLNNAVE-------- 127
FFVP M+ +++ + + ++T +P+ ++ A+E
Sbjct 76 FFVPFEQMFSLFGSVFYGIDDYNSSSLVKHNNLT----MPFFKSDAVSAALEAAYTSFSS 131
Query 128 NVKASTYHDNMFGFSRALGFAKLYNYLGVGQFDPSKTLA---NLRISVFPFYAYQKIYND 184
++ +M G R G +L LG G S + +SVF F AYQKI+ND
Sbjct 132 SINRKVLTPDMMGQPRVYGILRLSEMLGYGSLLLSNDNNLLPHADMSVFLFTAYQKIFND 191
Query 185 YYRNSQWEVNKPWTYNCDFWNGEDTTPVASLKDLFDTNPNDSVFELRYANWNKDLYMGAM 244
+YR + + +YN D+ G+ T ++S+FEL Y W KD + +
Sbjct 192 FYRLDDYTSVQHKSYNVDYAQGQPIT-------------DNSMFELHYRPWKKDYFTNVI 238
Query 245 PNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTGAGGVQPDAQIGLRGAVTGAPDN 304
PN F VD+ GAG D +GL
Sbjct 239 PNPYFSS-----VDNKSSF-----------------GGAGLF--DRPVGL---------- 264
Query 305 GQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQ-VPYEFSSK----FDVLQLRAA 359
++T++ D +D F P ++ ++ N Q +P +S V LR
Sbjct 265 --SITSFNFDGSD-----FLQAP-SDLSTMENNQPIFQELPVNLTSASSAGLSVSDLRYL 316
Query 360 ECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDGSIDISAVENTNLSSD 419
K I Q G++Y +Q AHFG S + G + IS+VE+T + D
Sbjct 317 YATDKLLRITQFAGKHYDAQTLAHFGKRVPQGVSGEVYYIGGQSQPLQISSVESTATTFD 376
Query 420 EAIIRGKGIG-----GYRV---NKPETFKTTEHGVLMCIYHAVPLLDYAPTGPDLQFMTT 471
+ G +G GY K +F+ HGVLM IY AVP DY D T
Sbjct 377 SGDVVGSVLGELAGKGYSQTGNQKDFSFEAPCHGVLMAIYSAVPEADYLDERIDY-LNTL 435
Query 472 VDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRPFGYVPRYISWKTSVDVVRGAFI 531
+ + + PE DS+G E P+Y L + + G+ RY K+ D++ GAF
Sbjct 436 IQSNDFYKPEFDSLGMEPFPNYEL---DQYRMVGNNSRLGWRYRYSGLKSKPDLISGAFK 492
Query 532 DTLKSWTA 539
TL+ W A
Sbjct 493 YTLRDWVA 500
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 153 bits (387), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 150/548 (27%), Positives = 233/548 (43%), Gaps = 83/548 (15%)
Query 16 RSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRIREYYEW 75
R+ FD+ ++ FTA G LLPV LP D +I+ F RT P++++AF +R YE+
Sbjct 18 RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGVYEF 77
Query 76 FFVPLHLMYRNSNEAIMSMENQPN---YAASGT---QSITFN-RKLPWVDLLTLNNAVEN 128
+FVP ++ ++ I M + + YA G ++F+ +KL VD N A ++
Sbjct 78 YFVPYKQLWSGFDQFITGMSDYKSSFMYAFKGKTPPSCVSFDVQKL--VDWCKTNTA-KD 134
Query 129 VKASTYHDNMFGFSRALGFAKLYNYLGVGQFDPSKTLANLRISVFPFYAYQKIYNDYYRN 188
+ + ++ LG+ K N GV +P+ T + + F AYQKIYND+YRN
Sbjct 135 IHGFDKNKGVYRILDLLGYGKYANSAGVPYTNPTSTTMG-KCTPFRGLAYQKIYNDFYRN 193
Query 189 SQWEVNKPWTYNCDFWNGEDTTPVASLKDLFDTNPND-SVFELRYANWNKDLYMGAMPNA 247
+ +E + ++N D + G +K+ P D F LRY N KDL P
Sbjct 194 TTYEEYQLESFNVDMFYGS-----GKVKETIPNEPWDYDWFTLRYRNAQKDLLTNVRPT- 247
Query 248 QFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTGAGGVQPDAQIGLRGAVTGAPDNGQT 307
P + P + TG +
Sbjct 248 --------------------PLFSIDDFNPQFFTGGSDI--------------------- 266
Query 308 VTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQVPYEFSSKFDVLQLRAAECLQKWKE 367
V G + T Y SV + N K V + + V +R A L+K
Sbjct 267 VMEKGPNVTGGTHEY-----RDSVVIVGKNLKENGVDSK-RTMISVADIRNAFALEKLAS 320
Query 368 IAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDGSIDISAVENTNLSSDEAIIRGKG 427
+ G+ Y Q++AHFG+S RC + GFD +I + V ++ ++ +
Sbjct 321 VTMRAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTG-TKDTS 379
Query 428 IGGY--RVNKPET--------FKTTEHGVLMCIYHAVPLLDYAPTGPDLQFMTTVDGDSW 477
GGY R T F EHG+LMCIY VP + Y D F+ ++ +
Sbjct 380 FGGYLGRTTGKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVD-PFVQKIERGDF 438
Query 478 PVPELDSVGFEEL----PSYSLLNTSDVQPIKEPRPFGYVPRYISWKTSVDVVRGAFI-- 531
VPE +++G + L SY N + IK FG+ PRY +KT++D+ G F+
Sbjct 439 FVPEFENLGMQPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQFVHQ 498
Query 532 DTLKSWTA 539
+ L WT
Sbjct 499 EPLSYWTV 506
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 150 bits (380), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 158/579 (27%), Positives = 241/579 (42%), Gaps = 130/579 (22%)
Query 13 KPR--RSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRIR 70
KPR R+GFDL ++ F+AK G+LLP+ P + FK S + RT ++T+++ R++
Sbjct 5 KPRLARNGFDLSSRRIFSAKAGQLLPIGCWEVNPSEHFKFSVQDLVRTTTLNTASYARMK 64
Query 71 EYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGTQ---SITFNRKLPWVDLLTLNNAVE 127
EYY +FFV +++ ++ I+ N P+ A +G + + +N+ V L +
Sbjct 65 EYYHFFFVSYRSLWQWFDQFIVGT-NNPHSALNGVKKNGTTNYNQICSSVPTFDLGKLIT 123
Query 128 NVKASTYHDNMFGFSRALGFAKLYNYLGVGQFDPSK--TLANL----------------- 168
+K S F +S G AKL N L G + K L NL
Sbjct 124 RLKTSDMDSQGFNYSE--GAAKLLNMLNYGVTNKGKFMNLENLITSTSYLPSKDDKEPSS 181
Query 169 ----RISVFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDT---TPVASLKDLFDT 221
++S F AYQKI+ND+YRN W + ++N D + + P +LK
Sbjct 182 IYACKVSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYADDSNLTIEPDVALK----- 236
Query 222 NPNDSVFELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLPVSLPSIEVG-GVAPIYN 280
++RY + KD P + D F +LP G G + N
Sbjct 237 -----FCQMRYRPYAKDWLTSMKPTPNYSDGIF-----------NLPEYVRGNGNVILTN 280
Query 281 TGAGGVQPDAQIGLRGAVTGAPDNGQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKT 340
+G V D+ G+V+
Sbjct 281 NKSGSVSLDS--------------------------------------GTVS-------- 294
Query 341 IQVPYEFSSKFDVLQLRAAECLQKWKEIA-QANGQNYASQVKAHFGVSPNPMTSHRCQRV 399
P FS V LRAA L K E +ANG +YASQ++AHFG ++ + +
Sbjct 295 ---PSSFS----VNDLRAAFALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFL 347
Query 400 CGFDGSIDISAV--ENTNLSSDEAI-----IRGKGIGGYRVNKPETFKTTEHGVLMCIYH 452
GFD SI +S V N N +SD + + GKGIG E F +TEHG++MCIY
Sbjct 348 GGFDNSIVVSEVVSTNGNAASDGSHASIGDLGGKGIGSMSSGTIE-FDSTEHGIIMCIYS 406
Query 453 AVPLLDYAPTGPDLQFMTTVDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEP----- 507
P +Y + D F + + + PE +G++ L L+ ++ K+
Sbjct 407 VAPQSEYNASYLD-PFNRKLTREQFYQPEFADLGYQALIGSDLICSTLGMNEKQAGFSDI 465
Query 508 ----RPFGYVPRYISWKTSVDVVRGAFID--TLKSWTAP 540
GY RY +KT+ D+V G F +L W P
Sbjct 466 ELNNNLLGYQVRYNEYKTARDLVFGDFESGKSLSYWCTP 504
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 150 bits (378), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 151/549 (28%), Positives = 225/549 (41%), Gaps = 100/549 (18%)
Query 16 RSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRIREYYEW 75
R+ FDL ++ FTA G LLPV +P D +I+ + F RT P++T+AF +R YE+
Sbjct 17 RNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYEF 76
Query 76 FFVPLHLMYRNSNEAIMSMENQPNYAASGTQSITFNRKLPWVDLLTLNNAVENVKAS--- 132
FFVP H ++ ++ I M + + A Q T ++P+ ++ ++ N++ K S
Sbjct 77 FFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQVPYFNVDSVFNSLNTGKESGSG 136
Query 133 -------TYHDNMFGFSRALGFAKLYNYLGVGQFDPSKTLAN---LRISVFPFYAYQKIY 182
+ F LG+ + ++ G D L N SVF AY KIY
Sbjct 137 STDDLQYKFKYGAFRLLDLLGYGRKFDSFGTAYPDNVSGLKNNLDYNCSVFRILAYNKIY 196
Query 183 NDYYRNSQWEVNKPWTYNCDFWNGEDTTPVASLKDLFDTNPNDSVFELRYANWNKDLYMG 242
DYYRNS +E ++N D + G L D +F+LRY N D +
Sbjct 197 QDYYRNSNYENFDTDSFNFDKFKG----------GLVDAKVVADLFKLRYRNAQTDYFTN 246
Query 243 AMPNAQFG-DVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTGAGGVQPDAQIGLRGAVTGA 301
+ F AF VD+ +AP R V
Sbjct 247 LRQSQLFSFTTAFEDVDNI-------------NIAP-----------------RDYVKSD 276
Query 302 PDNGQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQVPYEFSSKFDVLQLRAAEC 361
N V +G D TD++ D SV+ L+ K + +RA +
Sbjct 277 GSNFTRVN-FGVD-TDSSE------GDFSVSSLRAAFAV--------DKLLSVTMRAGKT 320
Query 362 LQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDGSIDISAVENTNLSSDEA 421
Q Q++AH+GV R + GFD + +S V T+ ++
Sbjct 321 FQ--------------DQMRAHYGVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATE 366
Query 422 I---------IRGKGIGGYRVNKPETFKTTEHGVLMCIYHAVPLLDYAPTGPDLQFMTTV 472
+ GKG G R F EHGVLMCIY VP + Y T D + +
Sbjct 367 YKPEAGYLGRVAGKGTGSGR--GRIVFDAKEHGVLMCIYSLVPQIQYDCTRLD-PMVDKL 423
Query 473 DGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRPFGYVPRYISWKTSVDVVRGAFI- 531
D + PE +++G + L S + + P K P GY PRY +KT++DV G F
Sbjct 424 DRFDYFTPEFENLGMQPLNSSYISSFCTTDP-KNP-VLGYQPRYSEYKTALDVNHGQFAQ 481
Query 532 -DTLKSWTA 539
D L SW+
Sbjct 482 SDALSSWSV 490
Lambda K H a alpha
0.318 0.136 0.423 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4146447800358