bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-27_CDS_annotation_glimmer3.pl_2_1
Length=612
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 391 2e-124
gi|575094354|emb|CDL65742.1| unnamed protein product 371 3e-116
gi|490418709|ref|WP_004291032.1| hypothetical protein 357 3e-111
gi|496050829|ref|WP_008775336.1| hypothetical protein 342 2e-105
gi|494822885|ref|WP_007558293.1| hypothetical protein 311 2e-93
gi|575094321|emb|CDL65708.1| unnamed protein product 224 8e-61
gi|575094339|emb|CDL65730.1| unnamed protein product 179 3e-45
gi|517172762|ref|WP_018361580.1| hypothetical protein 175 4e-44
gi|647452987|ref|WP_025792807.1| hypothetical protein 171 1e-42
gi|494308783|ref|WP_007173938.1| hypothetical protein 163 4e-40
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 391 bits (1004), Expect = 2e-124, Method: Compositional matrix adjust.
Identities = 235/631 (37%), Positives = 341/631 (54%), Gaps = 77/631 (12%)
Query 1 MAGLFSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQP 60
M+ + S +KN +R+GFDL KNAFTAKVGELLP+ K PGDKF I + FTRTQP
Sbjct 1 MSSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGTQSITFNRKLPWVDLL 120
V+++A++R+REYY+++FVP L++ + +M + P++AA S+ +++ PW
Sbjct 61 VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPD-PHHAADLVSSVNLSQRHPWFTFF 119
Query 121 TLNNAVENVKA-----STYHDNMFGFSRALGFAKLYNYLGVG------QVDPSKTLANLR 169
+ + N+ + Y N FGFSR KL NYL G V ++
Sbjct 120 DIMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYGFGKDYESVKVPSDSDDIV 179
Query 170 ISAFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDT---TPVASSKDLFDTNPNDS 226
+S FP AYQKI DY+R+ QW+ P+ YN D+ G+ + P++S + D N +
Sbjct 180 LSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPMSSFTN--DAFKNPT 237
Query 227 IFELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTGAGGV 286
+F+L Y N+ KD + G +P AQ+GDV+ +PI+
Sbjct 238 MFDLNYCNFQKDYFTGMLPRAQYGDVSVA--------------------SPIFG------ 271
Query 287 QPDAQIGLRGAVT--GAPDNGQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQVP 344
D IG ++T AP G G V + N+ T
Sbjct 272 --DLDIGDSSSLTFASAPQQGANTIQSG------------------VLVVNNNSNT---- 307
Query 345 YEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDG 404
++ VL LR AECLQKW+EIAQ+ +Y +Q++ HF VSP+ S C+ + G+
Sbjct 308 ---TAGLSVLALRQAECLQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTS 364
Query 405 SIDISAVENTNLSSD-EAIIRGKGIGGYRVNKPETFKTTEHGVLMCIYHAVPLLDYAPTG 463
++DIS V NTNL+ D +A I+GKG G NK + F+++EHG++MCIYH +PLLD++
Sbjct 365 NLDISEVVNTNLTGDNQADIQGKGTGTLNGNKVD-FESSEHGIIMCIYHCLPLLDWSINR 423
Query 464 PDLQFMTTVDGDSWPVPELDSVGFEEL-PSYSLLNTSDVQPIKEPRPFGYVPRYISWKTS 522
Q T D + +PE DSVG ++L PS + D+ GYVPRY KTS
Sbjct 424 IARQNFKTTFTD-YAIPEFDSVGMQQLYPSEMIFGLEDLPSDPSSINMGYVPRYADLKTS 482
Query 523 VDVVRGAFIDTLKSWTAPIGEDYMKIYFDNNNVPGGAHFGF-YTWFKVNPSVVNPIFGVV 581
+D + G+FIDTL SW +P+ + Y+ Y G + Y +FKVNP +V+ IFGV
Sbjct 483 IDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDITMTYNFFKVNPHIVDNIFGVK 542
Query 582 ADGSWNTDQLLVNCDFDVRVARNLSYDGLPY 612
AD + NTDQLL+N FD++ RN Y+GLPY
Sbjct 543 ADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 371 bits (952), Expect = 3e-116, Method: Compositional matrix adjust.
Identities = 243/648 (38%), Positives = 352/648 (54%), Gaps = 73/648 (11%)
Query 5 FSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTS 64
S DIKN+P R+GFDL K FTAK GELLPV K LPGD F I+ FTRTQP++TS
Sbjct 1 MSMADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTS 60
Query 65 AFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGT--QSITFNRKLPWVDLLTL 122
AF R+REYY+++FVP M+ + I M +A+ T + + ++P+ +
Sbjct 61 AFARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQI 120
Query 123 NNAVENVKASTYHDNMFGFSRALGFAKLYNYLGVGQVDP--SKT--------LANLRISA 172
+ + N +A+ N FGF+R+ KL YLG G + S+T L NL +S
Sbjct 121 ADYL-NDQATAARKNPFGFNRSTLTCKLLQYLGYGDYNSFDSETNTWSAKPLLYNLELSP 179
Query 173 FPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTTPVASSKDLFDTNPNDSIFELRY 232
FP AYQKIY+D+YR +QWE P T+N D+ G + + D N + F++RY
Sbjct 180 FPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKGTSDLQMDLTGLPSDDN---NFFDIRY 236
Query 233 ANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNT--------GAG 284
N+ KD++ G +P AQ+G + VP++ G+L V I G PI+ T G
Sbjct 237 CNYQKDMFHGVLPVAQYGSASVVPIN--GQLNV----ISNGDSGPIFKTSTPDPGTPGTS 290
Query 285 GVQPDAQIGLRG---AVTGAPDN-GQTV--TAYGADKTDAARPYFYAVPDGSVAHLKTNA 338
V IG+ V+G+ N G++ + YG + R + P+ + N
Sbjct 291 YVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPN----LIIENN 346
Query 339 KTIQVPYEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQR 398
+ VP +L LR AE LQKWKE++ + ++Y SQ++ H+G+ + SH+ +
Sbjct 347 QGFYVP--------ILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARY 398
Query 399 VCGFDGSIDISAVENTNLSSDEAI-IRGKGIGGYRVNKPETFKTT-EHGVLMCIYHAVPL 456
+ G S+DI+ V N N++ D A I GKG + N F++ E+G++MCIYH +P+
Sbjct 399 LGGCATSLDINEVINNNITGDNAADIAGKGT--FTGNGSIRFESKGEYGIIMCIYHVLPI 456
Query 457 LDYAPTGPDLQFMTTVDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRP------F 510
+DY +G D T VD S+P+PELD +G E +P +N P+KE
Sbjct 457 VDYVGSGVD-HSCTLVDATSFPIPELDQIGMESVPLVRAMN-----PVKESDTPSADTFL 510
Query 511 GYVPRYISWKTSVDVVRGAFIDTLKSWTAPIGEDYM----KIYFDNN-NV-PGGAHFGFY 564
GY PRYI WKTSVD G F D+L++W P+G+ + + F +N NV P GF
Sbjct 511 GYAPRYIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDSIAAGF- 569
Query 565 TWFKVNPSVVNPIFGVVADGSWNTDQLLVNCDFDVRVARNLSYDGLPY 612
FKVNPS+V+P+F VVAD + TD+ L + FDV+V RNL +GLPY
Sbjct 570 --FKVNPSIVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 357 bits (916), Expect = 3e-111, Method: Compositional matrix adjust.
Identities = 240/640 (38%), Positives = 324/640 (51%), Gaps = 90/640 (14%)
Query 1 MAGLFSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQP 60
MA + S I+NKP R+GFDL K FTAK GELLPV K LPGD FKI+ + FTRTQP
Sbjct 1 MANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAAS--GTQSITFNRKLPWVD 118
V+T+AF RIREYY++FFVP L++ +N + M + P +A S T++ + ++P++
Sbjct 61 VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMT 120
Query 119 ---LLTLNNAVENVKA-STYHDNMFGFSRALGFAKLYNYLGVGQVDPSKT--------LA 166
+ + NA+ A + Y N FG++R+ KL YLG G + T +A
Sbjct 121 SEAIASYINALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGNYESFLTDDWNTAPLMA 180
Query 167 NLRISAFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTTPVASSKDLFDTNPNDS 226
NL + F AYQKIY+D+YR+SQWE P T+N D+ +G + F N N
Sbjct 181 NLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSSMNLDNAYSTEFYQNYN-- 238
Query 227 IFELRYANWNKDLYMGAMPNAQFGDVAFVPV--DSSGKLPVSLPSIEVGGVAPIYNTGAG 284
F+LRY NW KDL+ G +P+ Q+G+ A + D +GKL +S N
Sbjct 239 FFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTLS-------------NFSTV 285
Query 285 GVQPDAQIGLRGAVTGAPDNGQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQVP 344
G P TA G + P F V D S
Sbjct 286 GTSP-------------------TTASGTATKNL--PAFDTVGDLS-------------- 310
Query 345 YEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDG 404
+L LR AE LQKWKEI Q+ ++Y Q++ H+GVS S C + G
Sbjct 311 --------ILVLRQAEFLQKWKEITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSS 362
Query 405 SIDISAVENTNLS-SDEAIIRGKGIGGYRVNKPETFKTT-EHGVLMCIYHAVPLLDYAPT 462
SIDI+ V NTN++ S A I GKG+G N F + +G++MCIYH +PLLDY
Sbjct 363 SIDINEVINTNITGSAAADIAGKGVG--VANGEINFNSNGRYGLIMCIYHCLPLLDYTTD 420
Query 463 GPDLQFMTTVDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRPFGYVPRYISWKTS 522
D F+ V+ + +PE D VG + +P L+N GYVPRYI +KTS
Sbjct 421 MLDPAFL-KVNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFANASGLVLGYVPRYIDYKTS 479
Query 523 VDVVRGAFIDTLKSWTAPIGEDYM--KIYFDNNN--------VPGGAHFGFYTWFKVNPS 572
VD G F TL SW G + ++ N+ VP A F T+FKVNP
Sbjct 480 VDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEPSEPVPSVAPMNF-TFFKVNPD 538
Query 573 VVNPIFGVVADGSWNTDQLLVNCDFDVRVARNLSYDGLPY 612
++PIF V A NTDQ L + FD++ RNL DGLPY
Sbjct 539 CLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 342 bits (877), Expect = 2e-105, Method: Compositional matrix adjust.
Identities = 241/649 (37%), Positives = 325/649 (50%), Gaps = 106/649 (16%)
Query 1 MAGLFSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQP 60
MA + S ++NK R+GFDL +K FTAK GELLPV LPGDK+ I + FTRTQP
Sbjct 1 MANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGTQSITFNRKLPWV--- 117
++T+AF R+REYY+++FVP +L++ +N + M + P +A S S N+ L V
Sbjct 61 LNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSA--NQALAGVMPN 118
Query 118 -------DLLTLNNAVENVKASTYHDNMFGFSRALGFAKLYNYLGVGQV----------- 159
D L L A + ++Y N FG+SR+LG AKL YLG G
Sbjct 119 VTCKGIADYLNL-VAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYTYATSKNNTW 177
Query 160 DPSKTLANLRISAFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTTPVASSKDLF 219
S +NL+++ + AYQKIY D+ R+SQWE P +N D+ +G T A + D
Sbjct 178 TKSPLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSG--TVDSAMTIDSM 235
Query 220 DTN----PNDSIFELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLP----VSLPSIE 271
T P ++F+LRY NW KDL+ G +P Q+GD A V V+ S L V P +
Sbjct 236 ITGQGFAPFYNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNVNLSNVLSAQYMVQTPDGD 295
Query 272 VGGVAPIYNTGAGGVQPDAQIGLRGAVTGAPDNGQTVTAYGADKTDAARPYFYAVPDGSV 331
G +P +TG N QTV G
Sbjct 296 PVGGSPFSSTGV--------------------NLQTVNGSGT------------------ 317
Query 332 AHLKTNAKTIQVPYEFSSKFDVLQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPM 391
F VL LR AE LQKWKEI Q+ ++Y Q++ H+ VS
Sbjct 318 -------------------FTVLALRQAEFLQKWKEITQSGNKDYKDQIEKHWNVSVGEA 358
Query 392 TSHRCQRVCGFDGSIDISAVENTNLS-SDEAIIRGKG--IGGYRVNKPETFKTTE-HGVL 447
S + G S+DI+ V N N++ S+ A I GKG +G R+ +F E +G++
Sbjct 359 YSEMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVVVGNGRI----SFDAGERYGLI 414
Query 448 MCIYHAVPLLDYAPTGPDLQFMTTVDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEP 507
MCIYH++PLLDY + F T ++ + +PE D VG E +P SL+N
Sbjct 415 MCIYHSLPLLDYTTDLVNPAF-TKINSTDFAIPEFDRVGMESVPLVSLMNPLQSSYNVGS 473
Query 508 RPFGYVPRYISWKTSVDVVRGAFIDTLKSWTAPIGE----DYMKIYFDNNNVPGGAHFGF 563
GY PRYIS+KT VD GAF TLKSW + + D NN PG
Sbjct 474 SILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDPNNSPG--TLVN 531
Query 564 YTWFKVNPSVVNPIFGVVADGSWNTDQLLVNCDFDVRVARNLSYDGLPY 612
YT FKVNP+ V+P+F V A S +TDQ L + FDV+V RNL DGLPY
Sbjct 532 YTNFKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 311 bits (797), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 221/649 (34%), Positives = 329/649 (51%), Gaps = 80/649 (12%)
Query 1 MAGLFSYGDIKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQP 60
MA + S ++NKP R+G+DL K FTAK G L+PV+W LP D + + F RTQP
Sbjct 8 MANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQP 67
Query 61 VDTSAFTRIREYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGT--QSITFNRKLPWVD 118
++T+AF R+R Y++++FVP M+ AI M +A+ ++ + +LP+
Sbjct 68 LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADNVPLSDELPY-- 125
Query 119 LLTLNNAVENVKASTYHDNMFGFSRALGFAKLYNYLGVGQVDP---------------SK 163
T + + + N FG+ RA + YLG G P
Sbjct 126 -FTAEQVADYIVSLADSKNQFGYYRAWLVCIILEYLGYGDFYPYIVEAAGGEGATWATRP 184
Query 164 TLANLRISAFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGE-DTTPVASSKDLFDTN 222
L NL+ S FP +AYQKIY D+ R +QWE + P T+N D+ +G D+ + + + F +
Sbjct 185 MLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGSADSLQLDFTVEGFKDS 244
Query 223 PNDSIFELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTG 282
N +F++RY+NW +DL G +P AQ+G+ + VPV SG + V +E G P + TG
Sbjct 245 FN--LFDMRYSNWQRDLLHGTIPQAQYGEASAVPV--SGSMQV----VE-GPTPPAFTTG 295
Query 283 AGGVQPDAQIGLRGAVTGAPDNG--QTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKT 340
GV L G VT +G Q T+ G + L+ N
Sbjct 296 QDGVA-----FLNGNVTIQGSSGYLQAQTSVGESRI-----------------LRFNNTN 333
Query 341 IQVPYEFSSKFDV--LQLRAAECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQR 398
+ E S F V L LR AE QKWKE+A A+ ++Y SQ++AH+G S N S CQ
Sbjct 334 SGLIVEGDSSFGVSILALRRAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQW 393
Query 399 VCGFDGSIDISAVENTNLSSDEAI-IRGKGIGGYRVNKPETFKT-TEHGVLMCIYHAVPL 456
+ + + I+ V N N++ + A I GKG N F ++G++MC++H +P
Sbjct 394 LGSINIDLSINEVVNNNITGENAADIAGKGT--MSGNGSINFNVGGQYGIVMCVFHVLPQ 451
Query 457 LDYAPTGPDLQFMTTVDGD-SWPVPELDSVGFEELPSYSLLNTSDVQP------IKEPRP 509
LDY + P F TT+ +P+PE D +G E++P LN V+P +
Sbjct 452 LDYITSAP--HFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNP--VKPKDGDFKVSPNLY 507
Query 510 FGYVPRYISWKTSVDVVRGAFIDTLKSWTAPIGEDYMKI-----YFDNNNVPG-GAHFGF 563
FGY P+Y +WKT++D G F +LK+W P ++ + + DN NV GF
Sbjct 508 FGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKAGF 567
Query 564 YTWFKVNPSVVNPIFGVVADGSWNTDQLLVNCDFDVRVARNLSYDGLPY 612
FKV+PSV++ +F V A+ NTDQ L + FDV V R+L +GLPY
Sbjct 568 ---FKVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 224 bits (571), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 197/670 (29%), Positives = 290/670 (43%), Gaps = 109/670 (16%)
Query 10 IKNKPRRSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRI 69
+KNKP R+ FDL ++N FTAKVGELLP + + PGD K+S +FTRT P+ ++AFTR+
Sbjct 13 LKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQSNAFTRL 72
Query 70 REYYEWFFVPLHLMYRNSNEAIMSMENQPN------YAAS--GTQSITFNRKLPWVDLLT 121
RE ++FFVP +++ + +++M N A+S G Q +T ++P V+ T
Sbjct 73 RENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVT--TQMPCVNYKT 130
Query 122 L--------NNAVENVKASTYHDNMFGFSRALGFAKLYNYLGVGQVDPSKTLANLRI--- 170
L N + S + G R AKL LG G + AN ++
Sbjct 131 LHAYLLKFINRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNF--PEQFANFKVNND 188
Query 171 --------------------SAFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTT 210
S F AY KI ND+Y QW+ YN N + T
Sbjct 189 KHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQ-----PYNASLCNVDYLT 243
Query 211 PVASS----KDLFDTNPNDSI-------FELRYANWNKDLYMGAMPNAQFGDVAFVPVDS 259
P +SS D + P+DSI ++R++N D + G +P +QFG + V ++
Sbjct 244 PNSSSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSESVVNLNL 303
Query 260 SGKLPVSLPSIEVGGVAPIYNTGAGGVQPDAQIGLRGAVTGAPDNGQTV--TAYGADKTD 317
G A + G D+ G TG + Q V +A G K D
Sbjct 304 G----------NASGSAVL----NGTTSKDS--GRWRTTTGEWEMEQRVASSANGNLKLD 347
Query 318 AARPYFYAVPDGSVAHLKTNAKTIQVPYEFSSKFDVLQLRAAECLQKWKEIAQANGQNYA 377
+ F ++H T + + + S ++ LR A QK+KEI AN ++
Sbjct 348 NSNGTF-------ISHDHTFSGNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQ 400
Query 378 SQVKAHFGVSPNPMTSHRCQRVCGFDGSIDISAVENTNLSSDEAIIRG---KGIGGYRVN 434
SQV+AHFG+ P+ + + G I+I+ N NLS D G +G G +
Sbjct 401 SQVEAHFGIKPDEKNENSL-FIGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASI- 458
Query 435 KPETFKTTEHGVLMCIYHAVPLLDYAPTGPDLQFMTTVDGDSWPVPELDSVGFEEL---- 490
F +GV++ IY P+LD+A G D T D + +PE+DS+G ++
Sbjct 459 ---KFTAKTYGVVIGIYRCTPVLDFAHLGIDRTLFKT-DASDFVIPEMDSIGMQQTFRCE 514
Query 491 --------PSYSLLNTSDVQPIKEPRPFGYVPRYISWKTSVDVVRGAFIDTLKSWTAPIG 542
+ D +GY PRY +KTS D GAF +LKSW I
Sbjct 515 VAAPAPYNDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGIN 574
Query 543 EDYMKIYFDNNNVPGGAHFGFYTWFKVNPSVVNPIFGVVADGSWNTDQLLVNCDFDVRVA 602
D ++ NN A F P +V +F V + + + DQL V
Sbjct 575 FDAIQ----NNVWNTWAGINAPNMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYAT 630
Query 603 RNLSYDGLPY 612
RNLS GLPY
Sbjct 631 RNLSRYGLPY 640
>gi|575094339|emb|CDL65730.1| unnamed protein product [uncultured bacterium]
Length=588
Score = 179 bits (454), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 152/548 (28%), Positives = 238/548 (43%), Gaps = 87/548 (16%)
Query 16 RSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRIREYYEW 75
++GFD+ ++ FT+ VG+LLPV++ + PGDK +IS FTRTQP+ ++A R+ E+ E+
Sbjct 16 KNGFDMSQRHPFTSSVGQLLPVFYDYLNPGDKIRISANLFTRTQPMKSTAMARLTEHIEY 75
Query 76 FFVPLHLMYRNSNEAIMSMENQPNYAASGTQSITFNRKLPWVDLLTLNNAVE-------- 127
FFVP M+ +++ + + ++T +P+ ++ A+E
Sbjct 76 FFVPFEQMFSLFGSVFYGIDDYNSSSLVKHNNLT----MPFFKSDAVSAALEAAYTSFSS 131
Query 128 NVKASTYHDNMFGFSRALGFAKLYNYLGVGQVDPSKTLA---NLRISAFPFYAYQKIYND 184
++ +M G R G +L LG G + S + +S F F AYQKI+ND
Sbjct 132 SINRKVLTPDMMGQPRVYGILRLSEMLGYGSLLLSNDNNLLPHADMSVFLFTAYQKIFND 191
Query 185 YYRNSQWEVNKPWTYNCDFWNGEDTTPVASSKDLFDTNPNDSIFELRYANWNKDLYMGAM 244
+YR + + +YN D+ G+ T ++S+FEL Y W KD + +
Sbjct 192 FYRLDDYTSVQHKSYNVDYAQGQPIT-------------DNSMFELHYRPWKKDYFTNVI 238
Query 245 PNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTGAGGVQPDAQIGLRGAVTGAPDN 304
PN F VD+ GAG D +GL
Sbjct 239 PNPYFSS-----VDNKSSF-----------------GGAGLF--DRPVGL---------- 264
Query 305 GQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQ-VPYEFSSK----FDVLQLRAA 359
++T++ D +D F P ++ ++ N Q +P +S V LR
Sbjct 265 --SITSFNFDGSD-----FLQAP-SDLSTMENNQPIFQELPVNLTSASSAGLSVSDLRYL 316
Query 360 ECLQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDGSIDISAVENTNLSSD 419
K I Q G++Y +Q AHFG S + G + IS+VE+T + D
Sbjct 317 YATDKLLRITQFAGKHYDAQTLAHFGKRVPQGVSGEVYYIGGQSQPLQISSVESTATTFD 376
Query 420 EAIIRGKGIG-----GYRV---NKPETFKTTEHGVLMCIYHAVPLLDYAPTGPDLQFMTT 471
+ G +G GY K +F+ HGVLM IY AVP DY D T
Sbjct 377 SGDVVGSVLGELAGKGYSQTGNQKDFSFEAPCHGVLMAIYSAVPEADYLDERIDY-LNTL 435
Query 472 VDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRPFGYVPRYISWKTSVDVVRGAFI 531
+ + + PE DS+G E P+Y L + + G+ RY K+ D++ GAF
Sbjct 436 IQSNDFYKPEFDSLGMEPFPNYEL---DQYRMVGNNSRLGWRYRYSGLKSKPDLISGAFK 492
Query 532 DTLKSWTA 539
TL+ W A
Sbjct 493 YTLRDWVA 500
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 175 bits (444), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 171/625 (27%), Positives = 269/625 (43%), Gaps = 105/625 (17%)
Query 16 RSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRIREYYEW 75
R+ FD+ ++ FTA G LLPV LP D +I+ F RT P++++AF +R YE+
Sbjct 18 RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGVYEF 77
Query 76 FFVPLHLMYRNSNEAIMSMENQPN---YAASGT---QSITFN-RKLPWVDLLTLNNAVEN 128
+FVP ++ ++ I M + + YA G ++F+ +KL VD N A ++
Sbjct 78 YFVPYKQLWSGFDQFITGMSDYKSSFMYAFKGKTPPSCVSFDVQKL--VDWCKTNTA-KD 134
Query 129 VKASTYHDNMFGFSRALGFAKLYNYLGVGQVDPSKTLANLRISAFPFYAYQKIYNDYYRN 188
+ + ++ LG+ K N GV +P+ T + + F AYQKIYND+YRN
Sbjct 135 IHGFDKNKGVYRILDLLGYGKYANSAGVPYTNPTSTTMG-KCTPFRGLAYQKIYNDFYRN 193
Query 189 SQWEVNKPWTYNCDFWNGEDTTPVASSKDLFDTNPND----SIFELRYANWNKDLYMGAM 244
+ +E + ++N D + G S + +T PN+ F LRY N KDL
Sbjct 194 TTYEEYQLESFNVDMFYG--------SGKVKETIPNEPWDYDWFTLRYRNAQKDLLTNVR 245
Query 245 PNAQFGDVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTGAGGVQPDAQIGLRGAVTGAPDN 304
P P + P + TG +
Sbjct 246 PT---------------------PLFSIDDFNPQFFTGGSDI------------------ 266
Query 305 GQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQVPYEFSSKFDVLQLRAAECLQK 364
V G + T Y SV + N K V + + V +R A L+K
Sbjct 267 ---VMEKGPNVTGGTHEY-----RDSVVIVGKNLKENGVDSK-RTMISVADIRNAFALEK 317
Query 365 WKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDGSIDISAVENTNLSSDEAIIR 424
+ G+ Y Q++AHFG+S RC + GFD +I + V ++ ++ +
Sbjct 318 LASVTMRAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTG-TK 376
Query 425 GKGIGGY--RVNKPET--------FKTTEHGVLMCIYHAVPLLDYAPTGPDLQFMTTVDG 474
GGY R T F EHG+LMCIY VP + Y D F+ ++
Sbjct 377 DTSFGGYLGRTTGKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVD-PFVQKIER 435
Query 475 DSWPVPELDSVGFEEL----PSYSLLNTSDVQPIKEPRPFGYVPRYISWKTSVDVVRGAF 530
+ VPE +++G + L SY N + IK FG+ PRY +KT++D+ G F
Sbjct 436 GDFFVPEFENLGMQPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQF 495
Query 531 I--DTLKSWTA--PIGEDYMKIYFDNNNVPGGAHFGFYTWFKVNPSVVNPIFGVVADGSW 586
+ + L WT GE N N+ + FK+NP ++ +F V +G+
Sbjct 496 VHQEPLSYWTVARARGES-----MSNFNI---------STFKINPKWLDDVFAVNYNGTE 541
Query 587 NTDQLLVNCDFDVRVARNLSYDGLP 611
TDQ+ C F++ ++S DG+P
Sbjct 542 LTDQVFGGCYFNIVKVSDMSIDGMP 566
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 171 bits (433), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 182/659 (28%), Positives = 274/659 (42%), Gaps = 143/659 (22%)
Query 13 KPR--RSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRIR 70
KPR R+GFDL ++ F+AK G+LLP+ P + FK S + RT ++T+++ R++
Sbjct 5 KPRLARNGFDLSSRRIFSAKAGQLLPIGCWEVNPSEHFKFSVQDLVRTTTLNTASYARMK 64
Query 71 EYYEWFFVPLHLMYRNSNEAIMSMENQPNYAASGTQ---SITFNRKLPWVDLLTLNNAVE 127
EYY +FFV +++ ++ I+ N P+ A +G + + +N+ V L +
Sbjct 65 EYYHFFFVSYRSLWQWFDQFIVGT-NNPHSALNGVKKNGTTNYNQICSSVPTFDLGKLIT 123
Query 128 NVKASTYHDNMFGFSRALGFAKLYNYLGVGQVDPSK--TLANL----------------- 168
+K S F +S G AKL N L G + K L NL
Sbjct 124 RLKTSDMDSQGFNYSE--GAAKLLNMLNYGVTNKGKFMNLENLITSTSYLPSKDDKEPSS 181
Query 169 ----RISAFPFYAYQKIYNDYYRNSQWEVNKPWTYNCDFWNGEDTTPVASSKDLFDTNPN 224
++S F AYQKI+ND+YRN W + ++N D + + + L
Sbjct 182 IYACKVSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYADDSNLTIEPDVAL------ 235
Query 225 DSIFELRYANWNKDLYMGAMPNAQFGDVAFVPVDSSGKLPVSLPSIEVG-GVAPIYNTGA 283
++RY + KD P + D F +LP G G + N +
Sbjct 236 -KFCQMRYRPYAKDWLTSMKPTPNYSDGIF-----------NLPEYVRGNGNVILTNNKS 283
Query 284 GGVQPDAQIGLRGAVTGAPDNGQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQV 343
G V D+ G+V+
Sbjct 284 GSVSLDS--------------------------------------GTVS----------- 294
Query 344 PYEFSSKFDVLQLRAAECLQKWKEIA-QANGQNYASQVKAHFGVSPNPMTSHRCQRVCGF 402
P FS V LRAA L K E +ANG +YASQ++AHFG ++ + + GF
Sbjct 295 PSSFS----VNDLRAAFALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFLGGF 350
Query 403 DGSIDISAV--ENTNLSSDEAI-----IRGKGIGGYRVNKPETFKTTEHGVLMCIYHAVP 455
D SI +S V N N +SD + + GKGIG E F +TEHG++MCIY P
Sbjct 351 DNSIVVSEVVSTNGNAASDGSHASIGDLGGKGIGSMSSGTIE-FDSTEHGIIMCIYSVAP 409
Query 456 LLDYAPTGPDLQFMTTVDGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEP-------- 507
+Y + D F + + + PE +G++ L L+ ++ K+
Sbjct 410 QSEYNASYLD-PFNRKLTREQFYQPEFADLGYQALIGSDLICSTLGMNEKQAGFSDIELN 468
Query 508 -RPFGYVPRYISWKTSVDVVRGAFID--TLKSWTAP-----IGEDYMKIYFDNNNVPGGA 559
GY RY +KT+ D+V G F +L W P G+ KI +N GGA
Sbjct 469 NNLLGYQVRYNEYKTARDLVFGDFESGKSLSYWCTPRFDFGYGDTEKKIAPENK---GGA 525
Query 560 HF----GFYTW----FKVNPSVVNPIFGVVADGSWNTDQLLVNCDFDVRVARNLSYDGL 610
+ W F +NP++VNPIF A D +VN DV+ R +S GL
Sbjct 526 DYRKKGNRSHWSSRNFYINPNLVNPIFLTSA---VQADHFIVNSFLDVKAVRPMSVTGL 581
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 163 bits (413), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 166/621 (27%), Positives = 254/621 (41%), Gaps = 111/621 (18%)
Query 16 RSGFDLGNKNAFTAKVGELLPVYWKFCLPGDKFKISQEWFTRTQPVDTSAFTRIREYYEW 75
R+ FDL ++ FTA G LLPV +P D +I+ + F RT P++T+AF +R YE+
Sbjct 17 RNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYEF 76
Query 76 FFVPLHLMYRNSNEAIMSMENQPNYAASGTQSITFNRKLPWVDLLTLNNAVENVKAS--- 132
FFVP H ++ ++ I M + + A Q T ++P+ ++ ++ N++ K S
Sbjct 77 FFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQVPYFNVDSVFNSLNTGKESGSG 136
Query 133 -------TYHDNMFGFSRALGFAKLYNYLGVGQVDPSKTLAN---LRISAFPFYAYQKIY 182
+ F LG+ + ++ G D L N S F AY KIY
Sbjct 137 STDDLQYKFKYGAFRLLDLLGYGRKFDSFGTAYPDNVSGLKNNLDYNCSVFRILAYNKIY 196
Query 183 NDYYRNSQWEVNKPWTYNCDFWNGEDTTPVASSKDLFDTNPNDSIFELRYANWNKDLYMG 242
DYYRNS +E ++N D + G L D +F+LRY N D +
Sbjct 197 QDYYRNSNYENFDTDSFNFDKFKG----------GLVDAKVVADLFKLRYRNAQTDYFTN 246
Query 243 AMPNAQFG-DVAFVPVDSSGKLPVSLPSIEVGGVAPIYNTGAGGVQPDAQIGLRGAVTGA 301
+ F AF VD+ +AP R V
Sbjct 247 LRQSQLFSFTTAFEDVDNI-------------NIAP-----------------RDYVKSD 276
Query 302 PDNGQTVTAYGADKTDAARPYFYAVPDGSVAHLKTNAKTIQVPYEFSSKFDVLQLRAAEC 361
N V +G D TD++ D SV+ L+ K + +RA +
Sbjct 277 GSNFTRVN-FGVD-TDSSE------GDFSVSSLRAAFAV--------DKLLSVTMRAGKT 320
Query 362 LQKWKEIAQANGQNYASQVKAHFGVSPNPMTSHRCQRVCGFDGSIDISAVENTNLSSDEA 421
Q Q++AH+GV R + GFD + +S V T+ ++
Sbjct 321 FQ--------------DQMRAHYGVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATE 366
Query 422 I---------IRGKGIGGYRVNKPETFKTTEHGVLMCIYHAVPLLDYAPTGPDLQFMTTV 472
+ GKG G R F EHGVLMCIY VP + Y T D + +
Sbjct 367 YKPEAGYLGRVAGKGTGSGR--GRIVFDAKEHGVLMCIYSLVPQIQYDCTRLD-PMVDKL 423
Query 473 DGDSWPVPELDSVGFEELPSYSLLNTSDVQPIKEPRPFGYVPRYISWKTSVDVVRGAFI- 531
D + PE +++G + L S + + P K P GY PRY +KT++DV G F
Sbjct 424 DRFDYFTPEFENLGMQPLNSSYISSFCTTDP-KNP-VLGYQPRYSEYKTALDVNHGQFAQ 481
Query 532 -DTLKSWTAPIGEDYMKIYFDNNNVPGGAHFGFYTWFKVNPSVVNPIFGVVADGSWNTDQ 590
D L SW+ + F + FK++P +N IF V +G+ D
Sbjct 482 SDALSSWSVSRFRRWTT--FPQLEIAD---------FKIDPGCLNSIFPVDYNGTEANDC 530
Query 591 LLVNCDFDVRVARNLSYDGLP 611
+ C+F++ ++S DG+P
Sbjct 531 VYGGCNFNIVKVSDMSVDGMP 551
Lambda K H a alpha
0.318 0.136 0.428 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4587133073826