bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-48_CDS_annotation_glimmer3.pl_2_1
Length=473
Score E
Sequences producing significant alignments: (Bits) Value
gi|490418709|ref|WP_004291032.1| hypothetical protein 313 4e-96
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 311 9e-96
gi|496050829|ref|WP_008775336.1| hypothetical protein 309 9e-95
gi|575094354|emb|CDL65742.1| unnamed protein product 301 2e-91
gi|494822885|ref|WP_007558293.1| hypothetical protein 279 9e-83
gi|575094321|emb|CDL65708.1| unnamed protein product 198 1e-52
gi|517172762|ref|WP_018361580.1| hypothetical protein 167 6e-42
gi|565841287|ref|WP_023924568.1| hypothetical protein 164 6e-41
gi|647452987|ref|WP_025792807.1| hypothetical protein 143 6e-34
gi|496521299|ref|WP_009229582.1| capsid protein 139 1e-32
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 313 bits (801), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 185/480 (39%), Positives = 259/480 (54%), Gaps = 53/480 (11%)
Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60
G++ + + KLL YLGYGN+ S T W+T+ + N N+F
Sbjct 145 FGYNRSKSSVKLLEYLGYGNY----ESFLTDDWNTA------------PLMANLNHNIFG 188
Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYC 120
LLAYQKIY DF+R SQWER +PS++NVDY G S +L Y+++++++ FDL+YC
Sbjct 189 LLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSSMNLDNA---YSTEFYQNYNFFDLRYC 245
Query 121 NWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSSSG 180
NW KD+ G+LP Q+GE SI G L +T G T+ ++SG
Sbjct 246 NWQKDLFHGVLPHQQYGETAVASITPDVTGKLTLSNFSTVG-----------TSPTTASG 294
Query 181 LSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDSDYREQIRK 240
+T + + TV S+L LRQAE LQ+WKEI+QSG+ DY++Q+ K
Sbjct 295 TATKNLPAFDTVG--------------DLSILVLRQAEFLQKWKEITQSGNKDYKDQLEK 340
Query 241 HFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGKGVGAGNGSFTYTTD 300
H+GV++ S LCTY+GG+S ++DI+EV+N N+ A IAGKGVG NG + ++
Sbjct 341 HWGVSVGDGFSELCTYLGGVSSSIDINEVINTNITGSA-AADIAGKGVGVANGEINFNSN 399
Query 301 -EHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQIFNSPKAS 359
+ ++MCIYH +PLLDYT D L ++ IPEFD +GM+ +P+ Q+ N P S
Sbjct 400 GRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMN-PLRS 458
Query 360 IVNL--FNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWFGFGYSQDDVNK 417
N GY PRY ++KT +D G F TL SWV + +
Sbjct 459 FANASGLVLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEP 518
Query 418 DTKV----VLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNLSRDGVPY 473
V +N+ FFKVNP LDPIF V A +TDQ L +S+ RNL DG+PY
Sbjct 519 SEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 311 bits (798), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 202/484 (42%), Positives = 267/484 (55%), Gaps = 64/484 (13%)
Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60
GFS +L+ KLL+YL YG S + S S SP FP
Sbjct 143 FGFSRVELSVKLLNYLNYGF---GKDYESVKVPSDSDDIVLSP---------------FP 184
Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYC 120
LLAYQKI +D+FR QW+ A P YN+DY G SS + +T+D +K+ TMFDL YC
Sbjct 185 LLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPMSSFTNDAFKNPTMFDLNYC 244
Query 121 NWNKDMLMGILPDSQFGECCYKS-IFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSSS 179
N+ KD G+LP +Q+G+ S IF GDL G ++ F SA T + S
Sbjct 245 NFQKDYFTGMLPRAQYGDVSVASPIF----GDLDIG-DSSSLTFASAPQQGANTIQ---S 296
Query 180 GLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDSDYREQIR 239
G+ S +T L SVLALRQAE LQ+W+EI+QSG DY+ Q++
Sbjct 297 GVLVVNNNSNTTAGL---------------SVLALRQAECLQKWREIAQSGKMDYQTQMQ 341
Query 240 KHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGKGVGAGNGS-FTYT 298
KHF V+ +LS C Y+GG + NLDISEVVN NL + + A I GKG G NG+ +
Sbjct 342 KHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNLTGD-NQADIQGKGTGTLNGNKVDFE 400
Query 299 TDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVL-PMTQIFN--- 354
+ EH ++MCIYH +PLLD++I Q T IPEFD++GM+ L P IF
Sbjct 401 SSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLYPSEMIFGLED 460
Query 355 --SPKASIVNLFNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWFGFGYSQ 412
S +SI N GY PRY + KT +D ++G+F TL SWVSP+T+S +S +
Sbjct 461 LPSDPSSI----NMGYVPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAY------- 509
Query 413 DDVNKD---TKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNLSRD 469
KD + + + Y FFKVNP ++D IFGV ADST +TDQLL+NSY RN +
Sbjct 510 RQACKDAGFSDITMTYNFFKVNPHIVDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYN 569
Query 470 GVPY 473
G+PY
Sbjct 570 GLPY 573
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 309 bits (792), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 187/477 (39%), Positives = 265/477 (56%), Gaps = 46/477 (10%)
Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60
G+S + KLL YLGYGNF S + W + L N +N++
Sbjct 146 FGYSRSLGTAKLLEYLGYGNFYTYATSKNNTWTKSPLSS-------------NLQLNIYG 192
Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSD-YWKSDTMFDLKY 119
+LAYQKIY D R SQWE+ +PS +NVDY SG S +T+ T + MFDL+Y
Sbjct 193 VLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVDSAMTIDSMITGQGFAPFYNMFDLRY 252
Query 120 CNWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSSS 179
CNW KD+ G+LP Q+G+ ++ + + +T DG V +P SS+
Sbjct 253 CNWQKDLFHGVLPRQQYGDTAAVNVNLSNVLSAQYMVQTPDG---DPVGGSPF----SST 305
Query 180 GLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDSDYREQIR 239
G++ V T F+VLALRQAE LQ+WKEI+QSG+ DY++QI
Sbjct 306 GVNLQTVNGSGT-----------------FTVLALRQAEFLQKWKEITQSGNKDYKDQIE 348
Query 240 KHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGKGVGAGNGSFTYTT 299
KH+ V++ ++ S + Y+GG + +LDI+EVVNNN+ + A IAGKGV GNG ++
Sbjct 349 KHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNITGS-NAADIAGKGVVVGNGRISFDA 407
Query 300 DE-HCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQIFNSPKA 358
E + ++MCIYH++PLLDYT + ++ IPEFD +GME +P+ + N P
Sbjct 408 GERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMN-PLQ 466
Query 359 SIVNLFNA--GYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWFGFGYSQDDVN 416
S N+ ++ GY PRY ++KT +D GAF TTLKSWV + + QDD N
Sbjct 467 SSYNVGSSILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNY---QDDPN 523
Query 417 KDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNLSRDGVPY 473
++NY FKVNP+ +DP+F V A ++ DTDQ L +S+ V RNL DG+PY
Sbjct 524 NSPGTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 301 bits (771), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 190/506 (38%), Positives = 272/506 (54%), Gaps = 59/506 (12%)
Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60
GF+ + L KLL YLGYG++ + + T WS A P Y + ++ FP
Sbjct 136 FGFNRSTLTCKLLQYLGYGDY--NSFDSETNTWS------AKPLLYNLE------LSPFP 181
Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYC 120
LLAYQKIY DF+R++QWE+ NPS++N+DY G +S L L SD + FD++YC
Sbjct 182 LLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKG-TSDLQMDLTGLPSD---DNNFFDIRYC 237
Query 121 NWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLT------- 173
N+ KDM G+LP +Q+G I G L G T P T
Sbjct 238 NYQKDMFHGVLPVAQYGSASVVPI----NGQLNVISNGDSGPIFKTSTPDPGTPGTSYVT 293
Query 174 ------TENSSSGLSTPGVTSG-----------STVALKSPLISDLSALQSQ-----FSV 211
+N S G+S + G S + +S L + + + +
Sbjct 294 VGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPNLIIENNQGFYVPI 353
Query 212 LALRQAEALQRWKEISQSGDSDYREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVVN 271
LALRQAE LQ+WKE+S SG+ DY+ QI KH+G+ + LS+ Y+GG + +LDI+EV+N
Sbjct 354 LALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCATSLDINEVIN 413
Query 272 NNLAAEGDTAVIAGKGVGAGNGSFTYTTD-EHCVVMCIYHAVPLLDYTITGQDGQLLVTD 330
NN+ + + A IAGKG GNGS + + E+ ++MCIYH +P++DY +G D + D
Sbjct 414 NNITGD-NAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGVDHSCTLVD 472
Query 331 AESLPIPEFDNIGMEVLPMTQIFNSPKASIVNLFNA--GYNPRYFNWKTKLDVVNGAFTT 388
A S PIPE D IGME +P+ + N K S + GY PRY +WKT +D G F
Sbjct 473 ATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFLGYAPRYIDWKTSVDRSVGDFAD 532
Query 389 TLKSWVSPVTESLLSGWFGFGY-SQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTW 447
+L++W PV + L+ + S +V D+ + FFKVNPS++DP+F V ADST
Sbjct 533 SLRTWCLPVGDKELTSANSLNFPSNPNVEPDS---IAAGFFKVNPSIVDPLFAVVADSTV 589
Query 448 DTDQLLVNSYIGCYVARNLSRDGVPY 473
TD+ L +S+ V RNL +G+PY
Sbjct 590 KTDEFLCSSFFDVKVVRNLDVNGLPY 615
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 279 bits (713), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 179/487 (37%), Positives = 265/487 (54%), Gaps = 32/487 (7%)
Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60
G+ A L +L YLGYG+F P + T T+ + N + FP
Sbjct 145 FGYYRAWLVCIILEYLGYGDFYP---------YIVEAAGGEGATWATRPMLNNLKFSPFP 195
Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWK-SDTMFDLKY 119
L AYQKIY DF R++QWER+NPS++N+DY SG + SL D+T + +K S +FD++Y
Sbjct 196 LFAYQKIYADFNRYTQWERSNPSTFNIDYISGSADSLQL---DFTVEGFKDSFNLFDMRY 252
Query 120 CNWNKDMLMGILPDSQFGECCYKSI---FETPGGDLKAGFRTTDGKFISAVTNAPLTTEN 176
NW +D+L G +P +Q+GE + + G F T G+ A N +T +
Sbjct 253 SNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAFTT--GQDGVAFLNGNVTIQG 310
Query 177 SSSGLSTPGVTSGSTVALKSPLISDLSAL-QSQF--SVLALRQAEALQRWKEISQSGDSD 233
SS L S + + S L S F S+LALR+AEA Q+WKE++ + + D
Sbjct 311 SSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRAEAAQKWKEVALASEED 370
Query 234 YREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGKGVGAGNG 293
Y QI H+G ++ ++ S++C ++G I+ +L I+EVVNNN+ E + A IAGKG +GNG
Sbjct 371 YPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGE-NAADIAGKGTMSGNG 429
Query 294 SFTYTT-DEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQI 352
S + ++ +VMC++H +P LDY + +T+ PIPEFD IGME +P+ +
Sbjct 430 SINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRG 489
Query 353 FNSPKAS------IVNLFNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWF 406
N K NL+ GY P+Y+NWKT LD G F +LK+W+ P + L
Sbjct 490 LNPVKPKDGDFKVSPNLY-FGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAAD 548
Query 407 GFGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNL 466
+ D+ N + V FFKV+PSVLD +F V A+S +TDQ L ++ V R+L
Sbjct 549 SVDFP-DNPNVEADSV-KAGFFKVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSL 606
Query 467 SRDGVPY 473
+G+PY
Sbjct 607 DPNGLPY 613
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 198 bits (503), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 154/502 (31%), Positives = 234/502 (47%), Gaps = 66/502 (13%)
Query 11 KLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNN-YVNLFPLLAYQKIYQ 69
KLL LGYGNF P + + K S + N+ Y+++F LLAY KI
Sbjct 166 KLLQLLGYGNF----PEQFANFKVNNDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICN 221
Query 70 DFFRWSQWERANPSSYNVDYYSGVSSSLVTV------LPDYTSDYWKSD--TMFDLKYCN 121
D + + QW+ N S NVDY + SSSL+++ +PD D K++ + D+++ N
Sbjct 222 DHYLYRQWQPYNASLCNVDYLTPNSSSLLSIDDALLSIPD---DSIKAEKLNLLDMRFSN 278
Query 122 WNKDMLMGILPDSQFG----------ECCYKSIFETPGGDLKAGFRTTDGKF-----ISA 166
D G+LP SQFG ++ +RTT G++ +++
Sbjct 279 LPLDYFTGVLPTSQFGSESVVNLNLGNASGSAVLNGTTSKDSGRWRTTTGEWEMEQRVAS 338
Query 167 VTNAPLTTENSSSGLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEI 226
N L +NS+ + T VA+ + L +LS ++ALR A A Q++KEI
Sbjct 339 SANGNLKLDNSNGTFISHDHTFSGNVAINTSLSGNLS-------IIALRNALAAQKYKEI 391
Query 227 SQSGDSDYREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGK 286
+ D D++ Q+ HFG+ P + +IGG S ++I+E +N NL+ + + A
Sbjct 392 QLANDVDFQSQVEAHFGIK-PDEKNENSLFIGGSSSMININEQINQNLSGD-NKATYGAA 449
Query 287 GVGAGNGSFTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEV 346
G G+ S +T + VV+ IY P+LD+ G D L TDA IPE D+IGM+
Sbjct 450 PQGNGSASIKFTAKTYGVVIGIYRCTPVLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQ 509
Query 347 LPMTQIFNSPKASIVNLFNA---------------GYNPRYFNWKTKLDVVNGAFTTTLK 391
++ + A + F A GY PRY +KT D NGAF +LK
Sbjct 510 TFRCEV--AAPAPYNDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLK 567
Query 392 SWVSPVTESLLSGWFGFGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQ 451
SWV+ + F Q++V + F P ++ +F V++ + D DQ
Sbjct 568 SWVTGIN---------FDAIQNNVWNTWAGINAPNMFACRPDIVKNLFLVSSTNNSDDDQ 618
Query 452 LLVNSYIGCYVARNLSRDGVPY 473
L V CY RNLSR G+PY
Sbjct 619 LYVGMVNMCYATRNLSRYGLPY 640
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 167 bits (422), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 138/493 (28%), Positives = 211/493 (43%), Gaps = 85/493 (17%)
Query 2 GFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFPL 61
GF Y++L LGYG + S T ST++ K +P F
Sbjct 137 GFDKNKGVYRILDLLGYGKYANSAGVPYTNPTSTTMGK-CTP---------------FRG 180
Query 62 LAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDY-WKSDTMFDLKYC 120
LAYQKIY DF+R + +E S+NVD + G S + +P+ DY W F L+Y
Sbjct 181 LAYQKIYNDFYRNTTYEEYQLESFNVDMFYG-SGKVKETIPNEPWDYDW-----FTLRYR 234
Query 121 NWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSSSG 180
N KD+L + P F + F T G D + E
Sbjct 235 NAQKDLLTNVRPTPLFSIDDFNPQFFTGGSD--------------------IVMEKG--- 271
Query 181 LSTPGVTSGSTVALKSPLI-------SDLSALQSQFSVLALRQAEALQRWKEISQSGDSD 233
P VT G+ S +I + + + ++ SV +R A AL++ ++
Sbjct 272 ---PNVTGGTHEYRDSVVIVGKNLKENGVDSKRTMISVADIRNAFALEKLASVTMRAGKT 328
Query 234 YREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNN---LAAEGDTAV------IA 284
Y+EQ+ HFG+++ + CTYIGG N+ + +V ++ + DT+
Sbjct 329 YKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGYLGRTT 388
Query 285 GKGVGAGNGSFTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGM 344
GK G+G+G + EH ++MCIY VP + Y D + + +PEF+N+GM
Sbjct 389 GKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEFENLGM 448
Query 345 EVLPMTQIF-----NSPKASIVNLFNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTE 399
+ L I N+ + I NL G+ PRY +KT LD+ +G F V +
Sbjct 449 QPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQF----------VHQ 498
Query 400 SLLSGWFGFGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIG 459
LS W + ++ N FK+NP LD +F VN + T TDQ+ Y
Sbjct 499 EPLSYW-----TVARARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQVFGGCYFN 553
Query 460 CYVARNLSRDGVP 472
++S DG+P
Sbjct 554 IVKVSDMSIDGMP 566
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 164 bits (416), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 148/477 (31%), Positives = 216/477 (45%), Gaps = 70/477 (15%)
Query 9 AYKLLSYLGYG----NFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFPLLAY 64
A++LL +LGYG FI ++ +K + Y I+ N+F LLAY
Sbjct 219 AFRLLHFLGYGVDNNGFIVDFNASYAAGTGEIVKNVLAKKTYKLPDIK---ANVFRLLAY 275
Query 65 QKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYCNWNK 124
Q+IY DF+R WE A P +NVD+ +S ++ Y M L+Y +W+K
Sbjct 276 QRIYNDFYRNDLWEAAQPDVFNVDWCCNNNSLDISDELVY--------KMCQLRYRHWSK 327
Query 125 DMLMGILPDSQFGECCYKSIFETPGG-DLKAGFRTTDGKFISAVTNAPLTTENSSSGLST 183
D + P + + K IFE P + GF TT+ K V N N S L
Sbjct 328 DWVTSAYPTASYD----KGIFELPDYINGNTGFATTEVK--RDVVN------NRGSQLEI 375
Query 184 PGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDS-DYREQIRKHF 242
+ +GS L S IS +S +R AL++ E +++ + DY QI HF
Sbjct 376 KSMDAGS---LGSNNISYISPND-------IRAMFALEKMLERTRAANGLDYSNQIAAHF 425
Query 243 GVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAV-------IAGKGVGAGN-GS 294
G +P+S N ++IGG + ISEVV + + TA + GKG+GA N G
Sbjct 426 GFKVPESRKNCASFIGGFDNQISISEVVTTSNGSVDGTASTGSVVGQVFGKGIGAMNSGH 485
Query 295 FTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQI-- 352
+Y EH ++MCIY P +DY D E PEF+N+GM+ + + +
Sbjct 486 ISYDVKEHGLIMCIYSIAPQVDYDARELDPFNRKFSREDYFQPEFENLGMQPVIQSDLCL 545
Query 353 -FNSPKASIVNLFN--AGYNPRYFNWKTKLDVVNGAFTT--TLKSWVSPVTESLLSGWFG 407
NS K+ + N GY+ RY +KT D++ G F + +L +W +P F
Sbjct 546 CINSAKSDSSDQHNNVLGYSARYLEYKTARDIIFGEFMSGGSLSAWATPKNNYT----FE 601
Query 408 FGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVAR 464
FG L+ V+P VL+PIF V + + TDQ LVNSY R
Sbjct 602 FG------------KLSLPDLLVDPKVLEPIFAVKYNGSMSTDQFLVNSYFDVKAIR 646
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 143 bits (361), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 145/511 (28%), Positives = 229/511 (45%), Gaps = 104/511 (20%)
Query 2 GFSGADLAYKLLSYLGYG-----------NFIPSPPSNSTRWWSTSLKKEASPTGYTQQY 50
GF+ ++ A KLL+ L YG N I ST + + KE S
Sbjct 134 GFNYSEGAAKLLNMLNYGVTNKGKFMNLENLI-----TSTSYLPSKDDKEPSS------- 181
Query 51 IQNNYVNLFPLLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWK 110
I V+ F LLAYQKI+ DF+R W ++ S+NVD Y+ S+ +T+ PD + +
Sbjct 182 IYACKVSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYADDSN--LTIEPDVALKFCQ 239
Query 111 SDTMFDLKYCNWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNA 170
++Y + KD L + P + + IF P +++ N
Sbjct 240 ------MRYRPYAKDWLTSMKPTPNYSD----GIFNLP-------------EYVRGNGNV 276
Query 171 PLTTENSSSGLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSG 230
LT S S V+L S +S S FSV LR A AL + E ++
Sbjct 277 ILTNNKSGS------------VSLDSGTVS-----PSSFSVNDLRAAFALDKMLEATRRA 319
Query 231 DS-DYREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVV--NNNLAAEGDTAVI---A 284
+ DY QI HFG +P+S +N ++GG ++ +SEVV N N A++G A I
Sbjct 320 NGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGDLG 379
Query 285 GKGVGA-GNGSFTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIG 343
GKG+G+ +G+ + + EH ++MCIY P +Y + D E PEF ++G
Sbjct 380 GKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQFYQPEFADLG 439
Query 344 MEVLPMTQI------FNSPKA--SIVNLFN--AGYNPRYFNWKTKLDVVNGAFTT--TLK 391
+ L + + N +A S + L N GY RY +KT D+V G F + +L
Sbjct 440 YQALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFESGKSLS 499
Query 392 SWVSPVTESLLSGWFGFGYSQDDVNKDTKVVLNYKF-----------FKVNPSVLDPIFG 440
W +P + FG+G ++ + + K +Y+ F +NP++++PIF
Sbjct 500 YWCTPRFD------FGYGDTEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFL 553
Query 441 VNADSTWDTDQLLVNSYIGCYVARNLSRDGV 471
+A D +VNS++ R +S G+
Sbjct 554 TSA---VQADHFIVNSFLDVKAVRPMSVTGL 581
>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon
317 str. F0108]
Length=541
Score = 139 bits (350), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 130/486 (27%), Positives = 203/486 (42%), Gaps = 97/486 (20%)
Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60
G+ ++ + +L+ LGYG I S K P YT VNLF
Sbjct 137 FGYPHSNNSCRLMDLLGYGKPITS-------------SKTPVPLLYTGN------VNLFR 177
Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYC 120
LLAY KIY D++R + +E + S+N+D+ G T +P T+D +K +L Y
Sbjct 178 LLAYNKIYSDYYRNTTYEGVDVYSFNIDHKKG------TFVP--TADEFKK--YLNLHYR 227
Query 121 NWNKDMLMGILPDSQF--GECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSS 178
N D + P F G + S+ + L+ S
Sbjct 228 NAPLDFYTNLRPTPLFTIGSDSFSSVLQ-------------------------LSDPTGS 262
Query 179 SGLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDSDYREQI 238
+G S G + + + + SP + ++SA++S F AL + IS Y EQI
Sbjct 263 AGFSADG--NSAKLNMASPDVLNVSAIRSAF---------ALDKLLSISMRAGKTYAEQI 311
Query 239 RKHFGVNLPQSLSNLCTYIGGISRNLDISEV------VNNNLAAEGDTAV------IAGK 286
HFGV + + Y+GG N+ + +V N N++ G+ + I GK
Sbjct 312 EAHFGVTVSEGRDGQVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYLGKITGK 371
Query 287 GVGAGNGSFTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEV 346
G G+G G + E V+MCIY VP + Y D + IPEF+N+GM+
Sbjct 372 GTGSGYGEIQFDAKEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQ- 430
Query 347 LPMTQIFNSPKASIVNLFNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWF 406
P+ F S + N + G+ PRY +KT D+ +G F P++ ++
Sbjct 431 -PIVPAFVSLNRAKDNSY--GWQPRYSEYKTAFDINHGQFANG-----EPLSYWSIARAR 482
Query 407 GFGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNL 466
G DT N K+NP LD +F VN + T TD + ++ ++
Sbjct 483 G---------SDTLNTFNVAALKINPHWLDSVFAVNYNGTEVTDCMFGYAHFNIEKVSDM 533
Query 467 SRDGVP 472
+ DG+P
Sbjct 534 TEDGMP 539
Lambda K H a alpha
0.317 0.134 0.410 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3246464580183