bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-48_CDS_annotation_glimmer3.pl_2_1 Length=473 Score E Sequences producing significant alignments: (Bits) Value gi|490418709|ref|WP_004291032.1| hypothetical protein 313 4e-96 gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 311 9e-96 gi|496050829|ref|WP_008775336.1| hypothetical protein 309 9e-95 gi|575094354|emb|CDL65742.1| unnamed protein product 301 2e-91 gi|494822885|ref|WP_007558293.1| hypothetical protein 279 9e-83 gi|575094321|emb|CDL65708.1| unnamed protein product 198 1e-52 gi|517172762|ref|WP_018361580.1| hypothetical protein 167 6e-42 gi|565841287|ref|WP_023924568.1| hypothetical protein 164 6e-41 gi|647452987|ref|WP_025792807.1| hypothetical protein 143 6e-34 gi|496521299|ref|WP_009229582.1| capsid protein 139 1e-32 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 313 bits (801), Expect = 4e-96, Method: Compositional matrix adjust. Identities = 185/480 (39%), Positives = 259/480 (54%), Gaps = 53/480 (11%) Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60 G++ + + KLL YLGYGN+ S T W+T+ + N N+F Sbjct 145 FGYNRSKSSVKLLEYLGYGNY----ESFLTDDWNTA------------PLMANLNHNIFG 188 Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYC 120 LLAYQKIY DF+R SQWER +PS++NVDY G S +L Y+++++++ FDL+YC Sbjct 189 LLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSSMNLDNA---YSTEFYQNYNFFDLRYC 245 Query 121 NWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSSSG 180 NW KD+ G+LP Q+GE SI G L +T G T+ ++SG Sbjct 246 NWQKDLFHGVLPHQQYGETAVASITPDVTGKLTLSNFSTVG-----------TSPTTASG 294 Query 181 LSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDSDYREQIRK 240 +T + + TV S+L LRQAE LQ+WKEI+QSG+ DY++Q+ K Sbjct 295 TATKNLPAFDTVG--------------DLSILVLRQAEFLQKWKEITQSGNKDYKDQLEK 340 Query 241 HFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGKGVGAGNGSFTYTTD 300 H+GV++ S LCTY+GG+S ++DI+EV+N N+ A IAGKGVG NG + ++ Sbjct 341 HWGVSVGDGFSELCTYLGGVSSSIDINEVINTNITGSA-AADIAGKGVGVANGEINFNSN 399 Query 301 -EHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQIFNSPKAS 359 + ++MCIYH +PLLDYT D L ++ IPEFD +GM+ +P+ Q+ N P S Sbjct 400 GRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMN-PLRS 458 Query 360 IVNL--FNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWFGFGYSQDDVNK 417 N GY PRY ++KT +D G F TL SWV + + Sbjct 459 FANASGLVLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEP 518 Query 418 DTKV----VLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNLSRDGVPY 473 V +N+ FFKVNP LDPIF V A +TDQ L +S+ RNL DG+PY Sbjct 519 SEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 311 bits (798), Expect = 9e-96, Method: Compositional matrix adjust. Identities = 202/484 (42%), Positives = 267/484 (55%), Gaps = 64/484 (13%) Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60 GFS +L+ KLL+YL YG S + S S SP FP Sbjct 143 FGFSRVELSVKLLNYLNYGF---GKDYESVKVPSDSDDIVLSP---------------FP 184 Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYC 120 LLAYQKI +D+FR QW+ A P YN+DY G SS + +T+D +K+ TMFDL YC Sbjct 185 LLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPMSSFTNDAFKNPTMFDLNYC 244 Query 121 NWNKDMLMGILPDSQFGECCYKS-IFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSSS 179 N+ KD G+LP +Q+G+ S IF GDL G ++ F SA T + S Sbjct 245 NFQKDYFTGMLPRAQYGDVSVASPIF----GDLDIG-DSSSLTFASAPQQGANTIQ---S 296 Query 180 GLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDSDYREQIR 239 G+ S +T L SVLALRQAE LQ+W+EI+QSG DY+ Q++ Sbjct 297 GVLVVNNNSNTTAGL---------------SVLALRQAECLQKWREIAQSGKMDYQTQMQ 341 Query 240 KHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGKGVGAGNGS-FTYT 298 KHF V+ +LS C Y+GG + NLDISEVVN NL + + A I GKG G NG+ + Sbjct 342 KHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNLTGD-NQADIQGKGTGTLNGNKVDFE 400 Query 299 TDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVL-PMTQIFN--- 354 + EH ++MCIYH +PLLD++I Q T IPEFD++GM+ L P IF Sbjct 401 SSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLYPSEMIFGLED 460 Query 355 --SPKASIVNLFNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWFGFGYSQ 412 S +SI N GY PRY + KT +D ++G+F TL SWVSP+T+S +S + Sbjct 461 LPSDPSSI----NMGYVPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAY------- 509 Query 413 DDVNKD---TKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNLSRD 469 KD + + + Y FFKVNP ++D IFGV ADST +TDQLL+NSY RN + Sbjct 510 RQACKDAGFSDITMTYNFFKVNPHIVDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYN 569 Query 470 GVPY 473 G+PY Sbjct 570 GLPY 573 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 309 bits (792), Expect = 9e-95, Method: Compositional matrix adjust. Identities = 187/477 (39%), Positives = 265/477 (56%), Gaps = 46/477 (10%) Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60 G+S + KLL YLGYGNF S + W + L N +N++ Sbjct 146 FGYSRSLGTAKLLEYLGYGNFYTYATSKNNTWTKSPLSS-------------NLQLNIYG 192 Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSD-YWKSDTMFDLKY 119 +LAYQKIY D R SQWE+ +PS +NVDY SG S +T+ T + MFDL+Y Sbjct 193 VLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVDSAMTIDSMITGQGFAPFYNMFDLRY 252 Query 120 CNWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSSS 179 CNW KD+ G+LP Q+G+ ++ + + +T DG V +P SS+ Sbjct 253 CNWQKDLFHGVLPRQQYGDTAAVNVNLSNVLSAQYMVQTPDG---DPVGGSPF----SST 305 Query 180 GLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDSDYREQIR 239 G++ V T F+VLALRQAE LQ+WKEI+QSG+ DY++QI Sbjct 306 GVNLQTVNGSGT-----------------FTVLALRQAEFLQKWKEITQSGNKDYKDQIE 348 Query 240 KHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGKGVGAGNGSFTYTT 299 KH+ V++ ++ S + Y+GG + +LDI+EVVNNN+ + A IAGKGV GNG ++ Sbjct 349 KHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNITGS-NAADIAGKGVVVGNGRISFDA 407 Query 300 DE-HCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQIFNSPKA 358 E + ++MCIYH++PLLDYT + ++ IPEFD +GME +P+ + N P Sbjct 408 GERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMN-PLQ 466 Query 359 SIVNLFNA--GYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWFGFGYSQDDVN 416 S N+ ++ GY PRY ++KT +D GAF TTLKSWV + + QDD N Sbjct 467 SSYNVGSSILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNY---QDDPN 523 Query 417 KDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNLSRDGVPY 473 ++NY FKVNP+ +DP+F V A ++ DTDQ L +S+ V RNL DG+PY Sbjct 524 NSPGTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 301 bits (771), Expect = 2e-91, Method: Compositional matrix adjust. Identities = 190/506 (38%), Positives = 272/506 (54%), Gaps = 59/506 (12%) Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60 GF+ + L KLL YLGYG++ + + T WS A P Y + ++ FP Sbjct 136 FGFNRSTLTCKLLQYLGYGDY--NSFDSETNTWS------AKPLLYNLE------LSPFP 181 Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYC 120 LLAYQKIY DF+R++QWE+ NPS++N+DY G +S L L SD + FD++YC Sbjct 182 LLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKG-TSDLQMDLTGLPSD---DNNFFDIRYC 237 Query 121 NWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLT------- 173 N+ KDM G+LP +Q+G I G L G T P T Sbjct 238 NYQKDMFHGVLPVAQYGSASVVPI----NGQLNVISNGDSGPIFKTSTPDPGTPGTSYVT 293 Query 174 ------TENSSSGLSTPGVTSG-----------STVALKSPLISDLSALQSQ-----FSV 211 +N S G+S + G S + +S L + + + + Sbjct 294 VGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPNLIIENNQGFYVPI 353 Query 212 LALRQAEALQRWKEISQSGDSDYREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVVN 271 LALRQAE LQ+WKE+S SG+ DY+ QI KH+G+ + LS+ Y+GG + +LDI+EV+N Sbjct 354 LALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCATSLDINEVIN 413 Query 272 NNLAAEGDTAVIAGKGVGAGNGSFTYTTD-EHCVVMCIYHAVPLLDYTITGQDGQLLVTD 330 NN+ + + A IAGKG GNGS + + E+ ++MCIYH +P++DY +G D + D Sbjct 414 NNITGD-NAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGVDHSCTLVD 472 Query 331 AESLPIPEFDNIGMEVLPMTQIFNSPKASIVNLFNA--GYNPRYFNWKTKLDVVNGAFTT 388 A S PIPE D IGME +P+ + N K S + GY PRY +WKT +D G F Sbjct 473 ATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFLGYAPRYIDWKTSVDRSVGDFAD 532 Query 389 TLKSWVSPVTESLLSGWFGFGY-SQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTW 447 +L++W PV + L+ + S +V D+ + FFKVNPS++DP+F V ADST Sbjct 533 SLRTWCLPVGDKELTSANSLNFPSNPNVEPDS---IAAGFFKVNPSIVDPLFAVVADSTV 589 Query 448 DTDQLLVNSYIGCYVARNLSRDGVPY 473 TD+ L +S+ V RNL +G+PY Sbjct 590 KTDEFLCSSFFDVKVVRNLDVNGLPY 615 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 279 bits (713), Expect = 9e-83, Method: Compositional matrix adjust. Identities = 179/487 (37%), Positives = 265/487 (54%), Gaps = 32/487 (7%) Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60 G+ A L +L YLGYG+F P + T T+ + N + FP Sbjct 145 FGYYRAWLVCIILEYLGYGDFYP---------YIVEAAGGEGATWATRPMLNNLKFSPFP 195 Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWK-SDTMFDLKY 119 L AYQKIY DF R++QWER+NPS++N+DY SG + SL D+T + +K S +FD++Y Sbjct 196 LFAYQKIYADFNRYTQWERSNPSTFNIDYISGSADSLQL---DFTVEGFKDSFNLFDMRY 252 Query 120 CNWNKDMLMGILPDSQFGECCYKSI---FETPGGDLKAGFRTTDGKFISAVTNAPLTTEN 176 NW +D+L G +P +Q+GE + + G F T G+ A N +T + Sbjct 253 SNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAFTT--GQDGVAFLNGNVTIQG 310 Query 177 SSSGLSTPGVTSGSTVALKSPLISDLSAL-QSQF--SVLALRQAEALQRWKEISQSGDSD 233 SS L S + + S L S F S+LALR+AEA Q+WKE++ + + D Sbjct 311 SSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRAEAAQKWKEVALASEED 370 Query 234 YREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGKGVGAGNG 293 Y QI H+G ++ ++ S++C ++G I+ +L I+EVVNNN+ E + A IAGKG +GNG Sbjct 371 YPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGE-NAADIAGKGTMSGNG 429 Query 294 SFTYTT-DEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQI 352 S + ++ +VMC++H +P LDY + +T+ PIPEFD IGME +P+ + Sbjct 430 SINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRG 489 Query 353 FNSPKAS------IVNLFNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWF 406 N K NL+ GY P+Y+NWKT LD G F +LK+W+ P + L Sbjct 490 LNPVKPKDGDFKVSPNLY-FGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAAD 548 Query 407 GFGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNL 466 + D+ N + V FFKV+PSVLD +F V A+S +TDQ L ++ V R+L Sbjct 549 SVDFP-DNPNVEADSV-KAGFFKVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSL 606 Query 467 SRDGVPY 473 +G+PY Sbjct 607 DPNGLPY 613 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 198 bits (503), Expect = 1e-52, Method: Compositional matrix adjust. Identities = 154/502 (31%), Positives = 234/502 (47%), Gaps = 66/502 (13%) Query 11 KLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNN-YVNLFPLLAYQKIYQ 69 KLL LGYGNF P + + K S + N+ Y+++F LLAY KI Sbjct 166 KLLQLLGYGNF----PEQFANFKVNNDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICN 221 Query 70 DFFRWSQWERANPSSYNVDYYSGVSSSLVTV------LPDYTSDYWKSD--TMFDLKYCN 121 D + + QW+ N S NVDY + SSSL+++ +PD D K++ + D+++ N Sbjct 222 DHYLYRQWQPYNASLCNVDYLTPNSSSLLSIDDALLSIPD---DSIKAEKLNLLDMRFSN 278 Query 122 WNKDMLMGILPDSQFG----------ECCYKSIFETPGGDLKAGFRTTDGKF-----ISA 166 D G+LP SQFG ++ +RTT G++ +++ Sbjct 279 LPLDYFTGVLPTSQFGSESVVNLNLGNASGSAVLNGTTSKDSGRWRTTTGEWEMEQRVAS 338 Query 167 VTNAPLTTENSSSGLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEI 226 N L +NS+ + T VA+ + L +LS ++ALR A A Q++KEI Sbjct 339 SANGNLKLDNSNGTFISHDHTFSGNVAINTSLSGNLS-------IIALRNALAAQKYKEI 391 Query 227 SQSGDSDYREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAVIAGK 286 + D D++ Q+ HFG+ P + +IGG S ++I+E +N NL+ + + A Sbjct 392 QLANDVDFQSQVEAHFGIK-PDEKNENSLFIGGSSSMININEQINQNLSGD-NKATYGAA 449 Query 287 GVGAGNGSFTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEV 346 G G+ S +T + VV+ IY P+LD+ G D L TDA IPE D+IGM+ Sbjct 450 PQGNGSASIKFTAKTYGVVIGIYRCTPVLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQ 509 Query 347 LPMTQIFNSPKASIVNLFNA---------------GYNPRYFNWKTKLDVVNGAFTTTLK 391 ++ + A + F A GY PRY +KT D NGAF +LK Sbjct 510 TFRCEV--AAPAPYNDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLK 567 Query 392 SWVSPVTESLLSGWFGFGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQ 451 SWV+ + F Q++V + F P ++ +F V++ + D DQ Sbjct 568 SWVTGIN---------FDAIQNNVWNTWAGINAPNMFACRPDIVKNLFLVSSTNNSDDDQ 618 Query 452 LLVNSYIGCYVARNLSRDGVPY 473 L V CY RNLSR G+PY Sbjct 619 LYVGMVNMCYATRNLSRYGLPY 640 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 167 bits (422), Expect = 6e-42, Method: Compositional matrix adjust. Identities = 138/493 (28%), Positives = 211/493 (43%), Gaps = 85/493 (17%) Query 2 GFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFPL 61 GF Y++L LGYG + S T ST++ K +P F Sbjct 137 GFDKNKGVYRILDLLGYGKYANSAGVPYTNPTSTTMGK-CTP---------------FRG 180 Query 62 LAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDY-WKSDTMFDLKYC 120 LAYQKIY DF+R + +E S+NVD + G S + +P+ DY W F L+Y Sbjct 181 LAYQKIYNDFYRNTTYEEYQLESFNVDMFYG-SGKVKETIPNEPWDYDW-----FTLRYR 234 Query 121 NWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSSSG 180 N KD+L + P F + F T G D + E Sbjct 235 NAQKDLLTNVRPTPLFSIDDFNPQFFTGGSD--------------------IVMEKG--- 271 Query 181 LSTPGVTSGSTVALKSPLI-------SDLSALQSQFSVLALRQAEALQRWKEISQSGDSD 233 P VT G+ S +I + + + ++ SV +R A AL++ ++ Sbjct 272 ---PNVTGGTHEYRDSVVIVGKNLKENGVDSKRTMISVADIRNAFALEKLASVTMRAGKT 328 Query 234 YREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVVNNN---LAAEGDTAV------IA 284 Y+EQ+ HFG+++ + CTYIGG N+ + +V ++ + DT+ Sbjct 329 YKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGYLGRTT 388 Query 285 GKGVGAGNGSFTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGM 344 GK G+G+G + EH ++MCIY VP + Y D + + +PEF+N+GM Sbjct 389 GKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEFENLGM 448 Query 345 EVLPMTQIF-----NSPKASIVNLFNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTE 399 + L I N+ + I NL G+ PRY +KT LD+ +G F V + Sbjct 449 QPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQF----------VHQ 498 Query 400 SLLSGWFGFGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIG 459 LS W + ++ N FK+NP LD +F VN + T TDQ+ Y Sbjct 499 EPLSYW-----TVARARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQVFGGCYFN 553 Query 460 CYVARNLSRDGVP 472 ++S DG+P Sbjct 554 IVKVSDMSIDGMP 566 >gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens] gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens CC14M] Length=656 Score = 164 bits (416), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 148/477 (31%), Positives = 216/477 (45%), Gaps = 70/477 (15%) Query 9 AYKLLSYLGYG----NFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFPLLAY 64 A++LL +LGYG FI ++ +K + Y I+ N+F LLAY Sbjct 219 AFRLLHFLGYGVDNNGFIVDFNASYAAGTGEIVKNVLAKKTYKLPDIK---ANVFRLLAY 275 Query 65 QKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYCNWNK 124 Q+IY DF+R WE A P +NVD+ +S ++ Y M L+Y +W+K Sbjct 276 QRIYNDFYRNDLWEAAQPDVFNVDWCCNNNSLDISDELVY--------KMCQLRYRHWSK 327 Query 125 DMLMGILPDSQFGECCYKSIFETPGG-DLKAGFRTTDGKFISAVTNAPLTTENSSSGLST 183 D + P + + K IFE P + GF TT+ K V N N S L Sbjct 328 DWVTSAYPTASYD----KGIFELPDYINGNTGFATTEVK--RDVVN------NRGSQLEI 375 Query 184 PGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDS-DYREQIRKHF 242 + +GS L S IS +S +R AL++ E +++ + DY QI HF Sbjct 376 KSMDAGS---LGSNNISYISPND-------IRAMFALEKMLERTRAANGLDYSNQIAAHF 425 Query 243 GVNLPQSLSNLCTYIGGISRNLDISEVVNNNLAAEGDTAV-------IAGKGVGAGN-GS 294 G +P+S N ++IGG + ISEVV + + TA + GKG+GA N G Sbjct 426 GFKVPESRKNCASFIGGFDNQISISEVVTTSNGSVDGTASTGSVVGQVFGKGIGAMNSGH 485 Query 295 FTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQI-- 352 +Y EH ++MCIY P +DY D E PEF+N+GM+ + + + Sbjct 486 ISYDVKEHGLIMCIYSIAPQVDYDARELDPFNRKFSREDYFQPEFENLGMQPVIQSDLCL 545 Query 353 -FNSPKASIVNLFN--AGYNPRYFNWKTKLDVVNGAFTT--TLKSWVSPVTESLLSGWFG 407 NS K+ + N GY+ RY +KT D++ G F + +L +W +P F Sbjct 546 CINSAKSDSSDQHNNVLGYSARYLEYKTARDIIFGEFMSGGSLSAWATPKNNYT----FE 601 Query 408 FGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVAR 464 FG L+ V+P VL+PIF V + + TDQ LVNSY R Sbjct 602 FG------------KLSLPDLLVDPKVLEPIFAVKYNGSMSTDQFLVNSYFDVKAIR 646 >gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola] Length=584 Score = 143 bits (361), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 145/511 (28%), Positives = 229/511 (45%), Gaps = 104/511 (20%) Query 2 GFSGADLAYKLLSYLGYG-----------NFIPSPPSNSTRWWSTSLKKEASPTGYTQQY 50 GF+ ++ A KLL+ L YG N I ST + + KE S Sbjct 134 GFNYSEGAAKLLNMLNYGVTNKGKFMNLENLI-----TSTSYLPSKDDKEPSS------- 181 Query 51 IQNNYVNLFPLLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWK 110 I V+ F LLAYQKI+ DF+R W ++ S+NVD Y+ S+ +T+ PD + + Sbjct 182 IYACKVSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYADDSN--LTIEPDVALKFCQ 239 Query 111 SDTMFDLKYCNWNKDMLMGILPDSQFGECCYKSIFETPGGDLKAGFRTTDGKFISAVTNA 170 ++Y + KD L + P + + IF P +++ N Sbjct 240 ------MRYRPYAKDWLTSMKPTPNYSD----GIFNLP-------------EYVRGNGNV 276 Query 171 PLTTENSSSGLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSG 230 LT S S V+L S +S S FSV LR A AL + E ++ Sbjct 277 ILTNNKSGS------------VSLDSGTVS-----PSSFSVNDLRAAFALDKMLEATRRA 319 Query 231 DS-DYREQIRKHFGVNLPQSLSNLCTYIGGISRNLDISEVV--NNNLAAEGDTAVI---A 284 + DY QI HFG +P+S +N ++GG ++ +SEVV N N A++G A I Sbjct 320 NGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGDLG 379 Query 285 GKGVGA-GNGSFTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIG 343 GKG+G+ +G+ + + EH ++MCIY P +Y + D E PEF ++G Sbjct 380 GKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQFYQPEFADLG 439 Query 344 MEVLPMTQI------FNSPKA--SIVNLFN--AGYNPRYFNWKTKLDVVNGAFTT--TLK 391 + L + + N +A S + L N GY RY +KT D+V G F + +L Sbjct 440 YQALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFESGKSLS 499 Query 392 SWVSPVTESLLSGWFGFGYSQDDVNKDTKVVLNYKF-----------FKVNPSVLDPIFG 440 W +P + FG+G ++ + + K +Y+ F +NP++++PIF Sbjct 500 YWCTPRFD------FGYGDTEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFL 553 Query 441 VNADSTWDTDQLLVNSYIGCYVARNLSRDGV 471 +A D +VNS++ R +S G+ Sbjct 554 TSA---VQADHFIVNSFLDVKAVRPMSVTGL 581 >gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317] gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 317 str. F0108] Length=541 Score = 139 bits (350), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 130/486 (27%), Positives = 203/486 (42%), Gaps = 97/486 (20%) Query 1 LGFSGADLAYKLLSYLGYGNFIPSPPSNSTRWWSTSLKKEASPTGYTQQYIQNNYVNLFP 60 G+ ++ + +L+ LGYG I S K P YT VNLF Sbjct 137 FGYPHSNNSCRLMDLLGYGKPITS-------------SKTPVPLLYTGN------VNLFR 177 Query 61 LLAYQKIYQDFFRWSQWERANPSSYNVDYYSGVSSSLVTVLPDYTSDYWKSDTMFDLKYC 120 LLAY KIY D++R + +E + S+N+D+ G T +P T+D +K +L Y Sbjct 178 LLAYNKIYSDYYRNTTYEGVDVYSFNIDHKKG------TFVP--TADEFKK--YLNLHYR 227 Query 121 NWNKDMLMGILPDSQF--GECCYKSIFETPGGDLKAGFRTTDGKFISAVTNAPLTTENSS 178 N D + P F G + S+ + L+ S Sbjct 228 NAPLDFYTNLRPTPLFTIGSDSFSSVLQ-------------------------LSDPTGS 262 Query 179 SGLSTPGVTSGSTVALKSPLISDLSALQSQFSVLALRQAEALQRWKEISQSGDSDYREQI 238 +G S G + + + + SP + ++SA++S F AL + IS Y EQI Sbjct 263 AGFSADG--NSAKLNMASPDVLNVSAIRSAF---------ALDKLLSISMRAGKTYAEQI 311 Query 239 RKHFGVNLPQSLSNLCTYIGGISRNLDISEV------VNNNLAAEGDTAV------IAGK 286 HFGV + + Y+GG N+ + +V N N++ G+ + I GK Sbjct 312 EAHFGVTVSEGRDGQVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYLGKITGK 371 Query 287 GVGAGNGSFTYTTDEHCVVMCIYHAVPLLDYTITGQDGQLLVTDAESLPIPEFDNIGMEV 346 G G+G G + E V+MCIY VP + Y D + IPEF+N+GM+ Sbjct 372 GTGSGYGEIQFDAKEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQ- 430 Query 347 LPMTQIFNSPKASIVNLFNAGYNPRYFNWKTKLDVVNGAFTTTLKSWVSPVTESLLSGWF 406 P+ F S + N + G+ PRY +KT D+ +G F P++ ++ Sbjct 431 -PIVPAFVSLNRAKDNSY--GWQPRYSEYKTAFDINHGQFANG-----EPLSYWSIARAR 482 Query 407 GFGYSQDDVNKDTKVVLNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVARNL 466 G DT N K+NP LD +F VN + T TD + ++ ++ Sbjct 483 G---------SDTLNTFNVAALKINPHWLDSVFAVNYNGTEVTDCMFGYAHFNIEKVSDM 533 Query 467 SRDGVP 472 + DG+P Sbjct 534 TEDGMP 539 Lambda K H a alpha 0.317 0.134 0.410 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 3246464580183