bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-9_CDS_annotation_glimmer3.pl_2_6 Length=529 Score E Sequences producing significant alignments: (Bits) Value gi|490418709|ref|WP_004291032.1| hypothetical protein 615 0.0 gi|496050829|ref|WP_008775336.1| hypothetical protein 615 0.0 gi|575094354|emb|CDL65742.1| unnamed protein product 441 2e-144 gi|494822885|ref|WP_007558293.1| hypothetical protein 416 1e-134 gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 364 2e-115 gi|575094321|emb|CDL65708.1| unnamed protein product 200 8e-53 gi|517172762|ref|WP_018361580.1| hypothetical protein 190 1e-49 gi|494308783|ref|WP_007173938.1| hypothetical protein 184 1e-47 gi|496521299|ref|WP_009229582.1| capsid protein 182 5e-47 gi|494306153|ref|WP_007173049.1| hypothetical protein 182 5e-47 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 615 bits (1587), Expect = 0.0, Method: Compositional matrix adjust. Identities = 313/538 (58%), Positives = 381/538 (71%), Gaps = 29/538 (5%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYDNPQHALDSSPLNVVKLDGSMPFTD 60 +NTAAFAR+REYYDF+FVPYDLLWNKANT LTQMYDNPQHA+ P L G MP+ Sbjct 61 VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMT 120 Query 61 LSSISRYLNSLASNSTAVTNKANYFGYNRALCSAKLMECLGYGNLYYYAESTSNTFAKRP 120 +I+ Y+N+L++ S K+NYFGYNR+ S KL+E LGYGN Y ++ + P Sbjct 121 SEAIASYINALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGN---YESFLTDDWNTAP 177 Query 121 LHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYMPYSTGSSGMMLSFNYS-DF 179 L NLN ++F LLAYQKIY+D+YRDSQWERVSPS FNVDY+ S M L YS +F Sbjct 178 LMANLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYL----DGSSMNLDNAYSTEF 233 Query 180 YENYSMFDLRYCNWQKDLFHGVVPNQQYGDVASISMSVPVVAGSSAAlinsritssnstt 239 Y+NY+ FDLRYCNWQKDLFHGV+P+QQYG+ A S++ P V G + Sbjct 234 YQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASIT-PDVTGK-----------LTLSN 281 Query 240 tlrFPTDPAIPDATPLLTHPSF------SILALRQAEFLQKWKEITQSGNKDYKEQVEKH 293 T P T P+F SIL LRQAEFLQKWKEITQSGNKDYK+Q+EKH Sbjct 282 FSTVGTSPTTASGTATKNLPAFDTVGDLSILVLRQAEFLQKWKEITQSGNKDYKDQLEKH 341 Query 294 WNVSPGDGFSEMCTYLGGISSSLDINEVVNQNITGSNAADIAGKGTGVSNGVINFNSQGR 353 W VS GDGFSE+CTYLGG+SSS+DINEV+N NITGS AADIAGKG GV+NG INFNS GR Sbjct 342 WGVSVGDGFSELCTYLGGVSSSIDINEVINTNITGSAAADIAGKGVGVANGEINFNSNGR 401 Query 354 YGVVMCIYHCLPLIDYTTDFVSPSVTRVNAADFAIPEFDRVGMQTVPLSYVSNGPLSVLP 413 YG++MCIYHCLPL+DYTTD + P+ +VN+ D+AIPEFDRVGMQ++PL + N PL Sbjct 402 YGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMN-PLRSFA 460 Query 414 LSIPNEIGYAPRYIDYKTDIDTSIGAFKTSLKNWVISYDNQSLANQFGYSVETP--ESPV 471 + +GY PRYIDYKT +D S+G FK +L +WVISY N S+ Q + P E Sbjct 461 NASGLVLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEPSE 520 Query 472 PNPANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQFLCSTFFDVKVVRNLDTDGLPY 529 P P+ + N+T FKVNP+ L+P+FAV+A +TDQFLCS+FFD+K VRNLDTDGLPY Sbjct 521 PVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 615 bits (1585), Expect = 0.0, Method: Compositional matrix adjust. Identities = 317/549 (58%), Positives = 391/549 (71%), Gaps = 49/549 (9%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYDNPQHALDSSPLNVVKLDGSMPFTD 60 LNTAAFARMREYYDFYFVPY+LLWNKANT LTQMYDNPQHA P L G MP Sbjct 61 LNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSANQALAGVMPNVT 120 Query 61 LSSISRYLNSLASNSTAVTN-KANYFGYNRALCSAKLMECLGYGNLYYYAESTSNTFAKR 119 I+ YLN +A + T + + NYFGY+R+L +AKL+E LGYGN Y YA S +NT+ K Sbjct 121 CKGIADYLNLVAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYTYATSKNNTWTKS 180 Query 120 PLHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYMPYSTGSS----GMMLSFN 175 PL NL ++++ +LAYQKIYAD+ RDSQWE+VSPSCFNVDY+ + S+ M+ Sbjct 181 PLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVDSAMTIDSMITGQG 240 Query 176 YSDFYENYSMFDLRYCNWQKDLFHGVVPNQQYGDVASISMSVPVVAGSSAAlinsritss 235 ++ FY +MFDLRYCNWQKDLFHGV+P QQYGD A++++++ V + + Sbjct 241 FAPFY---NMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNVNLSNVLSAQYMVQT------ 291 Query 236 nstttlrFPTDPAIPDATPLLTHP---------------SFSILALRQAEFLQKWKEITQ 280 PD P+ P +F++LALRQAEFLQKWKEITQ Sbjct 292 --------------PDGDPVGGSPFSSTGVNLQTVNGSGTFTVLALRQAEFLQKWKEITQ 337 Query 281 SGNKDYKEQVEKHWNVSPGDGFSEMCTYLGGISSSLDINEVVNQNITGSNAADIAGKGTG 340 SGNKDYK+Q+EKHWNVS G+ +SEM YLGG ++SLDINEVVN NITGSNAADIAGKG Sbjct 338 SGNKDYKDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVV 397 Query 341 VSNGVINFNSQGRYGVVMCIYHCLPLIDYTTDFVSPSVTRVNAADFAIPEFDRVGMQTVP 400 V NG I+F++ RYG++MCIYH LPL+DYTTD V+P+ T++N+ DFAIPEFDRVGM++VP Sbjct 398 VGNGRISFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVP 457 Query 401 LSYVSNGPLSVLPLSIPNEIGYAPRYIDYKTDIDTSIGAFKTSLKNWVISYDNQSLANQF 460 L + N PL + +GYAPRYI YKTD+D+S+GAFKT+LK+WV+SYDNQS+ NQ Sbjct 458 LVSLMN-PLQSSYNVGSSILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQL 516 Query 461 GYSVETPESPVPNPANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQFLCSTFFDVKVVR 520 Y + SP + NYT FKVNPN ++PLFAV A +SIDTDQFLCS+FFDVKVVR Sbjct 517 NYQDDPNNSP-----GTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVR 571 Query 521 NLDTDGLPY 529 NLDTDGLPY Sbjct 572 NLDTDGLPY 580 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 441 bits (1134), Expect = 2e-144, Method: Compositional matrix adjust. Identities = 249/575 (43%), Positives = 342/575 (59%), Gaps = 62/575 (11%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYDNPQHALDSSPLNVVKLDGSMPFTD 60 LNT+AFARMREYYDFYFVP++ +WNK ++ +TQM N QHA + + L G MP+ Sbjct 57 LNTSAFARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFT 116 Query 61 LSSISRYLNSLASNSTAVTNKANYFGYNRALCSAKLMECLGYGNLYYYAESTSNTFAKRP 120 I+ YLN A+ + + N FG+NR+ + KL++ LGYG+ Y +S +NT++ +P Sbjct 117 SEQIADYLNDQATAA-----RKNPFGFNRSTLTCKLLQYLGYGD-YNSFDSETNTWSAKP 170 Query 121 LHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYMPYSTGSSGMMLSFNYSDFY 180 L YNL +S F LLAYQKIY+D+YR +QWE+ +PS FN+DY+ G+S + + Sbjct 171 LLYNLELSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYI---KGTSDLQMDLTGLPSD 227 Query 181 ENYSMFDLRYCNWQKDLFHG--------------------VVPNQQYGDVASISMSVPVV 220 +N + FD+RYCN+QKD+FHG V+ N G + S P Sbjct 228 DN-NFFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTPDPGT 286 Query 221 AGSS----------------AAlinsritssnstttlrFPTDPAIPDATPLLTHPSF--- 261 G+S + + S + FP++ + + L +P+ Sbjct 287 PGTSYVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNAST--RSLLWENPNLIIE 344 Query 262 -------SILALRQAEFLQKWKEITQSGNKDYKEQVEKHWNVSPGDGFSEMCTYLGGISS 314 ILALRQAEFLQKWKE++ SG +DYK Q+EKHW + D S YLGG ++ Sbjct 345 NNQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCAT 404 Query 315 SLDINEVVNQNITGSNAADIAGKGTGVSNGVINFNSQGRYGVVMCIYHCLPLIDYTTDFV 374 SLDINEV+N NITG NAADIAGKGT NG I F S+G YG++MCIYH LP++DY V Sbjct 405 SLDINEVINNNITGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGV 464 Query 375 SPSVTRVNAADFAIPEFDRVGMQTVPLSYVSNGPLSVLPLSIPNEIGYAPRYIDYKTDID 434 S T V+A F IPE D++GM++VPL N S +GYAPRYID+KT +D Sbjct 465 DHSCTLVDATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFLGYAPRYIDWKTSVD 524 Query 435 TSIGAFKTSLKNWVISYDNQSLANQFGYSVETPESPVPNPANSSWNYTLFKVNPNSLNPL 494 S+G F SL+ W + ++ L + S+ P +P P + + + FKVNP+ ++PL Sbjct 525 RSVGDFADSLRTWCLPVGDKELTS--ANSLNFPSNPNVEPDSIAAGF--FKVNPSIVDPL 580 Query 495 FAVEADSSIDTDQFLCSTFFDVKVVRNLDTDGLPY 529 FAV ADS++ TD+FLCS+FFDVKVVRNLD +GLPY Sbjct 581 FAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 416 bits (1068), Expect = 1e-134, Method: Compositional matrix adjust. Identities = 238/562 (42%), Positives = 327/562 (58%), Gaps = 49/562 (9%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYDNPQHALDSSPLNVVKLDGSMPFTD 60 LNTAAFARMR Y+DFYFVP+ +WNK T++TQM N HA + V L +P+ Sbjct 68 LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADNVPLSDELPYFT 127 Query 61 LSSISRYLNSLASNSTAVTNKANYFGYNRALCSAKLMECLGYGNLYYY----AESTSNTF 116 ++ Y+ SLA + N FGY RA ++E LGYG+ Y Y A T+ Sbjct 128 AEQVADYIVSLA-------DSKNQFGYYRAWLVCIILEYLGYGDFYPYIVEAAGGEGATW 180 Query 117 AKRPLHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYMPYSTGSSGMMLSFNY 176 A RP+ NL S F L AYQKIYAD+ R +QWER +PS FN+DY+ S + + L F Sbjct 181 ATRPMLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYI--SGSADSLQLDFTV 238 Query 177 SDFYENYSMFDLRYCNWQKDLFHGVVPNQQYGDVASI--SMSVPVVAGSSA--------A 226 F +++++FD+RY NWQ+DL HG +P QYG+ +++ S S+ VV G + Sbjct 239 EGFKDSFNLFDMRYSNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAFTTGQDG 298 Query 227 linsritssnstttlrFPTDPAIPDATPLLTHPS-------------FSILALRQAEFLQ 273 + + ++ ++ ++ L + + SILALR+AE Q Sbjct 299 VAFLNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRAEAAQ 358 Query 274 KWKEITQSGNKDYKEQVEKHWNVSPGDGFSEMCTYLGGISSSLDINEVVNQNITGSNAAD 333 KWKE+ + +DY Q+E HW S +S+MC +LG I+ L INEVVN NITG NAAD Sbjct 359 KWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGENAAD 418 Query 334 IAGKGTGVSNGVINFNSQGRYGVVMCIYHCLPLIDYTTDFVSPSVTRVNAADFAIPEFDR 393 IAGKGT NG INFN G+YG+VMC++H LP +DY T T N DF IPEFD+ Sbjct 419 IAGKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDK 478 Query 394 VGMQTVPLSYVSN------GPLSVLPLSIPNEIGYAPRYIDYKTDIDTSIGAFKTSLKNW 447 +GM+ VP+ N G V P GYAP+Y ++KT +D S+G F+ SLK W Sbjct 479 IGMEQVPVIRGLNPVKPKDGDFKVSPNLY---FGYAPQYYNWKTTLDKSMGEFRRSLKTW 535 Query 448 VISYDNQSLANQFGYSVETPESPVPNPANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQ 507 +I +D+++L SV+ P++ PN S FKV+P+ L+ LFAV+A+S ++TDQ Sbjct 536 IIPFDDEALLA--ADSVDFPDN--PNVEADSVKAGFFKVSPSVLDNLFAVKANSDLNTDQ 591 Query 508 FLCSTFFDVKVVRNLDTDGLPY 529 FLCST FDV VVR+LD +GLPY Sbjct 592 FLCSTLFDVNVVRSLDPNGLPY 613 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 364 bits (935), Expect = 2e-115, Method: Compositional matrix adjust. Identities = 228/536 (43%), Positives = 318/536 (59%), Gaps = 30/536 (6%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYDNPQHALDSSPLNVVKLDGSMPFTD 60 +N+AA++R+REYYDFYFVPY LLWN A T T M D P HA D ++ V L P+ Sbjct 61 VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPD-PHHAADL--VSSVNLSQRHPWFT 117 Query 61 LSSISRYLNSLASNSTAVTN-KANYFGYNRALCSAKLMECL--GYGNLYYYAESTSNTFA 117 I YL +L S S A + N+FG++R S KL+ L G+G Y + S++ Sbjct 118 FFDIMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYGFGKDYESVKVPSDS-- 175 Query 118 KRPLHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYM-PYSTGSSGMMLSFNY 176 ++ +S F LLAYQKI DY+RD QW+ +P +N+DY+ S+G M SF Sbjct 176 -----DDIVLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPMSSFT- 229 Query 177 SDFYENYSMFDLRYCNWQKDLFHGVVPNQQYGDVASISMSVPVVAGSSAAlinsritssn 236 +D ++N +MFDL YCN+QKD F G++P QYGDV S++ P+ +S +S Sbjct 230 NDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDV---SVASPIFGDLDIGDSSSLTFASA 286 Query 237 stttlrFPTDPAIPDATPLLTHPSFSILALRQAEFLQKWKEITQSGNKDYKEQVEKHWNV 296 + T S+LALRQAE LQKW+EI QSG DY+ Q++KH+NV Sbjct 287 PQQGANTIQSGVLVVNNNSNTTAGLSVLALRQAECLQKWREIAQSGKMDYQTQMQKHFNV 346 Query 297 SPGDGFSEMCTYLGGISSSLDINEVVNQNITGSNAADIAGKGTGVSNG-VINFNSQGRYG 355 SP S C YLGG +S+LDI+EVVN N+TG N ADI GKGTG NG ++F S +G Sbjct 347 SPSATLSGHCKYLGGWTSNLDISEVVNTNLTGDNQADIQGKGTGTLNGNKVDFES-SEHG 405 Query 356 VVMCIYHCLPLIDYTTDFVSPSVTRVNAADFAIPEFDRVGMQTVPLSYVSNGPLSVLPLS 415 ++MCIYHCLPL+D++ + ++ + D+AIPEFD VGMQ + S + G L LP S Sbjct 406 IIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLYPSEMIFG-LEDLP-S 463 Query 416 IPNEI--GYAPRYIDYKTDIDTSIGAFKTSLKNWVISYDNQSLANQFGYSVETPESPVPN 473 P+ I GY PRY D KT ID G+F +L +WV + ++ Y ++ Sbjct 464 DPSSINMGYVPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYIS---AYRQACKDAGF-- 518 Query 474 PANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQFLCSTFFDVKVVRNLDTDGLPY 529 ++ + Y FKVNP+ ++ +F V+ADS+I+TDQ L +++FD+K VRN D +GLPY Sbjct 519 -SDITMTYNFFKVNPHIVDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 200 bits (508), Expect = 8e-53, Method: Compositional matrix adjust. Identities = 168/597 (28%), Positives = 267/597 (45%), Gaps = 88/597 (15%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYDNPQHA----LDSSPLNVVKLDGSM 56 L + AF R+RE ++FVPY LW ++ + M N + SS + K+ M Sbjct 64 LQSNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQM 123 Query 57 PFTDLSSISRYLNSLASNSTAVTNKANYFGYNRALC-----SAKLMECLGYGNLYYYAES 111 P + ++ YL + ST ++ + +NR C SAKL++ LGYGN + E Sbjct 124 PCVNYKTLHAYLLKFINRSTVGSDGSVGPEFNRG-CYRHAESAKLLQLLGYGN---FPEQ 179 Query 112 TSNTFAKRPLH-----------YNLN--VSLFNLLAYQKIYADYYRDSQWERVSPSCFNV 158 +N H YN + +S+F LLAY KI D+Y QW+ + S NV Sbjct 180 FANFKVNNDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYNASLCNV 239 Query 159 DYMPYSTGSSGMMLSFNYSDF--------YENYSMFDLRYCNWQKDLFHGVVPNQQYGDV 210 DY+ T +S +LS + + E ++ D+R+ N D F GV+P Q+G Sbjct 240 DYL---TPNSSSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSE 296 Query 211 ASISMSVPVVAGSSA-------------------------AlinsritssnstttlrFPT 245 + +++++ +GS+ A + +++ Sbjct 297 SVVNLNLGNASGSAVLNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFISH 356 Query 246 DPAIPDATPLLTHPS--FSILALRQAEFLQKWKEITQSGNKDYKEQVEKHWNVSPGDGFS 303 D + T S SI+ALR A QK+KEI + + D++ QVE H+ + P D + Sbjct 357 DHTFSGNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIKP-DEKN 415 Query 304 EMCTYLGGISSSLDINEVVNQNITGSNAADIAGKGTGVSNGVINFNSQGRYGVVMCIYHC 363 E ++GG SS ++INE +NQN++G N A G + I F ++ YGVV+ IY C Sbjct 416 ENSLFIGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASIKFTAK-TYGVVIGIYRC 474 Query 364 LPLIDYTTDFVSPSVTRVNAADFAIPEFDRVGMQTVPLSYVS-----NGPLSVLPLS--- 415 P++D+ + ++ + +A+DF IPE D +GMQ V+ N + Sbjct 475 TPVLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPYNDEFKAFRVGDGS 534 Query 416 ---IPNEIGYAPRYIDYKTDIDTSIGAFKTSLKNWVISYDNQSLANQFGYSVETPESPVP 472 + GYAPRY ++KT D GAF SLK+WV + ++ N + +P Sbjct 535 SPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVWNTWAGINAP-- 592 Query 473 NPANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQFLCSTFFDVKVVRNLDTDGLPY 529 +F P+ + LF V + ++ D DQ RNL GLPY Sbjct 593 ---------NMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY 640 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 190 bits (482), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 161/559 (29%), Positives = 249/559 (45%), Gaps = 86/559 (15%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYDNPQHALDSSPLNVVKLDGSMPFTD 60 +N+AAF MR Y+FYFVPY LW+ + +T M D SS + K G P + Sbjct 63 MNSAAFMSMRGVYEFYFVPYKQLWSGFDQFITGMSD-----YKSSFMYAFK--GKTPPSC 115 Query 61 LSSISRYLNSLASNSTAVTNKANYFGYNRALCSAKLMECLGYGNLYY-----YAESTSNT 115 +S + L +TA + G+++ ++++ LGYG Y TS T Sbjct 116 VSFDVQKLVDWCKTNTA----KDIHGFDKNKGVYRILDLLGYGKYANSAGVPYTNPTSTT 171 Query 116 FAKRPLHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYMPYSTGSSGMMLSFN 175 K + F LAYQKIY D+YR++ +E FNVD M Y +G + Sbjct 172 MGK--------CTPFRGLAYQKIYNDFYRNTTYEEYQLESFNVD-MFYGSGKVKETIPNE 222 Query 176 YSDFYENYSMFDLRYCNWQKDLFHGVVP---------NQQY--GDVASISMSVPVVAGSS 224 D Y F LRY N QKDL V P N Q+ G + P V G + Sbjct 223 PWD----YDWFTLRYRNAQKDLLTNVRPTPLFSIDDFNPQFFTGGSDIVMEKGPNVTGGT 278 Query 225 AAlinsritssnstttlrFPTDPAIPDATPLLTHPSFSILALRQAEFLQKWKEITQSGNK 284 +S + + + + S+ +R A L+K +T K Sbjct 279 HEYRDSVVIVGKNLKENGVDSKRTM-----------ISVADIRNAFALEKLASVTMRAGK 327 Query 285 DYKEQVEKHWNVSPGDGFSEMCTYLGGISSSLDINEVVNQNIT----------GSNAADI 334 YKEQ+E H+ +S +G CTY+GG S++ + +V + T G Sbjct 328 TYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGYLGRT 387 Query 335 AGKGTGVSNGVINFNSQGRYGVVMCIYHCLPLIDYTTDFVSPSVTRVNAADFAIPEFDRV 394 GK TG +G I F+++ +G++MCIY +P + Y + V P V ++ DF +PEF+ + Sbjct 388 TGKATGSGSGHIRFDAK-EHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEFENL 446 Query 395 GMQTV---PLSYVSNGPLSVLPLSIPNEIGYAPRYIDYKTDIDTSIGAF--KTSLKNWVI 449 GMQ + +SY N + + G+ PRY +YKT +D + G F + L W + Sbjct 447 GMQPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQFVHQEPLSYWTV 506 Query 450 SYDNQSLANQFGYSVETPESPVPNPANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQFL 509 A G S+ S++N + FK+NP L+ +FAV + + TDQ Sbjct 507 -------ARARGESM------------SNFNISTFKINPKWLDDVFAVNYNGTELTDQVF 547 Query 510 CSTFFDVKVVRNLDTDGLP 528 +F++ V ++ DG+P Sbjct 548 GGCYFNIVKVSDMSIDGMP 566 >gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis] gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=553 Score = 184 bits (466), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 154/548 (28%), Positives = 254/548 (46%), Gaps = 78/548 (14%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYD-----NPQHALDSSPLNVVKLDGS 55 +NTAAFA MR Y+F+FVPY LW + + +T M D N +SPL V Sbjct 62 MNTAAFASMRGVYEFFFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQV------ 115 Query 56 MPFTDLSSISRYLNSLASNSTAVTNKANY-FGYNRALCSAKLMECLGYGNLY-YYAESTS 113 P+ ++ S+ LN+ + + T+ Y F Y + +L++ LGYG + + + Sbjct 116 -PYFNVDSVFNSLNTGKESGSGSTDDLQYKFKYG----AFRLLDLLGYGRKFDSFGTAYP 170 Query 114 NTFAKRPLHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYMPYSTGSSGMMLS 173 + + + + N S+F +LAY KIY DYYR+S +E FN D G++ + Sbjct 171 DNVSGLKNNLDYNCSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKF-----KGGLVDA 225 Query 174 FNYSDFYENYSMFDLRYCNWQKDLFHGVVPNQQYGDVASISMSVPVVAGSSAAlinsrit 233 +D +F LRY N Q D F + +Q + S + + V + A + + Sbjct 226 KVVAD------LFKLRYRNAQTDYFTNLRQSQLF----SFTTAFEDVDNINIAPRDYVKS 275 Query 234 ssnstttlrFPTDPAIPDATPLLTHPSFSILALRQAEFLQKWKEITQSGNKDYKEQVEKH 293 ++ T + F D + FS+ +LR A + K +T K +++Q+ H Sbjct 276 DGSNFTRVNFGVDTDSSEG-------DFSVSSLRAAFAVDKLLSVTMRAGKTFQDQMRAH 328 Query 294 WNVSPGDGFSEMCTYLGGISSSLDINEVVNQNITGSNAAD----------IAGKGTGVSN 343 + V D YLGG S + +++V +G+ A + +AGKGTG Sbjct 329 YGVEIPDSRDGRVNYLGGFDSDMQVSDVTQ--TSGTTATEYKPEAGYLGRVAGKGTGSGR 386 Query 344 GVINFNSQGRYGVVMCIYHCLPLIDYTTDFVSPSVTRVNAADFAIPEFDRVGMQTVPLSY 403 G I F+++ +GV+MCIY +P I Y + P V +++ D+ PEF+ +GMQ + SY Sbjct 387 GRIVFDAK-EHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQPLNSSY 445 Query 404 VSNGPLSVLPLSIPNEI-GYAPRYIDYKTDIDTSIGAFKTS--LKNWVISYDNQSLANQF 460 +S S N + GY PRY +YKT +D + G F S L +W +S +F Sbjct 446 IS----SFCTTDPKNPVLGYQPRYSEYKTALDVNHGQFAQSDALSSWSVS--------RF 493 Query 461 GYSVETPESPVPNPANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQFLCSTFFDVKVVR 520 P+ + + FK++P LN +F V+ + + D F++ V Sbjct 494 RRWTTFPQLEIAD----------FKIDPGCLNSIFPVDYNGTEANDCVYGGCNFNIVKVS 543 Query 521 NLDTDGLP 528 ++ DG+P Sbjct 544 DMSVDGMP 551 >gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317] gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 317 str. F0108] Length=541 Score = 182 bits (462), Expect = 5e-47, Method: Compositional matrix adjust. Identities = 158/546 (29%), Positives = 251/546 (46%), Gaps = 87/546 (16%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYDNPQHALDSSPLNVVKLDGSMPFTD 60 +N+AAF MR Y+F+FVPY LW+ + +T M D + ++ SS LD S+P Sbjct 63 MNSAAFISMRGVYEFFFVPYSQLWHPYDQFITSMNDY-RSSVVSSAAGDKALD-SVPNVK 120 Query 61 LSSISRYLNSLASNSTAVTNKANYFGYNRALCSAKLMECLGYGNLYYYAESTSNTFAKRP 120 L+ + +++ T+K + FGY + S +LM+ LGYG + +++ P Sbjct 121 LADMYKFVRER-------TDK-DIFGYPHSNNSCRLMDLLGYG------KPITSSKTPVP 166 Query 121 LHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYMPYSTGSSGMMLSFNYSDFY 180 L Y NV+LF LLAY KIY+DYYR++ +E V FN+D+ G + +D + Sbjct 167 LLYTGNVNLFRLLAYNKIYSDYYRNTTYEGVDVYSFNIDH------KKGTFVP--TADEF 218 Query 181 ENYSMFDLRYCNWQKDLFHGVVPNQQY---GDVASISMSVPVVAGSSAAlinsritssns 237 + Y +L Y N D + + P + D S + + GS+ + N Sbjct 219 KKY--LNLHYRNAPLDFYTNLRPTPLFTIGSDSFSSVLQLSDPTGSAGFSADGNSAKLNM 276 Query 238 tttlrFPTDPAIPDATPLLTHPSFSILALRQAEFLQKWKEITQSGNKDYKEQVEKHWNVS 297 A PD ++ A+R A L K I+ K Y EQ+E H+ V+ Sbjct 277 ----------ASPDV--------LNVSAIRSAFALDKLLSISMRAGKTYAEQIEAHFGVT 318 Query 298 PGDGFSEMCTYLGGISSSLDINEVV------NQNITGSNAADIAG-------KGTGVSNG 344 +G YLGG S++ + +V N N++ A +AG KGTG G Sbjct 319 VSEGRDGQVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYLGKITGKGTGSGYG 378 Query 345 VINFNSQGRYGVVMCIYHCLPLIDYTTDFVSPSVTRVNAADFAIPEFDRVGMQTVPLSYV 404 I F+++ GV+MCIY +P + Y + P V + D+ IPEF+ +GMQ + ++V Sbjct 379 EIQFDAK-EPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQPIVPAFV 437 Query 405 SNGPLSVLPLSIPNEIGYAPRYIDYKTDIDTSIGAFKTS--LKNWVISYDNQSLANQFGY 462 S L + N G+ PRY +YKT D + G F L W I+ S Sbjct 438 S------LNRAKDNSYGWQPRYSEYKTAFDINHGQFANGEPLSYWSIARARGS------- 484 Query 463 SVETPESPVPNPANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQFLCSTFFDVKVVRNL 522 +++N K+NP+ L+ +FAV + + TD F+++ V ++ Sbjct 485 -----------DTLNTFNVAALKINPHWLDSVFAVNYNGTEVTDCMFGYAHFNIEKVSDM 533 Query 523 DTDGLP 528 DG+P Sbjct 534 TEDGMP 539 >gi|494306153|ref|WP_007173049.1| hypothetical protein [Prevotella bergensis] gi|270333881|gb|EFA44667.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=519 Score = 182 bits (461), Expect = 5e-47, Method: Compositional matrix adjust. Identities = 157/554 (28%), Positives = 251/554 (45%), Gaps = 91/554 (16%) Query 1 LNTAAFARMREYYDFYFVPYDLLWNKANTSLTQMYD-----NPQHALDSSPLNVVKLDGS 55 +NTAAFA MR Y+F+FVPY LW + + +T M D N +SPL V Sbjct 29 MNTAAFASMRGVYEFFFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQV------ 82 Query 56 MPFTDLSSISRYLNSLASNSTAVTNKANYFGYNRALCSAKLMECLGYGNLY-YYAESTSN 114 P+ +L S+ + + S + + F Y + +L++ LGYG + + + + Sbjct 83 -PYFNLESVFKNIIERDSTPSFQDDLQYRFKYG----AFRLLDLLGYGRKFDSFGTAYPD 137 Query 115 TFAKRPLHYNLNVSLFNLLAYQKIYADYYRDSQWERVSPSCFNVDYMPYSTGSSGMMLSF 174 + + + N S+F +LAY KIY DYYR+S +E FN D G++ + Sbjct 138 NVSGLKNNLDYNCSVFRVLAYNKIYQDYYRNSNYENFDTDSFNFDKF-----KGGLVDAK 192 Query 175 NYSDFYENYSMFDLRYCNWQKDLFHGVVPNQ------QYGDVASISMSVPVVAGSSAAli 228 +D +F LRY N Q D F + +Q ++ D ++ A S + Sbjct 193 VVAD------LFKLRYRNAQTDYFTNLRQSQLFTFIPEFSDDEHLNFDRDQYADQSKSNF 246 Query 229 nsritssnstttlrFPTDPAIPDATPLLTHPSFSILALRQAEFLQKWKEITQSGNKDYKE 288 FP D L + FS+ +LR A + K +T K +++ Sbjct 247 TQLN----------FPVD-----VDNNLGY--FSVSSLRSAFAVDKLLSVTMRAGKTFQD 289 Query 289 QVEKHWNVSPGDGFSEMCTYLGGISSSLDINEVVNQNITGSNAAD----------IAGKG 338 Q+ H+ V D YLGG S L +++V +G+ A + IAGKG Sbjct 290 QMRAHYGVEIPDSRDGRVNYLGGFDSDLQVSDVTQ--TSGTTATEYKPEAGYLGRIAGKG 347 Query 339 TGVSNGVINFNSQGRYGVVMCIYHCLPLIDYTTDFVSPSVTRVNAADFAIPEFDRVGMQT 398 TG G I F+++ +GV+MCIY +P I Y + P V +++ DF PEF+ +GMQ Sbjct 348 TGSGRGRIVFDAK-EHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDFFTPEFENLGMQP 406 Query 399 VPLSYVSN--GPLSVLPLSIPNEIGYAPRYIDYKTDIDTSIGAFKT--SLKNWVISYDNQ 454 + SY+S+ P P+ +GY PRY +YKT +D + G F +L +W +S Sbjct 407 LNSSYISSFCTPDPKNPV-----LGYQPRYSEYKTALDINHGQFAQNDALSSWSVS---- 457 Query 455 SLANQFGYSVETPESPVPNPANSSWNYTLFKVNPNSLNPLFAVEADSSIDTDQFLCSTFF 514 +F P+ + + FK++P LN +F VE + + TD F Sbjct 458 ----RFRRWTTFPQLEIAD----------FKIDPGCLNSVFPVEFNGTESTDCVFGGCNF 503 Query 515 DVKVVRNLDTDGLP 528 ++ V ++ DG+P Sbjct 504 NIVKVSDMSVDGMP 517 Lambda K H a alpha 0.317 0.133 0.404 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 3784284189360