bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-9_CDS_annotation_glimmer3.pl_2_6 Length=384 Score E Sequences producing significant alignments: (Bits) Value gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 268 2e-80 gi|496050829|ref|WP_008775336.1| hypothetical protein 256 2e-75 gi|490418709|ref|WP_004291032.1| hypothetical protein 255 2e-75 gi|575094354|emb|CDL65742.1| unnamed protein product 245 3e-71 gi|494822885|ref|WP_007558293.1| hypothetical protein 216 3e-60 gi|575094321|emb|CDL65708.1| unnamed protein product 152 2e-37 gi|565841287|ref|WP_023924568.1| hypothetical protein 134 4e-31 gi|517172762|ref|WP_018361580.1| hypothetical protein 131 2e-30 gi|494306153|ref|WP_007173049.1| hypothetical protein 122 2e-27 gi|647452987|ref|WP_025792807.1| hypothetical protein 122 5e-27 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 268 bits (686), Expect = 2e-80, Method: Compositional matrix adjust. Identities = 168/393 (43%), Positives = 229/393 (58%), Gaps = 39/393 (10%) Query 1 VDYFTGvspslissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPD 60 +DY G S + S + D +K+ TMFDL YCN+ KD G+LP +Q+GDV+V P Sbjct 211 LDYLYGKSSGFHIPMSSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVAS-PI 269 Query 61 SGDSNVVLGTDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQF 120 GD ++ G +S++T +AP N I + + S+ + Sbjct 270 FGDLDI-----------GDSSSLTFASAP-------QQGANTIQSGVLVVNNNSNTTAGL 311 Query 121 TVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEV 180 +VLALRQAE LQ+W+EI+QSG DY+ Q++KHF V LS C Y+GG + NLDISEV Sbjct 312 SVLALRQAECLQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEV 371 Query 181 VNNNLATEGDTAVIAGKGVGAGNGS-FEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLV 239 VN NL T + A I GKG G NG+ ++ ++EH ++MCIYH +PLLD+++ Q Sbjct 372 VNTNL-TGDNQADIQGKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSINRIARQNFK 430 Query 240 TDAESLPIPEFDNIGMEVL-PMTQVFN-----SPKASIVNLFNAGYNPRYFNWKTKLDVI 293 T IPEFD++GM+ L P +F S +SI N GY PRY + KT +D I Sbjct 431 TTFTDYAIPEFDSVGMQQLYPSEMIFGLEDLPSDPSSI----NMGYVPRYADLKTSIDEI 486 Query 294 NGAFTTTLKSWVSPVTESLLSGW--FCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFG 351 +G+F TL SWVSP+T+S +S + C KD D + M Y FFKVNP ++D IFG Sbjct 487 HGSFIDTLVSWVSPLTDSYISAYRQAC----KDAGFSD--ITMTYNFFKVNPHIVDNIFG 540 Query 352 VNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 384 V ADST +TDQLL+NSY VRN +G+PY Sbjct 541 VKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 256 bits (653), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 151/361 (42%), Positives = 209/361 (58%), Gaps = 31/361 (9%) Query 28 MFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGIASAITSKT 87 MFDL+YCNW KD+ GVLP Q+GD A +++ SNV+ + Sbjct 247 MFDLRYCNWQKDLFHGVLPRQQYGDTAAVNV---NLSNVL-------------------S 284 Query 88 APFPLFALDASPENPIPINsklrldlsslks-QFTVLALRQAEALQRWKEISQSGDSDYR 146 A + + D P P +S + S FTVLALRQAE LQ+WKEI+QSG+ DY+ Sbjct 285 AQYMVQTPDGDPVGGSPFSSTGVNLQTVNGSGTFTVLALRQAEFLQKWKEITQSGNKDYK 344 Query 147 EQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEGDTAVIAGKGVGAGNGSF 206 +QI KH+ V + +A S M Y+GG + +LDI+EVVNNN+ T + A IAGKGV GNG Sbjct 345 DQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNI-TGSNAADIAGKGVVVGNGRI 403 Query 207 EYTTTE-HCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFN 265 + E + ++MCIYH++PLLDYT + ++ IPEFD +GME +P+ + N Sbjct 404 SFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMN 463 Query 266 SPKASIVNLFNA--GYNPRYFNWKTKLDVINGAFTTTLKSWVSPVTESLLSGWFCFGYNK 323 P S N+ ++ GY PRY ++KT +D GAF TTLKSWV + + + Sbjct 464 -PLQSSYNVGSSILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNY---Q 519 Query 324 DDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVP 383 DD ++NY FKVNP+ +DP+F V A ++ DTDQ L +S+ VVRNL DG+P Sbjct 520 DDPNNSPGTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLP 579 Query 384 Y 384 Y Sbjct 580 Y 580 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 255 bits (652), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 147/371 (40%), Positives = 202/371 (54%), Gaps = 31/371 (8%) Query 21 DYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGIA 80 +++++ FDL+YCNW KD+ GVLP+ Q+G+ AV I + L S+ S+VG + Sbjct 232 EFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTL---SNFSTVGTS 288 Query 81 SAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQS 140 S TA L A D + ++L LRQAE LQ+WKEI+QS Sbjct 289 PTTASGTATKNLPAFDTVGD-------------------LSILVLRQAEFLQKWKEITQS 329 Query 141 GDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEGDTAVIAGKGVG 200 G+ DY++Q+ KH+GV + S +CTY+GGVS ++DI+EV+N N+ T A IAGKGVG Sbjct 330 GNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNI-TGSAAADIAGKGVG 388 Query 201 AGNGSFEYTTT-EHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLP 259 NG + + + ++MCIYH +PLLDYT D L ++ IPEFD +GM+ +P Sbjct 389 VANGEINFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMP 448 Query 260 MTQVFNSPKASIVNL--FNAGYNPRYFNWKTKLDVINGAFTTTLKSWVSPVTESLLSGWF 317 + Q+ N P S N GY PRY ++KT +D G F TL SWV + Sbjct 449 LVQLMN-PLRSFANASGLVLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQV 507 Query 318 CFGYNKDDAAPDTKV----IMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYV 373 + P V MN+ FFKVNP LDPIF V A +TDQ L +S+ Sbjct 508 TLPNDAPPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKA 567 Query 374 VRNLSRDGVPY 384 VRNL DG+PY Sbjct 568 VRNLDTDGLPY 578 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 245 bits (626), Expect = 3e-71, Method: Compositional matrix adjust. Identities = 151/395 (38%), Positives = 218/395 (55%), Gaps = 48/395 (12%) Query 28 MFDLKYCNWNKDMLMGVLPNSQFGDVAV------LDIPDSGDSNVV-------------- 67 FD++YCN+ KDM GVLP +Q+G +V L++ +GDS + Sbjct 231 FFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTPDPGTPGTS 290 Query 68 -------LGTDSHKSSVGIASAITSKTAP-----FPLFALDASP--ENPIPINsklrldl 113 +G D+ V ++ K+A FP A S ENP I Sbjct 291 YVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPNLI------IE 344 Query 114 sslksQFTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSR 173 ++ +LALRQAE LQ+WKE+S SG+ DY+ QI KH+G+K+ LS+ Y+GG + Sbjct 345 NNQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCAT 404 Query 174 NLDISEVVNNNLATEGDTAVIAGKGVGAGNGSFEYTTT-EHCVVMCIYHAVPLLDYTLTG 232 +LDI+EV+NNN+ T + A IAGKG GNGS + + E+ ++MCIYH +P++DY +G Sbjct 405 SLDINEVINNNI-TGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSG 463 Query 233 QDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFNSPKASIVNLFNA--GYNPRYFNWKTKL 290 D + DA S PIPE D IGME +P+ + N K S + GY PRY +WKT + Sbjct 464 VDHSCTLVDATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFLGYAPRYIDWKTSV 523 Query 291 DVINGAFTTTLKSWVSPVTESLLSGWFCFGYNKD-DAAPDTKVIMNYKFFKVNPSVLDPI 349 D G F +L++W PV + L+ + + + PD+ + FFKVNPS++DP+ Sbjct 524 DRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDS---IAAGFFKVNPSIVDPL 580 Query 350 FGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 384 F V ADST TD+ L +S+ VVRNL +G+PY Sbjct 581 FAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 216 bits (549), Expect = 3e-60, Method: Compositional matrix adjust. Identities = 139/378 (37%), Positives = 211/378 (56%), Gaps = 26/378 (7%) Query 25 SGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVG------ 78 S +FD++Y NW +D+L G +P +Q+G+ + +P SG VV G + G Sbjct 244 SFNLFDMRYSNWQRDLLHGTIPQAQYGEASA--VPVSGSMQVVEGPTPPAFTTGQDGVAF 301 Query 79 IASAITSKTAPFPLFALDASPENPI-PINsklrldlsslksQF--TVLALRQAEALQRWK 135 + +T + + L A + E+ I N+ + S F ++LALR+AEA Q+WK Sbjct 302 LNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRAEAAQKWK 361 Query 136 EISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEGDTAVIA 195 E++ + + DY QI H+G + +A S+MC ++G ++ +L I+EVVNNN+ E + A IA Sbjct 362 EVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGE-NAADIA 420 Query 196 GKGVGAGNGSFEYTT-TEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIG 254 GKG +GNGS + ++ +VMC++H +P LDY + +T+ PIPEFD IG Sbjct 421 GKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIG 480 Query 255 MEVLPMTQVFNSPKAS------IVNLFNAGYNPRYFNWKTKLDVINGAFTTTLKSWVSPV 308 ME +P+ + N K NL+ GY P+Y+NWKT LD G F +LK+W+ P Sbjct 481 MEQVPVIRGLNPVKPKDGDFKVSPNLY-FGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPF 539 Query 309 -TESLLSG-WFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVN 366 E+LL+ F N + A K FFKV+PSVLD +F V A+S +TDQ L + Sbjct 540 DDEALLAADSVDFPDNPNVEADSVKA----GFFKVSPSVLDNLFAVKANSDLNTDQFLCS 595 Query 367 SYIGCYVVRNLSRDGVPY 384 + VVR+L +G+PY Sbjct 596 TLFDVNVVRSLDPNGLPY 613 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 152 bits (384), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 123/394 (31%), Positives = 186/394 (47%), Gaps = 61/394 (15%) Query 28 MFDLKYCNWNKDMLMGVLPNSQFGDVAV--LDIPDSGDSNVVLGTDSH-----KSSVG-- 78 + D+++ N D GVLP SQFG +V L++ ++ S V+ GT S +++ G Sbjct 271 LLDMRFSNLPLDYFTGVLPTSQFGSESVVNLNLGNASGSAVLNGTTSKDSGRWRTTTGEW 330 Query 79 -----IASAITSK----TAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAE 129 +AS+ + + D + + IN+ L +++ALR A Sbjct 331 EMEQRVASSANGNLKLDNSNGTFISHDHTFSGNVAINTSLSG-------NLSIIALRNAL 383 Query 130 ALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEG 189 A Q++KEI + D D++ Q+ HFG+K P + +IGG S ++I+E +N NL+ G Sbjct 384 AAQKYKEIQLANDVDFQSQVEAHFGIK-PDEKNENSLFIGGSSSMININEQINQNLS--G 440 Query 190 DTAVIAGKG-VGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIP 248 D G G G+ S ++T + VV+ IY P+LD+ G D L TDA IP Sbjct 441 DNKATYGAAPQGNGSASIKFTAKTYGVVIGIYRCTPVLDFAHLGIDRTLFKTDASDFVIP 500 Query 249 EFDNIGMEVLPMTQVFNSPKASIVNLFNA---------------GYNPRYFNWKTKLDVI 293 E D+IGM+ +V + A + F A GY PRY +KT D Sbjct 501 EMDSIGMQQTFRCEV--AAPAPYNDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRY 558 Query 294 NGAFTTTLKSWVSPVTESLLSG--WFCF-GYNKDDAAPDTKVIMNYKFFKVNPSVLDPIF 350 NGAF +LKSWV+ + + W + G N AP+ F P ++ +F Sbjct 559 NGAFCHSLKSWVTGINFDAIQNNVWNTWAGIN----APN--------MFACRPDIVKNLF 606 Query 351 GVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 384 V++ + D DQL V CY RNLSR G+PY Sbjct 607 LVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY 640 >gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens] gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens CC14M] Length=656 Score = 134 bits (337), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 107/366 (29%), Positives = 166/366 (45%), Gaps = 54/366 (15%) Query 28 MFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGIASAITSKT 87 M L+Y +W+KD + P + + D + ++PD + N T K V + ++ Sbjct 317 MCQLRYRHWSKDWVTSAYPTASY-DKGIFELPDYINGNTGFATTEVKRDV-----VNNRG 370 Query 88 APFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQSGDS-DYR 146 + + ++DA I+ D +R AL++ E +++ + DY Sbjct 371 SQLEIKSMDAGSLGSNNISYISPND------------IRAMFALEKMLERTRAANGLDYS 418 Query 147 EQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEGDTAV-------IAGKGV 199 QI HFG K+P++ N ++IGG + ISEVV + + TA + GKG+ Sbjct 419 NQIAAHFGFKVPESRKNCASFIGGFDNQISISEVVTTSNGSVDGTASTGSVVGQVFGKGI 478 Query 200 GAGN-GSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVL 258 GA N G Y EH ++MCIY P +DY D E PEF+N+GM+ Sbjct 479 GAMNSGHISYDVKEHGLIMCIYSIAPQVDYDARELDPFNRKFSREDYFQPEFENLGMQ-- 536 Query 259 PMTQ-----VFNSPKASIVNLFN--AGYNPRYFNWKTKLDVINGAFTT--TLKSWVSPVT 309 P+ Q NS K+ + N GY+ RY +KT D+I G F + +L +W +P Sbjct 537 PVIQSDLCLCINSAKSDSSDQHNNVLGYSARYLEYKTARDIIFGEFMSGGSLSAWATPKN 596 Query 310 ESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYI 369 + F + K + PD V+P VL+PIF V + + TDQ LVNSY Sbjct 597 N------YTFEFGK-LSLPD---------LLVDPKVLEPIFAVKYNGSMSTDQFLVNSYF 640 Query 370 GCYVVR 375 +R Sbjct 641 DVKAIR 646 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 131 bits (329), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 106/378 (28%), Positives = 158/378 (42%), Gaps = 63/378 (17%) Query 29 FDLKYCNWNKDMLMGVLP---------NSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGI 79 F L+Y N KD+L V P N QF DI NV GT ++ SV I Sbjct 229 FTLRYRNAQKDLLTNVRPTPLFSIDDFNPQFF-TGGSDIVMEKGPNVTGGTHEYRDSVVI 287 Query 80 ASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQ 139 EN + S ++ +V +R A AL++ ++ Sbjct 288 VGKNLK--------------ENGV----------DSKRTMISVADIRNAFALEKLASVTM 323 Query 140 SGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLAT---------EGD 190 Y+EQ+ HFG+ + + CTYIGG N+ + +V ++ T G Sbjct 324 RAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGY 383 Query 191 TAVIAGKGVGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEF 250 GK G+G+G + EH ++MCIY VP + Y D + + +PEF Sbjct 384 LGRTTGKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEF 443 Query 251 DNIGMEVLPMTQVF-----NSPKASIVNLFNAGYNPRYFNWKTKLDVINGAFTTTLKSWV 305 +N+GM+ L + N+ + I NL G+ PRY +KT LD+ +G F Sbjct 444 ENLGMQPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQF-------- 495 Query 306 SPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLV 365 V + LS W A ++ N FK+NP LD +F VN + T TDQ+ Sbjct 496 --VHQEPLSYWTV-----ARARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQVFG 548 Query 366 NSYIGCYVVRNLSRDGVP 383 Y V ++S DG+P Sbjct 549 GCYFNIVKVSDMSIDGMP 566 >gi|494306153|ref|WP_007173049.1| hypothetical protein [Prevotella bergensis] gi|270333881|gb|EFA44667.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=519 Score = 122 bits (306), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 84/277 (30%), Positives = 131/277 (47%), Gaps = 34/277 (12%) Query 120 FTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISE 179 F+V +LR A A+ + ++ +++Q+R H+GV++P + Y+GG +L +S+ Sbjct 262 FSVSSLRSAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDSRDGRVNYLGGFDSDLQVSD 321 Query 180 VVNNN--LATE-----GDTAVIAGKGVGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTG 232 V + ATE G IAGKG G+G G + EH V+MCIY VP + Y T Sbjct 322 VTQTSGTTATEYKPEAGYLGRIAGKGTGSGRGRIVFDAKEHGVLMCIYSLVPQIQYDCTR 381 Query 233 QDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFN----SPKASIVNLFNAGYNPRYFNWKT 288 D + D PEF+N+GM+ L + + + PK ++ GY PRY +KT Sbjct 382 LDPMVDKLDRFDFFTPEFENLGMQPLNSSYISSFCTPDPKNPVL-----GYQPRYSEYKT 436 Query 289 KLDVINGAFTT--TLKSWVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVL 346 LD+ +G F L SW + S W F P ++ FK++P L Sbjct 437 ALDINHGQFAQNDALSSW----SVSRFRRWTTF--------PQLEIAD----FKIDPGCL 480 Query 347 DPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVP 383 + +F V + T TD + V ++S DG+P Sbjct 481 NSVFPVEFNGTESTDCVFGGCNFNIVKVSDMSVDGMP 517 >gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola] Length=584 Score = 122 bits (305), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 106/384 (28%), Positives = 178/384 (46%), Gaps = 74/384 (19%) Query 31 LKYCNWNKDMLMGVLPNSQFGDVAVLDIPD--SGDSNVVLGTDSHKSSVGIASAITSKTA 88 ++Y + KD L + P + D + ++P+ G+ NV+L T++ SV + S S ++ Sbjct 240 MRYRPYAKDWLTSMKPTPNYSD-GIFNLPEYVRGNGNVIL-TNNKSGSVSLDSGTVSPSS 297 Query 89 PFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQSGDS-DYRE 147 F+V LR A AL + E ++ + DY Sbjct 298 -------------------------------FSVNDLRAAFALDKMLEATRRANGLDYAS 326 Query 148 QIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVV--NNNLATEGDTAVI---AGKGVGA- 201 QI HFG K+P++ +N ++GG ++ +SEVV N N A++G A I GKG+G+ Sbjct 327 QIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGDLGGKGIGSM 386 Query 202 GNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMT 261 +G+ E+ +TEH ++MCIY P +Y + D E PEF ++G + L + Sbjct 387 SSGTIEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQFYQPEFADLGYQALIGS 446 Query 262 QV------FNSPKA--SIVNLFN--AGYNPRYFNWKTKLDVINGAFTT--TLKSWVSPVT 309 + N +A S + L N GY RY +KT D++ G F + +L W +P Sbjct 447 DLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFESGKSLSYWCTPRF 506 Query 310 ESLLSGWFCFGYNKDDAAPDTKVIMNYKF-----------FKVNPSVLDPIFGVNADSTW 358 + F +G + AP+ K +Y+ F +NP++++PIF +A Sbjct 507 D------FGYGDTEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFLTSA---V 557 Query 359 DTDQLLVNSYIGCYVVRNLSRDGV 382 D +VNS++ VR +S G+ Sbjct 558 QADHFIVNSFLDVKAVRPMSVTGL 581 Lambda K H a alpha 0.318 0.136 0.416 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 2389518904266