bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-31_CDS_annotation_glimmer3.pl_2_1 Length=534 Score E Sequences producing significant alignments: (Bits) Value gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 370 1e-117 gi|490418709|ref|WP_004291032.1| hypothetical protein 363 1e-114 gi|575094354|emb|CDL65742.1| unnamed protein product 363 4e-114 gi|496050829|ref|WP_008775336.1| hypothetical protein 353 6e-111 gi|494822885|ref|WP_007558293.1| hypothetical protein 318 6e-97 gi|575094321|emb|CDL65708.1| unnamed protein product 246 2e-69 gi|494308783|ref|WP_007173938.1| hypothetical protein 177 5e-45 gi|496521299|ref|WP_009229582.1| capsid protein 170 9e-43 gi|517172762|ref|WP_018361580.1| hypothetical protein 158 1e-38 gi|494306153|ref|WP_007173049.1| hypothetical protein 154 3e-37 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 370 bits (951), Expect = 1e-117, Method: Compositional matrix adjust. Identities = 219/551 (40%), Positives = 317/551 (58%), Gaps = 55/551 (10%) Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61 S+ +L+++KN +R+GFDLS K AFTAKVGELLP+ PGDKF+++ Q FTRTQPVN Sbjct 3 SVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQPVN 62 Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGSNMPCLSADQI 121 ++AY+R+REYYD+++ P LLW AP + + + HA+ SV L P + I Sbjct 63 SAAYSRLREYYDFYFVPYRLLWNMAPTFFTNM-PDPHHAADLVSSVNLSQRHPWFTFFDI 121 Query 122 SQSLDQLKS--------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDV 173 + L L S ++N+FGF R +L+ KLL YL YG G + S D+ Sbjct 122 MEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG---FGKDYESVKVPSDSDDI 178 Query 174 KDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPT 233 LS FP+LAY+K C+DYFR QWQ +APY +N+DY GK + +P Sbjct 179 -------------VLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPM 225 Query 234 SLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPVVTKQNLQ 293 S ++ A F+ T FDL YCN+ KD F G+LP AQ+GD SV +G +L Sbjct 226 SSFTNDA-FKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVASPIFG----------DLD 274 Query 294 SPTNSSVSIGTDDANSKTLIASG---------TNLTLDVLALRRGEALQRFREISLCTPL 344 +SS++ + I SG T L VLALR+ E LQ++REI+ + Sbjct 275 IGDSSSLTFASAPQQGANTIQSGVLVVNNNSNTTAGLSVLALRQAECLQKWREIAQSGKM 334 Query 345 NYRSQIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITETNEALIAGKGVGTGQS 404 +Y++Q++ HF V A +SG Y+GG S+LDISEVVNTN+T N+A I GKG GT Sbjct 335 DYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNLTGDNQADIQGKGTGTLNG 394 Query 405 SE-SFYAKDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISVAYY 463 ++ F + + GI+MCIYH +PLLD+ ++ Q F + T + +PE DS+G++ + Sbjct 395 NKVDFESSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLY---- 450 Query 464 SNNPTELP-SASGLLSDP-TVTVGYLPRYFAWKTSLDYVLGAFTTTEKEWVAPITQTLWT 521 P+E+ L SDP ++ +GY+PRY KTS+D + G+F T WV+P+T + + Sbjct 451 ---PSEMIFGLEDLPSDPSSINMGYVPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYIS 507 Query 522 KYVEAFERFGY 532 Y +A + G+ Sbjct 508 AYRQACKDAGF 518 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 363 bits (932), Expect = 1e-114, Method: Compositional matrix adjust. Identities = 205/525 (39%), Positives = 306/525 (58%), Gaps = 46/525 (9%) Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61 ++ +L S++N P R+GFDLS K FTAK GELLPV +PGD F + + FTRTQPVN Sbjct 3 NIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQPVN 62 Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGS--VLLGSNMPCLSAD 119 T+A+ R+REYYD+F+ P LLW A V++Q+ N QHA S D + +L MP ++++ Sbjct 63 TAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMTSE 122 Query 120 QISQSLDQLKS-------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSID 172 I+ ++ L + K NYFG++R+ + KLL+YL YGN + + ++ T+ Sbjct 123 AIASYINALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGNYESFL----TDDWNTAPL 178 Query 173 VKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILP 232 + + +N IF +LAY+K D++R +QW+ +P +N+DY DG + Sbjct 179 MANLNHN----------IFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSS----MN 224 Query 233 TSLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPVVTKQNL 292 S +++ FFDL YCNW KD+F G+LP Q+G+T+V I+ VTG +T N Sbjct 225 LDNAYSTEFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGK--LTLSNF 282 Query 293 Q----SPTNSSVSIGTDDANSKTLIASGTNLTLDVLALRRGEALQRFREISLCTPLNYRS 348 SPT +S GT +K L A T L +L LR+ E LQ+++EI+ +Y+ Sbjct 283 STVGTSPTTAS---GT---ATKNLPAFDTVGDLSILVLRQAEFLQKWKEITQSGNKDYKD 336 Query 349 QIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITETNEALIAGKGVGTGQSSESF 408 Q++ H+GV VG S + TY+GG +SS+DI+EV+NTNIT + A IAGKGVG +F Sbjct 337 QLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNITGSAAADIAGKGVGVANGEINF 396 Query 409 YAKD-WGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISVAYYSNNP 467 + +G++MCIYH +PLLDY DP +T + +PE D +G++ + + N Sbjct 397 NSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMNPL 456 Query 468 TELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTEKEWV 512 +ASGL+ +GY+PRY +KTS+D +G F T WV Sbjct 457 RSFANASGLV------LGYVPRYIDYKTSVDQSVGGFKRTLNSWV 495 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 363 bits (931), Expect = 4e-114, Method: Compositional matrix adjust. Identities = 216/564 (38%), Positives = 317/564 (56%), Gaps = 79/564 (14%) Query 6 LSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVNTSAY 65 ++ +KN P R+GFDLS K FTAK GELLPV + +PGD F++ + FTRTQP+NTSA+ Sbjct 3 MADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTSAF 62 Query 66 TRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHAS--SFDGSVLLGSNMPCLSADQISQ 123 R+REYYD+++ P +W I+Q+ NVQHAS + D + L MP +++QI+ Sbjct 63 ARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQIAD 122 Query 124 SLDQ--LKSKQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDVKDGTYN-Q 180 L+ +++N FGF+R+ L KLLQYL YG+ S D + T++ + Sbjct 123 YLNDQATAARKNPFGFNRSTLTCKLLQYLGYGDYN-------------SFDSETNTWSAK 169 Query 181 NRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPTSLTSSAA 240 YN LS FP+LAY+K D++R TQW+ + P +N+DY G + + T L S Sbjct 170 PLLYNLELSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKGTSDLQMDLTGLPS--- 226 Query 241 YFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVD-------ISYGVTGAPVVTKQ-NL 292 + + FFD+ YCN+ KDMF G+LP AQ+G SVV IS G +G T + Sbjct 227 --DDNNFFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTPDP 284 Query 293 QSPTNSSVSIGTD------------------------------DANSKTLIASGTNLTLD 322 +P S V++G + +A++++L+ NL ++ Sbjct 285 GTPGTSYVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPNLIIE 344 Query 323 --------VLALRRGEALQRFREISLCTPLNYRSQIKAHFGVDVGAAMSGMSTYIGGEAS 374 +LALR+ E LQ+++E+S+ +Y+SQI+ H+G+ V +S + Y+GG A+ Sbjct 345 NNQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCAT 404 Query 375 SLDISEVVNTNITETNEALIAGKGVGTGQSSESFYAK-DWGILMCIYHSVPLLDYVLSAP 433 SLDI+EV+N NIT N A IAGKG TG S F +K ++GI+MCIYH +P++DYV S Sbjct 405 SLDINEVINNNITGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGV 464 Query 434 DPQLFTSENTSFPVPELDSIGLE--PISVAYYSNNPTELPSASGLLSDPTVTVGYLPRYF 491 D + TSFP+PELD IG+E P+ A ++ PSA L GY PRY Sbjct 465 DHSCTLVDATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFL-------GYAPRYI 517 Query 492 AWKTSLDYVLGAFTTTEKEWVAPI 515 WKTS+D +G F + + W P+ Sbjct 518 DWKTSVDRSVGDFADSLRTWCLPV 541 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 353 bits (907), Expect = 6e-111, Method: Compositional matrix adjust. Identities = 211/527 (40%), Positives = 310/527 (59%), Gaps = 41/527 (8%) Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61 ++ +L S++N R+GFDLSSK FTAK GELLPVK +PGDK+S+ + FTRTQP+N Sbjct 3 NIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQPLN 62 Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSV--LLGSNMPCLSAD 119 T+A+ R+REYYD+++ P +LLW A V++Q+ N QHA+S+ S L MP ++ Sbjct 63 TAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSANQALAGVMPNVTCK 122 Query 120 QISQSLDQLKS--------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSI 171 I+ L+ + ++NYFG+ R+ KLL+YL YGN T Y TS Sbjct 123 GIADYLNLVAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYT---------YATS- 172 Query 172 DVKDGTYNQN-RAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTI 230 K+ T+ ++ + N L+I+ +LAY+K D+ R +QW+ +P +N+DY G + Sbjct 173 --KNNTWTKSPLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVDSAM 230 Query 231 LPTSLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPVVTKQ 290 S+ + + FDL YCNW KD+F G+LP Q+GDT+ V+++ + V++ Q Sbjct 231 TIDSMITGQGFAPFYNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNVNL----SNVLSAQ 286 Query 291 NL-QSPTNSSVS---IGTDDANSKTLIASGTNLTLDVLALRRGEALQRFREISLCTPLNY 346 + Q+P V + N +T+ SG T VLALR+ E LQ+++EI+ +Y Sbjct 287 YMVQTPDGDPVGGSPFSSTGVNLQTVNGSG---TFTVLALRQAEFLQKWKEITQSGNKDY 343 Query 347 RSQIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITETNEALIAGKGVGTGQSSE 406 + QI+ H+ V VG A S MS Y+GG +SLDI+EVVN NIT +N A IAGKGV G Sbjct 344 KDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVVVGNGRI 403 Query 407 SFYAKD-WGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISVAYYSN 465 SF A + +G++MCIYHS+PLLDY +P +T F +PE D +G+E + + N Sbjct 404 SFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMN 463 Query 466 NPTELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTEKEWV 512 L S+ + S +GY PRY ++KT +D +GAF TT K WV Sbjct 464 ---PLQSSYNVGSS---ILGYAPRYISYKTDVDSSVGAFKTTLKSWV 504 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 318 bits (815), Expect = 6e-97, Method: Compositional matrix adjust. Identities = 187/546 (34%), Positives = 286/546 (52%), Gaps = 50/546 (9%) Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61 ++ ++ SV+N P R+G+DL+ K+ FTAK G L+PV WT +P D + + F RTQP+N Sbjct 10 NIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQPLN 69 Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASS--FDGSVLLGSNMPCLSAD 119 T+A+ R+R Y+D+++ P +W P I+Q++ N+ HAS +V L +P +A+ Sbjct 70 TAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADNVPLSDELPYFTAE 129 Query 120 QISQSLDQLKSKQNYFGFDRADLAYKLLQYLRYGN----VRTGVGSNGARNYGTSIDVKD 175 Q++ + L +N FG+ RA L +L+YL YG+ + G GA Sbjct 130 QVADYIVSLADSKNQFGYYRAWLVCIILEYLGYGDFYPYIVEAAGGEGA----------- 178 Query 176 GTYNQNRAYNH-ALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPTS 234 T+ N+ S FP+ AY+K D+ R TQW+ S P +NIDY G L + Sbjct 179 -TWATRPMLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGSADSLQLDFT 237 Query 235 LTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDIS------YGVTGAPVVT 288 + F FD+ Y NW +D+ G +P AQ+G+ S V +S G T T Sbjct 238 VEGFKDSF---NLFDMRYSNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAFTT 294 Query 289 KQNLQSPTNSSVSIGT-------------------DDANSKTLIASGTNLTLDVLALRRG 329 Q+ + N +V+I ++ NS ++ ++ + +LALRR Sbjct 295 GQDGVAFLNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRA 354 Query 330 EALQRFREISLCTPLNYRSQIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITET 389 EA Q+++E++L + +Y SQI+AH+G V A S M ++G L I+EVVN NIT Sbjct 355 EAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGE 414 Query 390 NEALIAGKGVGTGQSSESF-YAKDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVP 448 N A IAGKG +G S +F +GI+MC++H +P LDY+ SAP + FP+P Sbjct 415 NAADIAGKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIP 474 Query 449 ELDSIGLEPISVAYYSNNPTELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTE 508 E D IG+E + V NP + P P + GY P+Y+ WKT+LD +G F + Sbjct 475 EFDKIGMEQVPVI-RGLNPVK-PKDGDFKVSPNLYFGYAPQYYNWKTTLDKSMGEFRRSL 532 Query 509 KEWVAP 514 K W+ P Sbjct 533 KTWIIP 538 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 246 bits (628), Expect = 2e-69, Method: Compositional matrix adjust. Identities = 177/574 (31%), Positives = 279/574 (49%), Gaps = 64/574 (11%) Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61 ++ L +KN P R+ FDLS + FTAKVGELLP PGD + +FTRT P+ Sbjct 6 NIMGLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQ 65 Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNV------QHASSFDGSVLLGSNMPC 115 ++A+TR+RE +F+ P LW+ + + +N + ASS G+ + + MPC Sbjct 66 SNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQMPC 125 Query 116 LSADQISQSLDQLKSKQNY-----------FGFDRADLAYKLLQYLRYGNVRTGVGS--- 161 ++ + L + ++ G R + KLLQ L YGN + Sbjct 126 VNYKTLHAYLLKFINRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNFPEQFANFKV 185 Query 162 NGARNYGTSIDVKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDY 221 N ++ + + KD TYN N Y LSIF +LAY K C D++ QWQ L N+DY Sbjct 186 NNDKHNQSGQNFKDVTYN-NSPY---LSIFRLLAYHKICNDHYLYRQWQPYNASLCNVDY 241 Query 222 YDGKG----AVTILPTSLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDI 277 ++ S+ + E D+ + N D F G+LP +QFG SVV++ Sbjct 242 LTPNSSSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSESVVNL 301 Query 278 SYG-VTGAPVVT-------------------KQNLQSPTNSSVSI----GTDDANSKTL- 312 + G +G+ V+ +Q + S N ++ + GT ++ T Sbjct 302 NLGNASGSAVLNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFISHDHTFS 361 Query 313 --IASGTNLT--LDVLALRRGEALQRFREISLCTPLNYRSQIKAHFGVDVGAAMSGMSTY 368 +A T+L+ L ++ALR A Q+++EI L ++++SQ++AHFG+ S + Sbjct 362 GNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIKPDEKNEN-SLF 420 Query 369 IGGEASSLDISEVVNTNITETNEALIAGKGVGTGQSSESFYAKDWGILMCIYHSVPLLDY 428 IGG +S ++I+E +N N++ N+A G G +S F AK +G+++ IY P+LD+ Sbjct 421 IGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASIKFTAKTYGVVIGIYRCTPVLDF 480 Query 429 VLSAPDPQLFTSENTSFPVPELDSIGL------EPISVAYYSNNPTELPSASGLLSDPTV 482 D LF ++ + F +PE+DSIG+ E + A Y++ G D + Sbjct 481 AHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPYNDEFKAFRVGDGSSPDMSE 540 Query 483 TVGYLPRYFAWKTSLDYVLGAFTTTEKEWVAPIT 516 T GY PRY +KTS D GAF + K WV I Sbjct 541 TYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGIN 574 >gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis] gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=553 Score = 177 bits (448), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 143/531 (27%), Positives = 240/531 (45%), Gaps = 73/531 (14%) Query 1 MSLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPV 60 +S+ + + + + R+ FDLS + FTA G LLPV +P D + Q F RT P+ Sbjct 3 VSIPKIKATRPNRNRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPM 62 Query 61 NTSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGSN---MPCLS 117 NT+A+ +R Y++F+ P H LW + I+ + N H+S+ + S+ G++ +P + Sbjct 63 NTAAFASMRGVYEFFFVPYHQLWAQFDQFITGM--NDFHSSA-NKSIQGGTSPLQVPYFN 119 Query 118 ADQISQSL-----------DQLKSKQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARN 166 D + SL D L+ K Y A++LL L YG + Sbjct 120 VDSVFNSLNTGKESGSGSTDDLQYKFKY-------GAFRLLDLLGYG--------RKFDS 164 Query 167 YGTSIDVKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKG 226 +GT+ N YN S+F ILAY K QDY+R + +++ +N D + G Sbjct 165 FGTAYPDNVSGLKNNLDYN--CSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKGGL 222 Query 227 AVTILPTSLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPV 286 + L F L Y N D F + F T+ + + AP Sbjct 223 VDAKVVADL------------FKLRYRNAQTDYFTNLRQSQLFSFTTAFEDVDNINIAPR 270 Query 287 -VTKQNLQSPTNSSVSIGTDDANSKTLIASGTNLTLDVLALRRGEALQRFREISLCTPLN 345 K + + T + + TD + ++S LR A+ + +++ Sbjct 271 DYVKSDGSNFTRVNFGVDTDSSEGDFSVSS----------LRAAFAVDKLLSVTMRAGKT 320 Query 346 YRSQIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITETNE--------ALIAGK 397 ++ Q++AH+GV++ + G Y+GG S + +S+V T+ T E +AGK Sbjct 321 FQDQMRAHYGVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGRVAGK 380 Query 398 GVGTGQSSESFYAKDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEP 457 G G+G+ F AK+ G+LMCIY VP + Y + DP + + + PE +++G++P Sbjct 381 GTGSGRGRIVFDAKEHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQP 440 Query 458 ISVAYYSNNPTELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTE 508 ++ +Y S+ T P +GY PRY +KT+LD G F ++ Sbjct 441 LNSSYISSFCTTDPK--------NPVLGYQPRYSEYKTALDVNHGQFAQSD 483 >gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317] gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 317 str. F0108] Length=541 Score = 170 bits (430), Expect = 9e-43, Method: Compositional matrix adjust. Identities = 151/528 (29%), Positives = 237/528 (45%), Gaps = 77/528 (15%) Query 1 MSLFNLSSVK----NHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTR 56 MSL + +K N PR S FDLS K +TA G LLPV M D ++ Q F R Sbjct 1 MSLKKVPQIKPSRANRPR-SAFDLSQKHLYTAPAGALLPVLSVDLMFHDHIRIQAQDFMR 59 Query 57 TQPVNTSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQ-NVQHASSFDGSVLLGSNMPC 115 T P+N++A+ +R Y++F+ P LW + I+ + SS G L S +P Sbjct 60 TMPMNSAAFISMRGVYEFFFVPYSQLWHPYDQFITSMNDYRSSVVSSAAGDKALDS-VPN 118 Query 116 LSADQISQSLDQLKSKQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDVKD 175 + + + + + ++ ++ FG+ ++ + +L+ L YG T + T + + Sbjct 119 VKLADMYKFVRE-RTDKDIFGYPHSNNSCRLMDLLGYGKPIT--------SSKTPVPL-- 167 Query 176 GTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPTSL 235 Y +++F +LAY K DY+R T ++ Y +NID+ G T +PT+ Sbjct 168 -------LYTGNVNLFRLLAYNKIYSDYYRNTTYEGVDVYSFNIDHKKG----TFVPTAD 216 Query 236 TSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQF--GDTSVVDISYGVTGAPVVTKQNLQ 293 E + +L Y N D + + P F G S + L Sbjct 217 -------EFKKYLNLHYRNAPLDFYTNLRPTPLFTIGSDSFSSV------------LQLS 257 Query 294 SPTNSSVSIGTDDANSKTLIASGTNLTLDVLALRRGEALQRFREISLCTPLNYRSQIKAH 353 PT S+ + D NS L + ++ L+V A+R AL + IS+ Y QI+AH Sbjct 258 DPTGSAGF--SADGNSAKLNMASPDV-LNVSAIRSAFALDKLLSISMRAGKTYAEQIEAH 314 Query 354 FGVDVGAAMSGMSTYIGGEASSLDISEV------VNTNITETNEALIA-------GKGVG 400 FGV V G Y+GG S++ + +V N N++E A +A GKG G Sbjct 315 FGVTVSEGRDGQVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYLGKITGKGTG 374 Query 401 TGQSSESFYAKDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISV 460 +G F AK+ G+LMCIY VP + Y DP + + +PE +++G++PI Sbjct 375 SGYGEIQFDAKEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQPIVP 434 Query 461 AYYSNNPTELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTE 508 A+ S N + S G+ PRY +KT+ D G F E Sbjct 435 AFVSLNRAKDNS-----------YGWQPRYSEYKTAFDINHGQFANGE 471 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 158 bits (399), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 140/521 (27%), Positives = 235/521 (45%), Gaps = 66/521 (13%) Query 15 RSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVNTSAYTRVREYYDW 74 R+ FD+S + FTA G LLPV +P D + F RT P+N++A+ +R Y++ Sbjct 18 RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGVYEF 77 Query 75 FWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGSNMPCLSADQISQSLDQLKSK--Q 132 ++ P LW + I+ + + SSF + + C+S D + + +D K+ + Sbjct 78 YFVPYKQLWSGFDQFITGMS---DYKSSFMYAFKGKTPPSCVSFD-VQKLVDWCKTNTAK 133 Query 133 NYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDVKDGTYNQNRAYNHALSIFP 192 + GFD+ Y++L L YG G T++ + F Sbjct 134 DIHGFDKNKGVYRILDLLGYGKYANSAGVPYTNPTSTTMG--------------KCTPFR 179 Query 193 ILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPTSLTSSAAYFEGDTFFDLEY 252 LAY+K D++R T +++ +N+D + G G V ++ + ++ +F L Y Sbjct 180 GLAYQKIYNDFYRNTTYEEYQLESFNVDMFYGSGKVK---ETIPNEPWDYD---WFTLRY 233 Query 253 CNWNKDMFFGILPDAQFGDTSVVDIS--YGVTGAPVVTKQNLQSPTNSSVSIGTDDANSK 310 N KD+ + P F S+ D + + G+ +V ++ +V+ GT + Sbjct 234 RNAQKDLLTNVRPTPLF---SIDDFNPQFFTGGSDIVMEK------GPNVTGGTHEYRDS 284 Query 311 TLIASGTNLT----------LDVLALRRGEALQRFREISLCTPLNYRSQIKAHFGVDVGA 360 +I G NL + V +R AL++ +++ Y+ Q++AHFG+ V Sbjct 285 VVIV-GKNLKENGVDSKRTMISVADIRNAFALEKLASVTMRAGKTYKEQMEAHFGISVEE 343 Query 361 AMSGMSTYIGGEASSLDISEVVN---TNITETNE-------ALIAGKGVGTGQSSESFYA 410 G TYIGG S++ + +V T +T T + GK G+G F A Sbjct 344 GRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGYLGRTTGKATGSGSGHIRFDA 403 Query 411 KDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPI---SVAYYSNNP 467 K+ GILMCIY VP + Y DP + E F VPE +++G++P+ +++Y NN Sbjct 404 KEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEFENLGMQPLFAKNISYKYNNN 463 Query 468 TELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTE 508 T L + G+ PRY +KT+LD G F E Sbjct 464 TANSRIKNLGA-----FGWQPRYSEYKTALDINHGQFVHQE 499 >gi|494306153|ref|WP_007173049.1| hypothetical protein [Prevotella bergensis] gi|270333881|gb|EFA44667.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=519 Score = 154 bits (388), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 132/506 (26%), Positives = 225/506 (44%), Gaps = 88/506 (17%) Query 33 LLPVKWTLTMPGDKFSLKEQHFTRTQPVNTSAYTRVREYYDWFWCPLHLLWRNAPEVISQ 92 LLPV +P D + Q F RT P+NT+A+ +R Y++F+ P H LW + I+ Sbjct 2 LLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYEFFFVPYHQLWAQFDQFITG 61 Query 93 IQQNVQHASSFDGSVLLGSN---MPCLSADQISQSL---DQLKSKQNYFGFDRADLAYKL 146 + N H+S+ + S+ G++ +P + + + +++ D S Q+ + A++L Sbjct 62 M--NDFHSSA-NKSIQGGTSPLQVPYFNLESVFKNIIERDSTPSFQDDLQYRFKYGAFRL 118 Query 147 LQYLRYGNVRTGVGSNGARNYGTSIDVKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRL 206 L L YG ++GT+ N YN S+F +LAY K QDY+R Sbjct 119 LDLLGYGR--------KFDSFGTAYPDNVSGLKNNLDYN--CSVFRVLAYNKIYQDYYRN 168 Query 207 TQWQDSAPYLWNIDYYDG----------------KGAVTILPTSLTSSAAYFEGDTFFDL 250 + +++ +N D + G + A T T+L S + F D Sbjct 169 SNYENFDTDSFNFDKFKGGLVDAKVVADLFKLRYRNAQTDYFTNLRQSQLFTFIPEFSDD 228 Query 251 EYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPVVTKQNLQSPTNSSVSIGTDDANSK 310 E+ N+++D Q+ D S + + L P + ++G Sbjct 229 EHLNFDRD---------QYADQSKSNFT------------QLNFPVDVDNNLGY------ 261 Query 311 TLIASGTNLTLDVLALRRGEALQRFREISLCTPLNYRSQIKAHFGVDVGAAMSGMSTYIG 370 V +LR A+ + +++ ++ Q++AH+GV++ + G Y+G Sbjct 262 ----------FSVSSLRSAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDSRDGRVNYLG 311 Query 371 GEASSLDISEVVNTNITETNE--------ALIAGKGVGTGQSSESFYAKDWGILMCIYHS 422 G S L +S+V T+ T E IAGKG G+G+ F AK+ G+LMCIY Sbjct 312 GFDSDLQVSDVTQTSGTTATEYKPEAGYLGRIAGKGTGSGRGRIVFDAKEHGVLMCIYSL 371 Query 423 VPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISVAYYSNNPTELPSASGLLSDPTV 482 VP + Y + DP + + F PE +++G++P++ +Y S+ T P Sbjct 372 VPQIQYDCTRLDPMVDKLDRFDFFTPEFENLGMQPLNSSYISSFCTPDPK--------NP 423 Query 483 TVGYLPRYFAWKTSLDYVLGAFTTTE 508 +GY PRY +KT+LD G F + Sbjct 424 VLGYQPRYSEYKTALDINHGQFAQND 449 Lambda K H a alpha 0.318 0.133 0.407 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 3834607117410