bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-31_CDS_annotation_glimmer3.pl_2_1
Length=534
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 370 1e-117
gi|490418709|ref|WP_004291032.1| hypothetical protein 363 1e-114
gi|575094354|emb|CDL65742.1| unnamed protein product 363 4e-114
gi|496050829|ref|WP_008775336.1| hypothetical protein 353 6e-111
gi|494822885|ref|WP_007558293.1| hypothetical protein 318 6e-97
gi|575094321|emb|CDL65708.1| unnamed protein product 246 2e-69
gi|494308783|ref|WP_007173938.1| hypothetical protein 177 5e-45
gi|496521299|ref|WP_009229582.1| capsid protein 170 9e-43
gi|517172762|ref|WP_018361580.1| hypothetical protein 158 1e-38
gi|494306153|ref|WP_007173049.1| hypothetical protein 154 3e-37
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 370 bits (951), Expect = 1e-117, Method: Compositional matrix adjust.
Identities = 219/551 (40%), Positives = 317/551 (58%), Gaps = 55/551 (10%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
S+ +L+++KN +R+GFDLS K AFTAKVGELLP+ PGDKF+++ Q FTRTQPVN
Sbjct 3 SVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQPVN 62
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGSNMPCLSADQI 121
++AY+R+REYYD+++ P LLW AP + + + HA+ SV L P + I
Sbjct 63 SAAYSRLREYYDFYFVPYRLLWNMAPTFFTNM-PDPHHAADLVSSVNLSQRHPWFTFFDI 121
Query 122 SQSLDQLKS--------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDV 173
+ L L S ++N+FGF R +L+ KLL YL YG G + S D+
Sbjct 122 MEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG---FGKDYESVKVPSDSDDI 178
Query 174 KDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPT 233
LS FP+LAY+K C+DYFR QWQ +APY +N+DY GK + +P
Sbjct 179 -------------VLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPM 225
Query 234 SLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPVVTKQNLQ 293
S ++ A F+ T FDL YCN+ KD F G+LP AQ+GD SV +G +L
Sbjct 226 SSFTNDA-FKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVASPIFG----------DLD 274
Query 294 SPTNSSVSIGTDDANSKTLIASG---------TNLTLDVLALRRGEALQRFREISLCTPL 344
+SS++ + I SG T L VLALR+ E LQ++REI+ +
Sbjct 275 IGDSSSLTFASAPQQGANTIQSGVLVVNNNSNTTAGLSVLALRQAECLQKWREIAQSGKM 334
Query 345 NYRSQIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITETNEALIAGKGVGTGQS 404
+Y++Q++ HF V A +SG Y+GG S+LDISEVVNTN+T N+A I GKG GT
Sbjct 335 DYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNLTGDNQADIQGKGTGTLNG 394
Query 405 SE-SFYAKDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISVAYY 463
++ F + + GI+MCIYH +PLLD+ ++ Q F + T + +PE DS+G++ +
Sbjct 395 NKVDFESSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLY---- 450
Query 464 SNNPTELP-SASGLLSDP-TVTVGYLPRYFAWKTSLDYVLGAFTTTEKEWVAPITQTLWT 521
P+E+ L SDP ++ +GY+PRY KTS+D + G+F T WV+P+T + +
Sbjct 451 ---PSEMIFGLEDLPSDPSSINMGYVPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYIS 507
Query 522 KYVEAFERFGY 532
Y +A + G+
Sbjct 508 AYRQACKDAGF 518
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 363 bits (932), Expect = 1e-114, Method: Compositional matrix adjust.
Identities = 205/525 (39%), Positives = 306/525 (58%), Gaps = 46/525 (9%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
++ +L S++N P R+GFDLS K FTAK GELLPV +PGD F + + FTRTQPVN
Sbjct 3 NIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQPVN 62
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGS--VLLGSNMPCLSAD 119
T+A+ R+REYYD+F+ P LLW A V++Q+ N QHA S D + +L MP ++++
Sbjct 63 TAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMTSE 122
Query 120 QISQSLDQLKS-------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSID 172
I+ ++ L + K NYFG++R+ + KLL+YL YGN + + ++ T+
Sbjct 123 AIASYINALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGNYESFL----TDDWNTAPL 178
Query 173 VKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILP 232
+ + +N IF +LAY+K D++R +QW+ +P +N+DY DG +
Sbjct 179 MANLNHN----------IFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSS----MN 224
Query 233 TSLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPVVTKQNL 292
S +++ FFDL YCNW KD+F G+LP Q+G+T+V I+ VTG +T N
Sbjct 225 LDNAYSTEFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGK--LTLSNF 282
Query 293 Q----SPTNSSVSIGTDDANSKTLIASGTNLTLDVLALRRGEALQRFREISLCTPLNYRS 348
SPT +S GT +K L A T L +L LR+ E LQ+++EI+ +Y+
Sbjct 283 STVGTSPTTAS---GT---ATKNLPAFDTVGDLSILVLRQAEFLQKWKEITQSGNKDYKD 336
Query 349 QIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITETNEALIAGKGVGTGQSSESF 408
Q++ H+GV VG S + TY+GG +SS+DI+EV+NTNIT + A IAGKGVG +F
Sbjct 337 QLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNITGSAAADIAGKGVGVANGEINF 396
Query 409 YAKD-WGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISVAYYSNNP 467
+ +G++MCIYH +PLLDY DP +T + +PE D +G++ + + N
Sbjct 397 NSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMNPL 456
Query 468 TELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTEKEWV 512
+ASGL+ +GY+PRY +KTS+D +G F T WV
Sbjct 457 RSFANASGLV------LGYVPRYIDYKTSVDQSVGGFKRTLNSWV 495
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 363 bits (931), Expect = 4e-114, Method: Compositional matrix adjust.
Identities = 216/564 (38%), Positives = 317/564 (56%), Gaps = 79/564 (14%)
Query 6 LSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVNTSAY 65
++ +KN P R+GFDLS K FTAK GELLPV + +PGD F++ + FTRTQP+NTSA+
Sbjct 3 MADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTSAF 62
Query 66 TRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHAS--SFDGSVLLGSNMPCLSADQISQ 123
R+REYYD+++ P +W I+Q+ NVQHAS + D + L MP +++QI+
Sbjct 63 ARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQIAD 122
Query 124 SLDQ--LKSKQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDVKDGTYN-Q 180
L+ +++N FGF+R+ L KLLQYL YG+ S D + T++ +
Sbjct 123 YLNDQATAARKNPFGFNRSTLTCKLLQYLGYGDYN-------------SFDSETNTWSAK 169
Query 181 NRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPTSLTSSAA 240
YN LS FP+LAY+K D++R TQW+ + P +N+DY G + + T L S
Sbjct 170 PLLYNLELSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKGTSDLQMDLTGLPS--- 226
Query 241 YFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVD-------ISYGVTGAPVVTKQ-NL 292
+ + FFD+ YCN+ KDMF G+LP AQ+G SVV IS G +G T +
Sbjct 227 --DDNNFFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTPDP 284
Query 293 QSPTNSSVSIGTD------------------------------DANSKTLIASGTNLTLD 322
+P S V++G + +A++++L+ NL ++
Sbjct 285 GTPGTSYVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPNLIIE 344
Query 323 --------VLALRRGEALQRFREISLCTPLNYRSQIKAHFGVDVGAAMSGMSTYIGGEAS 374
+LALR+ E LQ+++E+S+ +Y+SQI+ H+G+ V +S + Y+GG A+
Sbjct 345 NNQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCAT 404
Query 375 SLDISEVVNTNITETNEALIAGKGVGTGQSSESFYAK-DWGILMCIYHSVPLLDYVLSAP 433
SLDI+EV+N NIT N A IAGKG TG S F +K ++GI+MCIYH +P++DYV S
Sbjct 405 SLDINEVINNNITGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGV 464
Query 434 DPQLFTSENTSFPVPELDSIGLE--PISVAYYSNNPTELPSASGLLSDPTVTVGYLPRYF 491
D + TSFP+PELD IG+E P+ A ++ PSA L GY PRY
Sbjct 465 DHSCTLVDATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFL-------GYAPRYI 517
Query 492 AWKTSLDYVLGAFTTTEKEWVAPI 515
WKTS+D +G F + + W P+
Sbjct 518 DWKTSVDRSVGDFADSLRTWCLPV 541
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 353 bits (907), Expect = 6e-111, Method: Compositional matrix adjust.
Identities = 211/527 (40%), Positives = 310/527 (59%), Gaps = 41/527 (8%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
++ +L S++N R+GFDLSSK FTAK GELLPVK +PGDK+S+ + FTRTQP+N
Sbjct 3 NIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQPLN 62
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSV--LLGSNMPCLSAD 119
T+A+ R+REYYD+++ P +LLW A V++Q+ N QHA+S+ S L MP ++
Sbjct 63 TAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSANQALAGVMPNVTCK 122
Query 120 QISQSLDQLKS--------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSI 171
I+ L+ + ++NYFG+ R+ KLL+YL YGN T Y TS
Sbjct 123 GIADYLNLVAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYT---------YATS- 172
Query 172 DVKDGTYNQN-RAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTI 230
K+ T+ ++ + N L+I+ +LAY+K D+ R +QW+ +P +N+DY G +
Sbjct 173 --KNNTWTKSPLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVDSAM 230
Query 231 LPTSLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPVVTKQ 290
S+ + + FDL YCNW KD+F G+LP Q+GDT+ V+++ + V++ Q
Sbjct 231 TIDSMITGQGFAPFYNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNVNL----SNVLSAQ 286
Query 291 NL-QSPTNSSVS---IGTDDANSKTLIASGTNLTLDVLALRRGEALQRFREISLCTPLNY 346
+ Q+P V + N +T+ SG T VLALR+ E LQ+++EI+ +Y
Sbjct 287 YMVQTPDGDPVGGSPFSSTGVNLQTVNGSG---TFTVLALRQAEFLQKWKEITQSGNKDY 343
Query 347 RSQIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITETNEALIAGKGVGTGQSSE 406
+ QI+ H+ V VG A S MS Y+GG +SLDI+EVVN NIT +N A IAGKGV G
Sbjct 344 KDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVVVGNGRI 403
Query 407 SFYAKD-WGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISVAYYSN 465
SF A + +G++MCIYHS+PLLDY +P +T F +PE D +G+E + + N
Sbjct 404 SFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMN 463
Query 466 NPTELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTEKEWV 512
L S+ + S +GY PRY ++KT +D +GAF TT K WV
Sbjct 464 ---PLQSSYNVGSS---ILGYAPRYISYKTDVDSSVGAFKTTLKSWV 504
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 318 bits (815), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 187/546 (34%), Positives = 286/546 (52%), Gaps = 50/546 (9%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
++ ++ SV+N P R+G+DL+ K+ FTAK G L+PV WT +P D + + F RTQP+N
Sbjct 10 NIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQPLN 69
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASS--FDGSVLLGSNMPCLSAD 119
T+A+ R+R Y+D+++ P +W P I+Q++ N+ HAS +V L +P +A+
Sbjct 70 TAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADNVPLSDELPYFTAE 129
Query 120 QISQSLDQLKSKQNYFGFDRADLAYKLLQYLRYGN----VRTGVGSNGARNYGTSIDVKD 175
Q++ + L +N FG+ RA L +L+YL YG+ + G GA
Sbjct 130 QVADYIVSLADSKNQFGYYRAWLVCIILEYLGYGDFYPYIVEAAGGEGA----------- 178
Query 176 GTYNQNRAYNH-ALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPTS 234
T+ N+ S FP+ AY+K D+ R TQW+ S P +NIDY G L +
Sbjct 179 -TWATRPMLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGSADSLQLDFT 237
Query 235 LTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDIS------YGVTGAPVVT 288
+ F FD+ Y NW +D+ G +P AQ+G+ S V +S G T T
Sbjct 238 VEGFKDSF---NLFDMRYSNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAFTT 294
Query 289 KQNLQSPTNSSVSIGT-------------------DDANSKTLIASGTNLTLDVLALRRG 329
Q+ + N +V+I ++ NS ++ ++ + +LALRR
Sbjct 295 GQDGVAFLNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRA 354
Query 330 EALQRFREISLCTPLNYRSQIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITET 389
EA Q+++E++L + +Y SQI+AH+G V A S M ++G L I+EVVN NIT
Sbjct 355 EAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGE 414
Query 390 NEALIAGKGVGTGQSSESF-YAKDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVP 448
N A IAGKG +G S +F +GI+MC++H +P LDY+ SAP + FP+P
Sbjct 415 NAADIAGKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIP 474
Query 449 ELDSIGLEPISVAYYSNNPTELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTE 508
E D IG+E + V NP + P P + GY P+Y+ WKT+LD +G F +
Sbjct 475 EFDKIGMEQVPVI-RGLNPVK-PKDGDFKVSPNLYFGYAPQYYNWKTTLDKSMGEFRRSL 532
Query 509 KEWVAP 514
K W+ P
Sbjct 533 KTWIIP 538
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 246 bits (628), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 177/574 (31%), Positives = 279/574 (49%), Gaps = 64/574 (11%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
++ L +KN P R+ FDLS + FTAKVGELLP PGD + +FTRT P+
Sbjct 6 NIMGLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQ 65
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNV------QHASSFDGSVLLGSNMPC 115
++A+TR+RE +F+ P LW+ + + +N + ASS G+ + + MPC
Sbjct 66 SNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQMPC 125
Query 116 LSADQISQSLDQLKSKQNY-----------FGFDRADLAYKLLQYLRYGNVRTGVGS--- 161
++ + L + ++ G R + KLLQ L YGN +
Sbjct 126 VNYKTLHAYLLKFINRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNFPEQFANFKV 185
Query 162 NGARNYGTSIDVKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDY 221
N ++ + + KD TYN N Y LSIF +LAY K C D++ QWQ L N+DY
Sbjct 186 NNDKHNQSGQNFKDVTYN-NSPY---LSIFRLLAYHKICNDHYLYRQWQPYNASLCNVDY 241
Query 222 YDGKG----AVTILPTSLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDI 277
++ S+ + E D+ + N D F G+LP +QFG SVV++
Sbjct 242 LTPNSSSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSESVVNL 301
Query 278 SYG-VTGAPVVT-------------------KQNLQSPTNSSVSI----GTDDANSKTL- 312
+ G +G+ V+ +Q + S N ++ + GT ++ T
Sbjct 302 NLGNASGSAVLNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFISHDHTFS 361
Query 313 --IASGTNLT--LDVLALRRGEALQRFREISLCTPLNYRSQIKAHFGVDVGAAMSGMSTY 368
+A T+L+ L ++ALR A Q+++EI L ++++SQ++AHFG+ S +
Sbjct 362 GNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIKPDEKNEN-SLF 420
Query 369 IGGEASSLDISEVVNTNITETNEALIAGKGVGTGQSSESFYAKDWGILMCIYHSVPLLDY 428
IGG +S ++I+E +N N++ N+A G G +S F AK +G+++ IY P+LD+
Sbjct 421 IGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASIKFTAKTYGVVIGIYRCTPVLDF 480
Query 429 VLSAPDPQLFTSENTSFPVPELDSIGL------EPISVAYYSNNPTELPSASGLLSDPTV 482
D LF ++ + F +PE+DSIG+ E + A Y++ G D +
Sbjct 481 AHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPYNDEFKAFRVGDGSSPDMSE 540
Query 483 TVGYLPRYFAWKTSLDYVLGAFTTTEKEWVAPIT 516
T GY PRY +KTS D GAF + K WV I
Sbjct 541 TYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGIN 574
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 177 bits (448), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 143/531 (27%), Positives = 240/531 (45%), Gaps = 73/531 (14%)
Query 1 MSLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPV 60
+S+ + + + + R+ FDLS + FTA G LLPV +P D + Q F RT P+
Sbjct 3 VSIPKIKATRPNRNRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPM 62
Query 61 NTSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGSN---MPCLS 117
NT+A+ +R Y++F+ P H LW + I+ + N H+S+ + S+ G++ +P +
Sbjct 63 NTAAFASMRGVYEFFFVPYHQLWAQFDQFITGM--NDFHSSA-NKSIQGGTSPLQVPYFN 119
Query 118 ADQISQSL-----------DQLKSKQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARN 166
D + SL D L+ K Y A++LL L YG +
Sbjct 120 VDSVFNSLNTGKESGSGSTDDLQYKFKY-------GAFRLLDLLGYG--------RKFDS 164
Query 167 YGTSIDVKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKG 226
+GT+ N YN S+F ILAY K QDY+R + +++ +N D + G
Sbjct 165 FGTAYPDNVSGLKNNLDYN--CSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKGGL 222
Query 227 AVTILPTSLTSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPV 286
+ L F L Y N D F + F T+ + + AP
Sbjct 223 VDAKVVADL------------FKLRYRNAQTDYFTNLRQSQLFSFTTAFEDVDNINIAPR 270
Query 287 -VTKQNLQSPTNSSVSIGTDDANSKTLIASGTNLTLDVLALRRGEALQRFREISLCTPLN 345
K + + T + + TD + ++S LR A+ + +++
Sbjct 271 DYVKSDGSNFTRVNFGVDTDSSEGDFSVSS----------LRAAFAVDKLLSVTMRAGKT 320
Query 346 YRSQIKAHFGVDVGAAMSGMSTYIGGEASSLDISEVVNTNITETNE--------ALIAGK 397
++ Q++AH+GV++ + G Y+GG S + +S+V T+ T E +AGK
Sbjct 321 FQDQMRAHYGVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGRVAGK 380
Query 398 GVGTGQSSESFYAKDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEP 457
G G+G+ F AK+ G+LMCIY VP + Y + DP + + + PE +++G++P
Sbjct 381 GTGSGRGRIVFDAKEHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQP 440
Query 458 ISVAYYSNNPTELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTE 508
++ +Y S+ T P +GY PRY +KT+LD G F ++
Sbjct 441 LNSSYISSFCTTDPK--------NPVLGYQPRYSEYKTALDVNHGQFAQSD 483
>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon
317 str. F0108]
Length=541
Score = 170 bits (430), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 151/528 (29%), Positives = 237/528 (45%), Gaps = 77/528 (15%)
Query 1 MSLFNLSSVK----NHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTR 56
MSL + +K N PR S FDLS K +TA G LLPV M D ++ Q F R
Sbjct 1 MSLKKVPQIKPSRANRPR-SAFDLSQKHLYTAPAGALLPVLSVDLMFHDHIRIQAQDFMR 59
Query 57 TQPVNTSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQ-NVQHASSFDGSVLLGSNMPC 115
T P+N++A+ +R Y++F+ P LW + I+ + SS G L S +P
Sbjct 60 TMPMNSAAFISMRGVYEFFFVPYSQLWHPYDQFITSMNDYRSSVVSSAAGDKALDS-VPN 118
Query 116 LSADQISQSLDQLKSKQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDVKD 175
+ + + + + ++ ++ FG+ ++ + +L+ L YG T + T + +
Sbjct 119 VKLADMYKFVRE-RTDKDIFGYPHSNNSCRLMDLLGYGKPIT--------SSKTPVPL-- 167
Query 176 GTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPTSL 235
Y +++F +LAY K DY+R T ++ Y +NID+ G T +PT+
Sbjct 168 -------LYTGNVNLFRLLAYNKIYSDYYRNTTYEGVDVYSFNIDHKKG----TFVPTAD 216
Query 236 TSSAAYFEGDTFFDLEYCNWNKDMFFGILPDAQF--GDTSVVDISYGVTGAPVVTKQNLQ 293
E + +L Y N D + + P F G S + L
Sbjct 217 -------EFKKYLNLHYRNAPLDFYTNLRPTPLFTIGSDSFSSV------------LQLS 257
Query 294 SPTNSSVSIGTDDANSKTLIASGTNLTLDVLALRRGEALQRFREISLCTPLNYRSQIKAH 353
PT S+ + D NS L + ++ L+V A+R AL + IS+ Y QI+AH
Sbjct 258 DPTGSAGF--SADGNSAKLNMASPDV-LNVSAIRSAFALDKLLSISMRAGKTYAEQIEAH 314
Query 354 FGVDVGAAMSGMSTYIGGEASSLDISEV------VNTNITETNEALIA-------GKGVG 400
FGV V G Y+GG S++ + +V N N++E A +A GKG G
Sbjct 315 FGVTVSEGRDGQVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYLGKITGKGTG 374
Query 401 TGQSSESFYAKDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISV 460
+G F AK+ G+LMCIY VP + Y DP + + +PE +++G++PI
Sbjct 375 SGYGEIQFDAKEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQPIVP 434
Query 461 AYYSNNPTELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTE 508
A+ S N + S G+ PRY +KT+ D G F E
Sbjct 435 AFVSLNRAKDNS-----------YGWQPRYSEYKTAFDINHGQFANGE 471
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 158 bits (399), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 140/521 (27%), Positives = 235/521 (45%), Gaps = 66/521 (13%)
Query 15 RSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVNTSAYTRVREYYDW 74
R+ FD+S + FTA G LLPV +P D + F RT P+N++A+ +R Y++
Sbjct 18 RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGVYEF 77
Query 75 FWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGSNMPCLSADQISQSLDQLKSK--Q 132
++ P LW + I+ + + SSF + + C+S D + + +D K+ +
Sbjct 78 YFVPYKQLWSGFDQFITGMS---DYKSSFMYAFKGKTPPSCVSFD-VQKLVDWCKTNTAK 133
Query 133 NYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDVKDGTYNQNRAYNHALSIFP 192
+ GFD+ Y++L L YG G T++ + F
Sbjct 134 DIHGFDKNKGVYRILDLLGYGKYANSAGVPYTNPTSTTMG--------------KCTPFR 179
Query 193 ILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGAVTILPTSLTSSAAYFEGDTFFDLEY 252
LAY+K D++R T +++ +N+D + G G V ++ + ++ +F L Y
Sbjct 180 GLAYQKIYNDFYRNTTYEEYQLESFNVDMFYGSGKVK---ETIPNEPWDYD---WFTLRY 233
Query 253 CNWNKDMFFGILPDAQFGDTSVVDIS--YGVTGAPVVTKQNLQSPTNSSVSIGTDDANSK 310
N KD+ + P F S+ D + + G+ +V ++ +V+ GT +
Sbjct 234 RNAQKDLLTNVRPTPLF---SIDDFNPQFFTGGSDIVMEK------GPNVTGGTHEYRDS 284
Query 311 TLIASGTNLT----------LDVLALRRGEALQRFREISLCTPLNYRSQIKAHFGVDVGA 360
+I G NL + V +R AL++ +++ Y+ Q++AHFG+ V
Sbjct 285 VVIV-GKNLKENGVDSKRTMISVADIRNAFALEKLASVTMRAGKTYKEQMEAHFGISVEE 343
Query 361 AMSGMSTYIGGEASSLDISEVVN---TNITETNE-------ALIAGKGVGTGQSSESFYA 410
G TYIGG S++ + +V T +T T + GK G+G F A
Sbjct 344 GRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGYLGRTTGKATGSGSGHIRFDA 403
Query 411 KDWGILMCIYHSVPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPI---SVAYYSNNP 467
K+ GILMCIY VP + Y DP + E F VPE +++G++P+ +++Y NN
Sbjct 404 KEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEFENLGMQPLFAKNISYKYNNN 463
Query 468 TELPSASGLLSDPTVTVGYLPRYFAWKTSLDYVLGAFTTTE 508
T L + G+ PRY +KT+LD G F E
Sbjct 464 TANSRIKNLGA-----FGWQPRYSEYKTALDINHGQFVHQE 499
>gi|494306153|ref|WP_007173049.1| hypothetical protein [Prevotella bergensis]
gi|270333881|gb|EFA44667.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=519
Score = 154 bits (388), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 132/506 (26%), Positives = 225/506 (44%), Gaps = 88/506 (17%)
Query 33 LLPVKWTLTMPGDKFSLKEQHFTRTQPVNTSAYTRVREYYDWFWCPLHLLWRNAPEVISQ 92
LLPV +P D + Q F RT P+NT+A+ +R Y++F+ P H LW + I+
Sbjct 2 LLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYEFFFVPYHQLWAQFDQFITG 61
Query 93 IQQNVQHASSFDGSVLLGSN---MPCLSADQISQSL---DQLKSKQNYFGFDRADLAYKL 146
+ N H+S+ + S+ G++ +P + + + +++ D S Q+ + A++L
Sbjct 62 M--NDFHSSA-NKSIQGGTSPLQVPYFNLESVFKNIIERDSTPSFQDDLQYRFKYGAFRL 118
Query 147 LQYLRYGNVRTGVGSNGARNYGTSIDVKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRL 206
L L YG ++GT+ N YN S+F +LAY K QDY+R
Sbjct 119 LDLLGYGR--------KFDSFGTAYPDNVSGLKNNLDYN--CSVFRVLAYNKIYQDYYRN 168
Query 207 TQWQDSAPYLWNIDYYDG----------------KGAVTILPTSLTSSAAYFEGDTFFDL 250
+ +++ +N D + G + A T T+L S + F D
Sbjct 169 SNYENFDTDSFNFDKFKGGLVDAKVVADLFKLRYRNAQTDYFTNLRQSQLFTFIPEFSDD 228
Query 251 EYCNWNKDMFFGILPDAQFGDTSVVDISYGVTGAPVVTKQNLQSPTNSSVSIGTDDANSK 310
E+ N+++D Q+ D S + + L P + ++G
Sbjct 229 EHLNFDRD---------QYADQSKSNFT------------QLNFPVDVDNNLGY------ 261
Query 311 TLIASGTNLTLDVLALRRGEALQRFREISLCTPLNYRSQIKAHFGVDVGAAMSGMSTYIG 370
V +LR A+ + +++ ++ Q++AH+GV++ + G Y+G
Sbjct 262 ----------FSVSSLRSAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDSRDGRVNYLG 311
Query 371 GEASSLDISEVVNTNITETNE--------ALIAGKGVGTGQSSESFYAKDWGILMCIYHS 422
G S L +S+V T+ T E IAGKG G+G+ F AK+ G+LMCIY
Sbjct 312 GFDSDLQVSDVTQTSGTTATEYKPEAGYLGRIAGKGTGSGRGRIVFDAKEHGVLMCIYSL 371
Query 423 VPLLDYVLSAPDPQLFTSENTSFPVPELDSIGLEPISVAYYSNNPTELPSASGLLSDPTV 482
VP + Y + DP + + F PE +++G++P++ +Y S+ T P
Sbjct 372 VPQIQYDCTRLDPMVDKLDRFDFFTPEFENLGMQPLNSSYISSFCTPDPK--------NP 423
Query 483 TVGYLPRYFAWKTSLDYVLGAFTTTE 508
+GY PRY +KT+LD G F +
Sbjct 424 VLGYQPRYSEYKTALDINHGQFAQND 449
Lambda K H a alpha
0.318 0.133 0.407 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3834607117410