bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-14_CDS_annotation_glimmer3.pl_2_4
Length=596
Score E
Sequences producing significant alignments: (Bits) Value
gi|490418709|ref|WP_004291032.1| hypothetical protein 353 6e-110
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 352 2e-109
gi|575094354|emb|CDL65742.1| unnamed protein product 350 2e-108
gi|496050829|ref|WP_008775336.1| hypothetical protein 338 3e-104
gi|494822885|ref|WP_007558293.1| hypothetical protein 314 1e-94
gi|575094321|emb|CDL65708.1| unnamed protein product 243 7e-68
gi|494308783|ref|WP_007173938.1| hypothetical protein 187 2e-48
gi|517172762|ref|WP_018361580.1| hypothetical protein 178 4e-45
gi|647452987|ref|WP_025792807.1| hypothetical protein 169 4e-42
gi|496521299|ref|WP_009229582.1| capsid protein 164 1e-40
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 353 bits (906), Expect = 6e-110, Method: Compositional matrix adjust.
Identities = 238/618 (39%), Positives = 335/618 (54%), Gaps = 65/618 (11%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGT--KVNLKDMHFTRTM 60
+ + S ++ +PSR GFDLS K FTAKAGELLPV K +LPG K+NLK FTRT
Sbjct 2 ANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLK--AFTRTQ 59
Query 61 PVNTAAYTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANSL--IQNKVVSDEIPCF 118
PVNTAA+ RI+EY+D++FVP L+ N L M D A S+ +N V+S E+P
Sbjct 60 PVNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYM 119
Query 119 DYDTLTSCLKAFNTQHPSYLDIA----GFERVPKTLKLLRYLRYGN---FLYDTGFSTLP 171
+ + S + A +T + D G+ R ++KLL YL YGN FL D ++T P
Sbjct 120 TSEAIASYINALSTAS-ALADYKSNYFGYNRSKSSVKLLEYLGYGNYESFLTDD-WNTAP 177
Query 172 SKNMNYSSVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGG- 230
L A NLN N+ L AYQKIY D++R QWE+ P T+N DY G
Sbjct 178 -------------LMA--NLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSS 222
Query 231 -NILTEYKGDPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVATINISHSSSAGVHL 289
N+ Y ++ + N F LRY N+ KDLF G+LP Q G A +I+ + + L
Sbjct 223 MNLDNAYS---TEFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTL 279
Query 290 TNQEGYLTGTVASDGTTITVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMR 349
+N TV + TT + T++L P V + IL R A +Q+ +
Sbjct 280 SN-----FSTVGTSPTTASGTATKNL-PAFDTV---------GDLSILVLRQAEFLQKWK 324
Query 350 EIQQCAGQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIk 409
EI Q + YK+QLE W V + S+ CTY+GG SS I+I+EV+N ++ T + ADI
Sbjct 325 EITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNI-TGSAAADIA 383
Query 410 gkgvgsgsgsesFETQ-EHGILMCIYHAVPVLDYQLTGPDLQLLNTYATDLPQPELDNLG 468
GKGVG +G +F + +G++MCIYH +P+LDY D L +TD PE D +G
Sbjct 384 GKGVGVANGEINFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVG 443
Query 469 LEALPYFTFVNDAVATQPNNVTVKSIIGYVPRYIAYKTDIDCVDGAFLTSLTSWVTPLTI 528
++++P +N + + N + ++GYVPRYI YKT +D G F +L SWV
Sbjct 444 MQSMPLVQLMN-PLRSFANASGL--VLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGN 500
Query 529 DEIVTKISLGSGTGPFTP----------NYGLFKVSPYVLDSIFVSQCDSTVDTDQFLVE 578
++ +++L + P P N+ FKV+P LD IF Q +TDQFL
Sbjct 501 ISVLKQVTLPNDAPPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCS 560
Query 579 SFFDVKLVQNLDYNGMPY 596
SFFD+K V+NLD +G+PY
Sbjct 561 SFFDIKAVRNLDTDGLPY 578
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 352 bits (902), Expect = 2e-109, Method: Compositional matrix adjust.
Identities = 224/612 (37%), Positives = 324/612 (53%), Gaps = 58/612 (9%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPV 62
S + S +K R GFDLS K FTAK GELLP+ K + PG K N++ FTRT PV
Sbjct 2 SSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQPV 61
Query 63 NTAAYTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKVVSDEIPCFDYDT 122
N+AAY+R++EY+D+YFVP RL+ N+ P + + A L+ + +S P F +
Sbjct 62 NSAAYSRLREYYDFYFVPYRLL-WNMAPTFFTNMPDPHHAADLVSSVNLSQRHPWFTFFD 120
Query 123 LTSCLKAFNTQHPSY----LDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYS 178
+ L N+ +Y + GF RV ++KLL YL YG F D +PS +
Sbjct 121 IMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG-FGKDYESVKVPSDSD--- 176
Query 179 SVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSG--GNILTEY 236
++ ++ PL AYQKI DYFR +QW+ A PY YN DY G
Sbjct 177 -----------DIVLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPM 225
Query 237 KGDPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVAT-------INISHSSSAGVHL 289
+D F +F L Y N+ KD F G+LP +Q G V+ ++I SSS
Sbjct 226 SSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVASPIFGDLDIGDSSSLTFAS 285
Query 290 TNQEGYLTGTVASDGTTITVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMR 349
Q+G T+ S + V N + T G+S +L+ R A +Q+ R
Sbjct 286 APQQG--ANTIQS--GVLVVNNNSNTTAGLS---------------VLALRQAECLQKWR 326
Query 350 EIQQCAGQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIk 409
EI Q Y+ Q++ +NV S LS HC Y+GG +S ++ISEV+N +L T +QADI+
Sbjct 327 EIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNL-TGDNQADIQ 385
Query 410 -gkgvgsgsgsesFETQEHGILMCIYHAVPVLDYQLTGPDLQLLNTYATDLPQPELDNLG 468
FE+ EHGI+MCIYH +P+LD+ + Q T TD PE D++G
Sbjct 386 GKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVG 445
Query 469 LEAL--PYFTFVNDAVATQPNNVTVKSIIGYVPRYIAYKTDIDCVDGAFLTSLTSWVTPL 526
++ L F + + + P+++ +GYVPRY KT ID + G+F+ +L SWV+PL
Sbjct 446 MQQLYPSEMIFGLEDLPSDPSSIN----MGYVPRYADLKTSIDEIHGSFIDTLVSWVSPL 501
Query 527 TIDEIVT--KISLGSGTGPFTPNYGLFKVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVK 584
T I + +G T Y FKV+P+++D+IF + DST++TDQ L+ S+FD+K
Sbjct 502 TDSYISAYRQACKDAGFSDITMTYNFFKVNPHIVDNIFGVKADSTINTDQLLINSYFDIK 561
Query 585 LVQNLDYNGMPY 596
V+N DYNG+PY
Sbjct 562 AVRNFDYNGLPY 573
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 350 bits (898), Expect = 2e-108, Method: Compositional matrix adjust.
Identities = 236/640 (37%), Positives = 349/640 (55%), Gaps = 76/640 (12%)
Query 7 SYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAA 66
S D+K RPSR GFDLS K FTAKAGELLPV K++LPG N+ FTRT P+NT+A
Sbjct 2 SMADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTSA 61
Query 67 YTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANS----LIQNKVVSDEIPCFDYDT 122
+ R++EY+D+YFVP + + + M +N+ ++ L N +S +P F +
Sbjct 62 FARMREYYDFYFVPFEQMWNKFDSCITQM--NANVQHASGPTLDDNTPLSGRMPYFTSEQ 119
Query 123 LTSCLKAFNTQHPSYLDIAGFERVPKTLKLLRYLRYGNF-LYDTGFSTLPSKNMNYSSVK 181
+ L T + + GF R T KLL+YL YG++ +D+ +T +K + Y
Sbjct 120 IADYLNDQATA--ARKNPFGFNRSTLTCKLLQYLGYGDYNSFDSETNTWSAKPLLY---- 173
Query 182 DFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSG-GNILTEYKGDP 240
NL ++ PL AYQKIY D++R+ QWEK P T+N DY G ++ + G P
Sbjct 174 --------NLELSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKGTSDLQMDLTGLP 225
Query 241 SDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVATIN-------ISHSSSAGVHLTNQE 293
SD +N F +RY NY KD+F G+LP +Q GS + + IS+ S + T+
Sbjct 226 SD---DNNFFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTP 282
Query 294 -------GYLT--GTVASD-------GTTITV--------------KNTRSLTPGISPVL 323
Y+T G + D G+T+ V +TRSL ++
Sbjct 283 DPGTPGTSYVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPNLI 342
Query 324 RTNFADLNANF--DILSFRIANAIQRMREIQQCAGQGYKEQLEARWNVKLSTALSDHCTY 381
N N F IL+ R A +Q+ +E+ + YK Q+E W +K+S LS Y
Sbjct 343 IEN----NQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARY 398
Query 382 IGGNSSQINISEVLNNSLDTEQSQADIkgkgvgsgsgsesFETQ-EHGILMCIYHAVPVL 440
+GG ++ ++I+EV+NN++ T + ADI GKG +G+GS FE++ E+GI+MCIYH +P++
Sbjct 399 LGGCATSLDINEVINNNI-TGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIV 457
Query 441 DYQLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVNDAVATQPNNVTVKSIIGYVPR 500
DY +G D AT P PELD +G+E++P +N + + + + +GY PR
Sbjct 458 DYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAMNP--VKESDTPSADTFLGYAPR 515
Query 501 YIAYKTDIDCVDGAFLTSLTSWVTPLTIDEIVTKISLGSGTGP-FTPN---YGLFKVSPY 556
YI +KT +D G F SL +W P+ E+ + SL + P P+ G FKV+P
Sbjct 516 YIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDSIAAGFFKVNPS 575
Query 557 VLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMPY 596
++D +F DSTV TD+FL SFFDVK+V+NLD NG+PY
Sbjct 576 IVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 338 bits (867), Expect = 3e-104, Method: Compositional matrix adjust.
Identities = 233/618 (38%), Positives = 334/618 (54%), Gaps = 63/618 (10%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPV 62
+ + S ++ + SR GFDLS K FTAK GELLPV +LPG K ++ FTRT P+
Sbjct 2 ANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQPL 61
Query 63 NTAAYTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANSLI--QNKVVSDEIP---- 116
NTAA+ R++EY+D+YFVP L+ N L M D A S I N+ ++ +P
Sbjct 62 NTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSANQALAGVMPNVTC 121
Query 117 --CFDYDTLTSC-LKAFNTQHPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSK 173
DY L + + N+ +Y G+ R T KLL YL YGNF ++ SK
Sbjct 122 KGIADYLNLVAPDVTTTNSYEKNYF---GYSRSLGTAKLLEYLGYGNF-----YTYATSK 173
Query 174 NMNYSSVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGG--- 230
N ++ NL +N+ + AYQKIY D+ R QWEK P +N DY SG
Sbjct 174 NNTWTKSP-----LSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVDS 228
Query 231 --NILTEYKGDPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVATINISHSSSAGVH 288
I + G F N+F LRY N+ KDLF G+LP Q G A +N++ S+
Sbjct 229 AMTIDSMITGQGFAPFY--NMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNVNLSNVLSAQ 286
Query 289 LTNQEGYLTGTVASDGTTITVKNTRSLTPGISPVLRT--NFADLNAN--FDILSFRIANA 344
Y+ T DG + G SP T N +N + F +L+ R A
Sbjct 287 ------YMVQT--PDGDPV----------GGSPFSSTGVNLQTVNGSGTFTVLALRQAEF 328
Query 345 IQRMREIQQCAGQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQS 404
+Q+ +EI Q + YK+Q+E WNV + A S+ Y+GG ++ ++I+EV+NN++ T +
Sbjct 329 LQKWKEITQSGNKDYKDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNI-TGSN 387
Query 405 QADIkgkgvgsgsgsesFETQE-HGILMCIYHAVPVLDY--QLTGPDLQLLNTYATDLPQ 461
ADI GKGV G+G SF+ E +G++MCIYH++P+LDY L P +N+ TD
Sbjct 388 AADIAGKGVVVGNGRISFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINS--TDFAI 445
Query 462 PELDNLGLEALPYFTFVNDAVATQPNNVTVKSIIGYVPRYIAYKTDIDCVDGAFLTSLTS 521
PE D +G+E++P + +N Q + SI+GY PRYI+YKTD+D GAF T+L S
Sbjct 446 PEFDRVGMESVPLVSLMN---PLQSSYNVGSSILGYAPRYISYKTDVDSSVGAFKTTLKS 502
Query 522 WVTPLTIDEIVTKISLGS--GTGPFT-PNYGLFKVSPYVLDSIFVSQCDSTVDTDQFLVE 578
WV ++ +++ P T NY FKV+P +D +F +++DTDQFL
Sbjct 503 WVMSYDNQSVINQLNYQDDPNNSPGTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQFLCS 562
Query 579 SFFDVKLVQNLDYNGMPY 596
SFFDVK+V+NLD +G+PY
Sbjct 563 SFFDVKVVRNLDTDGLPY 580
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 314 bits (804), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 210/625 (34%), Positives = 330/625 (53%), Gaps = 51/625 (8%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPV 62
+ + S V+ +P+RAG+DL++K FTAKAG L+PV+W +LP +N F RT P+
Sbjct 9 ANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQPL 68
Query 63 NTAAYTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANS----LIQNKVVSDEIPCF 118
NTAA+ R++ YFD+YFVP R + A+ M ++N+ ++ L N +SDE+P F
Sbjct 69 NTAAFARMRGYFDFYFVPFRQMWNKFPTAITQM--RTNLLHASGPVLADNVPLSDELPYF 126
Query 119 DYDTLTSCLKAFNTQHPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYS 178
+ + + + + G+ R +L YL YG+F Y + ++
Sbjct 127 TAEQVADYIVSLADSKNQF----GYYRAWLVCIILEYLGYGDF-YPYIVEAAGGEGATWA 181
Query 179 SVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGGNILTEYKG 238
+ N NL + PL AYQKIY D+ R+ QWE++ P T+N DY SG +
Sbjct 182 TRPMLN-----NLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGS--ADSLQL 234
Query 239 DPSDLFLKD--NLFSLRYANYPKDLFMGILPSSQLGSVATINISHS------SSAGVHLT 290
D + KD NLF +RY+N+ +DL G +P +Q G + + +S S + T
Sbjct 235 DFTVEGFKDSFNLFDMRYSNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAFTT 294
Query 291 NQEG--YLTGTVASDGTTITVKNTRSLTPGISPVLRTN--------FADLNANFDILSFR 340
Q+G +L G V G++ ++ S+ G S +LR N D + IL+ R
Sbjct 295 GQDGVAFLNGNVTIQGSSGYLQAQTSV--GESRILRFNNTNSGLIVEGDSSFGVSILALR 352
Query 341 IANAIQRMREIQQCAGQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLD 400
A A Q+ +E+ + + Y Q+EA W ++ A SD C ++G + ++I+EV+NN++
Sbjct 353 RAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNI- 411
Query 401 TEQSQADIkgkgvgsgsgsesFET-QEHGILMCIYHAVPVLDYQLTGPDLQLLNTYATDL 459
T ++ ADI GKG SG+GS +F ++GI+MC++H +P LDY + P T D
Sbjct 412 TGENAADIAGKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDF 471
Query 460 PQPELDNLGLEALPYFTFVNDAVATQPN-NVTVKSIIGYVPRYIAYKTDIDCVDGAFLTS 518
P PE D +G+E +P +N + V+ GY P+Y +KT +D G F S
Sbjct 472 PIPEFDKIGMEQVPVIRGLNPVKPKDGDFKVSPNLYFGYAPQYYNWKTTLDKSMGEFRRS 531
Query 519 LTSWVTPLTIDEIVTKISLGSGTGPFTPNY-------GLFKVSPYVLDSIFVSQCDSTVD 571
L +W+ P + ++ S+ P PN G FKVSP VLD++F + +S ++
Sbjct 532 LKTWIIPFDDEALLAADSVDF---PDNPNVEADSVKAGFFKVSPSVLDNLFAVKANSDLN 588
Query 572 TDQFLVESFFDVKLVQNLDYNGMPY 596
TDQFL + FDV +V++LD NG+PY
Sbjct 589 TDQFLCSTLFDVNVVRSLDPNGLPY 613
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 243 bits (621), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 197/661 (30%), Positives = 312/661 (47%), Gaps = 92/661 (14%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPV 62
S + +K +PSR FDLS + FTAK GELLP + + L PG V + +FTRT P+
Sbjct 5 SNIMGLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPL 64
Query 63 NTAAYTRIKEYFDWYFVPLRLINKNLNPALVNML------DQSNIANSLIQNKVVSDEIP 116
+ A+TR++E ++FVP + K + ++NM D S IA+SL+ N+ V+ ++P
Sbjct 65 QSNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQMP 124
Query 117 CFDYDTLTSCLKAFNTQHPSYLDIA-------GFERVPKTLKLLRYLRYGNFLYDTGFST 169
C +Y TL + L F + D + G R ++ KLL+ L YGNF
Sbjct 125 CVNYKTLHAYLLKFINRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNF-------- 176
Query 170 LPSKNMNYSSVKDFNLYAKWNLN---------VNVLPLAAYQKIYCDYFRFEQWEKAQPY 220
P + N+ D + + N +++ L AY KI D++ + QW+
Sbjct 177 -PEQFANFKVNNDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYNAS 235
Query 221 TYNFDYYS--GGNILTEYKG-----DPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGS 273
N DY + ++L+ D S K NL +R++N P D F G+LP+SQ GS
Sbjct 236 LCNVDYLTPNSSSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGS 295
Query 274 VATINISHSSSAGVHLTN--------------QEGYLTGTVA-----------SDGTTIT 308
+ +N++ +++G + N E + VA S+GT I+
Sbjct 296 ESVVNLNLGNASGSAVLNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFIS 355
Query 309 VKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMREIQQCAGQGYKEQLEARWN 368
+T S I+ L+ N I++ R A A Q+ +EIQ ++ Q+EA +
Sbjct 356 HDHTFSGNVAIN-------TSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFG 408
Query 369 VKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIkgkgvgsgsgsesFETQEHG 428
+K +++ +IGG+SS INI+E +N +L + ++A G+GS S F + +G
Sbjct 409 IKPDEK-NENSLFIGGSSSMININEQINQNLSGD-NKATYGAAPQGNGSASIKFTAKTYG 466
Query 429 ILMCIYHAVPVLDYQLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVNDAVATQPNN 488
+++ IY PVLD+ G D L T A+D PE+D++G++ TF + A P N
Sbjct 467 VVIGIYRCTPVLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQ----TFRCEVAAPAPYN 522
Query 489 VTVKSI-------------IGYVPRYIAYKTDIDCVDGAFLTSLTSWVTPLTIDEIVTKI 535
K+ GY PRY +KT D +GAF SL SWVT + D I +
Sbjct 523 DEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNV 582
Query 536 SLGSGTGPFTPNYGLFKVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMP 595
+ G PN +F P ++ ++F+ + D DQ V +NL G+P
Sbjct 583 -WNTWAGINAPN--MFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLP 639
Query 596 Y 596
Y
Sbjct 640 Y 640
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 187 bits (476), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 169/598 (28%), Positives = 266/598 (44%), Gaps = 74/598 (12%)
Query 14 RPSRA--GFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAAYTRIK 71
RP+R FDLS++ FTA AG LLPV L+P V + F RT+P+NTAA+ ++
Sbjct 12 RPNRNRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMR 71
Query 72 EYFDWYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKVVSDEIPCFDYDTLTSCLKAFN 131
++++FVP + + + M D + AN IQ ++P F+ D++ + L
Sbjct 72 GVYEFFFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQVPYFNVDSVFNSLNTGK 131
Query 132 TQHPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYSSVKDFNLYAKWNL 191
D ++ +LL L YG +D+ + P N S +K+ +
Sbjct 132 ESGSGSTDDLQYKFKYGAFRLLDLLGYGR-KFDSFGTAYPD---NVSGLKN-----NLDY 182
Query 192 NVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGGNILTEYKGDPSDLFLKDNLFS 251
N +V + AY KIY DY+R +E ++NFD + GG + + D LF
Sbjct 183 NCSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKGGLVDAKVVAD---------LFK 233
Query 252 LRYANYPKDLFMGILPSSQLGSVATINISHSSSAGVHLTNQEGYLTGTVASDGTTITVKN 311
LRY N D F L SQL S T + +++ ++ V SDG+ T
Sbjct 234 LRYRNAQTDYFTN-LRQSQLFSFTT---AFEDVDNINIAPRD-----YVKSDGSNFT--- 281
Query 312 TRSLTPGISPVLRTNFA----DLNANFDILSFRIANAIQRMREIQQCAGQGYKEQLEARW 367
R NF +F + S R A A+ ++ + AG+ +++Q+ A +
Sbjct 282 ------------RVNFGVDTDSSEGDFSVSSLRAAFAVDKLLSVTMRAGKTFQDQMRAHY 329
Query 368 NVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQ-------ADIkgkgvgsgsgse 420
V++ + Y+GG S + +S+V S T + GKG GSG G
Sbjct 330 GVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGRVAGKGTGSGRGRI 389
Query 421 sFETQEHGILMCIYHAVPVLDYQLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVND 480
F+ +EHG+LMCIY VP + Y T D + D PE +NLG++ L ++++
Sbjct 390 VFDAKEHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQPLNS-SYISS 448
Query 481 AVATQPNNVTVKSIIGYVPRYIAYKTDIDCVDGAFLTS--LTSW-VTPLTIDEIVTKISL 537
T P N ++GY PRY YKT +D G F S L+SW V+ ++ +
Sbjct 449 FCTTDPKN----PVLGYQPRYSEYKTALDVNHGQFAQSDALSSWSVSRFRRWTTFPQLEI 504
Query 538 GSGTGPFTPNYGLFKVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMP 595
FK+ P L+SIF + T D F++ V ++ +GMP
Sbjct 505 AD-----------FKIDPGCLNSIFPVDYNGTEANDCVYGGCNFNIVKVSDMSVDGMP 551
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 178 bits (451), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 172/603 (29%), Positives = 265/603 (44%), Gaps = 73/603 (12%)
Query 14 RPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAAYTRIKEY 73
RP R FD+S++ FTA AG LLPV LLP V + F RT+P+N+AA+ ++
Sbjct 16 RP-RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGV 74
Query 74 FDWYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKVVSDEIPCFDYDTLTSCLKAFNTQ 133
+++YFVP + + + + M D + + K + FD L K NT
Sbjct 75 YEFYFVPYKQLWSGFDQFITGMSDYKSSFMYAFKGKTPPSCV-SFDVQKLVDWCKT-NTA 132
Query 134 HPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYSSVKDFNLYAKWNLNV 193
DI GF++ ++L L YG + G +P N +++ +
Sbjct 133 K----DIHGFDKNKGVYRILDLLGYGKYANSAG---VPYTNPTSTTMGKCTPFRG----- 180
Query 194 NVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFD-YYSGGNILTEYKGDPSDLFLKDNLFSL 252
AYQKIY D++R +E+ Q ++N D +Y G + +P D + F+L
Sbjct 181 -----LAYQKIYNDFYRNTTYEEYQLESFNVDMFYGSGKVKETIPNEPWDY----DWFTL 231
Query 253 RYANYPKDLFMGILPSSQLGSVATINISHSSSAGVHLTNQEGYLTGTVAS--DGTTITVK 310
RY N KDL + P+ L S+ N + + + +TG D I K
Sbjct 232 RYRNAQKDLLTNVRPTP-LFSIDDFNPQFFTGGSDIVMEKGPNVTGGTHEYRDSVVIVGK 290
Query 311 NTRSLTPGISPVLRTNFADLNANF-DILSFRIANAIQRMREIQQCAGQGYKEQLEARWNV 369
N L+ N D + R A A++++ + AG+ YKEQ+EA + +
Sbjct 291 N-----------LKENGVDSKRTMISVADIRNAFALEKLASVTMRAGKTYKEQMEAHFGI 339
Query 370 KLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIk---------gkgvgsgsgse 420
+ CTYIGG S I + +V +S T D GK GSGSG
Sbjct 340 SVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGYLGRTTGKATGSGSGHI 399
Query 421 sFETQEHGILMCIYHAVPVLDYQLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVND 480
F+ +EHGILMCIY VP + Y D + D PE +NLG++ L F +
Sbjct 400 RFDAKEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEFENLGMQPL----FAKN 455
Query 481 AVATQPNNVTVKSII------GYVPRYIAYKTDIDCVDGAFLTS--LTSWVTPLTIDEIV 532
++ + NN T S I G+ PRY YKT +D G F+ L+ W E +
Sbjct 456 -ISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQFVHQEPLSYWTVARARGESM 514
Query 533 TKISLGSGTGPFTPNYGLFKVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYN 592
+ ++ + FK++P LD +F + T TDQ +F++ V ++ +
Sbjct 515 SNFNIST-----------FKINPKWLDDVFAVNYNGTELTDQVFGGCYFNIVKVSDMSID 563
Query 593 GMP 595
GMP
Sbjct 564 GMP 566
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 169 bits (429), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 176/627 (28%), Positives = 274/627 (44%), Gaps = 94/627 (15%)
Query 12 KGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAAYTRIK 71
K R +R GFDLS + F+AKAG+LLP+ + P RT +NTA+Y R+K
Sbjct 5 KPRLARNGFDLSSRRIFSAKAGQLLPIGCWEVNPSEHFKFSVQDLVRTTTLNTASYARMK 64
Query 72 EYFDWYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKV-----VSDEIPCFDYDTLTSC 126
EY+ ++FV R + + + +V + + N + +N + +P FD L +
Sbjct 65 EYYHFFFVSYRSLWQWFDQFIVGTNNPHSALNGVKKNGTTNYNQICSSVPTFDLGKLITR 124
Query 127 LKAFNTQHPSYLDIAGFERVPKTLKLLRYLRYG-----------NFLYDTGFSTLPSKNM 175
LK S +D GF KLL L YG N + T + LPSK+
Sbjct 125 LKT------SDMDSQGFNYSEGAAKLLNMLNYGVTNKGKFMNLENLITSTSY--LPSKDD 176
Query 176 NYSSVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGGNILTE 235
S ++YA V+ L AYQKI+ D++R + W + ++N D Y+ + LT
Sbjct 177 KEPS----SIYA---CKVSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYADDSNLTI 229
Query 236 YKGDPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVATINISH--SSSAGVHLTNQE 293
+P D+ LK +RY Y KD + P+ S N+ + V LTN +
Sbjct 230 ---EP-DVALK--FCQMRYRPYAKDWLTSMKPTPNY-SDGIFNLPEYVRGNGNVILTNNK 282
Query 294 GYLTGTVASDGTTITVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMREIQQ 353
+G+V+ D T +SP ++F + R A A+ +M E +
Sbjct 283 ---SGSVSLDSGT------------VSP----------SSFSVNDLRAAFALDKMLEATR 317
Query 354 CA-GQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVL--NNSLDTEQSQADI-- 408
A G Y Q+EA + K+ + ++ ++GG + I +SEV+ N + ++ S A I
Sbjct 318 RANGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGD 377
Query 409 --kgkgvgsgsgsesFETQEHGILMCIYHAVPVLDYQLTGPDLQLLNTYATDLPQPELDN 466
SG+ F++ EHGI+MCIY P +Y + D QPE +
Sbjct 378 LGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQFYQPEFAD 437
Query 467 LGLEALPYFTFVNDAVATQPNNVTVKSI------IGYVPRYIAYKTDIDCVDGAFLT--S 518
LG +AL + + I +GY RY YKT D V G F + S
Sbjct 438 LGYQALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFESGKS 497
Query 519 LTSWVTP---LTIDEIVTKISLGSGTGPFTPNYG--------LFKVSPYVLDSIFVSQCD 567
L+ W TP + KI+ + G G F ++P +++ IF++
Sbjct 498 LSYWCTPRFDFGYGDTEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFLT--- 554
Query 568 STVDTDQFLVESFFDVKLVQNLDYNGM 594
S V D F+V SF DVK V+ + G+
Sbjct 555 SAVQADHFIVNSFLDVKAVRPMSVTGL 581
>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon
317 str. F0108]
Length=541
Score = 164 bits (416), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 166/599 (28%), Positives = 266/599 (44%), Gaps = 92/599 (15%)
Query 14 RPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAAYTRIKEY 73
RP R+ FDLS+K +TA AG LLPV L+ + ++ F RTMP+N+AA+ ++
Sbjct 16 RP-RSAFDLSQKHLYTAPAGALLPVLSVDLMFHDHIRIQAQDFMRTMPMNSAAFISMRGV 74
Query 74 FDWYFVPLRLINKNLNPALVNMLD-QSNIANSLIQNKVVSDEIPCFDYDTLTSCLKAFNT 132
++++FVP + + + +M D +S++ +S +K + D +P + ++
Sbjct 75 YEFFFVPYSQLWHPYDQFITSMNDYRSSVVSSAAGDKAL-DSVPNVKLADMYKFVRERTD 133
Query 133 QHPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYSSVKDFNLYAKWNLN 192
+ DI G+ + +L+ L YG K + S LY N
Sbjct 134 K-----DIFGYPHSNNSCRLMDLLGYG-------------KPITSSKTPVPLLYTG---N 172
Query 193 VNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGGNILTEYKGDPSDLFLKDNLFSL 252
VN+ L AY KIY DY+R +E Y++N D+ G + T +D F K +L
Sbjct 173 VNLFRLLAYNKIYSDYYRNTTYEGVDVYSFNIDHKKGTFVPT------ADEFKK--YLNL 224
Query 253 RYANYPKDLFMGILPSSQLGSVATINISHSSSAGVHLTNQEGYLTGTVASDGTTITVKNT 312
Y N P D + + P+ + TI S S S+ + L++ G + ++DG N+
Sbjct 225 HYRNAPLDFYTNLRPT----PLFTIG-SDSFSSVLQLSDPTG--SAGFSADG------NS 271
Query 313 RSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMREIQQCAGQGYKEQLEARWNVKLS 372
L VL ++ + R A A+ ++ I AG+ Y EQ+EA + V +S
Sbjct 272 AKLNMASPDVL-----------NVSAIRSAFALDKLLSISMRAGKTYAEQIEAHFGVTVS 320
Query 373 TALSDHCTYIGGNSSQINISEVLNNSLDTEQSQAD------------Ikgkgvgsgsgse 420
Y+GG S + + +V S T + ++ I GKG GSG G
Sbjct 321 EGRDGQVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYLGKITGKGTGSGYGEI 380
Query 421 sFETQEHGILMCIYHAVPVLDYQLTGPDLQLLNTYATDLPQPELDNLGLEAL-PYFTFVN 479
F+ +E G+LMCIY VP + Y D + D PE +NLG++ + P F +N
Sbjct 381 QFDAKEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQPIVPAFVSLN 440
Query 480 DAVATQPNNVTVKSIIGYVPRYIAYKTDIDCVDGAFLTS--LTSWVTPLTIDEIVTKISL 537
A G+ PRY YKT D G F L+ W I+
Sbjct 441 RAKDNS---------YGWQPRYSEYKTAFDINHGQFANGEPLSYW-----------SIAR 480
Query 538 GSGTGPF-TPNYGLFKVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMP 595
G+ T N K++P+ LDS+F + T TD + F+++ V ++ +GMP
Sbjct 481 ARGSDTLNTFNVAALKINPHWLDSVFAVNYNGTEVTDCMFGYAHFNIEKVSDMTEDGMP 539
Lambda K H a alpha
0.320 0.136 0.410 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4426883883474