bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-9_CDS_annotation_glimmer3.pl_2_1
Length=457
Score E
Sequences producing significant alignments: (Bits) Value
gi|490418709|ref|WP_004291032.1| hypothetical protein 258 3e-75
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 249 6e-72
gi|575094354|emb|CDL65742.1| unnamed protein product 248 1e-71
gi|496050829|ref|WP_008775336.1| hypothetical protein 237 1e-67
gi|494822885|ref|WP_007558293.1| hypothetical protein 229 2e-64
gi|575094321|emb|CDL65708.1| unnamed protein product 185 3e-48
gi|494308783|ref|WP_007173938.1| hypothetical protein 150 3e-36
gi|575094339|emb|CDL65730.1| unnamed protein product 135 2e-31
gi|517172762|ref|WP_018361580.1| hypothetical protein 135 3e-31
gi|647452987|ref|WP_025792807.1| hypothetical protein 134 5e-31
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 258 bits (658), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 179/454 (39%), Positives = 250/454 (55%), Gaps = 52/454 (11%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGT--KVNLKDMHFTRTM 60
+ + S ++ +PSR GFDLS K FTAKAGELLPV K +LPG K+NLK FTRT
Sbjct 2 ANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLK--AFTRTQ 59
Query 61 PVNTAAYTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANSL--IQNKVVSDEIPCF 118
PVNTAA+ RI+EY+D++FVP L+ N L M D A S+ +N V+S E+P
Sbjct 60 PVNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYM 119
Query 119 DYDTLTSCLKAFNTQHPSYLDIA----GFERVPKTLKLLRYLRYGN---FLYDTGFSTLP 171
+ + S + A +T + D G+ R ++KLL YL YGN FL D ++T P
Sbjct 120 TSEAIASYINALSTAS-ALADYKSNYFGYNRSKSSVKLLEYLGYGNYESFLTD-DWNTAP 177
Query 172 SKNMNYSSVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGG- 230
L A NLN N+ L AYQKIY D++R QWE+ P T+N DY G
Sbjct 178 -------------LMA--NLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSS 222
Query 231 -NILTEYKGDPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVATINISHSSSAGVHL 289
N+ Y ++ + N F LRY N+ KDLF G+LP Q G A +I+ + + L
Sbjct 223 MNLDNAYS---TEFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTL 279
Query 290 TSQEGYLTGTVASDGTTITVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMR 349
++ TV + TT + T++L P V + IL R A +Q+ +
Sbjct 280 SN-----FSTVGTSPTTASGTATKNL-PAFDTV---------GDLSILVLRQAEFLQKWK 324
Query 350 EIQQCAGQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIk 409
EI Q + YK+QLE W V + S+ CTY+GG SS I+I+EV+N ++ T + ADI
Sbjct 325 EITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNI-TGSAAADIA 383
Query 410 gkgvgsgsgsesFETQ-EHGILMCIYHAVPVLDY 442
GKGVG +G +F + +G++MCIYH +P+LDY
Sbjct 384 GKGVGVANGEINFNSNGRYGLIMCIYHCLPLLDY 417
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 249 bits (635), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 164/456 (36%), Positives = 233/456 (51%), Gaps = 50/456 (11%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPV 62
S + S +K R GFDLS K FTAK GELLP+ K + PG K N++ FTRT PV
Sbjct 2 SSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQPV 61
Query 63 NTAAYTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKVVSDEIPCFDYDT 122
N+AAY+R++EY+D+YFVP RL+ N+ P + + A L+ + +S P F +
Sbjct 62 NSAAYSRLREYYDFYFVPYRLL-WNMAPTFFTNMPDPHHAADLVSSVNLSQRHPWFTFFD 120
Query 123 LTSCLKAFNTQHPSY----LDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYS 178
+ L N+ +Y + GF RV ++KLL YL YG F D +PS +
Sbjct 121 IMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG-FGKDYESVKVPSDSD--- 176
Query 179 SVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSG--GNILTEY 236
++ ++ PL AYQKI DYFR +QW+ A PY YN DY G
Sbjct 177 -----------DIVLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPM 225
Query 237 KGDPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVAT-------INISHSSSAGVHL 289
+D F +F L Y N+ KD F G+LP +Q G V+ ++I SSS
Sbjct 226 SSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVASPIFGDLDIGDSSSLTFAS 285
Query 290 TSQEGYLTGTVASDGTTITVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMR 349
Q+G T+ S + V N + T G+S +L+ R A +Q+ R
Sbjct 286 APQQG--ANTIQS--GVLVVNNNSNTTAGLS---------------VLALRQAECLQKWR 326
Query 350 EIQQCAGQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIk 409
EI Q Y+ Q++ +NV S LS HC Y+GG +S ++ISEV+N +L T +QADI+
Sbjct 327 EIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNL-TGDNQADIQ 385
Query 410 -gkgvgsgsgsesFETQEHGILMCIYHAVPVLDYQL 444
FE+ EHGI+MCIYH +P+LD+ +
Sbjct 386 GKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSI 421
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 248 bits (634), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 180/488 (37%), Positives = 264/488 (54%), Gaps = 70/488 (14%)
Query 7 SYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAA 66
S D+K RPSR GFDLS K FTAKAGELLPV K++LPG N+ FTRT P+NT+A
Sbjct 2 SMADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTSA 61
Query 67 YTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANS----LIQNKVVSDEIPCFDYDT 122
+ R++EY+D+YFVP + + + M +N+ ++ L N +S +P F +
Sbjct 62 FARMREYYDFYFVPFEQMWNKFDSCITQM--NANVQHASGPTLDDNTPLSGRMPYFTSEQ 119
Query 123 LTSCLKAFNTQHPSYLDIAGFERVPKTLKLLRYLRYGNF-LYDTGFSTLPSKNMNYSSVK 181
+ L T + + GF R T KLL+YL YG++ +D+ +T +K + Y
Sbjct 120 IADYLNDQATA--ARKNPFGFNRSTLTCKLLQYLGYGDYNSFDSETNTWSAKPLLY---- 173
Query 182 DFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSG-GNILTEYKGDP 240
NL ++ PL AYQKIY D++R+ QWEK P T+N DY G ++ + G P
Sbjct 174 --------NLELSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKGTSDLQMDLTGLP 225
Query 241 SDLFLKDNLFSLRYANYPKDLFMGILPSSQLG--SVATIN-----ISHSSSAGVHLTSQE 293
SD +N F +RY NY KD+F G+LP +Q G SV IN IS+ S + TS
Sbjct 226 SD---DNNFFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTP 282
Query 294 -------GYLT--GTVASD-------GTTITV--------------KNTRSLTPGISPVL 323
Y+T G + D G+T+ V +TRSL ++
Sbjct 283 DPGTPGTSYVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENPNLI 342
Query 324 RTNFADLNANF--DILSFRIANAIQRMREIQQCAGQGYKEQLEARWNVKLSTALSDHCTY 381
N N F IL+ R A +Q+ +E+ + YK Q+E W +K+S LS Y
Sbjct 343 IEN----NQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARY 398
Query 382 IGGNSSQINISEVLNNSLDTEQSQADIkgkgvgsgsgsesFETQ-EHGILMCIYHAVPVL 440
+GG ++ ++I+EV+NN++ T + ADI GKG +G+GS FE++ E+GI+MCIYH +P++
Sbjct 399 LGGCATSLDINEVINNNI-TGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIV 457
Query 441 DYQLTGPD 448
DY +G D
Sbjct 458 DYVGSGVD 465
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 237 bits (605), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 168/471 (36%), Positives = 246/471 (52%), Gaps = 51/471 (11%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPV 62
+ + S ++ + SR GFDLS K FTAK GELLPV +LPG K ++ FTRT P+
Sbjct 2 ANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQPL 61
Query 63 NTAAYTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANSLI--QNKVVSDEIP---- 116
NTAA+ R++EY+D+YFVP L+ N L M D A S I N+ ++ +P
Sbjct 62 NTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSANQALAGVMPNVTC 121
Query 117 --CFDYDTLTSC-LKAFNTQHPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSK 173
DY L + + N+ +Y G+ R T KLL YL YGNF ++ SK
Sbjct 122 KGIADYLNLVAPDVTTTNSYEKNYF---GYSRSLGTAKLLEYLGYGNF-----YTYATSK 173
Query 174 NMNYSSVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGG--- 230
N ++ NL +N+ + AYQKIY D+ R QWEK P +N DY SG
Sbjct 174 NNTWTKSP-----LSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVDS 228
Query 231 --NILTEYKGDPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVATINISHSS--SAG 286
I + G F N+F LRY N+ KDLF G+LP Q G A +N++ S+ SA
Sbjct 229 AMTIDSMITGQGFAPFY--NMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNVNLSNVLSAQ 286
Query 287 VHLTSQEGYLTGTVASDGTTITVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQ 346
+ + +G G T + ++ + F +L+ R A +Q
Sbjct 287 YMVQTPDGDPVGGSPFSSTGVNLQTVNG----------------SGTFTVLALRQAEFLQ 330
Query 347 RMREIQQCAGQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQA 406
+ +EI Q + YK+Q+E WNV + A S+ Y+GG ++ ++I+EV+NN++ T + A
Sbjct 331 KWKEITQSGNKDYKDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNI-TGSNAA 389
Query 407 DIkgkgvgsgsgsesFETQE-HGILMCIYHAVPVLDY--QLTGPDLQLLNT 454
DI GKGV G+G SF+ E +G++MCIYH++P+LDY L P +N+
Sbjct 390 DIAGKGVVVGNGRISFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINS 440
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 229 bits (585), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 156/468 (33%), Positives = 250/468 (53%), Gaps = 40/468 (9%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPV 62
+ + S V+ +P+RAG+DL++K FTAKAG L+PV+W +LP +N F RT P+
Sbjct 9 ANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQPL 68
Query 63 NTAAYTRIKEYFDWYFVPLRLINKNLNPALVNMLDQSNIANS----LIQNKVVSDEIPCF 118
NTAA+ R++ YFD+YFVP R + A+ M ++N+ ++ L N +SDE+P F
Sbjct 69 NTAAFARMRGYFDFYFVPFRQMWNKFPTAITQM--RTNLLHASGPVLADNVPLSDELPYF 126
Query 119 DYDTLTSCLKAFNTQHPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYS 178
+ + + + + G+ R +L YL YG+F Y + ++
Sbjct 127 TAEQVADYIVSLADSKNQF----GYYRAWLVCIILEYLGYGDF-YPYIVEAAGGEGATWA 181
Query 179 SVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGGNILTEYKG 238
+ N NL + PL AYQKIY D+ R+ QWE++ P T+N DY SG +
Sbjct 182 TRPMLN-----NLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGS--ADSLQL 234
Query 239 DPSDLFLKD--NLFSLRYANYPKDLFMGILPSSQLGSVATINISHS------SSAGVHLT 290
D + KD NLF +RY+N+ +DL G +P +Q G + + +S S + T
Sbjct 235 DFTVEGFKDSFNLFDMRYSNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAFTT 294
Query 291 SQEG--YLTGTVASDGTTITVKNTRSLTPGISPVLRTN--------FADLNANFDILSFR 340
Q+G +L G V G++ ++ S+ G S +LR N D + IL+ R
Sbjct 295 GQDGVAFLNGNVTIQGSSGYLQAQTSV--GESRILRFNNTNSGLIVEGDSSFGVSILALR 352
Query 341 IANAIQRMREIQQCAGQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLD 400
A A Q+ +E+ + + Y Q+EA W ++ A SD C ++G + ++I+EV+NN++
Sbjct 353 RAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNI- 411
Query 401 TEQSQADIkgkgvgsgsgsesFET-QEHGILMCIYHAVPVLDYQLTGP 447
T ++ ADI GKG SG+GS +F ++GI+MC++H +P LDY + P
Sbjct 412 TGENAADIAGKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAP 459
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 185 bits (470), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 151/507 (30%), Positives = 245/507 (48%), Gaps = 74/507 (15%)
Query 3 SRLFSYGDVKGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPV 62
S + +K +PSR FDLS + FTAK GELLP + + L PG V + +FTRT P+
Sbjct 5 SNIMGLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPL 64
Query 63 NTAAYTRIKEYFDWYFVPLRLINKNLNPALVNML------DQSNIANSLIQNKVVSDEIP 116
+ A+TR++E ++FVP + K + ++NM D S IA+SL+ N+ V+ ++P
Sbjct 65 QSNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQMP 124
Query 117 CFDYDTLTSCLKAFNTQHPSYLDIA-------GFERVPKTLKLLRYLRYGNFLYDTGFST 169
C +Y TL + L F + D + G R ++ KLL+ L YGNF
Sbjct 125 CVNYKTLHAYLLKFINRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNF-------- 176
Query 170 LPSKNMNYSSVKDFNLYAKWNLN---------VNVLPLAAYQKIYCDYFRFEQWEKAQPY 220
P + N+ D + + N +++ L AY KI D++ + QW+
Sbjct 177 -PEQFANFKVNNDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYNAS 235
Query 221 TYNFDYYSGGNILTEYKGDPSDLFLKD--------NLFSLRYANYPKDLFMGILPSSQLG 272
N DY + N + D + L + D NL +R++N P D F G+LP+SQ G
Sbjct 236 LCNVDYLT-PNSSSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFG 294
Query 273 SVATINISHSSSAGVHL--------------TSQEGYLTGTVA-----------SDGTTI 307
S + +N++ +++G + T+ E + VA S+GT I
Sbjct 295 SESVVNLNLGNASGSAVLNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFI 354
Query 308 TVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMREIQQCAGQGYKEQLEARW 367
+ +T S I+ L+ N I++ R A A Q+ +EIQ ++ Q+EA +
Sbjct 355 SHDHTFSGNVAIN-------TSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHF 407
Query 368 NVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIkgkgvgsgsgsesFETQEH 427
+K +++ +IGG+SS INI+E +N +L + ++A G+GS S F + +
Sbjct 408 GIKPDEK-NENSLFIGGSSSMININEQINQNLSGD-NKATYGAAPQGNGSASIKFTAKTY 465
Query 428 GILMCIYHAVPVLDYQLTGPDLQLLNT 454
G+++ IY PVLD+ G D L T
Sbjct 466 GVVIGIYRCTPVLDFAHLGIDRTLFKT 492
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 150 bits (378), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 128/445 (29%), Positives = 201/445 (45%), Gaps = 55/445 (12%)
Query 14 RPSRA--GFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAAYTRIK 71
RP+R FDLS++ FTA AG LLPV L+P V + F RT+P+NTAA+ ++
Sbjct 12 RPNRNRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMR 71
Query 72 EYFDWYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKVVSDEIPCFDYDTLTSCLKAFN 131
++++FVP + + + M D + AN IQ ++P F+ D++ + L
Sbjct 72 GVYEFFFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQVPYFNVDSVFNSLNTGK 131
Query 132 TQHPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYSSVKDFNLYAKWNL 191
D ++ +LL L YG +D+ + P N S +K+ +
Sbjct 132 ESGSGSTDDLQYKFKYGAFRLLDLLGYGR-KFDSFGTAYPD---NVSGLKN-----NLDY 182
Query 192 NVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGGNILTEYKGDPSDLFLKDNLFS 251
N +V + AY KIY DY+R +E ++NFD + GG + + D LF
Sbjct 183 NCSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKGGLVDAKVVAD---------LFK 233
Query 252 LRYANYPKDLFMGILPSSQLGSVATINISHSSSAGVHLTSQEGYLTGTVASDGTTITVKN 311
LRY N D F L SQL S T + +++ ++ V SDG+ T
Sbjct 234 LRYRNAQTDYFTN-LRQSQLFSFTT---AFEDVDNINIAPRD-----YVKSDGSNFT--- 281
Query 312 TRSLTPGISPVLRTNFA----DLNANFDILSFRIANAIQRMREIQQCAGQGYKEQLEARW 367
R NF +F + S R A A+ ++ + AG+ +++Q+ A +
Sbjct 282 ------------RVNFGVDTDSSEGDFSVSSLRAAFAVDKLLSVTMRAGKTFQDQMRAHY 329
Query 368 NVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQ-------ADIkgkgvgsgsgse 420
V++ + Y+GG S + +S+V S T + GKG GSG G
Sbjct 330 GVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGRVAGKGTGSGRGRI 389
Query 421 sFETQEHGILMCIYHAVPVLDYQLT 445
F+ +EHG+LMCIY VP + Y T
Sbjct 390 VFDAKEHGVLMCIYSLVPQIQYDCT 414
>gi|575094339|emb|CDL65730.1| unnamed protein product [uncultured bacterium]
Length=588
Score = 135 bits (341), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 128/467 (27%), Positives = 205/467 (44%), Gaps = 71/467 (15%)
Query 16 SRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAAYTRIKEYFD 75
S+ GFD+S++ FT+ G+LLPV++ L PG K+ + FTRT P+ + A R+ E+ +
Sbjct 15 SKNGFDMSQRHPFTSSVGQLLPVFYDYLNPGDKIRISANLFTRTQPMKSTAMARLTEHIE 74
Query 76 WYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKVVSDEIPCFDYDTLTSCLKAFNTQHP 135
++FVP + + D + ++SL+++ ++ +P F D +++ L+A T
Sbjct 75 YFFVPFEQMFSLFGSVFYGIDDYN--SSSLVKHNNLT--MPFFKSDAVSAALEAAYTSFS 130
Query 136 SYL-------DIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYSSVKDFNLYAK 188
S + D+ G RV L+L L YG+ L + LP +M
Sbjct 131 SSINRKVLTPDMMGQPRVYGILRLSEMLGYGSLLLSNDNNLLPHADM------------- 177
Query 189 WNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGGNILTEYKGDPSDLFLKDN 248
+V AYQKI+ D++R + + Q +YN DY G I ++
Sbjct 178 -----SVFLFTAYQKIFNDFYRLDDYTSVQHKSYNVDYAQGQPI------------TDNS 220
Query 249 LFSLRYANYPKDLFMGILPSSQLGSVATINISHSSSAGVHLTSQEGYLTGTVAS-DGTTI 307
+F L Y + KD F ++P+ SV + SS G L + L+ T + DG+
Sbjct 221 MFELHYRPWKKDYFTNVIPNPYFSSVD----NKSSFGGAGLFDRPVGLSITSFNFDGSDF 276
Query 308 --------TVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMREIQQCAGQGY 359
T++N + + + PV T+ + +A + R A ++ I Q AG+ Y
Sbjct 277 LQAPSDLSTMENNQPIFQEL-PVNLTSAS--SAGLSVSDLRYLYATDKLLRITQFAGKHY 333
Query 360 KEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIkgk-------- 411
Q A + ++ +S YIGG S + IS V S T D+ G
Sbjct 334 DAQTLAHFGKRVPQGVSGEVYYIGGQSQPLQISSV--ESTATTFDSGDVVGSVLGELAGK 391
Query 412 --gvgsgsgsesFETQEHGILMCIYHAVPVLDYQLTGPDLQLLNTLC 456
SFE HG+LM IY AVP DY + LNTL
Sbjct 392 GYSQTGNQKDFSFEAPCHGVLMAIYSAVPEADY--LDERIDYLNTLI 436
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 135 bits (339), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 130/442 (29%), Positives = 196/442 (44%), Gaps = 49/442 (11%)
Query 14 RPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAAYTRIKEY 73
RP R FD+S++ FTA AG LLPV LLP V + F RT+P+N+AA+ ++
Sbjct 16 RP-RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGV 74
Query 74 FDWYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKVVSDEIPCFDYDTLTSCLKAFNTQ 133
+++YFVP + + + + M D + + K + FD L K NT
Sbjct 75 YEFYFVPYKQLWSGFDQFITGMSDYKSSFMYAFKGKTPPSCV-SFDVQKLVDWCKT-NTA 132
Query 134 HPSYLDIAGFERVPKTLKLLRYLRYGNFLYDTGFSTLPSKNMNYSSVKDFNLYAKWNLNV 193
DI GF++ ++L L YG + G +P N +++ +
Sbjct 133 K----DIHGFDKNKGVYRILDLLGYGKYANSAG---VPYTNPTSTTMGKCTPFRG----- 180
Query 194 NVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFD-YYSGGNILTEYKGDPSDLFLKDNLFSL 252
AYQKIY D++R +E+ Q ++N D +Y G + +P D + F+L
Sbjct 181 -----LAYQKIYNDFYRNTTYEEYQLESFNVDMFYGSGKVKETIPNEPWDY----DWFTL 231
Query 253 RYANYPKDLFMGILPSSQLGSVATINISHSSSAGVHLTSQEGYLTGTVAS--DGTTITVK 310
RY N KDL + P+ L S+ N + + + +TG D I K
Sbjct 232 RYRNAQKDLLTNVRPTP-LFSIDDFNPQFFTGGSDIVMEKGPNVTGGTHEYRDSVVIVGK 290
Query 311 NTRSLTPGISPVLRTNFADLNANF-DILSFRIANAIQRMREIQQCAGQGYKEQLEARWNV 369
N L+ N D + R A A++++ + AG+ YKEQ+EA + +
Sbjct 291 N-----------LKENGVDSKRTMISVADIRNAFALEKLASVTMRAGKTYKEQMEAHFGI 339
Query 370 KLSTALSDHCTYIGGNSSQINISEVLNNSLDTEQSQADIk---------gkgvgsgsgse 420
+ CTYIGG S I + +V +S T D GK GSGSG
Sbjct 340 SVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGYLGRTTGKATGSGSGHI 399
Query 421 sFETQEHGILMCIYHAVPVLDY 442
F+ +EHGILMCIY VP + Y
Sbjct 400 RFDAKEHGILMCIYSLVPDVQY 421
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 134 bits (338), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 130/459 (28%), Positives = 208/459 (45%), Gaps = 72/459 (16%)
Query 12 KGRPSRAGFDLSKKFCFTAKAGELLPVYWKMLLPGTKVNLKDMHFTRTMPVNTAAYTRIK 71
K R +R GFDLS + F+AKAG+LLP+ + P RT +NTA+Y R+K
Sbjct 5 KPRLARNGFDLSSRRIFSAKAGQLLPIGCWEVNPSEHFKFSVQDLVRTTTLNTASYARMK 64
Query 72 EYFDWYFVPLRLINKNLNPALVNMLDQSNIANSLIQNKV-----VSDEIPCFDYDTLTSC 126
EY+ ++FV R + + + +V + + N + +N + +P FD L +
Sbjct 65 EYYHFFFVSYRSLWQWFDQFIVGTNNPHSALNGVKKNGTTNYNQICSSVPTFDLGKLITR 124
Query 127 LKAFNTQHPSYLDIAGFERVPKTLKLLRYLRYG-----------NFLYDTGFSTLPSKNM 175
LK S +D GF KLL L YG N + T + LPSK+
Sbjct 125 LKT------SDMDSQGFNYSEGAAKLLNMLNYGVTNKGKFMNLENLITSTSY--LPSKDD 176
Query 176 NYSSVKDFNLYAKWNLNVNVLPLAAYQKIYCDYFRFEQWEKAQPYTYNFDYYSGGNILTE 235
S ++YA V+ L AYQKI+ D++R + W + ++N D Y+ + LT
Sbjct 177 KEPS----SIYA---CKVSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYADDSNLTI 229
Query 236 YKGDPSDLFLKDNLFSLRYANYPKDLFMGILPSSQLGSVATINISH--SSSAGVHLTSQE 293
+P D+ LK +RY Y KD + P+ S N+ + V LT+ +
Sbjct 230 ---EP-DVALK--FCQMRYRPYAKDWLTSMKPTPNY-SDGIFNLPEYVRGNGNVILTNNK 282
Query 294 GYLTGTVASDGTTITVKNTRSLTPGISPVLRTNFADLNANFDILSFRIANAIQRMREIQQ 353
+G+V+ D T +SP ++F + R A A+ +M E +
Sbjct 283 ---SGSVSLDSGT------------VSP----------SSFSVNDLRAAFALDKMLEATR 317
Query 354 CA-GQGYKEQLEARWNVKLSTALSDHCTYIGGNSSQINISEVL--NNSLDTEQSQADI-- 408
A G Y Q+EA + K+ + ++ ++GG + I +SEV+ N + ++ S A I
Sbjct 318 RANGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGD 377
Query 409 --kgkgvgsgsgsesFETQEHGILMCIYHAVPVLDYQLT 445
SG+ F++ EHGI+MCIY P +Y +
Sbjct 378 LGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNAS 416
Lambda K H a alpha
0.320 0.136 0.409 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3109758059016