bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-11_CDS_annotation_glimmer3.pl_2_3
Length=497
Score E
Sequences producing significant alignments: (Bits) Value
gi|496050828|ref|WP_008775335.1| hypothetical protein 848 0.0
gi|490418708|ref|WP_004291031.1| hypothetical protein 612 0.0
gi|575094355|emb|CDL65737.1| unnamed protein product 360 6e-115
gi|547226431|ref|WP_021963494.1| predicted protein 237 8e-68
gi|494822887|ref|WP_007558295.1| hypothetical protein 223 3e-62
gi|494610270|ref|WP_007368516.1| hypothetical protein 125 6e-28
gi|565841285|ref|WP_023924566.1| hypothetical protein 123 3e-27
gi|575094322|emb|CDL65709.1| unnamed protein product 114 4e-24
gi|647452984|ref|WP_025792805.1| hypothetical protein 113 1e-23
gi|496521300|ref|WP_009229583.1| hypothetical protein 110 2e-22
>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497
Score = 848 bits (2192), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/497 (80%), Positives = 448/497 (90%), Gaps = 1/497 (0%)
Query 1 MIKNSFCKCLHPKKIVNPYTNESMVVPCGHCQACMLAKNSRYAFQCDLESYVAKHTLFIT 60
M++N FCKCLHPK+I+NPYT ESMVVPCGHCQAC LAKNSRYAFQCDLESY AKHTLFIT
Sbjct 1 MVQNPFCKCLHPKRIMNPYTKESMVVPCGHCQACTLAKNSRYAFQCDLESYTAKHTLFIT 60
Query 61 LTYANRYIPRATFVDSLERPFGNDLVDKETGEILGPSDMKQEDIDRLLNKFYLFGDVPYL 120
LTYANR+IPRA FVDS+ERP+G DL+DKETGEILGP+D+ +++ LLNKFYLFGDVPYL
Sbjct 61 LTYANRFIPRAMFVDSIERPYGCDLIDKETGEILGPADLTEDERTNLLNKFYLFGDVPYL 120
Query 121 RKTDLQLFFKRLRYYVSKQCPSEKVRYFAVGEYGPVHFRPHYHILLFLQSDEALQVCSEN 180
RKTDLQLF KRLRYYV+KQ PSEKVRYFAVGEYGPVHFRPHYH+LLFLQSDEALQ+CSEN
Sbjct 121 RKTDLQLFLKRLRYYVTKQKPSEKVRYFAVGEYGPVHFRPHYHLLLFLQSDEALQICSEN 180
Query 181 ISQAWTLGRIDCQISKGQCSSYVASYVNSSCTIPKVFKLSSVCPFNVHSQKLGQGFLDCQ 240
IS+AWT GR+DCQ+SKGQCS+YVASYVNSSCTIPKVFK SSVCPF+VHSQKLGQGFLDCQ
Sbjct 181 ISKAWTFGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKASSVCPFSVHSQKLGQGFLDCQ 240
Query 241 REKIYSSTPQNFVKRSIVLNGKYKEFDVWRSCYAYYFPRCKGFASKSSRERAYSYGIYDT 300
REKIYS TP+NF++ SIVLNGKYKEFDVWRSCY++++PRCKGF +KSSRERAYSY IYDT
Sbjct 241 REKIYSLTPENFIRSSIVLNGKYKEFDVWRSCYSFFYPRCKGFVTKSSRERAYSYSIYDT 300
Query 301 ARRLFSSSETTFSLAKEIAFYIKHFHFTDDTYLLDLFGHVSDQKSLLDLSNYFLDRDAMV 360
AR LF ++TTFSLAKEIA YI +FH +TYLLDL+G+ SDQ L +LS YF D D ++
Sbjct 301 ARLLFPDAKTTFSLAKEIAIYIYYFHNPKETYLLDLYGYCSDQSKLYELSQYFYDSDVLL 360
Query 361 RPVESDEFNRWVHRIYTELLVSKHFLYFVCTHTTLAERKSKQRMIEEFYSYLDYMHLTTF 420
S EF+R+VHRIYTELL+SKHFLYFVCTH TLAERKSKQR+IEEFYS LDYMHLT F
Sbjct 361 HSFNSGEFSRYVHRIYTELLISKHFLYFVCTHNTLAERKSKQRLIEEFYSRLDYMHLTKF 420
Query 421 FESQQEFYESDLIGDDDLCTDQWENSYYPYFYYNVHTNEP-FEKTPVYRLYASDVKKLFN 479
FE+QQ FYESDLIGDDDLCTD W+NSYYPYFY NV+T+ FEKTPVYRLY+SDVKKLFN
Sbjct 421 FEAQQLFYESDLIGDDDLCTDNWDNSYYPYFYNNVYTDTNLFEKTPVYRLYSSDVKKLFN 480
Query 480 DRIKHKKLNDANKIFID 496
DRIKHKKLNDANK+F +
Sbjct 481 DRIKHKKLNDANKVFFE 497
>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM
20697]
Length=422
Score = 612 bits (1579), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 292/422 (69%), Positives = 350/422 (83%), Gaps = 1/422 (0%)
Query 77 LERPFGNDLVDKETGEILGPSDMKQEDIDRLLNKFYLFGDVPYLRKTDLQLFFKRLRYYV 136
+ERP+G+DLVD ETGE LG +D+ ++I+RL KF+LFG +PYLRK DLQLFFKR RYYV
Sbjct 1 MERPYGHDLVDVETGEYLGEADLSIKEIERLQEKFHLFGYLPYLRKFDLQLFFKRFRYYV 60
Query 137 SKQCPSEKVRYFAVGEYGPVHFRPHYHILLFLQSDEALQVCSENISQAWTLGRIDCQISK 196
+K+ P EKVRYFA+GEYGPVHFRPHYHILLFLQSDEALQVCS+ +S+AW GR+DCQ+SK
Sbjct 61 AKRFPKEKVRYFAIGEYGPVHFRPHYHILLFLQSDEALQVCSKVVSEAWPFGRVDCQLSK 120
Query 197 GQCSSYVASYVNSSCTIPKVFKLSSVCPFNVHSQKLGQGFLDCQREKIYSSTPQNFVKRS 256
G+CSSYVA YVNSS +PKV L ++CPF VHSQKLGQGFL +R K+YS TP+ FVKRS
Sbjct 121 GKCSSYVAGYVNSSVLVPKVLTLPTLCPFCVHSQKLGQGFLQSERAKVYSLTPEQFVKRS 180
Query 257 IVLNGKYKEFDVWRSCYAYYFPRCKGFASKSSRERAYSYGIYDTARRLFSSSETTFSLAK 316
IV+NG+YKEFDVWRS YAY+FP+CKGFA KSSRERAYSYG+YDTARRLF S+ETTF+LAK
Sbjct 181 IVINGRYKEFDVWRSAYAYFFPKCKGFADKSSRERAYSYGLYDTARRLFPSAETTFALAK 240
Query 317 EIAFYIKHFHFTDDTYLLDLFGHVSDQKSLLDLSNYFLDRDAMVRPVESDEFNRWVHRIY 376
EI YI +FH DTY LD+FG VSDQ L S YF + + + ++S E R+VHR+Y
Sbjct 241 EIVGYIYYFHNKKDTYCLDIFGEVSDQSDLYQFSQYFFEPEIVNYSLDSIEMCRYVHRVY 300
Query 377 TELLVSKHFLYFVCTHTTLAERKSKQRMIEEFYSYLDYMHLTTFFESQQEFYESDLIGDD 436
TELL+SKHFLYFVC TL+E+K K ++IEEFYS LDYMHL TFFE+QQ FYESDL+GD
Sbjct 301 TELLLSKHFLYFVCDRPTLSEQKRKLKLIEEFYSRLDYMHLKTFFENQQLFYESDLVGDL 360
Query 437 DLCTDQWENSYYPYFYYNVH-TNEPFEKTPVYRLYASDVKKLFNDRIKHKKLNDANKIFI 495
DL +D WENSYYP+FY NV+ ++E ++KTPVYRLY + KLF+DRIKHKKLND NKIF+
Sbjct 361 DLMSDAWENSYYPFFYDNVYFSSEVYKKTPVYRLYDMQISKLFSDRIKHKKLNDLNKIFV 420
Query 496 DE 497
DE
Sbjct 421 DE 422
>gi|575094355|emb|CDL65737.1| unnamed protein product [uncultured bacterium]
Length=517
Score = 360 bits (925), Expect = 6e-115, Method: Compositional matrix adjust.
Identities = 215/527 (41%), Positives = 309/527 (59%), Gaps = 52/527 (10%)
Query 6 FCKCLHPKKIVNPYTNESMVVPCGHCQACMLAKNSRYAFQCDLESYVAKHTLFITLTYAN 65
F +CL PK++ NPY N+ ++VPCG C+AC +K SRY Q LE+ K +F TLTYAN
Sbjct 8 FIRCLEPKRVFNPYLNDWLLVPCGKCRACQCSKASRYKLQIQLEASQHKFCIFGTLTYAN 67
Query 66 RYIPRATFVDSLERPFGN----DLVDKETGEILGPSDMKQEDIDRLLNKFYLFGDVPYLR 121
YIPR + V ++ FG ++ DKETGE LG D D++ LL+K +LFGDVPYLR
Sbjct 68 TYIPRLSLVPYNDKTFGVVNGYEMCDKETGEYLGYLDSPSYDVESLLDKLHLFGDVPYLR 127
Query 122 KTDLQLFFKRLRYYVSKQCPSEKVRYFAVGEYGPVHFRPHYHILLFLQS----------- 170
K DLQLF KRLR +SK + KVRYFA+GEYGPVHFRPHYH LLF
Sbjct 128 KRDLQLFIKRLRKNLSKYSDA-KVRYFAMGEYGPVHFRPHYHFLLFFDEIKFTAPSGHTL 186
Query 171 -----------------DEALQVCSENISQAWTLGRIDCQISKGQCSSYVASYVNSSCTI 213
+ L V I +W GR+D Q SKG + YV+SYV+ S ++
Sbjct 187 GEFPDWAWYDSQNKCSRSDILSVVEYCIRSSWKFGRVDAQYSKGDAAQYVSSYVSGSGSL 246
Query 214 PKVFKLSSVCPFNVHSQKLGQGFLDCQREKIYSSTPQNFVKRSIVLNGKYKEFDVWRSCY 273
PKV+++SS PF++HS+ LGQGFL + EK+Y + ++FVKRS+ LNG K+F++WRSCY
Sbjct 247 PKVYQVSSARPFSLHSRFLGQGFLAHECEKVYETPVRDFVKRSVELNGSNKDFNLWRSCY 306
Query 274 AYYFPRCKGFASKSSRERAYSYGIYDTARRLFSSSETTFSLAKEIAFYIKHFHFTDDTYL 333
+ ++P+CKGF KSS ER Y+Y +YDTA+RLF + LA+E ++ + + +
Sbjct 307 SVFYPKCKGFTRKSSSERLYTYKLYDTAKRLFPYVSSVIELARETMIHLTFYVYGKQHTV 366
Query 334 LDLFGHVSDQKSLLDLSNYFLDRDAMVRPVESDE--FNRWVHRIYTELLVSKHFLYFVCT 391
+L D K L L+ + +V D+ ++ ++ IY ELL+S+HFL F C+
Sbjct 367 AEL---DYDIKRYLLYFRDSLNINEVVLQYGFDDVRIDKCIYLIYNELLLSRHFLEFCCS 423
Query 392 HTTLAERKSKQRMIEEFYSYLDYMHLTTFFESQQEFYESDLIGDDDLCTDQWENSYYPYF 451
+ + + IE FY LDY+ LT FF+SQ+ ++ D DD Y Y
Sbjct 424 GRS---QNFVFKRIEAFYKDLDYLQLTEFFKSQELYFSQDFCDSDD----------YVYM 470
Query 452 YYNVHTN-EPFEKTPVYRLYASDVKKLFNDRIKHKKLNDANKIFIDE 497
Y N + + ++++ Y + +++ +IKHK+LND N++F D+
Sbjct 471 YNNSSFSLDAYKQSMSYLSFEQQTFEIWRSKIKHKELNDLNQLFFDK 517
>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498
Score = 237 bits (604), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 168/513 (33%), Positives = 242/513 (47%), Gaps = 48/513 (9%)
Query 8 KCLHPKKIVNPYTNESMVVPCGHCQACMLAKNSRYAFQCDLESYVAKHTLFITLTYANRY 67
KC HP+ + N YT E + V CG C+AC+ + + +F C +E K+ +F TLTY+N Y
Sbjct 10 KCYHPRHVQNKYTGEVIQVGCGVCKACLKRRADKMSFLCAIEEQSHKYCMFATLTYSNDY 69
Query 68 IPRA--------------TFVDSLERPFGNDLVDKETGEILGPSDMKQEDIDRLLNKFYL 113
+PR ++ D L VD + D + L K L
Sbjct 70 VPRMYPEVDNELRLVRWYSYCDRLNEKGKLMTVDYDYWHKCPSLDTY---VLMLTAKCNL 126
Query 114 FGDVPYLRKTDLQLFFKRLRYYVSKQCPSEKVRYFAVGEYGPVHFRPHYHILLFLQSDEA 173
G + Y K D QLF KR+R +SK EK+RY+ V EYGP FR HYH+L F +
Sbjct 127 DGYLSYTSKRDAQLFLKRVRKNLSKY-SDEKIRYYIVSEYGPKTFRAHYHVLFFYDEVKT 185
Query 174 LQVCSENISQAWTLGRIDCQISKGQCSSYVASYVNSSCTIPKVFKLSSVCPFNVHSQKLG 233
+V S+ I QAW GR+DC +S+G+C+SYVA YVN + +P+ S PF+ HS +
Sbjct 186 QKVMSKVIRQAWQFGRVDCSLSRGKCNSYVARYVNCNYCLPRFLGDMSTKPFSCHSIRFA 245
Query 234 QGFLDCQREKIYSSTPQNFVKRSIVLNGKYKEFDVWRSCYAYYFPRCKGFASKSSRERAY 293
G Q+E+IY + +F+ +S +NG Y EF WR+ +FP+CKG++ KS E
Sbjct 246 LGIHQSQKEEIYKGSVDDFIYQSGEINGNYVEFMPWRNLSCTFFPKCKGYSRKSDSELWQ 305
Query 294 SYGIYDTARRLFSSS-ETTFSLAKEIAFYIKHFHFTDDTYLLDLFGHVSDQKSLLDLSNY 352
SY I R S T A+ I + F+ D+ G +L + +Y
Sbjct 306 SYNILREVRSAIGYSFNTIIDYARCILDLVVTAKFSCDSR-----GLPCSSPALNKVISY 360
Query 353 FLDRDAMVRPVESDEFNRW-VHRIYTELLVSKHFLYFVCTHTTLAERKSKQRMIEEFYSY 411
F + P SD + + I EL +S+HFL FVC + + ER K +I +F+
Sbjct 361 F-SQGIDTNPYYSDYLADYHTNSIARELYISRHFLTFVCDNDSYHERYRKFTLIRQFWQR 419
Query 412 LDYMHLTTFFESQQEFYESDLIGDDDLCTDQWENSYYPYFYYNVHTNEPFEKTPV----- 466
DY L + SQ E LI + Y ++Y N + F V
Sbjct 420 YDYALLVGMYTSQIE--NRHLISN------------YDWYYINKTPLDSFGNVDVSQLSK 465
Query 467 ---YRLYASDVKKLFNDRIKHKKLNDANKIFID 496
Y+ + + F IKHK NDAN FI+
Sbjct 466 ELFYKRFVIKSDENFEKSIKHKIQNDANGFFIN 498
>gi|494822887|ref|WP_007558295.1| hypothetical protein [Bacteroides plebeius]
gi|198272100|gb|EDY96369.1| hypothetical protein BACPLE_00805 [Bacteroides plebeius DSM 17135]
Length=545
Score = 223 bits (569), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 174/554 (31%), Positives = 275/554 (50%), Gaps = 82/554 (15%)
Query 6 FCKCLHPKKIVNPYTNESMVVPCGHCQACMLAKNSRYAFQCDLESYVAKHTLFITLTYAN 65
F CL P++I N YT E MVV C HC AC +N +Y+ CD ES AK T+F+TLT+ +
Sbjct 5 FVSCLEPQRIKNKYTGEEMVVACKHCVACEQLRNFKYSNLCDFESLTAKKTVFLTLTFDD 64
Query 66 RYIPRATFVDSLERPFGND---LVDKETGEILGPSDMKQEDID----RLLNKFYLFGDVP 118
+++P+ F G+D + D +TGE LG + M + ++ R+ + G P
Sbjct 65 KFVPQFRFYK-----VGDDEYIMRDADTGEYLGRTLMTPQLMNEYQKRVNYRINYKGRFP 119
Query 119 YLRKTDLQLFFKRLRYYVSKQCPSEKVRYFAVGEYGPVHFRPHYHILLFLQSDEALQVCS 178
YL K +LQLF KRLR Y+ K +K+R+FA GEYGP+ FRPH+HILLF+ D +L + S
Sbjct 120 YLSKRELQLFMKRLRKYLDKY-EGQKIRFFATGEYGPLSFRPHFHILLFV-DDPSLFLPS 177
Query 179 EN------------------------------ISQAWTLGRIDCQ-ISKGQCSSYVASYV 207
+ I ++W G ID Q + +G CSSYVA YV
Sbjct 178 VHTLGEYPYPYWSKYQKAHCGKGTLLSKLEYYIRESWPFGGIDAQSVEQGSCSSYVAGYV 237
Query 208 NSSCTIPKVFKLSSVCPFNVHSQKLGQGFLDCQREKIYSSTPQNFVKRSIVLNGKYKEFD 267
NSS +P K+ +V F+ HS+ LG+ + + FV+RS G+Y F
Sbjct 238 NSSVPLPSCLKVDAVKSFSQHSRFLGRKIFGTELIPLLKLKFTEFVQRSFFCRGRYDNFR 297
Query 268 VWRSCYAYYFPRCKGFASKSSRERAYSYGIYDTARRLFSSSETTFSLAKEIAFYIKHFHF 327
+P+CKGFA S +R Y I+ R F+S + +A+ + + F+
Sbjct 298 TPSEMLHSVYPQCKGFALLSHEQRFRVYTIWSRLRYYFNSDKKA-DVARSL---VTSFYS 353
Query 328 TDDTYLLDLFGHVSDQKSLL------DLSNYFLDRDAMVRPVESDEFNRWVHR-IYTELL 380
DT +L + V + L+ +L+ +DR + D+ N + + +Y+ LL
Sbjct 354 WLDTGILRVPERVREDFLLIYTELSQNLNYKRIDRFDYDKFRHDDDLNNQLFQCVYSILL 413
Query 381 VSKHFLY-------FVCTHTTLA----ERKSKQRMIEEFYSYLDYMHLTTFFESQQEFYE 429
S F + ++C+ + + + R +E F+ +Y++L ++ Q+ +++
Sbjct 414 CSSVFEHNAKLWKSYLCSLSLMWCDFDRNELFLRKVERFWKNYEYLNLVDWYRKQEIYFD 473
Query 430 ------SDLI-GDDDLCTDQWENSYYPYFYYNV-HTNEPFEK-TPVYRLYASDVKKLFND 480
SD + G + L D YFY NV + E F+K T Y+ Y+++V+ +
Sbjct 474 KSYSRKSDFLDGKERLSGD------IKYFYNNVPYDVEQFKKRTFAYKAYSANVRFMARQ 527
Query 481 RIKHKKLNDANKIF 494
R+KHK+ ND N IF
Sbjct 528 RMKHKEQNDKNMIF 541
>gi|494610270|ref|WP_007368516.1| hypothetical protein [Prevotella multiformis]
gi|324988542|gb|EGC20505.1| hypothetical protein HMPREF9141_0984 [Prevotella multiformis
DSM 16608]
Length=479
Score = 125 bits (313), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 138/299 (46%), Gaps = 17/299 (6%)
Query 9 CLHPKKIVNPYTNESMVVPCGHCQACMLAKNSRYAFQCDLESYVAKHTLFITLTYANRYI 68
CL PK+I N Y +E++ VPC C C + S ++ + + E + +LF+TLTY N +I
Sbjct 10 CLSPKRIYNKYIDETLYVPCRKCFRCRDSYASDWSRRIENECREHRFSLFVTLTYDNEHI 69
Query 69 P--RATFVDSLERP--FGNDLVDKETGEILGPSDMKQEDIDRLLNKFYLFGDVPYLRKTD 124
P + +D P F N L E+G+ L S + ++ ++ Y K D
Sbjct 70 PLFQPLVMDDGSHPVWFSNRL--SESGKFLSDSVCRSLPPQKMEDEVCF----AYPCKKD 123
Query 125 LQLFFKRLRYYVSKQCPSEK-----VRYFAVGEYGPVHFRPHYHILLFLQSDEALQVCSE 179
+Q +FKRLR V Q K +RYF EYGP FRPHYH +L+ S+E +
Sbjct 124 VQDWFKRLRSAVDYQLNKNKSNEFRIRYFICSEYGPRTFRPHYHAILWYDSEELQRNIGR 183
Query 180 NISQAWTLGRIDCQISKGQCSSYVASYVNSSCTIPKVFKLSSVCPFNVHSQKLGQGFLDC 239
I + W G + S YVA YVN +P + F++ S+ G+
Sbjct 184 LIRETWKNGNSVFSLVNNSASQYVAKYVNGDTRLPPFLRTEFTSTFHLASKHPYIGYCKA 243
Query 240 QREKIYSSTPQNFVKRSIVL--NGKYKEFDVWRSCYAYYFPRCKGFASKSSRERAYSYG 296
E + S+ +S++ NG+++ RS P+C+G+ S S ER Y
Sbjct 244 DEEALRSNVLDGTYGQSVLNRDNGQFEFVPTPRSLENRLLPKCRGYRSLSHSERIRVYA 302
>gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens]
gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens
CC14M]
Length=484
Score = 123 bits (308), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 82/251 (33%), Positives = 126/251 (50%), Gaps = 45/251 (18%)
Query 9 CLHPKKIVNPYTNESMVVPCGHCQACMLAKNS----RYAFQCDLESYVAKHTLFITLTYA 64
C HPK+I+NPYT+E + V C C+ C+ K S R A +C L +Y A F+TLTY
Sbjct 9 CEHPKRIINPYTHERVWVACRRCKCCLNKKTSAWSGRVANECKLHAYSA----FVTLTYD 64
Query 65 NRYIPRATFVDSLERPFGNDLVDKETGEILGPSDMKQEDIDRLLNKFYLFGD-------- 116
N ++P L +P + E GE++ S+ RL ++ + G+
Sbjct 65 NEHLP-------LYQPECMN----ERGEMVWTSN-------RLCDEKVIVGNYDFIKVSN 106
Query 117 -----VPYLRKTDLQLFFKRLR----YYVSKQ--CPSEKVRYFAVGEYGPVHFRPHYHIL 165
V Y K+D+ FFKRLR YY K +EK+RYF EYGP RPHYH +
Sbjct 107 SDVQAVAYCCKSDIVKFFKRLRSKLSYYFKKHHIITNEKIRYFVCSEYGPKTLRPHYHAI 166
Query 166 LFLQSDEALQVCSENISQAWTLGRIDCQISKGQCSSYVASYVNSSCTIPKVFKLSSVCPF 225
++ S+E +V + +S +W+ G D + YVA YV+ + +P++ + + F
Sbjct 167 IWFDSEEVARVIEKMLSSSWSNGFTDFEYVNSTAPQYVAKYVSGNSVLPEILQHDACRTF 226
Query 226 NVHSQKLGQGF 236
++ SQ G+
Sbjct 227 HLQSQAPSVGY 237
>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499
Score = 114 bits (285), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 111/375 (30%), Positives = 164/375 (44%), Gaps = 57/375 (15%)
Query 1 MIKNSFCKCLHPKKIVNPYTNESMVVPCGHCQACMLAKNSRYAFQCDLESYVAKHTLFIT 60
MI+ SF KC P + +P VPCG C AC K S + + LE Y +K+ F+T
Sbjct 1 MIQ-SFVKCFSPLVLRDP-RGYPYQVPCGKCIACHNNKRSSLSLKLRLEEYTSKYCYFLT 58
Query 61 LTYANRYIP---------RATFV-----------DSLERPFGNDLVDKETGEILGPSDMK 100
LTY + +P FV DS F +DL + + + + D
Sbjct 59 LTYDDDNLPLFSVGLDTCATEFVRIYPYSERLRNDSFISDFCSDLHNFD-NDFVDKMDYY 117
Query 101 QEDIDRLLNKF-----YLFGDVPYLRKTDLQLFFKRLRYYVSKQCPSEKVRYFAVGEYGP 155
+ + +K+ Y G L D+QLF KRLR ++ K EK+R++ +GEYG
Sbjct 118 SDYVINYESKYHKSCVYGHGLYALLYYRDIQLFLKRLRKHIYKY-YGEKIRFYIIGEYGT 176
Query 156 VHFRPHYHILLFLQSDEALQV---------------CSENISQAWTLGRIDCQISKGQCS 200
RPH+H LLF S Q C + W G D + + G+
Sbjct 177 KSLRPHWHCLLFFNSSSLSQAFEDCVNVGTTSRPCSCPRFLRPFWQFGICDSKRTNGEAY 236
Query 201 SYVASYVNSSCTIPKVFKLSSVCPFNVHSQKLGQGFLDCQREKIYSSTPQ---NFVKRSI 257
+YV+SYVN S PK+ L S HS +LGQ + I S+ + +F +R
Sbjct 237 NYVSSYVNQSANFPKLLVLLSNQK-AYHSIQLGQIL---SEQSIVSAIQKGDFSFFERQF 292
Query 258 VLN--GKYKEFDVWRSCYAYYFPRCKGFASKSSRERAYS-YGIYDTARRLFSSSETTFSL 314
L+ G + VWRS Y+ +FP+ +S+ + E+ Y Y+T R LF +
Sbjct 293 YLDTFGAANSYSVWRSYYSRFFPKFTC-SSQLTYEQTYRVLTCYETLRDLFDTDSVGVIC 351
Query 315 AKEIAFYIKHFHFTD 329
+ FY HF + D
Sbjct 352 RR--LFYHYHFGYPD 364
>gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola]
Length=480
Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/298 (30%), Positives = 130/298 (44%), Gaps = 13/298 (4%)
Query 9 CLHPKKIVNPYTNESMVVPCGHCQACMLAKNSRYAFQCDLESYVAKHTLFITLTYANRYI 68
CL P +I N Y E + C C C + S +A + D E +++LF+TLTY N ++
Sbjct 12 CLRPHRIYNRYIGEFLYTNCRKCVRCRSSYASSWANRIDSECSFHRYSLFLTLTYDNDHL 71
Query 69 PRATFVDSLERPFGNDLVDKETGEILGPSDMKQEDIDRLLNKFYLFGDV--PYLRKTDLQ 126
P + +L+ D+ G G DI R + + V Y K D+Q
Sbjct 72 PYYAPLFNLDGS-RTDVWCSNRGCDNG--KFVSSDIARPIPPVGMEDTVCFAYPCKKDVQ 128
Query 127 LFFKRLR----YYVSKQCPSEKVRYFAVGEYGPVHFRPHYHILLFLQSDEALQVCSENIS 182
FFKRLR Y + + ++RYF EYGP FRPHYH +L+ S+ + I
Sbjct 129 DFFKRLRSKIDYKLKPRGNEYRIRYFICSEYGPNTFRPHYHAILWYDSEILHNELNVLIR 188
Query 183 QAWTLGRIDCQISKGQCSSYVASYVNSSCTIPKVFKLSSVCPFNVHSQKLGQGFLDCQRE 242
+ W G D + S YVA YVN C +P + F++ S+ G+ E
Sbjct 189 ETWKNGNTDFSLVNSSASQYVAKYVNGDCDLPSFLRTEFTSTFHLASKHPCIGYGKDDEE 248
Query 243 KIYSSTPQNFVKRSIVLNGKYKEFDVW---RSCYAYYFPRCKGFASKSSRERAYSYGI 297
+Y + R+ LN EF+ RS P+CKG+ S ER Y +
Sbjct 249 ALYENVINGTYGRN-CLNKSTNEFEFVCPPRSLENRILPKCKGYRRISHSERVRIYAL 305
>gi|496521300|ref|WP_009229583.1| hypothetical protein [Prevotella sp. oral taxon 317]
gi|288330571|gb|EFC69155.1| hypothetical protein HMPREF0670_00478 [Prevotella sp. oral taxon
317 str. F0108]
Length=569
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/344 (28%), Positives = 149/344 (43%), Gaps = 61/344 (18%)
Query 6 FCKCLHPKKIVNPYTNESMVVPCGHCQACMLAKNSRYAFQCDLESYVAKHTLFITLTYAN 65
F CL P + N +T + M VPCG C+AC+ A S+ + + E K+++ TLTY N
Sbjct 8 FGNCLCPVHVHNRWTRDEMFVPCGRCEACVNAAASKQSKRVRNEIMQHKYSVMFTLTYNN 67
Query 66 RYIPR-ATFVDSLE----RPFGN----------DLVDKETGEILGPSDMKQEDIDRLLNK 110
+IPR F+D+ + RP G + DK TG+ D+D L K
Sbjct 68 EFIPRWERFLDNNDCPQLRPIGRCAELFPSCPLNYFDKVTGKW-------SIDLDTFLPK 120
Query 111 F------YLFGDVPYLRKTDLQLFFKRLRYYVSK---QCPSEKVRYFAVGEYGPVHFRPH 161
+F K D+Q F KRLR+ +SK + S K+RY+ EYGP RPH
Sbjct 121 IENDEHTEVFASCC---KKDIQNFLKRLRFNISKLYGKAESRKIRYYVASEYGPTTLRPH 177
Query 162 YHILLFLQSDEALQVCSENISQAWTLGR--------------IDCQISK-------GQCS 200
YH ++F L S I ++W R D +++ +
Sbjct 178 YHGIIFFDDASLLSEISSLIVRSWGFQRRVGGKRNSFIFQPFADISLTQQYVKLCDQNTA 237
Query 201 SYVASYVNSSCTIPKVFKLSSVCPFNVHSQKLGQGFLDCQREKIYSSTPQNF--VKRSIV 258
YVA YV+ + +P+V S PF++ S+ G ++ + V R +
Sbjct 238 YYVAEYVSGNLGLPQVLAYKSTLPFHLCSKSPVIGCFKADYCEVLGRVHRGAYRVGREVF 297
Query 259 --LNGKYKEFDVW--RSCYAYYFPRCKGFASKSSRERAYSYGIY 298
+G++ +D+ R + F +C GF+S S E+ Y Y
Sbjct 298 DEKSGQFMHYDIPLDRDLCSSLFRKCLGFSSLSFNEKLLRYSFY 341
Lambda K H a alpha
0.324 0.138 0.429 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3489190903935