bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-2_CDS_annotation_glimmer3.pl_2_1
Length=611
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226431|ref|WP_021963494.1| predicted protein 96.7 6e-18
gi|496050828|ref|WP_008775335.1| hypothetical protein 89.0 2e-15
gi|575094322|emb|CDL65709.1| unnamed protein product 88.2 4e-15
gi|490418708|ref|WP_004291031.1| hypothetical protein 78.2 5e-12
gi|575094355|emb|CDL65737.1| unnamed protein product 75.1 5e-11
gi|575094340|emb|CDL65724.1| unnamed protein product 74.7 8e-11
gi|565841285|ref|WP_023924566.1| hypothetical protein 72.0 5e-10
gi|517172763|ref|WP_018361581.1| hypothetical protein 65.5 6e-08
gi|647452984|ref|WP_025792805.1| hypothetical protein 61.2 1e-06
gi|490477382|ref|WP_004347759.1| hypothetical protein 61.2 1e-06
>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 67/178 (38%), Positives = 95/178 (53%), Gaps = 12/178 (7%)
Query 195 ADCKLQ--IPVLQSRDIELFFKRLRRNLDSHGFTSSKICYYVVSEYGPQTYRPHWHCLLF 252
A C L + RD +LF KR+R+NL ++ KI YY+VSEYGP+T+R H+H L F
Sbjct 122 AKCNLDGYLSYTSKRDAQLFLKRVRKNLSK--YSDEKIRYYIVSEYGPKTFRAHYHVLFF 179
Query 253 FNSEEITQTLREDISKAWAYGRIDYSLSRGaaasyvasyvnsaaCLPFFYVGQKEIRPRS 312
++ + + + + I +AW +GR+D SLSRG SYVA YVN CLP F +G +P S
Sbjct 180 YDEVKTQKVMSKVIRQAWQFGRVDCSLSRGKCNSYVARYVNCNYCLPRF-LGDMSTKPFS 238
Query 313 FHSKGFGSNKIFPKSSDVSEISK--ISDLFFDGVNVDSNGKVINIRPVRSCELAVFPR 368
HS F + S EI K + D + + NG + P R+ FP+
Sbjct 239 CHSIRFA---LGIHQSQKEEIYKGSVDDFIYQSGEI--NGNYVEFMPWRNLSCTFFPK 291
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/72 (32%), Positives = 38/72 (53%), Gaps = 0/72 (0%)
Query 11 YLFTECLRPQRIVNPYSHDVIFAPCGRCKSCIMNKSNFATAYAMNMATHFKYCYFVTLTY 70
Y +C P+ + N Y+ +VI CG CK+C+ +++ + KYC F TLTY
Sbjct 6 YPLVKCYHPRHVQNKYTGEVIQVGCGVCKACLKRRADKMSFLCAIEEQSHKYCMFATLTY 65
Query 71 KDIFLPYLSVEV 82
+ ++P + EV
Sbjct 66 SNDYVPRMYPEV 77
>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/413 (25%), Positives = 178/413 (43%), Gaps = 38/413 (9%)
Query 201 IPVLQSRDIELFFKRLRRNLDSHGFTSSKICYYVVSEYGPQTYRPHWHCLLFFNSEEITQ 260
+P L+ D++LF KRLR + S K+ Y+ V EYGP +RPH+H LLF S+E Q
Sbjct 117 VPYLRKTDLQLFLKRLRYYVTKQK-PSEKVRYFAVGEYGPVHFRPHYHLLLFLQSDEALQ 175
Query 261 TLREDISKAWAYGRIDYSLSRGaaasyvasyvnsaaCLPFFYVGQKEIRPRSFHSKGFGS 320
E+ISKAW +GR+D +S+G ++YVASYVNS+ +P + + P S HS+ G
Sbjct 176 ICSENISKAWTFGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKAS-SVCPFSVHSQKLGQ 234
Query 321 NKIFPKSSDVSEISKISDLFFDGVNVDSNGKVINIRPVRSCELAVFPRFSNDFFSDSDTC 380
+ +I ++ F ++ NGK RSC +PR
Sbjct 235 GFL---DCQREKIYSLTPENFIRSSIVLNGKYKEFDVWRSCYSFFYPR------------ 279
Query 381 CKLFQSVIETPERLVSRGYLGIDTPSF----GSDGFRLSDLVRAYSEYYERNFTDFSFSL 436
CK F + R + Y DT F L+ + Y Y+ + L
Sbjct 280 CKGF---VTKSSRERAYSYSIYDTARLLFPDAKTTFSLAKEIAIYIYYFHNPKETYLLDL 336
Query 437 RFLRGYRTRDYSDELIFREARLFDGYVFNKDMIFGRLYRLFAKVLRCFRFWNLKQYTDSW 496
+++ Y F ++ + + FN ++R++ ++L F ++
Sbjct 337 YGYCSDQSKLYELSQYFYDSDVL-LHSFNSGEFSRYVHRIYTELLISKHFLYFVCTHNTL 395
Query 497 SLKAAVKKIWSYGWEYWKKKEYRFLTTYFEYLEGCNDDERLFLLVRTSG-SGLATDSPHS 555
+ + + +++ E++ + +Y LT +FE ++LF G L TD+ +
Sbjct 396 AERKSKQRLIE---EFYSRLDYMHLTKFFE-------AQQLFYESDLIGDDDLCTDNWDN 445
Query 556 WTYTQREDYVNCLPDDLYKRYMKTLKWLTARTEKVLKDKVKHKEFNDMQGVLL 608
Y Y N D + ++ +K+ D++KHK+ ND V
Sbjct 446 SYYPYF--YNNVYTDTNLFEKTPVYRLYSSDVKKLFNDRIKHKKLNDANKVFF 496
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/94 (32%), Positives = 55/94 (59%), Gaps = 3/94 (3%)
Query 13 FTECLRPQRIVNPYSHDVIFAPCGRCKSCIMNKSNFATAYAMNMATH-FKYCYFVTLTYK 71
F +CL P+RI+NPY+ + + PCG C++C + K N A+ ++ ++ K+ F+TLTY
Sbjct 6 FCKCLHPKRIMNPYTKESMVVPCGHCQACTLAK-NSRYAFQCDLESYTAKHTLFITLTYA 64
Query 72 DIFLP-YLSVEVVRRSGNRYLFDENFETMVSTSD 104
+ F+P + V+ + R L D+ ++ +D
Sbjct 65 NRFIPRAMFVDSIERPYGCDLIDKETGEILGPAD 98
>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 105/377 (28%), Positives = 160/377 (42%), Gaps = 82/377 (22%)
Query 13 FTECLRPQRIVNP--YSHDVIFAPCGRCKSCIMNK-SNFATAYAMNMATHFKYCYFVTLT 69
F +C P + +P Y + V PCG+C +C NK S+ + + T KYCYF+TLT
Sbjct 5 FVKCFSPLVLRDPRGYPYQV---PCGKCIACHNNKRSSLSLKLRLEEYTS-KYCYFLTLT 60
Query 70 YKDIFLPYLSVEVVRRSGNRYLFDENFETMVSTSDPRLLTPEYYHDRDLSLDPAQNEVEQ 129
Y D LP SV LD E +
Sbjct 61 YDDDNLPLFSV--------------------------------------GLDTCATEFVR 82
Query 130 VFDIGFQSIPRDVSVKSKGSFRFRSFDDEPLKFCIPMKLTELQDILIKANGRYDYGKKKV 189
++ + R+ S S +FD++ + K+ D +I +Y K
Sbjct 83 IYP--YSERLRNDSFISDFCSDLHNFDNDFVD-----KMDYYSDYVINYESKY---HKSC 132
Query 190 VYPSLADCKLQIPVLQSRDIELFFKRLRRNLDSHGFTSSKICYYVVSEYGPQTYRPHWHC 249
VY +L RDI+LF KRLR+++ + + KI +Y++ EYG ++ RPHWHC
Sbjct 133 VYGHGL-----YALLYYRDIQLFLKRLRKHI--YKYYGEKIRFYIIGEYGTKSLRPHWHC 185
Query 250 LLFFNSEEITQTLREDISKA---------------WAYGRIDYSLSRGaaasyvasyvns 294
LLFFNS ++Q + ++ W +G D + G A +YV+SYVN
Sbjct 186 LLFFNSSSLSQAFEDCVNVGTTSRPCSCPRFLRPFWQFGICDSKRTNGEAYNYVSSYVNQ 245
Query 295 aaCLPFFYVGQKEIRPRSFHSKGFGSNKIFPKSSDVSEISKISDLFFD-GVNVDSNGKVI 353
+A P V +++HS G +I + S VS I K FF+ +D+ G
Sbjct 246 SANFPKLLVLLSN--QKAYHSIQLG--QILSEQSIVSAIQKGDFSFFERQFYLDTFGAAN 301
Query 354 NIRPVRSCELAVFPRFS 370
+ RS FP+F+
Sbjct 302 SYSVWRSYYSRFFPKFT 318
>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM
20697]
Length=422
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 35/82 (43%), Positives = 53/82 (65%), Gaps = 1/82 (1%)
Query 201 IPVLQSRDIELFFKRLRRNLDSHGFTSSKICYYVVSEYGPQTYRPHWHCLLFFNSEEITQ 260
+P L+ D++LFFKR R + + F K+ Y+ + EYGP +RPH+H LLF S+E Q
Sbjct 41 LPYLRKFDLQLFFKRFRYYV-AKRFPKEKVRYFAIGEYGPVHFRPHYHILLFLQSDEALQ 99
Query 261 TLREDISKAWAYGRIDYSLSRG 282
+ +S+AW +GR+D LS+G
Sbjct 100 VCSKVVSEAWPFGRVDCQLSKG 121
>gi|575094355|emb|CDL65737.1| unnamed protein product [uncultured bacterium]
Length=517
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 96/196 (49%), Gaps = 34/196 (17%)
Query 201 IPVLQSRDIELFFKRLRRNLDSHGFTSSKICYYVVSEYGPQTYRPHWHCLLFFNSEEIT- 259
+P L+ RD++LF KRLR+NL ++ +K+ Y+ + EYGP +RPH+H LLFF+ + T
Sbjct 123 VPYLRKRDLQLFIKRLRKNLSK--YSDAKVRYFAMGEYGPVHFRPHYHFLLFFDEIKFTA 180
Query 260 ---QTLRE------------------------DISKAWAYGRIDYSLSRGaaasyvasyv 292
TL E I +W +GR+D S+G AA YV+SYV
Sbjct 181 PSGHTLGEFPDWAWYDSQNKCSRSDILSVVEYCIRSSWKFGRVDAQYSKGDAAQYVSSYV 240
Query 293 nsaaCLPFFYVGQKEIRPRSFHSKGFGSNKIFPKSSDVSEISKISDLFFDGVNVDSNGKV 352
+ + LP Y RP S HS+ G + + V E + + D V ++ + K
Sbjct 241 SGSGSLPKVY-QVSSARPFSLHSRFLGQGFLAHECEKVYE-TPVRDFVKRSVELNGSNKD 298
Query 353 INIRPVRSCELAVFPR 368
N+ RSC +P+
Sbjct 299 FNL--WRSCYSVFYPK 312
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/73 (37%), Positives = 44/73 (60%), Gaps = 0/73 (0%)
Query 8 LTKYLFTECLRPQRIVNPYSHDVIFAPCGRCKSCIMNKSNFATAYAMNMATHFKYCYFVT 67
L + F CL P+R+ NPY +D + PCG+C++C +K++ A+ K+C F T
Sbjct 3 LKDFPFIRCLEPKRVFNPYLNDWLLVPCGKCRACQCSKASRYKLQIQLEASQHKFCIFGT 62
Query 68 LTYKDIFLPYLSV 80
LTY + ++P LS+
Sbjct 63 LTYANTYIPRLSL 75
>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 80/324 (25%), Positives = 130/324 (40%), Gaps = 53/324 (16%)
Query 14 TECLRPQRIVNPYSHDVIFAPCGRCKSCIMNKSNFATAYAMN-MATHFKYCYFVTLTYKD 72
T C ++ N Y + CG C SC+ K+N + +N + + FVTLTY +
Sbjct 4 TMCTNRIKVTNKYVGRSFYVDCGHCPSCLQRKANKSCCKIINEYGRPYSFMCFVTLTYDN 63
Query 73 IFLPYLSVEVVRRSGNRYLFDENFETMVSTSDPRLLTPEYYHDRDLSLDPAQNEVEQVFD 132
+PY+ + T L + Y+ R ++FD
Sbjct 64 EHIPYIHPD--------------------TDYSHLYVGKSYYVRH----------SRIFD 93
Query 133 I-GFQSIPRDVSVKSKGSFRFRSFDDEPLKFCIPMKLTELQDILIKANGRYDYGKKKVVY 191
G +++P V +R+ F M ++ L G + VV
Sbjct 94 KDGVENLPLGV---------YRNGKLIDTVFLPEMPKEVFRNYLCNTTGIVTKSRNGVV- 143
Query 192 PSLADCKLQIPVLQSRDIELFFKRLRRNLDSHGFTSSKICYYVVSEYGPQTYRPHWHCLL 251
L ++ +L +D F KRLR NL + KI Y+ SEYGP T RPH+H +
Sbjct 144 --LERDDNKVGILYDKDFVNFVKRLRINLTRNYNYEGKITYFKCSEYGPTTNRPHFHGIF 201
Query 252 FFNSEEIT-QTLREDISKAWAYGRID-----YSLSRGaaasyvasyvnsaaCLPFFYVGQ 305
+F+S ++ + R + ++W D ++R A + + P F
Sbjct 202 WFDSRALSFDSFRSAVVESWKMCDKDKQYENVEIAREPATYVASYVNCLTSVPPLFLF-- 259
Query 306 KEIRPRSFHSKGFG-SNKIFPKSS 328
K +RP+ HSKGFG +N +F S+
Sbjct 260 KGLRPKHSHSKGFGFANNLFSFSA 283
>gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens]
gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens
CC14M]
Length=484
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 80/155 (52%), Gaps = 22/155 (14%)
Query 138 IPRDVSVKSKGSFRFRSFDDEPLKFCIPMKLTELQDILIKAN---------GRYDYGKKK 188
+ + + + +F ++D+E L P + E +++ +N G YD+ K
Sbjct 46 VANECKLHAYSAFVTLTYDNEHLPLYQPECMNERGEMVWTSNRLCDEKVIVGNYDFIK-- 103
Query 189 VVYPSLADCKLQ-IPVLQSRDIELFFKRLRRNLD-----SHGFTSSKICYYVVSEYGPQT 242
+++ +Q + DI FFKRLR L H T+ KI Y+V SEYGP+T
Sbjct 104 -----VSNSDVQAVAYCCKSDIVKFFKRLRSKLSYYFKKHHIITNEKIRYFVCSEYGPKT 158
Query 243 YRPHWHCLLFFNSEEITQTLREDISKAWAYGRIDY 277
RPH+H +++F+SEE+ + + + +S +W+ G D+
Sbjct 159 LRPHYHAIIWFDSEEVARVIEKMLSSSWSNGFTDF 193
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (53%), Gaps = 0/72 (0%)
Query 16 CLRPQRIVNPYSHDVIFAPCGRCKSCIMNKSNFATAYAMNMATHFKYCYFVTLTYKDIFL 75
C P+RI+NPY+H+ ++ C RCK C+ K++ + N Y FVTLTY + L
Sbjct 9 CEHPKRIINPYTHERVWVACRRCKCCLNKKTSAWSGRVANECKLHAYSAFVTLTYDNEHL 68
Query 76 PYLSVEVVRRSG 87
P E + G
Sbjct 69 PLYQPECMNERG 80
>gi|517172763|ref|WP_018361581.1| hypothetical protein [Prevotella nanceiensis]
Length=598
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 48/79 (61%), Gaps = 4/79 (5%)
Query 197 CK--LQIPVLQSRDIELFFKRLRRNLDSHGFTSS--KICYYVVSEYGPQTYRPHWHCLLF 252
CK LQ + +DI+ F KRLR+ +D + KI Y++ SEYGP+TYRPH+H +LF
Sbjct 142 CKNNLQFATVSKKDIQNFLKRLRKKIDKLNIPQNEKKIRYFIASEYGPKTYRPHYHGVLF 201
Query 253 FNSEEITQTLREDISKAWA 271
+S + ++ I ++W
Sbjct 202 IDSPTVLSKIKAFIVESWG 220
>gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola]
Length=480
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (61%), Gaps = 3/76 (4%)
Query 207 RDIELFFKRLRRNLD---SHGFTSSKICYYVVSEYGPQTYRPHWHCLLFFNSEEITQTLR 263
+D++ FFKRLR +D +I Y++ SEYGP T+RPH+H +L+++SE + L
Sbjct 125 KDVQDFFKRLRSKIDYKLKPRGNEYRIRYFICSEYGPNTFRPHYHAILWYDSEILHNELN 184
Query 264 EDISKAWAYGRIDYSL 279
I + W G D+SL
Sbjct 185 VLIRETWKNGNTDFSL 200
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 6/67 (9%)
Query 16 CLRPQRIVNPYSHDVIFAPCGRCKSCIMNKSNFATAYAMNM---ATHFKYCYFVTLTYKD 72
CLRP RI N Y + ++ C +C C +S++A+++A + + +Y F+TLTY +
Sbjct 12 CLRPHRIYNRYIGEFLYTNCRKCVRC---RSSYASSWANRIDSECSFHRYSLFLTLTYDN 68
Query 73 IFLPYLS 79
LPY +
Sbjct 69 DHLPYYA 75
>gi|490477382|ref|WP_004347759.1| hypothetical protein [Prevotella buccalis]
gi|281300711|gb|EFA93042.1| hypothetical protein HMPREF0650_1078 [Prevotella buccalis ATCC
35310]
Length=582
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 45/72 (63%), Gaps = 3/72 (4%)
Query 203 VLQSRDIELFFKRLR---RNLDSHGFTSSKICYYVVSEYGPQTYRPHWHCLLFFNSEEIT 259
V+ +DI+ F KRLR + + SKI YY+ SEYGP TYRPH+H +LFF+S++I
Sbjct 140 VVCKKDIQNFLKRLRWRISKIPNITKDESKIRYYISSEYGPTTYRPHYHGILFFDSKKIL 199
Query 260 QTLREDISKAWA 271
++ I +W
Sbjct 200 DKIKSLIVMSWG 211
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/95 (31%), Positives = 45/95 (47%), Gaps = 5/95 (5%)
Query 16 CLRPQRIVNPYSHDVIFAPCGRCKSCIMNKSNFATAYAMNMATHFKYCYFVTLTYKDIFL 75
CL P+++ NP H ++ C +C +C+ K+ + A KY F TLTY + L
Sbjct 13 CLNPRKVYNPSLHGWMYCSCDKCTACLNQKATTLSNRARAEIEQHKYSVFFTLTYDNEHL 72
Query 76 PYLSV-----EVVRRSGNRYLFDENFETMVSTSDP 105
P V EV++ L D++ M+S S P
Sbjct 73 PKYEVFQDSNEVIQYRPIGRLVDDSSSDMLSNSCP 107
Lambda K H a alpha
0.325 0.140 0.439 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4577117499429