bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-11_CDS_annotation_glimmer3.pl_2_5
Length=559
Score E
Sequences producing significant alignments: (Bits) Value
gi|496050828|ref|WP_008775335.1| hypothetical protein 105 7e-21
gi|575094298|emb|CDL65688.1| unnamed protein product 89.7 1e-15
gi|547226431|ref|WP_021963494.1| predicted protein 89.0 2e-15
gi|575094340|emb|CDL65724.1| unnamed protein product 85.9 2e-14
gi|575094355|emb|CDL65737.1| unnamed protein product 84.0 8e-14
gi|490418708|ref|WP_004291031.1| hypothetical protein 81.6 3e-13
gi|575094322|emb|CDL65709.1| unnamed protein product 79.7 2e-12
gi|575095229|emb|CDL66433.1| unnamed protein product 73.6 1e-10
gi|490477382|ref|WP_004347759.1| hypothetical protein 72.8 3e-10
gi|517172763|ref|WP_018361581.1| hypothetical protein 70.9 1e-09
>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 94/328 (29%), Positives = 141/328 (43%), Gaps = 73/328 (22%)
Query 14 CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI 73
C H I N YT + V CG C C + + + + K++ F+TLTY N I
Sbjct 9 CLHPKRIMNPYTKESMVVPCGHCQACTLAKNSRYAFQCDLESYTAKHTLFITLTYANRFI 68
Query 74 PLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQ---GTV 130
P R +F + ++ G
Sbjct 69 P--------------------------------------------RAMFVDSIERPYGCD 84
Query 131 PYDREIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQN 190
D+E E + D L+ D + ++K G +P+L D+Q
Sbjct 85 LIDKETGEILGPAD---LTEDERTNLLNKFYLFGD--------------VPYLRKTDLQL 127
Query 191 YIKRLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSWK 250
++KRLR Y+ K S E + ++AVGEYGPVHFRPH+HLLLF SDE ++ + K+W
Sbjct 128 FLKRLRYYVTKQKPS-EKVRYFAVGEYGPVHFRPHYHLLLFLQSDEALQICSENISKAWT 186
Query 251 FGRSDFQRsaggsasyvssyvNSLCSAPLLYRS---CRAFRPKSRASVGFFEKGCDFVED 307
FGR D Q S G ++YV+SYVNS C+ P ++++ C + GF + + +
Sbjct 187 FGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKASSVCPFSVHSQKLGQGFLDCQREKIYS 246
Query 308 EDPYAQIEKKIDSVVNGRCYNFNGVSVW 335
P I I V+NG+ F+ VW
Sbjct 247 LTPENFIRSSI--VLNGKYKEFD---VW 269
>gi|575094298|emb|CDL65688.1| unnamed protein product [uncultured bacterium]
Length=478
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/242 (27%), Positives = 111/242 (46%), Gaps = 29/242 (12%)
Query 14 CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI 73
C ++ I N+YTG ++ V CG+C C+ ++A ++ +++ S+ +FVTL YDN HI
Sbjct 2 CINKREIRNKYTGQKLYVSCGKCPACLQEKANASAYKIRNNQSSELSCFFVTLNYDNNHI 61
Query 74 PLMRCKVLHSEYEDVVGISGDIHFGDEYHHY-IPVSEYQCDDSSALRHIFFEQVQGTVP- 131
P++ H Y S HF +E +PV Y +G P
Sbjct 62 PVI---FKHDVYN--YNSSDVYHFDEERKELCLPVDLY----------------RGVCPA 100
Query 132 YDREIKEY-VPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQN 190
+ +I + P+ LS D + S + + KT + + + D+Q
Sbjct 101 FSNKIDTFNFPLNR---LSTDVVSSLDNHCGVVVKTKNHKPVLFNEE-IFSVCYTKDIQL 156
Query 191 YIKRLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVA-EVLRQCHDKSW 249
+ KRLR+ L++ G + ++ EYGP +R HFHL +F E++ + R+ K+W
Sbjct 157 FFKRLRQSLYRKFGFRPFIQYFQTSEYGPTTYRAHFHLCIFVKRSEISFDSFRKACVKAW 216
Query 250 KF 251
F
Sbjct 217 PF 218
>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/242 (28%), Positives = 103/242 (43%), Gaps = 49/242 (20%)
Query 14 CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI 73
C H + N+YTG I V CG C C+ +RA K S + KY F TLTY N+++
Sbjct 11 CYHPRHVQNKYTGEVIQVGCGVCKACLKRRADKMSFLCAIEEQSHKYCMFATLTYSNDYV 70
Query 74 PLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPYD 133
P M Y +V ++ Y + CD + + TV YD
Sbjct 71 PRM--------YPEV---DNELRLVRWYSY--------CDRLNEKGKLM------TVDYD 105
Query 134 REIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYIK 193
+ HK +L + + D + + + D Q ++K
Sbjct 106 ----------------------YWHKCPSLDTYVLMLTAKCNLDGYLSYTSKRDAQLFLK 143
Query 194 RLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSWKFGR 253
R+RK L K S E + +Y V EYGP FR H+H+L F + + +V+ + ++W+FGR
Sbjct 144 RVRKNLSKY--SDEKIRYYIVSEYGPKTFRAHYHVLFFYDEVKTQKVMSKVIRQAWQFGR 201
Query 254 SD 255
D
Sbjct 202 VD 203
>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 125/290 (43%), Gaps = 29/290 (10%)
Query 14 CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMR-VKTAGSAFKYSYFVTLTYDNEH 72
C +R +TN+Y G VDCG C C+ ++A K+ + + G + + FVTLTYDNEH
Sbjct 6 CTNRIKVTNKYVGRSFYVDCGHCPSCLQRKANKSCCKIINEYGRPYSFMCFVTLTYDNEH 65
Query 73 IPLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPY 132
IP IH +Y H Y S E + V
Sbjct 66 IPY-------------------IHPDTDYSHLYVGKSYYVRHSRIFDKDGVENLPLGVYR 106
Query 133 DREIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYI 192
+ ++ + V + + + + R+++ T + DN + L D N++
Sbjct 107 NGKLIDTVFLPE---MPKEVFRNYLCNTTGIVTKSRNGVVLERDDNKVGILYDKDFVNFV 163
Query 193 KRLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVA-EVLRQCHDKSWKF 251
KRLR L + + ++ EYGP RPHFH + + +S ++ + R +SWK
Sbjct 164 KRLRINLTRNYNYEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSAVVESWKM 223
Query 252 GRSDFQ----RsaggsasyvssyvNSLCSAPLLYRSCRAFRPKSRASVGF 297
D Q A A+YV+SYVN L S P L+ + RPK S GF
Sbjct 224 CDKDKQYENVEIAREPATYVASYVNCLTSVPPLFLF-KGLRPKHSHSKGF 272
>gi|575094355|emb|CDL65737.1| unnamed protein product [uncultured bacterium]
Length=517
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 97/348 (28%), Positives = 143/348 (41%), Gaps = 89/348 (26%)
Query 14 CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI 73
C + N Y + V CG+C C +A + ++++ S K+ F TLTY N +I
Sbjct 11 CLEPKRVFNPYLNDWLLVPCGKCRACQCSKASRYKLQIQLEASQHKFCIFGTLTYANTYI 70
Query 74 PLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPYD 133
P + +P ++ F V G D
Sbjct 71 PRLSL--------------------------VPYNDKT-----------FGVVNGYEMCD 93
Query 134 REIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYIK 193
+E EY+ D+ S D + S + K G +P+L D+Q +IK
Sbjct 94 KETGEYLGYLDS--PSYD-VESLLDKLHLFGD--------------VPYLRKRDLQLFIK 136
Query 194 RLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNS------------------- 234
RLRK L K S + ++A+GEYGPVHFRPH+H LLF +
Sbjct 137 RLRKNLSKY--SDAKVRYFAMGEYGPVHFRPHYHFLLFFDEIKFTAPSGHTLGEFPDWAW 194
Query 235 ---------DEVAEVLRQCHDKSWKFGRSDFQRsaggsasyvssyvNSLCSAPLLYR--S 283
++ V+ C SWKFGR D Q S G +A YVSSYV+ S P +Y+ S
Sbjct 195 YDSQNKCSRSDILSVVEYCIRSSWKFGRVDAQYSKGDAAQYVSSYVSGSGSLPKVYQVSS 254
Query 284 CRAFRPKSR-ASVGFFEKGCDFVEDEDPYAQIEKKIDSVVNGRCYNFN 330
R F SR GF C+ V + +++ ++ +NG +FN
Sbjct 255 ARPFSLHSRFLGQGFLAHECEKVYETPVRDFVKRSVE--LNGSNKDFN 300
>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM
20697]
Length=422
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/190 (34%), Positives = 95/190 (50%), Gaps = 15/190 (8%)
Query 157 IHKTQALGKTDYPVAE------QYGRDNLIPFLNYVDVQNYIKRLRKYLFKVLGSYESLH 210
+ + LG+ D + E ++ +P+L D+Q + KR R Y+ K E +
Sbjct 12 VETGEYLGEADLSIKEIERLQEKFHLFGYLPYLRKFDLQLFFKRFRYYVAKRFPK-EKVR 70
Query 211 FYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSWKFGRSDFQRsaggsasyvssy 270
++A+GEYGPVHFRPH+H+LLF SDE +V + ++W FGR D Q S G +SYV+ Y
Sbjct 71 YFAIGEYGPVHFRPHYHILLFLQSDEALQVCSKVVSEAWPFGRVDCQLSKGKCSSYVAGY 130
Query 271 vNSLCSAP---LLYRSCRAFRPKSRASVGFFEKGCDFVEDEDPYAQIEKKIDSVVNGRCY 327
VNS P L C + GF + V P +++ I V+NGR
Sbjct 131 VNSSVLVPKVLTLPTLCPFCVHSQKLGQGFLQSERAKVYSLTPEQFVKRSI--VINGRYK 188
Query 328 NFNGVSVWST 337
F+ VW +
Sbjct 189 EFD---VWRS 195
>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 106/248 (43%), Gaps = 51/248 (21%)
Query 26 GARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHIPLMRCKVLHSEY 85
G V CG+C C + + S++++ KY YF+TLTYD++++PL
Sbjct 19 GYPYQVPCGKCIACHNNKRSSLSLKLRLEEYTSKYCYFLTLTYDDDNLPLFS-------- 70
Query 86 EDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPYDREIKEYVPVKDN 145
VG+ E+ P SE +DS + +D + + + +
Sbjct 71 ---VGLDT---CATEFVRIYPYSERLRNDS-----FISDFCSDLHNFDNDFVDKMDYYSD 119
Query 146 WFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYIKRLRKYLFKVLGS 205
+ ++ + S HK+ G L L Y D+Q ++KRLRK+++K G
Sbjct 120 YVINYE---SKYHKSCVYGH------------GLYALLYYRDIQLFLKRLRKHIYKYYG- 163
Query 206 YESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKS---------------WK 250
E + FY +GEYG RPH+H LLF NS +++ C + W+
Sbjct 164 -EKIRFYIIGEYGTKSLRPHWHCLLFFNSSSLSQAFEDCVNVGTTSRPCSCPRFLRPFWQ 222
Query 251 FGRSDFQR 258
FG D +R
Sbjct 223 FGICDSKR 230
>gi|575095229|emb|CDL66433.1| unnamed protein product [uncultured bacterium]
Length=510
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/278 (25%), Positives = 124/278 (45%), Gaps = 35/278 (13%)
Query 20 ITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGS-AFKYSYFVTLTYDNEHIPLMRC 78
I NRY CG+C C+ ++ K + S A F+ LTYD EH+PL+R
Sbjct 20 IGNRYFA------CGRCSACLLAKSNKNRYNLTLELSNATTKCCFIMLTYDKEHLPLVR- 72
Query 79 KVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPYDREIKE 138
IS H D ++ P+++ + + + FF Q+ Y++++ +
Sbjct 73 ------------ISK--HDFDAMYYKKPINKPEYEKRN-----FFCQLS----YEKQLSK 109
Query 139 YVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYIKRLRKY 198
+ + + L ++ Y + ++P L YVDV ++KRLR
Sbjct 110 ITSLSNRKVFKSAYSSQSGYSMSTLFESGYNNSVHTDCYYMLPTLRYVDVSGFLKRLRTR 169
Query 199 LFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSWKFGRSDFQR 258
+ + +G ++ F A GEYGP FRPH+H+++ S+ + + + + W +G S +
Sbjct 170 VQREIGE-SNIRFAACGEYGPRGFRPHYHIIVICQSEAARQSVMRNYRTCWLYGLSS-AK 227
Query 259 saggsasyvssyvNSLCSAPLLYR--SCRAFRPKSRAS 294
S + N + +PLL + + + FRP R+S
Sbjct 228 LYIKSKNSADYVSNYVTCSPLLPKLYTYKPFRPFFRSS 265
>gi|490477382|ref|WP_004347759.1| hypothetical protein [Prevotella buccalis]
gi|281300711|gb|EFA93042.1| hypothetical protein HMPREF0650_1078 [Prevotella buccalis ATCC
35310]
Length=582
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 111/258 (43%), Gaps = 45/258 (17%)
Query 5 PDLLKAADHCQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFV 64
P LK +C + + N + C +C C++++A S R + KYS F
Sbjct 4 PSHLKIVGNCLNPRKVYNPSLHGWMYCSCDKCTACLNQKATTLSNRARAEIEQHKYSVFF 63
Query 65 TLTYDNEHIPLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFE 124
TLTYDNEH+P +YE + D +E Y P+ DDSS+ +
Sbjct 64 TLTYDNEHLP---------KYE----VFQD---SNEVIQYRPIGRL-VDDSSS------D 100
Query 125 QVQGTVPYDREIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLN 184
+ + P I+ ++ + Q T P E Y +
Sbjct 101 MLSNSCP------------------INKYNNYENLYQFDESTFIPPIENYEDIYHFGVVC 142
Query 185 YVDVQNYIKRLRKYLFKV--LGSYES-LHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVL 241
D+QN++KRLR + K+ + ES + +Y EYGP +RPH+H +LF +S ++ + +
Sbjct 143 KKDIQNFLKRLRWRISKIPNITKDESKIRYYISSEYGPTTYRPHYHGILFFDSKKILDKI 202
Query 242 RQCHDKSW-KFGRSDFQR 258
+ SW K+ R +R
Sbjct 203 KSLIVMSWGKYERQQGER 220
>gi|517172763|ref|WP_018361581.1| hypothetical protein [Prevotella nanceiensis]
Length=598
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/240 (25%), Positives = 98/240 (41%), Gaps = 50/240 (21%)
Query 14 CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI 73
C S+I N+ TG V C C YC++ A K S RV+ +S TLTYDN +I
Sbjct 26 CLSPSYIYNKNTGHYETVPCHNCTYCVNVEASKQSRRVREEIKQHLFSVMFTLTYDNVYI 85
Query 74 PLMRCKV-LHSEYE-DVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVP 131
P M H E + +G + D+H ++ +Y+ +D + +
Sbjct 86 PRMEAFAGKHGEMQLKPIGRTADLHDSCPFNSKNYNGDYRFNDDTRI------------- 132
Query 132 YDREIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNY 191
+I + K + A +D +QN+
Sbjct 133 -----------------------PWIENNKIYCKNNLQFATVSKKD----------IQNF 159
Query 192 IKRLRKYLFK--VLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSW 249
+KRLRK + K + + + + ++ EYGP +RPH+H +LF +S V ++ +SW
Sbjct 160 LKRLRKKIDKLNIPQNEKKIRYFIASEYGPKTYRPHYHGVLFIDSPTVLSKIKAFIVESW 219
Lambda K H a alpha
0.325 0.139 0.427 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4086221757660