bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-6_CDS_annotation_glimmer3.pl_2_3
Length=357
Score E
Sequences producing significant alignments: (Bits) Value
gi|496050831|ref|WP_008775338.1| predicted protein 359 4e-118
gi|490418711|ref|WP_004291034.1| hypothetical protein 288 1e-90
gi|547226428|ref|WP_021963491.1| putative uncharacterized protein 114 3e-25
gi|494822881|ref|WP_007558289.1| hypothetical protein 84.7 3e-15
gi|575094358|emb|CDL65740.1| unnamed protein product 74.3 1e-11
gi|575094319|emb|CDL65706.1| unnamed protein product 58.5 2e-06
gi|492501772|ref|WP_005867312.1| hypothetical protein 41.2 0.82
gi|649555290|gb|KDS61827.1| hypothetical protein M095_3809 40.4 1.1
gi|575094301|emb|CDL65691.1| unnamed protein product 39.3 4.0
gi|565841291|ref|WP_023924572.1| hypothetical protein 38.9 4.6
>gi|496050831|ref|WP_008775338.1| predicted protein [Bacteroides sp. 2_2_4]
gi|229448895|gb|EEO54686.1| hypothetical protein BSCG_01611 [Bacteroides sp. 2_2_4]
Length=381
Score = 359 bits (921), Expect = 4e-118, Method: Compositional matrix adjust.
Identities = 196/330 (59%), Positives = 254/330 (77%), Gaps = 9/330 (3%)
Query 34 IAQQNNAFNEKMLQKQMDYNTQMYQQQLGDQWQFYNDAKQNSWDMFNAANDYNSASAQRE 93
IAQ NN FNE+MLQKQMDYNT Y QQ+ DQW FYNDAKQN+WDMFNA N+YNSASAQRE
Sbjct 41 IAQMNNEFNERMLQKQMDYNTLAYDQQVSDQWSFYNDAKQNAWDMFNATNEYNSASAQRE 100
Query 94 RLEAAGLNPYLMMSGGNagtataqsspqasspsaqgVTPPTATPYSADYSGITQGLGMVL 153
R EAAGLNPY+MM+ G+AGTA A S+ A++P+ QG+TPPTA+PYSADYSGI QGLG +
Sbjct 101 RYEAAGLNPYVMMNTGSAGTAAATSATSATAPTKQGITPPTASPYSADYSGIMQGLGQAI 160
Query 154 DKIATQPDRDVKSAEADNLRIEGKYKAAKTIAEIVQMRTNAKTQEGRLALDKLIYSIDKD 213
D++++ PD+ AE NL+IEGKYKAA+ IA I ++ + +++ ++AL+KL+YSI KD
Sbjct 161 DQLSSIPDKAKTIAETGNLKIEGKYKAAEAIARIANIKADTHSKKEQVALNKLMYSIQKD 220
Query 214 LKSSQMDVNRESIANMQAERKLTNVQTLLVDKQLSWMDAQSKMDLAQKAADIQLKYAQGA 273
L SS M VN ++IANM+AE K N+QTL+ DKQLS+MDA KM+LA+KAA+IQLK AQGA
Sbjct 221 LASSTMAVNSQNIANMRAEEKFKNIQTLIADKQLSFMDATQKMELAEKAANIQLKLAQGA 280
Query 274 LTRKQVDHEIAKIAETEVRTSLDIQEQTTNVLKQ--------QGMRQENSFNEATFDNRV 325
LTR Q HEI KI+ETE RT+L I EQT+ ++Q Q RQ+N F+ T++ RV
Sbjct 281 LTRNQAAHEIKKISETEARTTL-INEQTSLTIEQNTGQQLQNQAQRQQNRFDADTYNVRV 339
Query 326 KSVKESLWNLMHEADSYGLSKTIGRVIRPL 355
K+++ESL+N++ E D G KT+G+ IR +
Sbjct 340 KTLEESLFNIVFETDKLGAVKTVGKGIRAV 369
>gi|490418711|ref|WP_004291034.1| hypothetical protein [Bacteroides eggerthii]
gi|217986638|gb|EEC52972.1| hypothetical protein BACEGG_02723 [Bacteroides eggerthii DSM
20697]
Length=368
Score = 288 bits (737), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 165/341 (48%), Positives = 223/341 (65%), Gaps = 30/341 (9%)
Query 28 NKGNASIAQQNNAFNEKMLQKQMDYNTQMYQQQLGDQWQFYNDAKQNSW----------- 76
N+ N IAQ NNAFNEKM KQ+ YN +MYQ QLGDQW+FY+D K N+W
Sbjct 31 NQANKEIAQMNNAFNEKMFDKQIAYNKEMYQTQLGDQWKFYDDQKANAWKLYEDNKAYQT 90
Query 77 DMFNAANDYNSASAQRERLEAAGLNPYLMMSGGN-----agtataqsspqasspsaqgVT 131
+M+N N+YN SAQR RLEAAGLNPY+MM+GG+ + + T S+P A SPSAQGV
Sbjct 91 EMWNKQNEYNDPSAQRARLEAAGLNPYMMMNGGSAGVAGSVSGTQGSAPSAGSPSAQGVQ 150
Query 132 PPTATPYSADYSGITQGLGMVLDKIATQPDRDVKSAEADNLRIEGKYKAAKTIAEIVQMR 191
PPTATPYSADYSG+ QGLG +D I T R++++A+ADNLRIEGKY A+K IAE+ +
Sbjct 151 PPTATPYSADYSGVMQGLGHAIDTIMTGSQRNIQNAQADNLRIEGKYIASKAIAELYKTY 210
Query 192 TNAKTQEGRLALDKLIYSIDKDLKSSQMDVNRESIANMQAERKLTNVQTLLVDKQLSWMD 251
AK + R+A+ +++ SI KDL +SQ+ VN E++ +QA+ K+ + LL ++QL ++
Sbjct 211 NEAKNDDERVAIQRVLSSIQKDLSASQVAVNNENVRQIQAQTKIAVTENLLREQQLKFLP 270
Query 252 AQSKMDLAQKAADIQLKYAQGALTRKQVDHEIAKIAETEVRTSLDIQEQTTNVLKQQGMR 311
+ + LA AADI LKYAQ LT KQ HEI K+AET VR + G
Sbjct 271 YEQRTQLALGAADIALKYAQKNLTEKQARHEIEKLAETIVRAN--------------GQA 316
Query 312 QENSFNEATFDNRVKSVKESLWNLMHEADSYGLSKTIGRVI 352
+N ++ T+ +RVK VKESL+N +++ D G+ KT+ R
Sbjct 317 MQNQYDAETYRDRVKLVKESLFNAIYDTDKVGIFKTMSRAF 357
>gi|547226428|ref|WP_021963491.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103380|emb|CCY83991.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=416
Score = 114 bits (285), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/368 (31%), Positives = 174/368 (47%), Gaps = 56/368 (15%)
Query 28 NKGNASIAQQNNAFNEKMLQKQMDYNTQMYQQQLG------DQWQFYNDAKQNSWDMFNA 81
NK N IAQ NN +NE+M KQ++YN M+ QQ+ +Q +N QN + A
Sbjct 30 NKTNLQIAQMNNEYNERMFNKQLEYNQDMFNQQVEYDQKKMEQQNNFNARMQN--EAIGA 87
Query 82 ANDYNSASAQRERLEAAGLNPYLMMSGGNagtataqsspqasspsaqg--VTPPTAT--- 136
YNSA AQR RLEAAGLNPYLMMSGGNAG +A S S S V PPTA+
Sbjct 88 QQVYNSAKAQRARLEAAGLNPYLMMSGGNAGAVSAVSGSSGSGGSPSPMGVNPPTASSAV 147
Query 137 --PYSADYSGITQGLGMVLDKIATQPDRDVKS----AEADNLRIEGKYKAAKTIAEIVQM 190
+ D+SG+T + +LD A + RD ++ +A +IE KYKA K + +I
Sbjct 148 MQAFRPDFSGVTGIIQTLLDIQAQKGVRDAQAFSLGEQASGFKIENKYKAEKLLWDIYNS 207
Query 191 RTNAKTQEGRLALDKLIYSIDKDLKSSQMDVNRESIANMQAERKLTNVQT-------LLV 243
+ + + + +L+ + ++ + + SS + + N Q +L QT LL
Sbjct 208 KADYNLKNSQESLNNMSFARLQAMFSSDVSKAQREAENAQFTGELIRAQTACQQLQGLLG 267
Query 244 DKQLSWMDAQSKMDLAQKAADIQLKYAQGALTRKQVDHEIAKIAETEVRTSLDIQEQTTN 303
K+L + D + +LA +A A G + Q A + +L++ EQ
Sbjct 268 AKELKYYDQKVLQELAIMSAQQYSLVAAGKASEAQ--------ARQAIENALNLVEQ--- 316
Query 304 VLKQQGMRQENSFNEATFDNRVKSVKE----SLWN------------LMHEADSYGLSKT 347
++G++ +N + T + +K+ + S WN + ++ S G +K
Sbjct 317 ---REGIKVDNYVKQKTANALIKTARNNCNTSYWNSKTAHNQSLRPSVFEDSFSQGFNKF 373
Query 348 IGRVIRPL 355
I I PL
Sbjct 374 INTYIAPL 381
>gi|494822881|ref|WP_007558289.1| hypothetical protein [Bacteroides plebeius]
gi|198272097|gb|EDY96366.1| hypothetical protein BACPLE_00802 [Bacteroides plebeius DSM 17135]
Length=344
Score = 84.7 bits (208), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/319 (27%), Positives = 147/319 (46%), Gaps = 41/319 (13%)
Query 28 NKGNASIAQQNNAFNEKMLQKQMDYNTQMYQQQLGDQWQFYNDAKQNSWDMFNAANDYNS 87
N+ N IAQ +N +N + L++Q++ WDM+NA N+YNS
Sbjct 44 NQANIQIAQMSNEYNREQLERQIE----------------------QEWDMWNAENEYNS 81
Query 88 ASAQRERLEAAGLNPYLMMSGGNagtataqsspqasspsaqgVTPPTATPYSADYSGITQ 147
AS+QR+RLE AGLNPY+MM GG+AG+A++ +SP A + T P AD SG++
Sbjct 82 ASSQRKRLEEAGLNPYMMMDGGSAGSASSMTSPAAQPAVVPQMQGATMQP--ADMSGLSG 139
Query 148 GLGMVLDKIAT-QPDRDVKSAEADN--LRIEGKYKAAKTIAEIVQMRTNAKTQEGRLALD 204
G+ + IAT + D++ + N IE +YKA K +A++ + RT + +
Sbjct 140 LRGIASEFIATLKAQEDIRGQQLINEGQEIENQYKADKLLADLEKTRTESGFVRSQTKGQ 199
Query 205 KLIYSIDKDLKSSQMDVNRESIANMQAERKLTNVQTLLVDKQLSWMDAQSKMDLAQKAAD 264
++ ++ SS++ + Q + L + + Q K + ++
Sbjct 200 DIMNRFRPEMLSSEIRQRKTDTMFTQLRAHGQMLANLSAYQWYKVLPQQIKQTINEQMVR 259
Query 265 IQLKYAQGALTRKQVDHEIAKIAETEVRTSLDIQEQTTNVLKQQGMRQENSFNEATFDNR 324
I QG LT+ Q++ EI K T +K +Q+ F ++ +R
Sbjct 260 INNMKLQGNLTQAQINTEINKA--------------VTEFMKGAREQQQFDFESDSYKDR 305
Query 325 VKSVKESLWNLMHEADSYG 343
+ +K L + ++ + G
Sbjct 306 LDQIKADLRHAIYNSGPEG 324
>gi|575094358|emb|CDL65740.1| unnamed protein product [uncultured bacterium]
Length=328
Score = 74.3 bits (181), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 124/260 (48%), Gaps = 29/260 (11%)
Query 28 NKGNASIAQQNNAFNEKMLQKQMDYNTQMYQQQLGDQWQFYNDAKQNSWDMFNAANDYNS 87
N N IAQ NN ++E+M++KQM YNT+M+++ DYNS
Sbjct 27 NSTNMQIAQMNNEWSERMMEKQMAYNTEMWEK----------------------VADYNS 64
Query 88 ASAQRERLEAAGLNPYLMMSGGNagtataqsspqasspsaqgVTPPTATPYSADYSGITQ 147
+ ++ AG+NPY+ +SG G+ +A S+ S PS V A P D+S ++
Sbjct 65 LPNKMQQARDAGVNPYMALSGNAFGSISAPSANSVSLPSPSQV---QAQPAQYDFSSVSN 121
Query 148 GL--GMVLDKIA--TQPDRDVKSAEADNLRIEGKYKAAKTIAEIVQMRTNAKTQEGRLAL 203
+ GM L + A + + A D LRIE KY A K ++EI + N K + +
Sbjct 122 SIIAGMDLFQKAQLMKSQQSNIDASTDQLRIENKYHAMKLVSEIAEKMANTKDSQAKAVY 181
Query 204 DKLIYSIDKDLKSSQMDVNRESIANMQAERKLTNVQTLLVDKQLSWMDAQSKMDLAQKAA 263
++I + + +++ ++++NM+ + ++ + +QL + Q + L A+
Sbjct 182 QQIINEYAEQGIKTDLEIKNQTLSNMKETFRGLVLENAMTSEQLRFFPEQVRAQLGLTAS 241
Query 264 DIQLKYAQGALTRKQVDHEI 283
I L + L+++++ I
Sbjct 242 QILLNQSNSKLSQQKMVESI 261
>gi|575094319|emb|CDL65706.1| unnamed protein product [uncultured bacterium]
Length=396
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/336 (25%), Positives = 125/336 (37%), Gaps = 89/336 (26%)
Query 28 NKGNASIAQQNNAFNEKMLQKQMDYNTQMYQQQLGDQWQFYNDAKQNSWDMFNAANDYNS 87
N+ N IA QNN FNE+M +N N+YN
Sbjct 60 NQANREIADQNNKFNERM---------------------------------WNLQNEYNR 86
Query 88 ASAQRERLEAAGLNPYLMMSGGNagtataqsspqasspsaqgVTPPTATPYSADYSGITQ 147
QR RLEAAGLNPYLMM GG+ A S + S + P + Y Q
Sbjct 87 PDMQRARLEAAGLNPYLMMDGGS---AGIAESAPTADTSGTQIAPDIGNTIAGGY----Q 139
Query 148 GLGMVLDKIATQPDRDVKSAEADNLRIEGKYKAAKTIAEIVQMRTNAKTQEGRLALDKLI 207
+G + A+Q + D+L+ K AKT+AE + E R
Sbjct 140 AMGNSISSAASQI---AQMTFQDDLQ---KANVAKTVAEAKNAHLQNQFDELRNEFAVAN 193
Query 208 YSIDKDLKSSQMDVN-------RESIANMQAERK------------------LTNVQTLL 242
+ ++ LK Q D++ R+S+ + K LT+VQ +
Sbjct 194 FLVNLRLKQKQGDISDYEANYLRDSMQDRLDSVKFQNTLSGSQSSYYSQMAGLTDVQRQI 253
Query 243 VDKQLSWMDAQSKMDLAQKAADIQLKYAQGALTRKQVDHEIAKIAETEVRTSLDIQEQTT 302
L W+ + + LA +I+ ++ L Q + A +
Sbjct 254 EQTNLDWLPQEKQAGLAATLQNIRTMVSEMGLNYAQAKNAFAMA--------------SL 299
Query 303 NVLKQQGMRQENSFNEATFDNRVKSVKESL----WN 334
N ++G+R +N E+TFD VK K ++ WN
Sbjct 300 NYANEEGLRIDNRLKESTFDLSVKLAKNTVNSEYWN 335
>gi|492501772|ref|WP_005867312.1| hypothetical protein [Parabacteroides distasonis]
gi|409230405|gb|EKN23269.1| hypothetical protein HMPREF1059_03254 [Parabacteroides distasonis
CL09T03C24]
Length=288
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 27/70 (39%), Positives = 37/70 (53%), Gaps = 0/70 (0%)
Query 40 AFNEKMLQKQMDYNTQMYQQQLGDQWQFYNDAKQNSWDMFNAANDYNSASAQRERLEAAG 99
A N K +Q N ++ + Q Q Q A Q S +M+N N+YNS + Q R+ AAG
Sbjct 22 AMNNKAVQDTNKANMEIAKYQAQWQQQENEKAYQRSLNMWNLQNEYNSPTQQMARIRAAG 81
Query 100 LNPYLMMSGG 109
LNP L+ G
Sbjct 82 LNPNLVYGNG 91
>gi|649555290|gb|KDS61827.1| hypothetical protein M095_3809 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649557306|gb|KDS63785.1| hypothetical protein M095_3404 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649559158|gb|KDS65545.1| hypothetical protein M096_4689 [Parabacteroides distasonis str.
3999B T(B) 6]
gi|649560567|gb|KDS66875.1| hypothetical protein M095_2448 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561016|gb|KDS67303.1| hypothetical protein M095_2410 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649562727|gb|KDS68911.1| hypothetical protein M096_3341 [Parabacteroides distasonis str.
3999B T(B) 6]
Length=288
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/70 (39%), Positives = 36/70 (51%), Gaps = 0/70 (0%)
Query 40 AFNEKMLQKQMDYNTQMYQQQLGDQWQFYNDAKQNSWDMFNAANDYNSASAQRERLEAAG 99
A N K +Q N ++ + Q Q Q A Q S M+N N+YNS + Q R+ AAG
Sbjct 22 AMNNKAVQDTNKANMEIAKYQAQWQQQENEKAYQRSLKMWNLQNEYNSPTQQMARIRAAG 81
Query 100 LNPYLMMSGG 109
LNP L+ G
Sbjct 82 LNPNLVYGNG 91
>gi|575094301|emb|CDL65691.1| unnamed protein product [uncultured bacterium]
Length=437
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 22/29 (76%), Gaps = 0/29 (0%)
Query 78 MFNAANDYNSASAQRERLEAAGLNPYLMM 106
M+ NDYN+ AQ++RLE AG+NPY+ M
Sbjct 69 MWKDTNDYNTPIAQKQRLEQAGMNPYVNM 97
>gi|565841291|ref|WP_023924572.1| hypothetical protein [Prevotella nigrescens]
gi|564729909|gb|ETD29853.1| hypothetical protein HMPREF1173_00035 [Prevotella nigrescens
CC14M]
Length=396
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 88/209 (42%), Gaps = 32/209 (15%)
Query 26 AGNKGNASIAQQNNAFNEKMLQKQMDYNTQMYQQQLGDQWQFYNDAKQNSWDMFNAANDY 85
+ N N IA++ NA N +M+Q Q ++N +M +Q N+Y
Sbjct 29 SANSTNLRIARETNAANFQMMQYQNEFNQKMLDKQ----------------------NEY 66
Query 86 NSASAQRERLEAAGLNPYLMMSGGNagtataqsspqasspsaqgVTPPTATPYSADYSGI 145
QR+R E AG+NPY +S ++GT P+ P A +
Sbjct 67 ALPINQRKRFEDAGINPYFALSQISSGTPQGALQSAQGHPAVAAQVQPVTAFGDALRDSV 126
Query 146 TQGL---GMVLDKIATQPDRDVKSAEADNLRIEGKYKAAKTIAEIVQMRTNAKTQEGRLA 202
+ G+ G ++ TQ +A+ +E ++KAA ++ I + K+ +
Sbjct 127 SHGVNTYGQLMQAKYTQQ-------QAEGQSLENRFKAATLLSRIDGEKAKNKSLTYNMM 179
Query 203 LDKLIYSIDKDLKSSQMDVNRESIANMQA 231
+D L + K + ++M + S+A M+A
Sbjct 180 MDGLRADLMKYVNGNEMKKSDLSVAQMEA 208
Lambda K H a alpha
0.311 0.125 0.341 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 2134211136096