bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-25_CDS_annotation_glimmer3.pl_2_5
Length=242
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 180 3e-49
gi|575094354|emb|CDL65742.1| unnamed protein product 177 5e-48
gi|490418709|ref|WP_004291032.1| hypothetical protein 174 9e-47
gi|496050829|ref|WP_008775336.1| hypothetical protein 162 8e-43
gi|494822885|ref|WP_007558293.1| hypothetical protein 153 2e-39
gi|575094321|emb|CDL65708.1| unnamed protein product 114 3e-25
gi|494308783|ref|WP_007173938.1| hypothetical protein 85.9 9e-16
gi|647452987|ref|WP_025792807.1| hypothetical protein 84.7 2e-15
gi|575094297|emb|CDL65693.1| unnamed protein product 81.6 2e-14
gi|565841287|ref|WP_023924568.1| hypothetical protein 81.3 4e-14
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 180 bits (457), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 142/244 (58%), Gaps = 25/244 (10%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
S+ +L+++KN +R+GFDLS K AFTAKVGELLP+ PGDKF+++ Q FTRTQPVN
Sbjct 3 SVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQPVN 62
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGSNMPCLSADQI 121
++AY+R+REYYD+++ P LLW AP + + + HA+ SV L P + I
Sbjct 63 SAAYSRLREYYDFYFVPYRLLWNMAPTFFTNM-PDPHHAADLVSSVNLSQRHPWFTFFDI 121
Query 122 SQSLDQLKS--------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDV 173
+ L L S ++N+FGF R +L+ KLL YL YG G + S D+
Sbjct 122 MEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG---FGKDYESVKVPSDSDDI 178
Query 174 KDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGCCYYLAY 233
LS FP+LAY+K C+DYFR QWQ +APY +N+DY GK +++
Sbjct 179 -------------VLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGFHIPM 225
Query 234 KSYV 237
S+
Sbjct 226 SSFT 229
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 177 bits (450), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 94/226 (42%), Positives = 138/226 (61%), Gaps = 18/226 (8%)
Query 4 FNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVNTS 63
+++ +KN P R+GFDLS K FTAK GELLPV + +PGD F++ + FTRTQP+NTS
Sbjct 1 MSMADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTS 60
Query 64 AYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHAS--SFDGSVLLGSNMPCLSADQI 121
A+ R+REYYD+++ P +W I+Q+ NVQHAS + D + L MP +++QI
Sbjct 61 AFARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQI 120
Query 122 SQSLDQ--LKSKQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSIDVKDGTYN 179
+ L+ +++N FGF+R+ L KLLQYL YG+ S D + T++
Sbjct 121 ADYLNDQATAARKNPFGFNRSTLTCKLLQYLGYGDYN-------------SFDSETNTWS 167
Query 180 -QNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDG 224
+ YN LS FP+LAY+K D++R TQW+ + P +N+DY G
Sbjct 168 AKPLLYNLELSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKG 213
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 174 bits (440), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 94/243 (39%), Positives = 144/243 (59%), Gaps = 23/243 (9%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
++ +L S++N P R+GFDLS K FTAK GELLPV +PGD F + + FTRTQPVN
Sbjct 3 NIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQPVN 62
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGS--VLLGSNMPCLSAD 119
T+A+ R+REYYD+F+ P LLW A V++Q+ N QHA S D + +L MP ++++
Sbjct 63 TAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMTSE 122
Query 120 QISQSLDQLKS-------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSID 172
I+ ++ L + K NYFG++R+ + KLL+YL YGN + + ++ T+
Sbjct 123 AIASYINALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGNYESFL----TDDWNTAPL 178
Query 173 VKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGCCYYLA 232
+ + +N IF +LAY+K D++R +QW+ +P +N+DY DG A
Sbjct 179 MANLNHN----------IFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSSMNLDNA 228
Query 233 YKS 235
Y +
Sbjct 229 YST 231
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 162 bits (411), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 95/234 (41%), Positives = 143/234 (61%), Gaps = 23/234 (10%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
++ +L S++N R+GFDLSSK FTAK GELLPVK +PGDK+S+ + FTRTQP+N
Sbjct 3 NIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQPLN 62
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSV--LLGSNMPCLSAD 119
T+A+ R+REYYD+++ P +LLW A V++Q+ N QHA+S+ S L MP ++
Sbjct 63 TAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSANQALAGVMPNVTCK 122
Query 120 QISQSLDQLKS--------KQNYFGFDRADLAYKLLQYLRYGNVRTGVGSNGARNYGTSI 171
I+ L+ + ++NYFG+ R+ KLL+YL YGN T Y TS
Sbjct 123 GIADYLNLVAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYT---------YATS- 172
Query 172 DVKDGTYNQNR-AYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDG 224
K+ T+ ++ + N L+I+ +LAY+K D+ R +QW+ +P +N+DY G
Sbjct 173 --KNNTWTKSPLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSG 224
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 153 bits (386), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/239 (35%), Positives = 130/239 (54%), Gaps = 19/239 (8%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
++ ++ SV+N P R+G+DL+ K+ FTAK G L+PV WT +P D + + F RTQP+N
Sbjct 10 NIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQPLN 69
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASS--FDGSVLLGSNMPCLSAD 119
T+A+ R+R Y+D+++ P +W P I+Q++ N+ HAS +V L +P +A+
Sbjct 70 TAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADNVPLSDELPYFTAE 129
Query 120 QISQSLDQLKSKQNYFGFDRADLAYKLLQYLRYGN----VRTGVGSNGARNYGTSIDVKD 175
Q++ + L +N FG+ RA L +L+YL YG+ + G GA
Sbjct 130 QVADYIVSLADSKNQFGYYRAWLVCIILEYLGYGDFYPYIVEAAGGEGA----------- 178
Query 176 GTYNQNRAYNH-ALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDGKGCCYYLAY 233
T+ N+ S FP+ AY+K D+ R TQW+ S P +NIDY G L +
Sbjct 179 -TWATRPMLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGSADSLQLDF 236
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 114 bits (284), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/241 (34%), Positives = 118/241 (49%), Gaps = 26/241 (11%)
Query 2 SLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVN 61
++ L +KN P R+ FDLS + FTAKVGELLP PGD + +FTRT P+
Sbjct 6 NIMGLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQ 65
Query 62 TSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQH------ASSFDGSVLLGSNMPC 115
++A+TR+RE +F+ P LW+ + + +N ASS G+ + + MPC
Sbjct 66 SNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQMPC 125
Query 116 LSADQISQSLDQLKSKQNYFGFD------------RADLAYKLLQYLRYGNVRTGVGS-- 161
++ + L + ++ G D R + KLLQ L YGN +
Sbjct 126 VNYKTLHAYLLKFINRST-VGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNFPEQFANFK 184
Query 162 -NGARNYGTSIDVKDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNID 220
N ++ + + KD TYN N Y LSIF +LAY K C D++ QWQ L N+D
Sbjct 185 VNNDKHNQSGQNFKDVTYN-NSPY---LSIFRLLAYHKICNDHYLYRQWQPYNASLCNVD 240
Query 221 Y 221
Y
Sbjct 241 Y 241
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 85.9 bits (211), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 113/231 (49%), Gaps = 20/231 (9%)
Query 1 MSLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPV 60
+S+ + + + + R+ FDLS + FTA G LLPV +P D + Q F RT P+
Sbjct 3 VSIPKIKATRPNRNRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPM 62
Query 61 NTSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGSN---MPCLS 117
NT+A+ +R Y++F+ P H LW + I+ + N H+S+ + S+ G++ +P +
Sbjct 63 NTAAFASMRGVYEFFFVPYHQLWAQFDQFITGM--NDFHSSA-NKSIQGGTSPLQVPYFN 119
Query 118 ADQISQSLDQLKSKQNYFGFDRADLAYKLLQYLRYGNVR----TGVGSNGARNYGTSIDV 173
D + SL+ K + DL YK +YG R G G ++GT+
Sbjct 120 VDSVFNSLNTGKESGSG---STDDLQYK----FKYGAFRLLDLLGYGRK-FDSFGTAYPD 171
Query 174 KDGTYNQNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYYDG 224
N YN S+F ILAY K QDY+R + +++ +N D + G
Sbjct 172 NVSGLKNNLDYN--CSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKG 220
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 84.7 bits (208), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 110/223 (49%), Gaps = 20/223 (9%)
Query 13 PR--RSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVNTSAYTRVRE 70
PR R+GFDLSS+ F+AK G+LLP+ P + F Q RT +NT++Y R++E
Sbjct 6 PRLARNGFDLSSRRIFSAKAGQLLPIGCWEVNPSEHFKFSVQDLVRTTTLNTASYARMKE 65
Query 71 YYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDGSVLLGS---NMPCLSAD--QISQSL 125
YY +F+ LW+ + I + N H S+ +G G+ N C S + + +
Sbjct 66 YYHFFFVSYRSLWQWFDQFI--VGTNNPH-SALNGVKKNGTTNYNQICSSVPTFDLGKLI 122
Query 126 DQLK-SKQNYFGFDRADLAYKLLQYLRYGNVRTGVGSN-----GARNYGTSIDVKDGTYN 179
+LK S + GF+ ++ A KLL L YG G N + +Y S D K+ +
Sbjct 123 TRLKTSDMDSQGFNYSEGAAKLLNMLNYGVTNKGKFMNLENLITSTSYLPSKDDKEPS-- 180
Query 180 QNRAYNHALSIFPILAYKKFCQDYFRLTQWQDSAPYLWNIDYY 222
Y +S F +LAY+K D++R W S +N+D Y
Sbjct 181 --SIYACKVSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDY 221
>gi|575094297|emb|CDL65693.1| unnamed protein product [uncultured bacterium]
Length=630
Score = 81.6 bits (200), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/213 (30%), Positives = 101/213 (47%), Gaps = 15/213 (7%)
Query 15 RSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPVNTSAYTRVREYYDW 74
R+ FD+S + FT+ VG+LLPV + + PGDK S+K T+TQP+ + + +V E D+
Sbjct 18 RNVFDMSQTLGFTSSVGQLLPVFYDVLNPGDKISIKSLFVTKTQPMQSDNFAKVTENVDY 77
Query 75 FWCPLHLLWRNAPEVISQIQQ-NVQHASSFDGSV-LLGSNMPCLSADQISQSLDQLKSKQ 132
F+ P ++ QI N S G++ L +++P S D +S L S+
Sbjct 78 FFVPFEQIYSLFGSFFYQIADFNSSLFSKKGGALDLTSTHLPLASFDGLSYEL--FSSQY 135
Query 133 NYFGFDRADLAYKLLQYLRYG-----NVRTGVGSNGARNYGTSIDVKDGTYNQNRAYNHA 187
+ + D + + YG N + G NY TS +Q + +
Sbjct 136 DIYSDDDDHIIFPNNTLDEYGVPNYFNHLRLMQLFGMSNYFTS------DASQPDQFKPS 189
Query 188 LSIFPILAYKKFCQDYFRLTQWQDSAPYLWNID 220
+++F LAY+K DY+RL W P +NID
Sbjct 190 INLFLPLAYQKIFNDYYRLDDWTAPDPTSYNID 222
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 81.3 bits (199), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 60/105 (57%), Gaps = 0/105 (0%)
Query 1 MSLFNLSSVKNHPRRSGFDLSSKVAFTAKVGELLPVKWTLTMPGDKFSLKEQHFTRTQPV 60
MSLF L + + R+G+DLSS+ F+A G LLP+ PG+KF + Q R QP+
Sbjct 1 MSLFKLPAPHPNLNRNGYDLSSRRIFSAPAGALLPIATWEANPGEKFRISVQDLVRAQPL 60
Query 61 NTSAYTRVREYYDWFWCPLHLLWRNAPEVISQIQQNVQHASSFDG 105
NT+A+ R +EYY +F+ P LW+++ + + + S DG
Sbjct 61 NTAAFARCKEYYHFFFVPYKSLWQHSDRFFTGVTEGDSAFSKPDG 105
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 46/89 (52%), Gaps = 8/89 (9%)
Query 143 AYKLLQYLRYGNVRTG--VGSNGARNYGTSIDVKDGTYNQN-RAYNHALSIFPILAYKKF 199
A++LL +L YG G V N + GT VK+ + + + ++F +LAY++
Sbjct 219 AFRLLHFLGYGVDNNGFIVDFNASYAAGTGEIVKNVLAKKTYKLPDIKANVFRLLAYQRI 278
Query 200 CQDYFRLTQWQDSAPYLWNIDYYDGKGCC 228
D++R W+ + P ++N+D+ CC
Sbjct 279 YNDFYRNDLWEAAQPDVFNVDW-----CC 302
Lambda K H a alpha
0.322 0.136 0.438 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1029557279646