bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-19_CDS_annotation_glimmer3.pl_2_9
Length=228
Score E
Sequences producing significant alignments: (Bits) Value
gi|490418709|ref|WP_004291032.1| hypothetical protein 166 3e-44
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 166 4e-44
gi|575094354|emb|CDL65742.1| unnamed protein product 160 6e-42
gi|496050829|ref|WP_008775336.1| hypothetical protein 144 1e-36
gi|494822885|ref|WP_007558293.1| hypothetical protein 140 5e-35
gi|575094321|emb|CDL65708.1| unnamed protein product 105 2e-22
gi|494308783|ref|WP_007173938.1| hypothetical protein 100 1e-20
gi|575094339|emb|CDL65730.1| unnamed protein product 97.1 1e-19
gi|496521299|ref|WP_009229582.1| capsid protein 94.7 7e-19
gi|647452987|ref|WP_025792807.1| hypothetical protein 87.4 2e-16
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 166 bits (421), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 133/223 (60%), Gaps = 20/223 (9%)
Query 1 MSSLFSYGDIKNTPRRSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQP 60
M+++ S I+N P R+GFDLS K FTAK GELLPV K LP D F I+ + F RTQP
Sbjct 1 MANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLLYRNSNEAIMSMENQPNYAASSSASISF--NRNLPWID 118
V+T+AF RIREYY++FFVP LL+ +N + M + P +A S + +F + +P++
Sbjct 61 VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMT 120
Query 119 LATINTAIGNVQSSASP-----NNFFGVLRSEGFKKLVSYLGYG----------ETSPEK 163
I + I N S+AS +N+FG RS+ KL+ YLGYG T+P
Sbjct 121 SEAIASYI-NALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGNYESFLTDDWNTAP-- 177
Query 164 YVDNLRCSALPLYAYQKIYQDYYRHSQWEKSKPWTYNCDFWNG 206
+ NL + L AYQKIY D+YR SQWE+ P T+N D+ +G
Sbjct 178 LMANLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDG 220
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 166 bits (419), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 92/220 (42%), Positives = 134/220 (61%), Gaps = 12/220 (5%)
Query 1 MSSLFSYGDIKNTPRRSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQP 60
MSS+ S +KN+ +R+GFDLS K AFTAKVGELLP+ K P DKFNI + F RTQP
Sbjct 1 MSSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLLYRNSNEAIMSMENQPNYAASSSASISFNRNLPWIDLA 120
V+++A++R+REYY+++FVP LL+ + +M + P++AA +S++ ++ PW
Sbjct 61 VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPD-PHHAADLVSSVNLSQRHPWFTFF 119
Query 121 TINTAIGNVQSSASP-----NNFFGVLRSEGFKKLVSYLGYGETSPEKYV------DNLR 169
I +GN+ S + NFFG R E KL++YL YG + V D++
Sbjct 120 DIMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYGFGKDYESVKVPSDSDDIV 179
Query 170 CSALPLYAYQKIYQDYYRHSQWEKSKPWTYNCDFWNGEDS 209
S PL AYQKI +DY+R QW+ + P+ YN D+ G+ S
Sbjct 180 LSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSS 219
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 160 bits (404), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 91/214 (43%), Positives = 123/214 (57%), Gaps = 13/214 (6%)
Query 5 FSYGDIKNTPRRSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQPVDTS 64
S DIKN P R+GFDLS K FTAK GELLPV K LP D FNI+ F RTQP++TS
Sbjct 1 MSMADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTS 60
Query 65 AFTRIREYYEWFFVPLHLLYRNSNEAIMSMENQPNYAASSSA--SISFNRNLPWIDLATI 122
AF R+REYY+++FVP ++ + I M +A+ + + + +P+ I
Sbjct 61 AFARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQI 120
Query 123 NTAIGNVQSSASPNNFFGVLRSEGFKKLVSYLGYGET----------SPEKYVDNLRCSA 172
+ N Q++A+ N FG RS KL+ YLGYG+ S + + NL S
Sbjct 121 ADYL-NDQATAARKNPFGFNRSTLTCKLLQYLGYGDYNSFDSETNTWSAKPLLYNLELSP 179
Query 173 LPLYAYQKIYQDYYRHSQWEKSKPWTYNCDFWNG 206
PL AYQKIY D+YR++QWEK+ P T+N D+ G
Sbjct 180 FPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKG 213
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 144 bits (364), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/224 (39%), Positives = 127/224 (57%), Gaps = 18/224 (8%)
Query 1 MSSLFSYGDIKNTPRRSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQP 60
M+++ S ++N R+GFDLS+K FTAK GELLPV LP DK++I + F RTQP
Sbjct 1 MANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQP 60
Query 61 VDTSAFTRIREYYEWFFVPLHLLYRNSNEAIMSMENQPNYAAS--SSASISFNRNLPWID 118
++T+AF R+REYY+++FVP +LL+ +N + M + P +A S SA+ + +P +
Sbjct 61 LNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNPQHATSYIPSANQALAGVMPNVT 120
Query 119 LATI----NTAIGNVQSSAS-PNNFFGVLRSEGFKKLVSYLGYGET-----------SPE 162
I N +V ++ S N+FG RS G KL+ YLGYG +
Sbjct 121 CKGIADYLNLVAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYTYATSKNNTWTKS 180
Query 163 KYVDNLRCSALPLYAYQKIYQDYYRHSQWEKSKPWTYNCDFWNG 206
NL+ + + AYQKIY D+ R SQWEK P +N D+ +G
Sbjct 181 PLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSG 224
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 140 bits (353), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 77/223 (35%), Positives = 122/223 (55%), Gaps = 20/223 (9%)
Query 1 MSSLFSYGDIKNTPRRSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQP 60
M+++ S ++N P R+G+DL+ K FTAK G L+PV+W LP D N + + F RTQP
Sbjct 8 MANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQP 67
Query 61 VDTSAFTRIREYYEWFFVPLHLLYRNSNEAIMSMENQPNYAASS--SASISFNRNLPWID 118
++T+AF R+R Y++++FVP ++ AI M +A+ + ++ + LP+
Sbjct 68 LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADNVPLSDELPYF- 126
Query 119 LATINTAIGNVQSSASPNNFFGVLRSEGFKKLVSYLGYGETSP---------------EK 163
T + S A N FG R+ ++ YLGYG+ P
Sbjct 127 --TAEQVADYIVSLADSKNQFGYYRAWLVCIILEYLGYGDFYPYIVEAAGGEGATWATRP 184
Query 164 YVDNLRCSALPLYAYQKIYQDYYRHSQWEKSKPWTYNCDFWNG 206
++NL+ S PL+AYQKIY D+ R++QWE+S P T+N D+ +G
Sbjct 185 MLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISG 227
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 105 bits (262), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/266 (31%), Positives = 121/266 (45%), Gaps = 46/266 (17%)
Query 2 SSLFSYGDIKNTPRRSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQPV 61
S++ +KN P R+ FDLS++ FTAKVGELLP + + P D +S +F RT P+
Sbjct 5 SNIMGLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPL 64
Query 62 DTSAFTRIREYYEWFFVPLHLLYRNSNEAIMSMENQPNYAASSSASISFNRN------LP 115
++AFTR+RE ++FFVP L++ + +++M N S + S N +P
Sbjct 65 QSNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQMP 124
Query 116 WIDLAT--------INTAIGNVQSSASPNNFFGVLRSEGFKKLVSYLGYGETSPEKY--- 164
++ T IN + S P G R KL+ LGYG PE++
Sbjct 125 CVNYKTLHAYLLKFINRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNF-PEQFANF 183
Query 165 -VDNLR------------------CSALPLYAYQKIYQDYYRHSQWEKSKPWTYNCDFWN 205
V+N + S L AY KI D+Y + QW+ YN N
Sbjct 184 KVNNDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQP-----YNASLCN 238
Query 206 GEDSTPVASSL----DLFSQNPNDSV 227
+ TP +SSL D P+DS+
Sbjct 239 VDYLTPNSSSLLSIDDALLSIPDDSI 264
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 100 bits (248), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/229 (31%), Positives = 112/229 (49%), Gaps = 17/229 (7%)
Query 16 RSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQPVDTSAFTRIREYYEW 75
R+ FDLS + FTA G LLPV +P D I+ + F RT P++T+AF +R YE+
Sbjct 17 RNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYEF 76
Query 76 FFVPLHLLYRNSNEAIMSMENQPNYAASSSASISFNRNLPWIDLATINTAI--GNVQSSA 133
FFVP H L+ ++ I M + + A S + +P+ ++ ++ ++ G S
Sbjct 77 FFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQVPYFNVDSVFNSLNTGKESGSG 136
Query 134 SPNNFFGVLRSEGFKKLVSYLGYG---ETSPEKYVDNL---------RCSALPLYAYQKI 181
S ++ + F +L+ LGYG ++ Y DN+ CS + AY KI
Sbjct 137 STDDLQYKFKYGAF-RLLDLLGYGRKFDSFGTAYPDNVSGLKNNLDYNCSVFRILAYNKI 195
Query 182 YQDYYRHSQWEKSKPWTYNCDFWNGE--DSTPVASSLDLFSQNPNDSVF 228
YQDYYR+S +E ++N D + G D+ VA L +N F
Sbjct 196 YQDYYRNSNYENFDTDSFNFDKFKGGLVDAKVVADLFKLRYRNAQTDYF 244
>gi|575094339|emb|CDL65730.1| unnamed protein product [uncultured bacterium]
Length=588
Score = 97.1 bits (240), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 101/206 (49%), Gaps = 15/206 (7%)
Query 16 RSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQPVDTSAFTRIREYYEW 75
++GFD+S + FT+ VG+LLPV++ + P DK IS F RTQP+ ++A R+ E+ E+
Sbjct 16 KNGFDMSQRHPFTSSVGQLLPVFYDYLNPGDKIRISANLFTRTQPMKSTAMARLTEHIEY 75
Query 76 FFVPLHLLYRNSNEAIMSMENQPNYAASSSASISFNRNLPWIDLATINTAIGNVQSSASP 135
FFVP ++ +++ SSS N +P+ ++ A+ +S S
Sbjct 76 FFVPFEQMFSLFGSVFYGIDDY----NSSSLVKHNNLTMPFFKSDAVSAALEAAYTSFSS 131
Query 136 N--------NFFGVLRSEGFKKLVSYLGYGETSPEKYVDNLRCSALPLY---AYQKIYQD 184
+ + G R G +L LGYG + L + + ++ AYQKI+ D
Sbjct 132 SINRKVLTPDMMGQPRVYGILRLSEMLGYGSLLLSNDNNLLPHADMSVFLFTAYQKIFND 191
Query 185 YYRHSQWEKSKPWTYNCDFWNGEDST 210
+YR + + +YN D+ G+ T
Sbjct 192 FYRLDDYTSVQHKSYNVDYAQGQPIT 217
>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon
317 str. F0108]
Length=541
Score = 94.7 bits (234), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 99/196 (51%), Gaps = 9/196 (5%)
Query 16 RSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQPVDTSAFTRIREYYEW 75
RS FDLS K +TA G LLPV + D I + F RT P++++AF +R YE+
Sbjct 18 RSAFDLSQKHLYTAPAGALLPVLSVDLMFHDHIRIQAQDFMRTMPMNSAAFISMRGVYEF 77
Query 76 FFVPLHLLYRNSNEAIMSMENQPNYAASSSASISFNRNLPWIDLATINTAIGNVQSSASP 135
FFVP L+ ++ I SM + + SS+A ++P + LA + + +
Sbjct 78 FFVPYSQLWHPYDQFITSMNDYRSSVVSSAAGDKALDSVPNVKLADMYKFV----RERTD 133
Query 136 NNFFGVLRSEGFKKLVSYLGYGE--TSPEKYVDNLRCSALPLY---AYQKIYQDYYRHSQ 190
+ FG S +L+ LGYG+ TS + V L + L+ AY KIY DYYR++
Sbjct 134 KDIFGYPHSNNSCRLMDLLGYGKPITSSKTPVPLLYTGNVNLFRLLAYNKIYSDYYRNTT 193
Query 191 WEKSKPWTYNCDFWNG 206
+E +++N D G
Sbjct 194 YEGVDVYSFNIDHKKG 209
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 87.4 bits (215), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/220 (30%), Positives = 111/220 (50%), Gaps = 37/220 (17%)
Query 14 PR--RSGFDLSNKCAFTAKVGELLPVYWKFCLPADKFNISQEWFARTQPVDTSAFTRIRE 71
PR R+GFDLS++ F+AK G+LLP+ P++ F S + RT ++T+++ R++E
Sbjct 6 PRLARNGFDLSSRRIFSAKAGQLLPIGCWEVNPSEHFKFSVQDLVRTTTLNTASYARMKE 65
Query 72 YYEWFFVPLHLLYRNSNEAIMSMENQPNYA---ASSSASISFNR---NLPWIDLATINTA 125
YY +FFV L++ ++ I+ N P+ A + + ++N+ ++P DL + T
Sbjct 66 YYHFFFVSYRSLWQWFDQFIVGT-NNPHSALNGVKKNGTTNYNQICSSVPTFDLGKLITR 124
Query 126 IGNVQSSASPNNFFGVLRSEGFKKLVSYLGYGETSPEKY--VDNL--------------- 168
+ + N+ SEG KL++ L YG T+ K+ ++NL
Sbjct 125 LKTSDMDSQGFNY-----SEGAAKLLNMLNYGVTNKGKFMNLENLITSTSYLPSKDDKEP 179
Query 169 ------RCSALPLYAYQKIYQDYYRHSQWEKSKPWTYNCD 202
+ S L AYQKI+ D+YR+ W S ++N D
Sbjct 180 SSIYACKVSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVD 219
Lambda K H a alpha
0.318 0.132 0.419 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 897149307900