bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-2_CDS_annotation_glimmer3.pl_2_1
Length=537
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226431|ref|WP_021963494.1| predicted protein 133 1e-30
gi|496050828|ref|WP_008775335.1| hypothetical protein 112 2e-23
gi|575094340|emb|CDL65724.1| unnamed protein product 108 4e-22
gi|490418708|ref|WP_004291031.1| hypothetical protein 98.6 6e-19
gi|575094322|emb|CDL65709.1| unnamed protein product 89.0 1e-15
gi|496521300|ref|WP_009229583.1| hypothetical protein 88.6 3e-15
gi|575094298|emb|CDL65688.1| unnamed protein product 81.3 4e-13
gi|494610270|ref|WP_007368516.1| hypothetical protein 80.9 5e-13
gi|647452984|ref|WP_025792805.1| hypothetical protein 78.6 3e-12
gi|565841285|ref|WP_023924566.1| hypothetical protein 76.6 1e-11
>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498
Score = 133 bits (335), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 149/330 (45%), Gaps = 58/330 (18%)
Query 1 VTNRYTHETLFVRCGTCPSCLVHRSNIQCALISNMSSHFKHAYFFTLTYSDEFVPRVSLE 60
V N+YT E + V CG C +CL R++ L + K+ F TLTYS+++VPR+ E
Sbjct 17 VQNKYTGEVIQVGCGVCKACLKRRADKMSFLCAIEEQSHKYCMFATLTYSNDYVPRMYPE 76
Query 61 VVERCDAESEIDAYMSDSDPRHLPYDDSRYQIAATYLPRSGCFRVHDSGRVRDFSETEDS 120
V D+ ++ Y S C R+++ G++
Sbjct 77 V-------------------------DNELRLVRWY---SYCDRLNEKGKLMTVD----- 103
Query 121 YQFLHTFSGKEIRDLLVSSNGRYDFSRKCVVFPSIDECRNEILVLNPYDQNLFFKRLRKL 180
Y + H + L++++ D + + D LF KR+RK
Sbjct 104 YDYWHKCPSLDTYVLMLTAKCNLD---------------GYLSYTSKRDAQLFLKRVRKN 148
Query 181 ISERYDEKICYYLVSEYGGRTYRPHWHGILFFNSDALTSSICELVSKSWSYGRTDCSLSR 240
+S+ DEKI YY+VSEYG +T+R H+H + F++ + +++ ++W +GR DCSLSR
Sbjct 149 LSKYSDEKIRYYIVSEYGPKTFRAHYHVLFFYDEVKTQKVMSKVIRQAWQFGRVDCSLSR 208
Query 241 GSAAGYVASYINSFVDLPDFFNRHKEIKPKSYHSKGLSVNSLFSRTSDISQVDEVAASCF 300
G YVA Y+N LP F KP S HS F+ SQ +E+
Sbjct 209 GKCNSYVARYVNCNYCLPRFLG-DMSTKPFSCHS------IRFALGIHQSQKEEIYKGSV 261
Query 301 DGF---SVPINGEYVTVKPSRSYEHTVFPR 327
D F S ING YV P R+ T FP+
Sbjct 262 DDFIYQSGEINGNYVEFMPWRNLSCTFFPK 291
>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497
Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/331 (29%), Positives = 145/331 (44%), Gaps = 70/331 (21%)
Query 1 VTNRYTHETLFVRCGTCPSCLVHRSN---IQCALISNMSSHFKHAYFFTLTYSDEFVPRV 57
+ N YT E++ V CG C +C + +++ QC L S + KH F TLTY++ F+PR
Sbjct 15 IMNPYTKESMVVPCGHCQACTLAKNSRYAFQCDLESYTA---KHTLFITLTYANRFIPR- 70
Query 58 SLEVVERCDAESEIDAYMSDSDPRHLPYDDSRYQIAATYLPRSGCFRVHDSGRVRDFSET 117
A DS R PY GC D +
Sbjct 71 ---------------AMFVDSIER--PY---------------GC----------DLIDK 88
Query 118 EDSYQFLHTFSGKEIRDLLVSSNGRYDFSRKCVVFPSIDECRNEILVLNPYDQNLFFKRL 177
E +G+ + ++ + R + K +F ++ L D LF KRL
Sbjct 89 E---------TGEILGPADLTEDERTNLLNKFYLF-------GDVPYLRKTDLQLFLKRL 132
Query 178 RKLIS-ERYDEKICYYLVSEYGGRTYRPHWHGILFFNSDALTSSICELVSKSWSYGRTDC 236
R ++ ++ EK+ Y+ V EYG +RPH+H +LF SD E +SK+W++GR DC
Sbjct 133 RYYVTKQKPSEKVRYFAVGEYGPVHFRPHYHLLLFLQSDEALQICSENISKAWTFGRVDC 192
Query 237 SLSRGSAAGYVASYINSFVDLPDFFNRHKEIKPKSYHSKGLSVNSLFSRTSDISQVDEVA 296
+S+G + YVASY+NS +P F + + P S HS+ L L + I +
Sbjct 193 QVSKGQCSNYVASYVNSSCTIPKVF-KASSVCPFSVHSQKLGQGFLDCQREKIY---SLT 248
Query 297 ASCFDGFSVPINGEYVTVKPSRSYEHTVFPR 327
F S+ +NG+Y RS +PR
Sbjct 249 PENFIRSSIVLNGKYKEFDVWRSCYSFFYPR 279
>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486
Score = 108 bits (271), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 95/298 (32%), Positives = 136/298 (46%), Gaps = 43/298 (14%)
Query 1 VTNRYTHETLFVRCGTCPSCLVHRSNIQCA-LISNMSSHFKHAYFFTLTYSDEFVPRVSL 59
VTN+Y + +V CG CPSCL ++N C +I+ + F TLTY +E +P +
Sbjct 12 VTNKYVGRSFYVDCGHCPSCLQRKANKSCCKIINEYGRPYSFMCFVTLTYDNEHIPYIH- 70
Query 60 EVVERCDAESEIDAYMSDSDPRHLPYDDSRYQIAATYLPRSGC----FRVHDSGRVRDFS 115
D+D HL S Y + + G V+ +G++ D
Sbjct 71 ----------------PDTDYSHLYVGKSYYVRHSRIFDKDGVENLPLGVYRNGKLIDTV 114
Query 116 ETEDSYQFLHTFSGKEIRDLLVSSNGRYDFSRKCVVFPSIDECRNEILVLNPYDQNLFFK 175
FL + R+ L ++ G SR VV D N++ +L D F K
Sbjct 115 -------FLPEMPKEVFRNYLCNTTGIVTKSRNGVVLERDD---NKVGILYDKDFVNFVK 164
Query 176 RLRKLISERY--DEKICYYLVSEYGGRTYRPHWHGILFFNSDALT-SSICELVSKSWSYG 232
RLR ++ Y + KI Y+ SEYG T RPH+HGI +F+S AL+ S V +SW
Sbjct 165 RLRINLTRNYNYEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSAVVESWKMC 224
Query 233 RTD-----CSLSRGSAAGYVASYINSFVDLPDFFNRHKEIKPKSYHSKGLS-VNSLFS 284
D ++R A YVASY+N +P F K ++PK HSKG N+LFS
Sbjct 225 DKDKQYENVEIAR-EPATYVASYVNCLTSVPPLF-LFKGLRPKHSHSKGFGFANNLFS 280
>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM
20697]
Length=422
Score = 98.6 bits (244), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 86/164 (52%), Gaps = 5/164 (3%)
Query 165 LNPYDQNLFFKRLRKLISERY-DEKICYYLVSEYGGRTYRPHWHGILFFNSDALTSSICE 223
L +D LFFKR R +++R+ EK+ Y+ + EYG +RPH+H +LF SD +
Sbjct 44 LRKFDLQLFFKRFRYYVAKRFPKEKVRYFAIGEYGPVHFRPHYHILLFLQSDEALQVCSK 103
Query 224 LVSKSWSYGRTDCSLSRGSAAGYVASYINSFVDLPDFFNRHKEIKPKSYHSKGLSVNSLF 283
+VS++W +GR DC LS+G + YVA Y+NS V +P + P HS+ L L
Sbjct 104 VVSEAWPFGRVDCQLSKGKCSSYVAGYVNSSVLVPKVLTL-PTLCPFCVHSQKLGQGFL- 161
Query 284 SRTSDISQVDEVAASCFDGFSVPINGEYVTVKPSRSYEHTVFPR 327
S+ ++V + F S+ ING Y RS FP+
Sbjct 162 --QSERAKVYSLTPEQFVKRSIVINGRYKEFDVWRSAYAYFFPK 203
>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 146/338 (43%), Gaps = 63/338 (19%)
Query 12 VRCGTCPSCLVH-RSNIQCAL-ISNMSSHFKHAYFFTLTYSDEFVPRVSLEVVERCDAES 69
V CG C +C + RS++ L + +S K+ YF TLTY D+ +P S+ + + C E
Sbjct 24 VPCGKCIACHNNKRSSLSLKLRLEEYTS--KYCYFLTLTYDDDNLPLFSVGL-DTCATEF 80
Query 70 EIDAYMSDSDPRHLPYDDSRYQIAATYLPRSGCFRVHDSGRVRDFSETEDSYQFLHTFSG 129
R PY + R+ + + DF D + F + F
Sbjct 81 ----------VRIYPYSE----------------RLRNDSFISDF--CSDLHNFDNDFVD 112
Query 130 KE--IRDLLVSSNGRYDFSRKCVVFPSIDECRNEILVLNPYDQNLFFKRLRKLISERYDE 187
K D +++ +Y + CV + +L D LF KRLRK I + Y E
Sbjct 113 KMDYYSDYVINYESKYH--KSCVYGHGL------YALLYYRDIQLFLKRLRKHIYKYYGE 164
Query 188 KICYYLVSEYGGRTYRPHWHGILFFNSDALTSSICELVSKS---------------WSYG 232
KI +Y++ EYG ++ RPHWH +LFFNS +L+ + + V+ W +G
Sbjct 165 KIRFYIIGEYGTKSLRPHWHCLLFFNSSSLSQAFEDCVNVGTTSRPCSCPRFLRPFWQFG 224
Query 233 RTDCSLSRGSAAGYVASYINSFVDLPDFFNRHKEIKPKSYHSKGLSVNSLFSRTSDISQV 292
D + G A YV+SY+N + P K+YHS + + + S S +S +
Sbjct 225 ICDSKRTNGEAYNYVSSYVNQSANFPKLLVLLSN--QKAYHS--IQLGQILSEQSIVSAI 280
Query 293 DEVAASCFD-GFSVPINGEYVTVKPSRSYEHTVFPRIS 329
+ S F+ F + G + RSY FP+ +
Sbjct 281 QKGDFSFFERQFYLDTFGAANSYSVWRSYYSRFFPKFT 318
>gi|496521300|ref|WP_009229583.1| hypothetical protein [Prevotella sp. oral taxon 317]
gi|288330571|gb|EFC69155.1| hypothetical protein HMPREF0670_00478 [Prevotella sp. oral taxon
317 str. F0108]
Length=569
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/285 (29%), Positives = 122/285 (43%), Gaps = 75/285 (26%)
Query 1 VTNRYTHETLFVRCGTCPSCLVHRSNIQCALISNMSSHFKHAYFFTLTYSDEFVPRVSLE 60
V NR+T + +FV CG C +C+ ++ Q + N K++ FTLTY++EF+PR
Sbjct 17 VHNRWTRDEMFVPCGRCEACVNAAASKQSKRVRNEIMQHKYSVMFTLTYNNEFIPR---- 72
Query 61 VVERCDAESEIDAYMSDSDPRHLPYDDSRYQIAATYLPRSGCFRVHDSGRVRDFSETEDS 120
+ ++ ++D L P C + S + F +
Sbjct 73 ----------WERFLDNNDCPQLR-------------PIGRCAELFPSCPLNYFDKVTGK 109
Query 121 YQF-LHTFSGKEIRDLLVSSNGRYDFSRKCVVFPSIDECRNEILVLNPYDQNLFFKRLRK 179
+ L TF K D VF S C+ +I QN F KRLR
Sbjct 110 WSIDLDTFLPKIEND------------EHTEVFASC--CKKDI-------QN-FLKRLRF 147
Query 180 LISERYDE----KICYYLVSEYGGRTYRPHWHGILFFNSDALTSSICELVSKSWSYGR-- 233
IS+ Y + KI YY+ SEYG T RPH+HGI+FF+ +L S I L+ +SW + R
Sbjct 148 NISKLYGKAESRKIRYYVASEYGPTTLRPHYHGIIFFDDASLLSEISSLIVRSWGFQRRV 207
Query 234 ------------TDCSLSR-------GSAAGYVASYINSFVDLPD 259
D SL++ + A YVA Y++ + LP
Sbjct 208 GGKRNSFIFQPFADISLTQQYVKLCDQNTAYYVAEYVSGNLGLPQ 252
>gi|575094298|emb|CDL65688.1| unnamed protein product [uncultured bacterium]
Length=478
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 142/340 (42%), Gaps = 59/340 (17%)
Query 1 VTNRYTHETLFVRCGTCPSCLVHRSNIQCALISNMSSHFKHAYFFTLTYSDEFVPRVSLE 60
+ N+YT + L+V CG CP+CL ++N I N S +F TL Y + +P +
Sbjct 8 IRNKYTGQKLYVSCGKCPACLQEKANASAYKIRNNQSSELSCFFVTLNYDNNHIPVIFKH 67
Query 61 VVERCDAESEIDAYMSDSDPRHL--PYDDSRYQIAATYLPRSGCFRVHDSGRVRDFSETE 118
V ++ D Y D + + L P D R G FS
Sbjct 68 DVYNYNSS---DVYHFDEERKELCLPVDLYR-------------------GVCPAFSNKI 105
Query 119 DSYQFLHTFSGKEIRDLLVSSNGRYDFSR--KCVVFPSIDECRNEIL-VLNPYDQNLFFK 175
D++ F ++ L + G ++ K V+F EI V D LFFK
Sbjct 106 DTFNFPLNRLSTDVVSSLDNHCGVVVKTKNHKPVLF------NEEIFSVCYTKDIQLFFK 159
Query 176 RLRKLISERYDEK--ICYYLVSEYGGRTYRPHWHGILFFNSDALT-SSICELVSKSWSYG 232
RLR+ + ++ + I Y+ SEYG TYR H+H +F ++ S + K+W +
Sbjct 160 RLRQSLYRKFGFRPFIQYFQTSEYGPTTYRAHFHLCIFVKRSEISFDSFRKACVKAWPFC 219
Query 233 RT-----DCSLSRGSAAGYVASYINSFVDLPDFFNRHKEIKPKSYHS-------KGLSVN 280
+ ++R S + Y+ASY+N ++P F N KE K K HS LS N
Sbjct 220 SKKQMFRNVEIAR-SPSAYIASYVNCRANVPLFLNL-KEAKAKHTHSLYFGHNNDKLSFN 277
Query 281 SLFSR---------TSDISQVDEVAASCFDGFSVPINGEY 311
S+ +R ++S VD V + F IN Y
Sbjct 278 SIVNRFETQGTTLYPRELSSVDGVPQTSFLPLPRYINAYY 317
>gi|494610270|ref|WP_007368516.1| hypothetical protein [Prevotella multiformis]
gi|324988542|gb|EGC20505.1| hypothetical protein HMPREF9141_0984 [Prevotella multiformis
DSM 16608]
Length=479
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 66/267 (25%), Positives = 111/267 (42%), Gaps = 67/267 (25%)
Query 1 VTNRYTHETLFVRCGTCPSCLVHRSNIQCALISNMSSHFKHAYFFTLTYSDEFVPRVSLE 60
+ N+Y ETL+V C C C ++ I N + + F TLTY +E +P
Sbjct 16 IYNKYIDETLYVPCRKCFRCRDSYASDWSRRIENECREHRFSLFVTLTYDNEHIPLFQPL 75
Query 61 VVERCDAESEIDAYMSDSDPRHLPYDDSRYQIAATYLPRSGCFRVHDSGRVRDFSETEDS 120
V++ D H + +R + +L S C
Sbjct 76 VMD---------------DGSHPVWFSNRLSESGKFLSDSVC------------------ 102
Query 121 YQFLHTFSGKEIRDLLVSSNGRYDFSRKCVVFPSIDECRNEILVLNPYDQNLFFKRLRKL 180
+ +++ D + C +P C+ ++ +FKRLR
Sbjct 103 ----RSLPPQKMEDEV------------CFAYP----CKKDV--------QDWFKRLRSA 134
Query 181 ISERYDE------KICYYLVSEYGGRTYRPHWHGILFFNSDALTSSICELVSKSWSYGRT 234
+ + ++ +I Y++ SEYG RT+RPH+H IL+++S+ L +I L+ ++W G +
Sbjct 135 VDYQLNKNKSNEFRIRYFICSEYGPRTFRPHYHAILWYDSEELQRNIGRLIRETWKNGNS 194
Query 235 DCSLSRGSAAGYVASYINSFVDLPDFF 261
SL SA+ YVA Y+N LP F
Sbjct 195 VFSLVNNSASQYVAKYVNGDTRLPPFL 221
>gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola]
Length=480
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/94 (44%), Positives = 58/94 (62%), Gaps = 5/94 (5%)
Query 173 FFKRLR-----KLISERYDEKICYYLVSEYGGRTYRPHWHGILFFNSDALTSSICELVSK 227
FFKRLR KL + +I Y++ SEYG T+RPH+H IL+++S+ L + + L+ +
Sbjct 130 FFKRLRSKIDYKLKPRGNEYRIRYFICSEYGPNTFRPHYHAILWYDSEILHNELNVLIRE 189
Query 228 SWSYGRTDCSLSRGSAAGYVASYINSFVDLPDFF 261
+W G TD SL SA+ YVA Y+N DLP F
Sbjct 190 TWKNGNTDFSLVNSSASQYVAKYVNGDCDLPSFL 223
>gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens]
gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens
CC14M]
Length=484
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/164 (33%), Positives = 84/164 (51%), Gaps = 13/164 (8%)
Query 173 FFKRLRKLISERY-------DEKICYYLVSEYGGRTYRPHWHGILFFNSDALTSSICELV 225
FFKRLR +S + +EKI Y++ SEYG +T RPH+H I++F+S+ + I +++
Sbjct 123 FFKRLRSKLSYYFKKHHIITNEKIRYFVCSEYGPKTLRPHYHAIIWFDSEEVARVIEKML 182
Query 226 SKSWSYGRTDCSLSRGSAAGYVASYINSFVDLPDFFNRHKEIKPKSYHSKGLSVNSLFSR 285
S SWS G TD +A YVA Y++ LP+ +H + S+ SV R
Sbjct 183 SSSWSNGFTDFEYVNSTAPQYVAKYVSGNSVLPEIL-QHDACRTFHLQSQAPSVG---YR 238
Query 286 TSDISQVD-EVAASCFDGFSV-PINGEYVTVKPSRSYEHTVFPR 327
+ D + + EV C+ F + V V+P + E FP+
Sbjct 239 SDDYEKFEKEVIDGCYGHFEYDSSSQSSVFVQPPGTLETRCFPK 282
Score = 40.8 bits (94), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 18/55 (33%), Positives = 30/55 (55%), Gaps = 0/55 (0%)
Query 1 VTNRYTHETLFVRCGTCPSCLVHRSNIQCALISNMSSHFKHAYFFTLTYSDEFVP 55
+ N YTHE ++V C C CL +++ ++N ++ F TLTY +E +P
Sbjct 15 IINPYTHERVWVACRRCKCCLNKKTSAWSGRVANECKLHAYSAFVTLTYDNEHLP 69
Lambda K H a alpha
0.323 0.138 0.430 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3864800874240