bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-31_CDS_annotation_glimmer3.pl_2_6
Length=236
Score E
Sequences producing significant alignments: (Bits) Value
gi|496050828|ref|WP_008775335.1| hypothetical protein 90.1 3e-17
gi|490418708|ref|WP_004291031.1| hypothetical protein 79.7 6e-14
gi|547226431|ref|WP_021963494.1| predicted protein 79.7 8e-14
gi|517172763|ref|WP_018361581.1| hypothetical protein 69.7 2e-10
gi|575094322|emb|CDL65709.1| unnamed protein product 68.6 3e-10
gi|575094355|emb|CDL65737.1| unnamed protein product 68.2 5e-10
gi|575094340|emb|CDL65724.1| unnamed protein product 67.8 6e-10
gi|565841285|ref|WP_023924566.1| hypothetical protein 65.9 3e-09
gi|647452984|ref|WP_025792805.1| hypothetical protein 61.2 9e-08
gi|496521300|ref|WP_009229583.1| hypothetical protein 61.2 1e-07
>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497
Score = 90.1 bits (222), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/121 (43%), Positives = 76/121 (63%), Gaps = 2/121 (2%)
Query 35 QIPVIRNRDFELFMKRLRSNLKQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLT 94
+P +R D +LF+KRLR + +Q P E++RY+ V EYGP RPH+HLLLF S +
Sbjct 116 DVPYLRKTDLQLFLKRLRYYVTKQK-PSEKVRYFAVGEYGPVHFRPHYHLLLFLQSDEAL 174
Query 95 QAIRENVCKAWSYGNCDVSLSRGaaasyvasyvnsvasLPYLYTGHKEIRPRCFHSKGFG 154
Q EN+ KAW++G D +S+G ++YVASYVNS ++P ++ + P HS+ G
Sbjct 175 QICSENISKAWTFGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKA-SSVCPFSVHSQKLG 233
Query 155 Q 155
Q
Sbjct 234 Q 234
>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM
20697]
Length=422
Score = 79.7 bits (195), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 71/121 (59%), Gaps = 2/121 (2%)
Query 36 IPVIRNRDFELFMKRLRSNLKQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQ 95
+P +R D +LF KR R + ++ P E++RY+ + EYGP RPH+H+LLF S + Q
Sbjct 41 LPYLRKFDLQLFFKRFRYYVAKR-FPKEKVRYFAIGEYGPVHFRPHYHILLFLQSDEALQ 99
Query 96 AIRENVCKAWSYGNCDVSLSRGaaasyvasyvnsvasLPYLYTGHKEIRPRCFHSKGFGQ 155
+ V +AW +G D LS+G +SYVA YVNS +P + T + P C HS+ GQ
Sbjct 100 VCSKVVSEAWPFGRVDCQLSKGKCSSYVAGYVNSSVLVPKVLT-LPTLCPFCVHSQKLGQ 158
Query 156 N 156
Sbjct 159 G 159
>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498
Score = 79.7 bits (195), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 68/113 (60%), Gaps = 3/113 (3%)
Query 42 RDFELFMKRLRSNLKQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQAIRENV 101
RD +LF+KR+R NL + E+IRYY VSEYGP+T R H+H+L F+D + + + + +
Sbjct 136 RDAQLFLKRVRKNLSK--YSDEKIRYYIVSEYGPKTFRAHYHVLFFYDEVKTQKVMSKVI 193
Query 102 CKAWSYGNCDVSLSRGaaasyvasyvnsvasLPYLYTGHKEIRPRCFHSKGFG 154
+AW +G D SLSRG SYVA YVN LP + G +P HS F
Sbjct 194 RQAWQFGRVDCSLSRGKCNSYVARYVNCNYCLP-RFLGDMSTKPFSCHSIRFA 245
>gi|517172763|ref|WP_018361581.1| hypothetical protein [Prevotella nanceiensis]
Length=598
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/74 (41%), Positives = 46/74 (62%), Gaps = 2/74 (3%)
Query 34 LQIPVIRNRDFELFMKRLRSNLKQQGLPHEE--IRYYCVSEYGPQTLRPHWHLLLFFDSP 91
LQ + +D + F+KRLR + + +P E IRY+ SEYGP+T RPH+H +LF DSP
Sbjct 146 LQFATVSKKDIQNFLKRLRKKIDKLNIPQNEKKIRYFIASEYGPKTYRPHYHGVLFIDSP 205
Query 92 QLTQAIRENVCKAW 105
+ I+ + ++W
Sbjct 206 TVLSKIKAFIVESW 219
>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499
Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 52/92 (57%), Gaps = 19/92 (21%)
Query 42 RDFELFMKRLRSNL-KQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQAIREN 100
RD +LF+KRLR ++ K G E+IR+Y + EYG ++LRPHWH LLFF+S L+QA +
Sbjct 145 RDIQLFLKRLRKHIYKYYG---EKIRFYIIGEYGTKSLRPHWHCLLFFNSSSLSQAFEDC 201
Query 101 V--------CKA-------WSYGNCDVSLSRG 117
V C W +G CD + G
Sbjct 202 VNVGTTSRPCSCPRFLRPFWQFGICDSKRTNG 233
>gi|575094355|emb|CDL65737.1| unnamed protein product [uncultured bacterium]
Length=517
Score = 68.2 bits (165), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 52/149 (35%), Positives = 71/149 (48%), Gaps = 31/149 (21%)
Query 35 QIPVIRNRDFELFMKRLRSNLKQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLT 94
+P +R RD +LF+KRLR NL + ++RY+ + EYGP RPH+H LLFFD + T
Sbjct 122 DVPYLRKRDLQLFIKRLRKNLSKYS--DAKVRYFAMGEYGPVHFRPHYHFLLFFDEIKFT 179
Query 95 QAI--------------RENVC--------------KAWSYGNCDVSLSRGaaasyvasy 126
+N C +W +G D S+G AA YV+SY
Sbjct 180 APSGHTLGEFPDWAWYDSQNKCSRSDILSVVEYCIRSSWKFGRVDAQYSKGDAAQYVSSY 239
Query 127 vnsvasLPYLYTGHKEIRPRCFHSKGFGQ 155
V+ SLP +Y RP HS+ GQ
Sbjct 240 VSGSGSLPKVYQV-SSARPFSLHSRFLGQ 267
>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486
Score = 67.8 bits (164), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 87/182 (48%), Gaps = 14/182 (8%)
Query 1 MTDDVIKDILRRANGKYSYSLRKVVYPPASEWKLQIPVIRNRDFELFMKRLRSNLKQQGL 60
M +V ++ L G + S VV ++ ++ ++DF F+KRLR NL +
Sbjct 119 MPKEVFRNYLCNTTGIVTKSRNGVVLERDDN---KVGILYDKDFVNFVKRLRINLTRNYN 175
Query 61 PHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLT-QAIRENVCKAWSYGNCD-----VSL 114
+I Y+ SEYGP T RPH+H + +FDS L+ + R V ++W + D V +
Sbjct 176 YEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSAVVESWKMCDKDKQYENVEI 235
Query 115 SRGaaasyvasyvnsvasLP-YLYTGHKEIRPRCFHSKGFG-QNKSFVNRPVFQKFEKYP 172
+R A + + P +L+ G +RP+ HSKGFG N F VF F
Sbjct 236 AREPATYVASYVNCLTSVPPLFLFKG---LRPKHSHSKGFGFANNLFSFSAVFTNFMAQR 292
Query 173 LT 174
LT
Sbjct 293 LT 294
>gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens]
gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens
CC14M]
Length=484
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/74 (43%), Positives = 47/74 (64%), Gaps = 5/74 (7%)
Query 43 DFELFMKRLRSNL-----KQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQAI 97
D F KRLRS L K + +E+IRY+ SEYGP+TLRPH+H +++FDS ++ + I
Sbjct 119 DIVKFFKRLRSKLSYYFKKHHIITNEKIRYFVCSEYGPKTLRPHYHAIIWFDSEEVARVI 178
Query 98 RENVCKAWSYGNCD 111
+ + +WS G D
Sbjct 179 EKMLSSSWSNGFTD 192
>gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola]
Length=480
Score = 61.2 bits (147), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 4/105 (4%)
Query 14 NGKY-SYSLRKVVYPPASEWKLQIPVIRNRDFELFMKRLRSNLKQQGLPHEE---IRYYC 69
NGK+ S + + + P E + +D + F KRLRS + + P IRY+
Sbjct 96 NGKFVSSDIARPIPPVGMEDTVCFAYPCKKDVQDFFKRLRSKIDYKLKPRGNEYRIRYFI 155
Query 70 VSEYGPQTLRPHWHLLLFFDSPQLTQAIRENVCKAWSYGNCDVSL 114
SEYGP T RPH+H +L++DS L + + + W GN D SL
Sbjct 156 CSEYGPNTFRPHYHAILWYDSEILHNELNVLIRETWKNGNTDFSL 200
>gi|496521300|ref|WP_009229583.1| hypothetical protein [Prevotella sp. oral taxon 317]
gi|288330571|gb|EFC69155.1| hypothetical protein HMPREF0670_00478 [Prevotella sp. oral taxon
317 str. F0108]
Length=569
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/68 (43%), Positives = 42/68 (62%), Gaps = 2/68 (3%)
Query 42 RDFELFMKRLRSNLKQQGLPHE--EIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQAIRE 99
+D + F+KRLR N+ + E +IRYY SEYGP TLRPH+H ++FFD L I
Sbjct 136 KDIQNFLKRLRFNISKLYGKAESRKIRYYVASEYGPTTLRPHYHGIIFFDDASLLSEISS 195
Query 100 NVCKAWSY 107
+ ++W +
Sbjct 196 LIVRSWGF 203
Lambda K H a alpha
0.326 0.141 0.441 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 981586889820