bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-37_CDS_annotation_glimmer3.pl_2_1
Length=238
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226431|ref|WP_021963494.1| predicted protein 102 1e-21
gi|496050828|ref|WP_008775335.1| hypothetical protein 95.5 3e-19
gi|490418708|ref|WP_004291031.1| hypothetical protein 88.6 4e-17
gi|494610270|ref|WP_007368516.1| hypothetical protein 83.6 4e-15
gi|647452984|ref|WP_025792805.1| hypothetical protein 82.8 7e-15
gi|575094322|emb|CDL65709.1| unnamed protein product 80.1 5e-14
gi|565841285|ref|WP_023924566.1| hypothetical protein 79.3 1e-13
gi|494822887|ref|WP_007558295.1| hypothetical protein 77.0 7e-13
gi|575094340|emb|CDL65724.1| unnamed protein product 63.9 1e-08
gi|575094355|emb|CDL65737.1| unnamed protein product 63.5 2e-08
>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498
Score = 102 bits (254), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/191 (32%), Positives = 98/191 (51%), Gaps = 14/191 (7%)
Query 2 QLFFKRLNQNIRSVTNEKIYYYVVGEYGPTTFRPHFHILLFHDSKELRQSIRQFVSKSWR 61
QLF KR+ +N+ ++EKI YY+V EYGP TFR H+H+L F+D + ++ + + + ++W+
Sbjct 139 QLFLKRVRKNLSKYSDEKIRYYIVSEYGPKTFRAHYHVLFFYDEVKTQKVMSKVIRQAWQ 198
Query 62 FGDTDTQPVWSSASCYVAGYVNSTACLPDFYKNFSHIKPFG----RFSMHFAESAFNEVF 117
FG D + YVA YVN CLP F + S KPF RF++ +S E++
Sbjct 199 FGRVDCSLSRGKCNSYVARYVNCNYCLPRFLGDMS-TKPFSCHSIRFALGIHQSQKEEIY 257
Query 118 KPQEDEEIFSLFYDGRMLELNGKPTLVRPKRSHINRLYPRLNKSKHASVDDDIRVATALS 177
K D+ I+ + E+NG P R+ +P K K S D + + +
Sbjct 258 KGSVDDFIY------QSGEINGNYVEFMPWRNLSCTFFP---KCKGYSRKSDSELWQSYN 308
Query 178 NIPHVLAKFGF 188
+ V + G+
Sbjct 309 ILREVRSAIGY 319
>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497
Score = 95.5 bits (236), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/205 (35%), Positives = 99/205 (48%), Gaps = 15/205 (7%)
Query 1 MQLFFKRLNQNI-RSVTNEKIYYYVVGEYGPTTFRPHFHILLFHDSKELRQSIRQFVSKS 59
+QLF KRL + + +EK+ Y+ VGEYGP FRPH+H+LLF S E Q + +SK+
Sbjct 125 LQLFLKRLRYYVTKQKPSEKVRYFAVGEYGPVHFRPHYHLLLFLQSDEALQICSENISKA 184
Query 60 WRFGDTDTQPVWSSASCYVAGYVNSTACLPDFYKNFSHIKPFGRFSMHFAESAFNEVFKP 119
W FG D Q S YVA YVNS+ +P +K S + PF S + F
Sbjct 185 WTFGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKA-SSVCPFSVHSQKLGQG-----FLD 238
Query 120 QEDEEIFSLFYDGRM---LELNGKPTLVRPKRSHINRLYPRLNKSKHASVDDDIRVATAL 176
+ E+I+SL + + + LNGK RS + YPR S + A
Sbjct 239 CQREKIYSLTPENFIRSSIVLNGKYKEFDVWRSCYSFFYPRCKGFVTKSSRE-----RAY 293
Query 177 SNIPHVLAKFGFIDEVTDFEMSKRI 201
S + A+ F D T F ++K I
Sbjct 294 SYSIYDTARLLFPDAKTTFSLAKEI 318
>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM
20697]
Length=422
Score = 88.6 bits (218), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/90 (48%), Positives = 55/90 (61%), Gaps = 1/90 (1%)
Query 1 MQLFFKRLNQNI-RSVTNEKIYYYVVGEYGPTTFRPHFHILLFHDSKELRQSIRQFVSKS 59
+QLFFKR + + EK+ Y+ +GEYGP FRPH+HILLF S E Q + VS++
Sbjct 49 LQLFFKRFRYYVAKRFPKEKVRYFAIGEYGPVHFRPHYHILLFLQSDEALQVCSKVVSEA 108
Query 60 WRFGDTDTQPVWSSASCYVAGYVNSTACLP 89
W FG D Q S YVAGYVNS+ +P
Sbjct 109 WPFGRVDCQLSKGKCSSYVAGYVNSSVLVP 138
>gi|494610270|ref|WP_007368516.1| hypothetical protein [Prevotella multiformis]
gi|324988542|gb|EGC20505.1| hypothetical protein HMPREF9141_0984 [Prevotella multiformis
DSM 16608]
Length=479
Score = 83.6 bits (205), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/187 (33%), Positives = 98/187 (52%), Gaps = 19/187 (10%)
Query 1 MQLFFKRLNQNI-----RSVTNE-KIYYYVVGEYGPTTFRPHFHILLFHDSKELRQSIRQ 54
+Q +FKRL + ++ +NE +I Y++ EYGP TFRPH+H +L++DS+EL+++I +
Sbjct 124 VQDWFKRLRSAVDYQLNKNKSNEFRIRYFICSEYGPRTFRPHYHAILWYDSEELQRNIGR 183
Query 55 FVSKSWRFGDTDTQPVWSSASCYVAGYVNSTACLPDFYKN-FSHIKPFGRFSMHFAESAF 113
+ ++W+ G++ V +SAS YVA YVN LP F + F+ + H A
Sbjct 184 LIRETWKNGNSVFSLVNNSASQYVAKYVNGDTRLPPFLRTEFTS-------TFHLASKHP 236
Query 114 NEVFKPQEDEEIFSLFYDGR-----MLELNGKPTLVRPKRSHINRLYPRLNKSKHASVDD 168
+ ++E + S DG + NG+ V RS NRL P+ + S +
Sbjct 237 YIGYCKADEEALRSNVLDGTYGQSVLNRDNGQFEFVPTPRSLENRLLPKCRGYRSLSHSE 296
Query 169 DIRVATA 175
IRV A
Sbjct 297 RIRVYAA 303
>gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola]
Length=480
Score = 82.8 bits (203), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 59/186 (32%), Positives = 91/186 (49%), Gaps = 24/186 (13%)
Query 1 MQLFFKRLNQNI----RSVTNE-KIYYYVVGEYGPTTFRPHFHILLFHDSKELRQSIRQF 55
+Q FFKRL I + NE +I Y++ EYGP TFRPH+H +L++DS+ L +
Sbjct 127 VQDFFKRLRSKIDYKLKPRGNEYRIRYFICSEYGPNTFRPHYHAILWYDSEILHNELNVL 186
Query 56 VSKSWRFGDTDTQPVWSSASCYVAGYVNSTACLPDFYKNFSHIKPFGRFSMHFAESAFNE 115
+ ++W+ G+TD V SSAS YVA YVN LP F + F+ F ++ +
Sbjct 187 IRETWKNGNTDFSLVNSSASQYVAKYVNGDCDLPSFLRT--------EFTSTFHLASKHP 238
Query 116 VFKPQEDEEIFSLFYDGRMLELNGKPTL---------VRPKRSHINRLYPRLNKSKHASV 166
+D+E Y+ + G+ L V P RS NR+ P+ + S
Sbjct 239 CIGYGKDDE--EALYENVINGTYGRNCLNKSTNEFEFVCPPRSLENRILPKCKGYRRISH 296
Query 167 DDDIRV 172
+ +R+
Sbjct 297 SERVRI 302
>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499
Score = 80.1 bits (196), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 86/190 (45%), Gaps = 17/190 (9%)
Query 1 MQLFFKRLNQNIRSVTNEKIYYYVVGEYGPTTFRPHFHILLFHDSKELRQ---------- 50
+QLF KRL ++I EKI +Y++GEYG + RPH+H LLF +S L Q
Sbjct 147 IQLFLKRLRKHIYKYYGEKIRFYIIGEYGTKSLRPHWHCLLFFNSSSLSQAFEDCVNVGT 206
Query 51 -----SIRQFVSKSWRFGDTDTQPVWSSASCYVAGYVNSTACLPDFYKNFSHIKPFGRFS 105
S +F+ W+FG D++ A YV+ YVN +A P S+ K +
Sbjct 207 TSRPCSCPRFLRPFWQFGICDSKRTNGEAYNYVSSYVNQSANFPKLLVLLSNQKAYHSIQ 266
Query 106 MHFAESAFNEVFKPQEDEEIFSLFYDGRMLELNGKPTLVRPKRSHINRLYPRLNKSKHAS 165
+ S + V Q+ + FS F L+ G RS+ +R +P+ S +
Sbjct 267 LGQILSEQSIVSAIQKGD--FSFFERQFYLDTFGAANSYSVWRSYYSRFFPKFTCSSQLT 324
Query 166 VDDDIRVATA 175
+ RV T
Sbjct 325 YEQTYRVLTC 334
>gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens]
gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens
CC14M]
Length=484
Score = 79.3 bits (194), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 58/95 (61%), Gaps = 7/95 (7%)
Query 4 FFKRLNQNI-------RSVTNEKIYYYVVGEYGPTTFRPHFHILLFHDSKELRQSIRQFV 56
FFKRL + +TNEKI Y+V EYGP T RPH+H +++ DS+E+ + I + +
Sbjct 123 FFKRLRSKLSYYFKKHHIITNEKIRYFVCSEYGPKTLRPHYHAIIWFDSEEVARVIEKML 182
Query 57 SKSWRFGDTDTQPVWSSASCYVAGYVNSTACLPDF 91
S SW G TD + V S+A YVA YV+ + LP+
Sbjct 183 SSSWSNGFTDFEYVNSTAPQYVAKYVSGNSVLPEI 217
>gi|494822887|ref|WP_007558295.1| hypothetical protein [Bacteroides plebeius]
gi|198272100|gb|EDY96369.1| hypothetical protein BACPLE_00805 [Bacteroides plebeius DSM 17135]
Length=545
Score = 77.0 bits (188), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/270 (29%), Positives = 108/270 (40%), Gaps = 45/270 (17%)
Query 1 MQLFFKRLNQNIRSVTNEKIYYYVVGEYGPTTFRPHFHILLFHDSKEL------------ 48
+QLF KRL + + +KI ++ GEYGP +FRPHFHILLF D L
Sbjct 126 LQLFMKRLRKYLDKYEGQKIRFFATGEYGPLSFRPHFHILLFVDDPSLFLPSVHTLGEYP 185
Query 49 -----------------RQSIRQFVSKSWRFGDTDTQPV-WSSASCYVAGYVNSTACLPD 90
+ ++ +SW FG D Q V S S YVAGYVNS+ LP
Sbjct 186 YPYWSKYQKAHCGKGTLLSKLEYYIRESWPFGGIDAQSVEQGSCSSYVAGYVNSSVPLPS 245
Query 91 FYKNFSHIKPFGRFSMHFAESAFNEVFKPQEDEEIFSLFYDGRMLELNGKPTLVRPKRSH 150
K +K F + S F P + F+ F R G+ R
Sbjct 246 CLK-VDAVKSFSQHSRFLGRKIFGTELIPLLKLK-FTEFVQ-RSFFCRGRYDNFRTPSEM 302
Query 151 INRLYPRLNKSKHASVDDDIRVATALSNIPHVLAKFGFIDEVTDFEMSKRIYYLIRRYLE 210
++ +YP+ S + RV T S + + D+ D S + Y
Sbjct 303 LHSVYPQCKGFALLSHEQRFRVYTIWSRLRYYFNS----DKKADVARS----LVTSFYSW 354
Query 211 IDHTLKYAPEQLR----LIYNYLSFVVVYK 236
+D + PE++R LIY LS + YK
Sbjct 355 LDTGILRVPERVREDFLLIYTELSQNLNYK 384
>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/130 (37%), Positives = 61/130 (47%), Gaps = 13/130 (10%)
Query 4 FFKRLNQNIRSVTN--EKIYYYVVGEYGPTTFRPHFHILLFHDSKELR-QSIRQFVSKSW 60
F KRL N+ N KI Y+ EYGPTT RPHFH + + DS+ L S R V +SW
Sbjct 162 FVKRLRINLTRNYNYEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSAVVESW 221
Query 61 RFGDTDTQ----PVWSSASCYVAGYVNSTACLPDFY------KNFSHIKPFGRFSMHFAE 110
+ D D Q + + YVA YVN +P + SH K FG + F+
Sbjct 222 KMCDKDKQYENVEIAREPATYVASYVNCLTSVPPLFLFKGLRPKHSHSKGFGFANNLFSF 281
Query 111 SAFNEVFKPQ 120
SA F Q
Sbjct 282 SAVFTNFMAQ 291
>gi|575094355|emb|CDL65737.1| unnamed protein product [uncultured bacterium]
Length=517
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 81/189 (43%), Gaps = 37/189 (20%)
Query 1 MQLFFKRLNQNIRSVTNEKIYYYVVGEYGPTTFRPHFHILLFHDS--------------- 45
+QLF KRL +N+ ++ K+ Y+ +GEYGP FRPH+H LLF D
Sbjct 131 LQLFIKRLRKNLSKYSDAKVRYFAMGEYGPVHFRPHYHFLLFFDEIKFTAPSGHTLGEFP 190
Query 46 -------------KELRQSIRQFVSKSWRFGDTDTQPVWSSASCYVAGYVNSTACLPDFY 92
++ + + SW+FG D Q A+ YV+ YV+ + LP Y
Sbjct 191 DWAWYDSQNKCSRSDILSVVEYCIRSSWKFGRVDAQYSKGDAAQYVSSYVSGSGSLPKVY 250
Query 93 KNFSHIKPFGRFSMHFAESAFNEVFKPQEDEEIFSL---FYDGRMLELNGKPTLVRPKRS 149
+ S +PF S + F E E+++ + R +ELNG RS
Sbjct 251 Q-VSSARPFSLHSRFLGQG-----FLAHECEKVYETPVRDFVKRSVELNGSNKDFNLWRS 304
Query 150 HINRLYPRL 158
+ YP+
Sbjct 305 CYSVFYPKC 313
Lambda K H a alpha
0.326 0.140 0.429 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1002696285300