bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-1_CDS_annotation_glimmer3.pl_2_2
Length=298
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094436|emb|CDL65809.1| unnamed protein product 266 1e-83
gi|575094418|emb|CDL65793.1| unnamed protein product 191 4e-54
gi|575094569|emb|CDL65925.1| unnamed protein product 166 8e-45
gi|575094494|emb|CDL65868.1| unnamed protein product 156 3e-41
gi|575096060|emb|CDL66943.1| unnamed protein product 146 2e-37
gi|575096096|emb|CDL66976.1| unnamed protein product 143 8e-37
gi|575094487|emb|CDL65854.1| unnamed protein product 142 3e-36
gi|492501778|ref|WP_005867316.1| hypothetical protein 139 2e-35
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 138 6e-35
gi|313766924|gb|ADR80651.1| putative replication initiation protein 135 7e-34
>gi|575094436|emb|CDL65809.1| unnamed protein product [uncultured bacterium]
Length=340
Score = 266 bits (681), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 145/302 (48%), Positives = 190/302 (63%), Gaps = 15/302 (5%)
Query 1 MYQKNIMLIPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVPGMIVKT 60
++++++MLIPCG+CIGCRIR ++DW TR+ELEAR Y K WF+TLTY D+ +P +I T
Sbjct 41 IHRQDVMLIPCGKCIGCRIRAKQDWATRLELEARAY-KGRAWFVTLTYRDDTIPLLIRNT 99
Query 61 GEIMRKVQYVWKPGEKRPESVQTLLYTDVQKFLKRLRK-----AYRGK-LRYFVAGEYGE 114
GE++ +W G PE + TL DV KF KRLRK GK LRYF AGEYGE
Sbjct 100 GELIEGGVSMWSRGADVPEQINTLNMDDVTKFWKRLRKYQTTEPDMGKELRYFYAGEYGE 159
Query 115 QTARPHYHMILYGWQPTDLEHLYKIQHNGYFTSKWLADLWGMGQIQIAQAVPETYRYVAG 174
QT RPHYH I++G + DL+ + N Y+ S L +WG G + IA + P TY YVAG
Sbjct 160 QTGRPHYHAIIFGLEIPDLKKV--PGRNQYYKSAILEKIWGKGNVTIAYSEPGTYNYVAG 217
Query 175 YVTKKMYEIDGQKANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWRQGYIQCTNGKH 234
YVTKKMY G Y LG P+ACMS KPG+G + +++ ++W Q YIQ GK
Sbjct 218 YVTKKMY---GNDTKEYQNLGLTAPYACMSRKPGIGMPWLEQNLDKLWEQDYIQLA-GKT 273
Query 235 AQIPRYYEKMMEAENPQRLWRIKQNRQAAAIAENRLKYENTD--FAEQCKTKERVIKKQM 292
A IPR ++KM+EA +P+RLW+ KQ RQ +AI TD EQ +TK+RV+ K
Sbjct 274 APIPRAFDKMLEATDPERLWKKKQARQKSAINGALQAMSQTDQTLLEQYETKDRVLMKSF 333
Query 293 KK 294
K
Sbjct 334 AK 335
>gi|575094418|emb|CDL65793.1| unnamed protein product [uncultured bacterium]
Length=367
Score = 191 bits (485), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 110/260 (42%), Positives = 157/260 (60%), Gaps = 23/260 (9%)
Query 3 QKNIM-----LIPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVPGMI 57
+KNI+ L PCGQC+ CRI+ +W R ELE +Y K+ + F+TLTYD+EHVP +
Sbjct 33 KKNILEDKWVLTPCGQCLACRIQYAANWAARCELET-NYHKQSI-FLTLTYDEEHVPVLN 90
Query 58 VKTGEIMRKV----QYVWKPGEKRPESVQTLLYTDVQKFLKRLRKAYRGK-----LRYFV 108
+TGEI R V +YV +R T+ DVQKF+KRLRKA + + Y++
Sbjct 91 KETGEIYRGVRNPAEYVAGVTLER----MTVYKPDVQKFIKRLRKAAEKEGLTDHIMYYL 146
Query 109 AGEYGEQTARPHYHMILYGWQPTDLEHLYKIQHNGYFTSKWLADLWGMGQIQIAQAVPET 168
+GEYG++T RPHYH+I+YG + D EH+ + FTS+WL +WGMG I+I E+
Sbjct 147 SGEYGDKTGRPHYHLIVYGLEVPDAEHIGSRRGYDRFTSEWLKGIWGMGLIEIGSVTYES 206
Query 169 YRYVAGYVTKKMYEIDGQKANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWRQGYIQ 228
+YVA YV KK G++A Y + G F MSLKP +G Y++EHK EI+ I
Sbjct 207 CQYVARYVIKKR---KGKEAKEYKDAGIMPEFVQMSLKPAIGQRYWEEHKDEIYSLDQIN 263
Query 229 CTNGKHAQIPRYYEKMMEAE 248
+G+ + PRY++K+ + E
Sbjct 264 LASGRTVKPPRYFDKLEDQE 283
>gi|575094569|emb|CDL65925.1| unnamed protein product [uncultured bacterium]
Length=354
Score = 166 bits (420), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 103/278 (37%), Positives = 143/278 (51%), Gaps = 30/278 (11%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVPGMIVKTGEIMRKVQ 68
IPCG+CI CR R WT R+ LE +D+ +E FITLTYDD+H+
Sbjct 70 IPCGKCISCRRRYAALWTDRLMLELQDH--KESCFITLTYDDDHICC------------- 114
Query 69 YVWKPGEKRPESVQTLLYTDVQKFLKRLRKAY------RGKLRYFVAGEYGEQTARPHYH 122
V P E+ S+ TL +Q F KRLR+ ++RYF GEYG+ T RPHYH
Sbjct 115 -VDSPIEENV-SMYTLNKVHLQCFWKRLRQYLVRHVEPEKRIRYFACGEYGDTTFRPHYH 172
Query 123 MILYGWQPTDLEHLYK-IQHNGYFTSKWLADLWGMGQIQIAQAVPETYRYVAGYVTKKMY 181
IL+GW+PTDL K Q++ + SK LA +W G + + PE+ RYVA Y KK
Sbjct 173 AILFGWRPTDLIQFKKNFQNDTLYLSKSLASIWQNGNVMVGDVTPESCRYVARYCLKKAT 232
Query 182 EIDGQKANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWRQGYIQCTN---GKHAQIP 238
D + Y LG F MS KPG+ Y+ +H EI + I + G QIP
Sbjct 233 GFDSE---IYERLGVLPEFVTMSRKPGIARKYFDDHYDEIIKYKTINLSTLKGGMSMQIP 289
Query 239 RYYEKMMEAENPQRLWRIKQNRQAAAIAENRLKYENTD 276
Y+ +++E + + IK++ + AA+ +NTD
Sbjct 290 PYFIRLIEDIDSELFKEIKRSNKQAALNHQEALMKNTD 327
>gi|575094494|emb|CDL65868.1| unnamed protein product [uncultured bacterium]
Length=348
Score = 156 bits (395), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 101/280 (36%), Positives = 148/280 (53%), Gaps = 39/280 (14%)
Query 4 KNIMLIPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVP---GMIVKT 60
++ ++IPCG+C+GCR+ W R LE+ + +F+TLTYDD+++P + T
Sbjct 57 RDYIIIPCGKCVGCRLAYSRQWADRCMLESSYHTHS--YFLTLTYDDDNLPLSESINQDT 114
Query 61 GEIMRKVQYVWKPGEKRPESVQTLLYTDVQKFLKRLRKAYRGKL------RYFVAGEYGE 114
GEI TL+ D+Q F+KRLR+ + +YF AGEYG
Sbjct 115 GEINYNA---------------TLVKKDIQDFIKRLRRFCEYNIDDNLHIKYFCAGEYGS 159
Query 115 QTARPHYHMILYGWQPTDLEHLYKIQHNG--YFTSKWLADLWGMGQIQIAQAVPETYRYV 172
QT RPHYHMILYG+ DL+ LYK+ +G Y+ S + LW G + I + +T Y
Sbjct 160 QTFRPHYHMILYGFPINDLK-LYKMSLDGYNYYNSATIDKLWKKGFVVIGEVTWDTCAYT 218
Query 173 AGYVTKKMYEIDGQKANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWRQGYIQC-TN 231
A Y+ KK Y G A Y + F CMS KP + YY+++K +I+ YI T
Sbjct 219 ARYILKKQY---GSGAQIYKDYNILPEFTCMSTKPAIAREYYEDNKDKIFDSDYIFLGTK 275
Query 232 GKHAQI--PRYYEKMMEAENPQRLWRIKQNRQA-AAIAEN 268
K Q+ P+Y+EK++E EN K+ R A++AE+
Sbjct 276 EKSIQMKPPKYFEKLLEKENEDVF---KERRDLHASLAED 312
>gi|575096060|emb|CDL66943.1| unnamed protein product [uncultured bacterium]
Length=339
Score = 146 bits (368), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/302 (33%), Positives = 145/302 (48%), Gaps = 36/302 (12%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVPGMIVKTGEIMRKVQ 68
+PCGQCIGCRI W R LE +D+ + +F T TYD++HVP E
Sbjct 50 LPCGQCIGCRIDYSRQWANRCMLELQDH--DSAFFCTFTYDNDHVPISYYADKET----- 102
Query 69 YVWKPGEKRPESVQTLLYTDVQKFLKRLRKAYRGK-LRYFVAGEYGEQTARPHYHMILYG 127
GE +P TL D Q +KR+RK + +R+F AGEYG QT RPHYH I+YG
Sbjct 103 -----GEAKPS--LTLRKRDFQLLMKRIRKHFSDDHIRFFAAGEYGGQTLRPHYHAIIYG 155
Query 128 WQPTDLEHLYKIQHNG----YFTSKWLADLW------GMGQIQIAQAVPETYRYVAGYVT 177
DL ++ G Y+ S L W +G + + E+ Y A YV
Sbjct 156 LHLNDLVPYKTVKEGGVLYTYYNSPSLQKCWLDSDGKPIGFVVVGAVTWESCAYTARYVL 215
Query 178 KKMYEIDGQKANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWRQGYIQCTN---GKH 234
KK G+ + Y E + F MS KPG+ +YY H ++++ +I + G+
Sbjct 216 KKQ---KGEASTVYQEFNLEPEFTLMSRKPGIARNYYDTH-PDLFQSDFINISTLKGGRK 271
Query 235 AQIPRYYEKMMEAENPQRLWRIKQNRQAA---AIAENRLKYENTDFAEQCKTKERVIKKQ 291
+ PRY+EK+ E + P+ + + R+AA A+A +L N D +ER +
Sbjct 272 FRPPRYFEKLFELDFPEEAAKRSEVRKAAGSNAMAA-KLAKTNLDPLSMLAVEERNFTDR 330
Query 292 MK 293
+K
Sbjct 331 IK 332
>gi|575096096|emb|CDL66976.1| unnamed protein product [uncultured bacterium]
Length=296
Score = 143 bits (361), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 93/271 (34%), Positives = 136/271 (50%), Gaps = 40/271 (15%)
Query 2 YQKNIMLIPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVPGMIVKTG 61
+ + +L+PCGQC+ CR+ + +W R E + + K F+TLTY+D+++P
Sbjct 29 FSHDYVLVPCGQCLECRLHRASEWALRCCHELKSHDKG--IFLTLTYNDDNLP------- 79
Query 62 EIMRKVQYVWKPGEKRPESVQTLLYTDVQKFLKRLRK--AYRG---KLRYFVAGEYGEQT 116
P TL+ VQ F+KRLR+ Y G K+RY AGEYG+ +
Sbjct 80 ----------------PNG--TLVKKHVQDFIKRLRRHIDYYGDCTKIRYLCAGEYGDLS 121
Query 117 ARPHYHMILYGWQPTD---LEHLYKIQHNGYFTSKWLADLWGMGQIQIAQAVPETYRYVA 173
RPHYH++++G+ P+D L L KI N FTS L LWG G I E+ RY
Sbjct 122 LRPHYHLLVFGYYPSDPRLLHGLQKIGKNSLFTSPTLTKLWGKGHISFGAITFESARYTC 181
Query 174 GYVTKKMYEIDGQKANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWRQGYIQCTNGK 233
Y KK G+ ++ Y + G F S + GLG + H ++ +GY+ NGK
Sbjct 182 QYALKKQ---TGEHSHYYVDRGVIPEFMICSNRNGLGYDFAVSHD-NMFERGYLT-MNGK 236
Query 234 HAQIPRYYEKMMEAENPQRLWRIKQNRQAAA 264
IPRYY+K+ E E P K+ R+ +A
Sbjct 237 KIGIPRYYQKICEREIPDYYASFKEMRRMSA 267
>gi|575094487|emb|CDL65854.1| unnamed protein product [uncultured bacterium]
Length=332
Score = 142 bits (359), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 95/284 (33%), Positives = 141/284 (50%), Gaps = 28/284 (10%)
Query 3 QKNIMLIPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVPGMIV---K 59
Q ++++PC QC+GCR+ + +W R+ +E + E WF+TLTY+DEH+P
Sbjct 45 QGRLLMLPCRQCVGCRLSKSREWANRVVMEQLYHV--ESWFLTLTYNDEHLPRSFPVDEA 102
Query 60 TGEIMRKVQYVWKPGEKRPESVQ-TLLYTDVQKFLKRLRKAYRGKLRYFVAGEYGEQTAR 118
TGEI+ SV TL+ D+QKFLKRLRK KLR+F AGEYG R
Sbjct 103 TGEIL---------------SVHGTLVKEDLQKFLKRLRKNSGQKLRFFAAGEYGSLNMR 147
Query 119 PHYHMILYGWQPTDLEHLYKIQ-HNGYFTSKWLADLWGMGQIQIAQAVPETYRYVAGYVT 177
PHYH++++G DL+ L K + Y+TS L W G + + ++ YVA Y
Sbjct 148 PHYHLLIFGLHLEDLQLLRKSPLGDEYYTSSLLEKCWPFGFHILGRVTWQSAAYVARYTM 207
Query 178 KKMYEIDGQKANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWRQGYIQCTN---GKH 234
KK + G + Y + Q F MS +PGL YY++H +I+R + G+
Sbjct 208 KKASK--GYDKDLYKKAALQPEFQVMSNRPGLARQYYEDH-PDIFRYLSFNVSTPQGGRK 264
Query 235 AQIPRYYEKMMEAENPQRLWRIKQNRQAAAIAENRLKYENTDFA 278
Y+ K+ + + L+ + EN LK TD +
Sbjct 265 MYPSEYFRKLYRDGHERELFERSLRTREELEVENHLKNMLTDLS 308
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 139 bits (350), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/263 (35%), Positives = 140/263 (53%), Gaps = 38/263 (14%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVPGMIVKTGEIMRKVQ 68
+PCG+C+ CR +R+ W R++ EA +YP F+TLTYDDEH+P ++ GE + K
Sbjct 15 VPCGRCVNCRKNKRQSWVYRLQAEADEYPFS--LFVTLTYDDEHMPTAMI--GEDLFK-- 68
Query 69 YVWKPGEKRPESVQTLLYTDVQKFLKRLRKAY-RGKLRYFVAGEYGEQTARPHYHMILYG 127
+V + D+Q F+KRLRK Y + +LRYF+ EYG Q RPHYHMIL+G
Sbjct 69 ----------STVGVVSKRDIQLFMKRLRKKYDQYRLRYFLTSEYGSQGGRPHYHMILFG 118
Query 128 WQPTDLEHLYKIQHNGYFTSKWLADLWGMGQIQIAQAVPETYRYVAGYVTKKMYEIDGQK 187
+ T +H G LA+ W G QA P T + +A YVTK MYE
Sbjct 119 FPFTG-------KHGG----DLLAECWKNG---FVQAHPLTTKEIA-YVTKYMYE-KSMV 162
Query 188 ANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWR---QGYIQCTNGKHAQIPRYYEKM 244
+ ++ + +PF S PG+G H+ +E + +R + Y++ NG +PRYY
Sbjct 163 PDILKDVKEYQPFMLCSRIPGIGYHFLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADK 222
Query 245 MEAENPQRLWRIKQNRQAAAIAE 267
+ ++ + +K+ R+A I +
Sbjct 223 LYDDDMKEY--LKELREAFFINQ 243
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 138 bits (347), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 95/289 (33%), Positives = 150/289 (52%), Gaps = 42/289 (15%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKEEVWFITLTYDDEHVPGMIVKTGEIMRKVQ 68
+PCG+C+ CR +R+ W R++ EA +YP F+TLTYDDEH+P ++ GE + K
Sbjct 15 VPCGRCVNCRKNKRQSWVYRLQAEADEYPFS--LFVTLTYDDEHIPTAMI--GEDLFKT- 69
Query 69 YVWKPGEKRPESVQTLLYTDVQKFLKRLRKAY-RGKLRYFVAGEYGEQTARPHYHMILYG 127
+V + D+Q F+KRLRK Y + +LRYF+ EYG Q RPHYHMIL+G
Sbjct 70 -----------TVGVVSKRDIQLFMKRLRKKYAQYRLRYFLTSEYGSQGGRPHYHMILFG 118
Query 128 WQPTDLEHLYKIQHNGYFTSKWLADLWGMGQIQIAQAVPETYRYVAGYVTKKMYEIDGQK 187
+ T +H G LA+ W G QA P T + ++ YVTK MYE
Sbjct 119 FPFTG-------KHGG----DLLAECWKNG---FVQAHPLTTKEIS-YVTKYMYE-KSMI 162
Query 188 ANAYYELGQQKPFACMSLKPGLGDHYYQEHKAEIWR---QGYIQCTNGKHAQIPRYY-EK 243
+ + + +PF S PG+G H+ +E + +R + Y++ NG +PRYY +K
Sbjct 163 PDILKGVKEYQPFMLCSKMPGIGYHFLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADK 222
Query 244 MMEAENPQRLWRIKQNRQAAAIAENRLKYENTD-----FAEQCKTKERV 287
+ + + + L +++ + + Y NT A+Q +T+ ++
Sbjct 223 LYDDDMKEYLKELREAFFINQMQQEWYHYINTSPRLRYIADQLETESKL 271
>gi|313766924|gb|ADR80651.1| putative replication initiation protein [Uncultured Microviridae]
Length=285
Score = 135 bits (340), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 93/284 (33%), Positives = 138/284 (49%), Gaps = 53/284 (19%)
Query 7 MLIPCGQCIGCRIRQREDWTTRIELEARDYPKEE-VWFITLTYDDEHVPGMIVKTGEIMR 65
M + C QCIGCR+ W +RIE E+ Y FITLTYD+EH+P
Sbjct 1 MEVACSQCIGCRLDHAGMWASRIEHESSLYDDSNGNCFITLTYDEEHLP----------- 49
Query 66 KVQYVWKPGEKRPESVQTLLYTDVQKFLKRLRKAYRGKLRYFVAGEYGE----------- 114
W +L + QKF+KRLRK Y K+RY+ GEYGE
Sbjct 50 ---QDW-----------SLDKSHFQKFMKRLRKRYPQKIRYYHCGEYGENCRHGIHTTLC 95
Query 115 ---QTARPHYHMILYGWQPTDLEHLYKIQHNGYFTSKWLADLWGMGQIQIAQAVPETYRY 171
RPHYH IL+ D + + + +FTS L ++WG G Q+ ++ Y
Sbjct 96 PGCNVGRPHYHAILFNIDFHDRVLVGQSKGIPHFTSDTLTEIWGHGFTQVGDLTAQSAGY 155
Query 172 VAGYVTKKMYEIDGQKANAYY--------ELGQQKP-FACMSLKPGLGDHYYQEHKAEIW 222
VA Y KK + G +A +Y E+ +P +A MS KPG+G +Y+++K +++
Sbjct 156 VARYALKK---VTGTQAEDHYRSIDLTTGEVTYVRPEYATMSRKPGIGKEWYEKYKKDMY 212
Query 223 RQGYIQCTNGKHAQ-IPRYYEKMMEAENPQRLWRIKQNRQAAAI 265
G IPR+Y+K+ME E+P++L +K+ R+ A+
Sbjct 213 PSNQTPSVGGGVKNGIPRFYDKLMEKEDPEQLEIVKEKRKEFAL 256
Lambda K H a alpha
0.321 0.136 0.431 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1564156586088