bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-23_CDS_annotation_glimmer3.pl_2_4
Length=298
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094436|emb|CDL65809.1| unnamed protein product 256 1e-79
gi|575094418|emb|CDL65793.1| unnamed protein product 189 3e-53
gi|575094569|emb|CDL65925.1| unnamed protein product 161 5e-43
gi|575094494|emb|CDL65868.1| unnamed protein product 155 6e-41
gi|575096060|emb|CDL66943.1| unnamed protein product 146 2e-37
gi|575096096|emb|CDL66976.1| unnamed protein product 143 1e-36
gi|492501778|ref|WP_005867316.1| hypothetical protein 140 1e-35
gi|575094487|emb|CDL65854.1| unnamed protein product 140 2e-35
gi|530695371|gb|AGT39925.1| replication initiator 138 9e-35
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 137 1e-34
>gi|575094436|emb|CDL65809.1| unnamed protein product [uncultured bacterium]
Length=340
Score = 256 bits (654), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 141/282 (50%), Positives = 182/282 (65%), Gaps = 14/282 (5%)
Query 6 VMLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIVKTGEIMR 65
VMLIPCG+CIGCRIR ++DW TR+ELEAR Y K WF+TLTY DD +P +I TGE++
Sbjct 46 VMLIPCGKCIGCRIRAKQDWATRLELEARAY-KGRAWFVTLTYRDDTIPLLIRNTGELIE 104
Query 66 KVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRK-----AYRGK-LRYFVAGEYGEQTARP 119
W G P + L +D+ KF KRLRK GK LRYF AGEYGEQT RP
Sbjct 105 GGVSMWSRGADVPEQINTLNMDDVTKFWKRLRKYQTTEPDMGKELRYFYAGEYGEQTGRP 164
Query 120 HYHMILYGWKPTDLENLYKIQHNGYYTSEWLKGLWGMGQIQIAQAVPETYRYVAGYVTKK 179
HYH I++G + DL+ + N YY S L+ +WG G + IA + P TY YVAGYVTKK
Sbjct 165 HYHAIIFGLEIPDLKKV--PGRNQYYKSAILEKIWGKGNVTIAYSEPGTYNYVAGYVTKK 222
Query 180 MYEIDGKKANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWRQGYIQCTNGKRAQIPR 239
MY D K+ Y LG T P+ACMS KPG+G + +++ ++W Q YIQ GK A IPR
Sbjct 223 MYGNDTKE---YQNLGLTAPYACMSRKPGIGMPWLEQNLDKLWEQDYIQLA-GKTAPIPR 278
Query 240 YYEKQMEAENPQRLWRIKQNRQKNAMQQKRLQIENQDYKTVL 281
++K +EA +P+RLW+ KQ RQK+A+ LQ +Q +T+L
Sbjct 279 AFDKMLEATDPERLWKKKQARQKSAINGA-LQAMSQTDQTLL 319
>gi|575094418|emb|CDL65793.1| unnamed protein product [uncultured bacterium]
Length=367
Score = 189 bits (479), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 106/250 (42%), Positives = 155/250 (62%), Gaps = 16/250 (6%)
Query 7 MLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIVKTGEIMRK 66
+L PCGQC+ CRI+ +W R ELE +Y K + F+TLTYD++HVP + +TGEI R
Sbjct 42 VLTPCGQCLACRIQYAANWAARCELET-NYHKQSI-FLTLTYDEEHVPVLNKETGEIYRG 99
Query 67 VQYTWKPGEKRPSSV--QILLYE-DIQKFLKRLRKAYRGK-----LRYFVAGEYGEQTAR 118
V+ P E ++ +Y+ D+QKF+KRLRKA + + Y+++GEYG++T R
Sbjct 100 VR---NPAEYVAGVTLERMTVYKPDVQKFIKRLRKAAEKEGLTDHIMYYLSGEYGDKTGR 156
Query 119 PHYHMILYGWKPTDLENLYKIQHNGYYTSEWLKGLWGMGQIQIAQAVPETYRYVAGYVTK 178
PHYH+I+YG + D E++ + +TSEWLKG+WGMG I+I E+ +YVA YV K
Sbjct 157 PHYHLIVYGLEVPDAEHIGSRRGYDRFTSEWLKGIWGMGLIEIGSVTYESCQYVARYVIK 216
Query 179 KMYEIDGKKANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWRQGYIQCTNGKRAQIP 238
K GK+A Y + G F MSLKP +G +Y++EHK EI+ I +G+ + P
Sbjct 217 KR---KGKEAKEYKDAGIMPEFVQMSLKPAIGQRYWEEHKDEIYSLDQINLASGRTVKPP 273
Query 239 RYYEKQMEAE 248
RY++K + E
Sbjct 274 RYFDKLEDQE 283
>gi|575094569|emb|CDL65925.1| unnamed protein product [uncultured bacterium]
Length=354
Score = 161 bits (408), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 100/278 (36%), Positives = 141/278 (51%), Gaps = 30/278 (11%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIVKTGEIMRKVQ 68
IPCG+CI CR R WT R+ LE +D+ E FITLTYDDDH+ +
Sbjct 70 IPCGKCISCRRRYAALWTDRLMLELQDHK--ESCFITLTYDDDHICCVD----------- 116
Query 69 YTWKPGEKRPSSVQILLYEDIQKFLKRLRKAY------RGKLRYFVAGEYGEQTARPHYH 122
P E+ S+ L +Q F KRLR+ ++RYF GEYG+ T RPHYH
Sbjct 117 ---SPIEEN-VSMYTLNKVHLQCFWKRLRQYLVRHVEPEKRIRYFACGEYGDTTFRPHYH 172
Query 123 MILYGWKPTDLENLYK-IQHNGYYTSEWLKGLWGMGQIQIAQAVPETYRYVAGYVTKKMY 181
IL+GW+PTDL K Q++ Y S+ L +W G + + PE+ RYVA Y KK
Sbjct 173 AILFGWRPTDLIQFKKNFQNDTLYLSKSLASIWQNGNVMVGDVTPESCRYVARYCLKKA- 231
Query 182 EIDGKKANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWRQGYIQCTN---GKRAQIP 238
G + Y LG F MS KPG+ KY+ +H EI + I + G QIP
Sbjct 232 --TGFDSEIYERLGVLPEFVTMSRKPGIARKYFDDHYDEIIKYKTINLSTLKGGMSMQIP 289
Query 239 RYYEKQMEAENPQRLWRIKQNRQKNAMQQKRLQIENQD 276
Y+ + +E + + IK++ ++ A+ + ++N D
Sbjct 290 PYFIRLIEDIDSELFKEIKRSNKQAALNHQEALMKNTD 327
>gi|575094494|emb|CDL65868.1| unnamed protein product [uncultured bacterium]
Length=348
Score = 155 bits (393), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 98/257 (38%), Positives = 137/257 (53%), Gaps = 35/257 (14%)
Query 7 MLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVP---GMIVKTGEI 63
++IPCG+C+GCR+ W R LE+ + +F+TLTYDDD++P + TGEI
Sbjct 60 IIIPCGKCVGCRLAYSRQWADRCMLESSYHT--HSYFLTLTYDDDNLPLSESINQDTGEI 117
Query 64 MRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKAYRGKL------RYFVAGEYGEQTA 117
L+ +DIQ F+KRLR+ + +YF AGEYG QT
Sbjct 118 NYNAT---------------LVKKDIQDFIKRLRRFCEYNIDDNLHIKYFCAGEYGSQTF 162
Query 118 RPHYHMILYGWKPTDLENLYKIQHNG--YYTSEWLKGLWGMGQIQIAQAVPETYRYVAGY 175
RPHYHMILYG+ DL+ LYK+ +G YY S + LW G + I + +T Y A Y
Sbjct 163 RPHYHMILYGFPINDLK-LYKMSLDGYNYYNSATIDKLWKKGFVVIGEVTWDTCAYTARY 221
Query 176 VTKKMYEIDGKKANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWRQGYIQC-TNGKR 234
+ KK Y G A Y + F CMS KP + +YY+++K +I+ YI T K
Sbjct 222 ILKKQY---GSGAQIYKDYNILPEFTCMSTKPAIAREYYEDNKDKIFDSDYIFLGTKEKS 278
Query 235 AQI--PRYYEKQMEAEN 249
Q+ P+Y+EK +E EN
Sbjct 279 IQMKPPKYFEKLLEKEN 295
>gi|575096060|emb|CDL66943.1| unnamed protein product [uncultured bacterium]
Length=339
Score = 146 bits (368), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 100/290 (34%), Positives = 139/290 (48%), Gaps = 36/290 (12%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIVKTGEIMRKVQ 68
+PCGQCIGCRI W R LE +D+ D +F T TYD+DHVP E
Sbjct 50 LPCGQCIGCRIDYSRQWANRCMLELQDH--DSAFFCTFTYDNDHVPISYYADKET----- 102
Query 69 YTWKPGEKRPSSVQILLYEDIQKFLKRLRKAYRGK-LRYFVAGEYGEQTARPHYHMILYG 127
GE +PS L D Q +KR+RK + +R+F AGEYG QT RPHYH I+YG
Sbjct 103 -----GEAKPSLT--LRKRDFQLLMKRIRKHFSDDHIRFFAAGEYGGQTLRPHYHAIIYG 155
Query 128 WKPTDLENLYKIQHNG----YYTSEWLKGLW------GMGQIQIAQAVPETYRYVAGYVT 177
DL ++ G YY S L+ W +G + + E+ Y A YV
Sbjct 156 LHLNDLVPYKTVKEGGVLYTYYNSPSLQKCWLDSDGKPIGFVVVGAVTWESCAYTARYVL 215
Query 178 KKMYEIDGKKANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWRQGYIQCTN---GKR 234
KK G+ + Y E F MS KPG+ YY H ++++ +I + G++
Sbjct 216 KKQ---KGEASTVYQEFNLEPEFTLMSRKPGIARNYYDTH-PDLFQSDFINISTLKGGRK 271
Query 235 AQIPRYYEKQMEAENPQRLWRIKQNRQ---KNAMQQKRLQIENQDYKTVL 281
+ PRY+EK E + P+ + + R+ NAM K L N D ++L
Sbjct 272 FRPPRYFEKLFELDFPEEAAKRSEVRKAAGSNAMAAK-LAKTNLDPLSML 320
>gi|575096096|emb|CDL66976.1| unnamed protein product [uncultured bacterium]
Length=296
Score = 143 bits (361), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/271 (34%), Positives = 138/271 (51%), Gaps = 40/271 (15%)
Query 2 YNPKVMLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIVKTG 61
++ +L+PCGQC+ CR+ + +W R E + + D+ F+TLTY+DD++P G
Sbjct 29 FSHDYVLVPCGQCLECRLHRASEWALRCCHELKSH--DKGIFLTLTYNDDNLP----PNG 82
Query 62 EIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKA--YRG---KLRYFVAGEYGEQT 116
+++K +Q F+KRLR+ Y G K+RY AGEYG+ +
Sbjct 83 TLVKK---------------------HVQDFIKRLRRHIDYYGDCTKIRYLCAGEYGDLS 121
Query 117 ARPHYHMILYGWKPTD---LENLYKIQHNGYYTSEWLKGLWGMGQIQIAQAVPETYRYVA 173
RPHYH++++G+ P+D L L KI N +TS L LWG G I E+ RY
Sbjct 122 LRPHYHLLVFGYYPSDPRLLHGLQKIGKNSLFTSPTLTKLWGKGHISFGAITFESARYTC 181
Query 174 GYVTKKMYEIDGKKANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWRQGYIQCTNGK 233
Y KK G+ ++ Y + G F S + GLG + H ++ +GY+ NGK
Sbjct 182 QYALKKQ---TGEHSHYYVDRGVIPEFMICSNRNGLGYDFAVSHD-NMFERGYLT-MNGK 236
Query 234 RAQIPRYYEKQMEAENPQRLWRIKQNRQKNA 264
+ IPRYY+K E E P K+ R+ +A
Sbjct 237 KIGIPRYYQKICEREIPDYYASFKEMRRMSA 267
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 140 bits (353), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 93/265 (35%), Positives = 142/265 (54%), Gaps = 37/265 (14%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIVKTGEIMRKVQ 68
+PCG+C+ CR +R+ W R++ EA +YP F+TLTYDD+H+P ++ GE + K
Sbjct 15 VPCGRCVNCRKNKRQSWVYRLQAEADEYPFS--LFVTLTYDDEHMPTAMI--GEDLFK-- 68
Query 69 YTWKPGEKRPSSVQILLYEDIQKFLKRLRKAY-RGKLRYFVAGEYGEQTARPHYHMILYG 127
S+V ++ DIQ F+KRLRK Y + +LRYF+ EYG Q RPHYHMIL+G
Sbjct 69 ----------STVGVVSKRDIQLFMKRLRKKYDQYRLRYFLTSEYGSQGGRPHYHMILFG 118
Query 128 WKPTDLENLYKIQHNGYYTSEWLKGLWGMGQIQIAQAVPETYRYVAGYVTKKMYEIDGKK 187
+ T G + + L W G QA P T + +A YVTK MYE
Sbjct 119 FPFT-----------GKHGGDLLAECWKNG---FVQAHPLTTKEIA-YVTKYMYE-KSMV 162
Query 188 ANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWR---QGYIQCTNGKRAQIPRYY-EK 243
+ ++ + +PF S PG+G + +E + +R + Y++ NG R +PRYY +K
Sbjct 163 PDILKDVKEYQPFMLCSRIPGIGYHFLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADK 222
Query 244 QMEAENPQRLWRIKQNRQKNAMQQK 268
+ + + L +++ N MQQ+
Sbjct 223 LYDDDMKEYLKELREAFFINQMQQE 247
>gi|575094487|emb|CDL65854.1| unnamed protein product [uncultured bacterium]
Length=332
Score = 140 bits (353), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 90/277 (32%), Positives = 144/277 (52%), Gaps = 30/277 (11%)
Query 5 KVMLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIV---KTG 61
+++++PC QC+GCR+ + +W R+ +E + E WF+TLTY+D+H+P TG
Sbjct 47 RLLMLPCRQCVGCRLSKSREWANRVVMEQLYHV--ESWFLTLTYNDEHLPRSFPVDEATG 104
Query 62 EIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKAYRGKLRYFVAGEYGEQTARPHY 121
EI+ S L+ ED+QKFLKRLRK KLR+F AGEYG RPHY
Sbjct 105 EIL--------------SVHGTLVKEDLQKFLKRLRKNSGQKLRFFAAGEYGSLNMRPHY 150
Query 122 HMILYGWKPTDLENLYKIQ-HNGYYTSEWLKGLWGMGQIQIAQAVPETYRYVAGYVTKKM 180
H++++G DL+ L K + YYTS L+ W G + + ++ YVA Y KK
Sbjct 151 HLLIFGLHLEDLQLLRKSPLGDEYYTSSLLEKCWPFGFHILGRVTWQSAAYVARYTMKKA 210
Query 181 YEIDGKKANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWR--QGYIQCTNGKRAQIP 238
+ G + Y + F MS +PGL +YY++H +I+R + G R P
Sbjct 211 SK--GYDKDLYKKAALQPEFQVMSNRPGLARQYYEDH-PDIFRYLSFNVSTPQGGRKMYP 267
Query 239 RYYEKQMEAENPQRLWRIKQNRQKNAMQQKRLQIENQ 275
Y +++ + +R + +++ ++ L++EN
Sbjct 268 SEYFRKLYRDGHER-----ELFERSLRTREELEVENH 299
>gi|530695371|gb|AGT39925.1| replication initiator [Marine gokushovirus]
Length=316
Score = 138 bits (348), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 93/268 (35%), Positives = 135/268 (50%), Gaps = 38/268 (14%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIVKTGEIMRKVQ 68
IPC QCIGCR+ + W R EA+ Y + FITLTY+ DH+P
Sbjct 49 IPCNQCIGCRLEKSRQWALRCTHEAKLYKNNS--FITLTYNSDHLPLT------------ 94
Query 69 YTWKPGEKRPSSVQILLYEDIQKFLKRLRKAYRGK-LRYFVAGEYGEQTARPHYHMILYG 127
+S+ L Q FLKRLRK Y K +R++ GEYG+ RPHYH +L+
Sbjct 95 ---------NNSLPTLNLRHFQLFLKRLRKKYSNKTIRFYHCGEYGDMNHRPHYHALLFN 145
Query 128 WKPTDLENLYKIQHN-GYYTSEWLKGLW-------GMGQIQIAQAVPETYRYVAGYVTKK 179
D + L+KI + YYTSE L GLW MG I ++ YVA Y KK
Sbjct 146 HDFED-KKLWKIHKDQNYYTSEVLDGLWTDPKTKSNMGFSTIGDLTFDSAAYVARYCLKK 204
Query 180 MYEIDGKKANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWRQGYIQCTNGKRAQIPR 239
I GK A YY+ G+ +A MS +PG+G+ + + K++++ G+I G++ Q P+
Sbjct 205 ---ITGKNAEDYYQ-GRVPEYATMSRRPGIGNGWLDKFKSDVYPSGFI-IHEGQKMQPPK 259
Query 240 YYEKQMEAENPQRLWRIKQNRQKNAMQQ 267
YY++ + + + R K R + A +
Sbjct 260 YYDRVTNETDEKAVRRSKILRMQEAKKH 287
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 137 bits (346), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 141/265 (53%), Gaps = 37/265 (14%)
Query 9 IPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVPGMIVKTGEIMRKVQ 68
+PCG+C+ CR +R+ W R++ EA +YP F+TLTYDD+H+P ++ GE + K
Sbjct 15 VPCGRCVNCRKNKRQSWVYRLQAEADEYPFS--LFVTLTYDDEHIPTAMI--GEDLFK-- 68
Query 69 YTWKPGEKRPSSVQILLYEDIQKFLKRLRKAY-RGKLRYFVAGEYGEQTARPHYHMILYG 127
++V ++ DIQ F+KRLRK Y + +LRYF+ EYG Q RPHYHMIL+G
Sbjct 69 ----------TTVGVVSKRDIQLFMKRLRKKYAQYRLRYFLTSEYGSQGGRPHYHMILFG 118
Query 128 WKPTDLENLYKIQHNGYYTSEWLKGLWGMGQIQIAQAVPETYRYVAGYVTKKMYEIDGKK 187
+ T G + + L W G QA P T + ++ YVTK MYE
Sbjct 119 FPFT-----------GKHGGDLLAECWKNG---FVQAHPLTTKEIS-YVTKYMYE-KSMI 162
Query 188 ANAYYELGQTKPFACMSLKPGLGDKYYQEHKAEIWR---QGYIQCTNGKRAQIPRYY-EK 243
+ + + +PF S PG+G + +E + +R + Y++ NG R +PRYY +K
Sbjct 163 PDILKGVKEYQPFMLCSKMPGIGYHFLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADK 222
Query 244 QMEAENPQRLWRIKQNRQKNAMQQK 268
+ + + L +++ N MQQ+
Sbjct 223 LYDDDMKEYLKELREAFFINQMQQE 247
Lambda K H a alpha
0.320 0.137 0.435 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1564156586088