bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-21_CDS_annotation_glimmer3.pl_2_4
Length=322
Score E
Sequences producing significant alignments: (Bits) Value
gi|547312922|ref|WP_022044634.1| putative replication initiation... 116 2e-26
gi|547920048|ref|WP_022322419.1| putative replication protein 97.1 6e-20
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 95.1 2e-19
gi|492501778|ref|WP_005867316.1| hypothetical protein 94.7 4e-19
gi|609718275|emb|CDN73649.1| conserved hypothetical protein 92.8 1e-18
gi|575094374|emb|CDL65755.1| unnamed protein product 83.6 1e-14
gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 71.2 4e-11
gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 72.4 5e-11
gi|575094557|emb|CDL65915.1| unnamed protein product 67.0 3e-09
gi|575094569|emb|CDL65925.1| unnamed protein product 59.3 1e-06
>gi|547312922|ref|WP_022044634.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
gi|524208442|emb|CCZ76638.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
Length=320
Score = 116 bits (290), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 131/270 (49%), Gaps = 41/270 (15%)
Query 2 CLYPKLILNKRYCSTKKNK---------GVIPPCPDERLRYVTAACGECYECRKQKQRAW 52
C PK+I+N+RY + + G P PD L CG C+ C+K +
Sbjct 3 CEQPKVIVNRRYANMTNTEIVNYAKVYYGCFWP-PDYILE---VPCGYCHSCQKSYNNQY 58
Query 53 VTRMTEELKQNPNAG--FYTLTIDDEHYKKLSKECKSKDENTIATYALRMFLERIRKKLK 110
R+ EL++ P F TLT +D+ +K SK+ A+R+FL+R RK
Sbjct 59 RIRLLYELRKYPPGTCLFVTLTFNDDSLEKFSKDTNK---------AVRLFLDRFRKVYG 109
Query 111 KSVKHWCITELGHEKSERLHLHGIFWGC-------------GIESIIREKWQNGFIFTGN 157
K ++HW + E G R H HGI + G ++ W+ GF+F G
Sbjct 110 KQIRHWFVCEFGTLHG-RPHYHGILFNVPQALIDGYDSDMPGHHPLLASCWKYGFVFVG- 167
Query 158 YVSQRTINYITKYMLKTDLDHPKFKGKVLCSAGIGKGYEKTTNAKNNKFKGEDTNETYRL 217
YVS T +YITKY+ K+ ++ K + +V+ S GIG Y T + +K G + + +
Sbjct 168 YVSDETCSYITKYVTKS-INGDKVRPRVISSFGIGSNYLNTEESSLHKL-GNQRYQPFMV 225
Query 218 PNGAKINLPIYYRNKIYSEKEREALFLSKV 247
NG + +P YY NKI+S+ +++ + + ++
Sbjct 226 LNGFQQAMPRYYYNKIFSDVDKQNMVVDRL 255
>gi|547920048|ref|WP_022322419.1| putative replication protein [Parabacteroides merdae CAG:48]
gi|524592960|emb|CDD13572.1| putative replication protein [Parabacteroides merdae CAG:48]
Length=278
Score = 97.1 bits (240), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 75/269 (28%), Positives = 127/269 (47%), Gaps = 27/269 (10%)
Query 36 AACGECYECRKQKQRAWVTRMTEELKQNPNAGFYTLTIDDEHY--KKLSKECKSKDENTI 93
CG C CR+ K+++WV R+ E K+ P + F TLT DDEH +++ + + +
Sbjct 10 VPCGWCVNCRQNKRQSWVYRLQAEAKEYPLSLFVTLTYDDEHLPIERIGSDLFQTNVAVV 69
Query 94 ATYALRMFLERIRKKLKKSVKHWCITELGHEKSERLHLHGIFWGCGIE-----SIIREKW 148
+ +++F++R+RKK + + +T K+ R H H I +G ++ E W
Sbjct 70 SKRDVQLFMKRLRKKYEDYKMRYFVTSEYGAKNGRPHYHMILFGFPFTGKMAGDLLAECW 129
Query 149 QNGFIFTGNYVSQRTINYITKYMLKTDL------DHPKFKGKVLCS--AGIGKGYEKTTN 200
QNGF+ + ++ + I Y+ KYM + + D K+K +LCS GIG G+ K
Sbjct 130 QNGFV-QAHPLTIKEIAYVCKYMYEKSMCPEILRDEKKYKPFMLCSRNPGIGFGFMK--- 185
Query 201 AKNNKFKGEDTNETYRLPNGAKINLPIYYRNKIYSE-------KEREALFLSKVEKGIVW 253
A +F + R G K+ +P YY +K+Y + + RE F K+ +
Sbjct 186 ADIIEFYRRHPRDYVRAWAGHKMAMPRYYADKLYDDDMKAFLKEMREEFFRHKMFNEWID 245
Query 254 ICG-EKCLIDDYDSYKNLLEYHRNRASRL 281
C E ++ D + EY + RL
Sbjct 246 YCARENPILTDLMQLEQREEYEKRMNERL 274
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 95.1 bits (235), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 119/244 (49%), Gaps = 40/244 (16%)
Query 35 TAACGECYECRKQKQRAWVTRMTEELKQNPNAGFYTLTIDDEHYKK--LSKECKSKDENT 92
CG C CRK K+++WV R+ E + P + F TLT DDEH + ++
Sbjct 14 AVPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTVGV 73
Query 93 IATYALRMFLERIRKKLKK-SVKHWCITELGHEKSERLHLHGIFWGCGIES-----IIRE 146
++ +++F++R+RKK + ++++ +E G + R H H I +G ++ E
Sbjct 74 VSKRDIQLFMKRLRKKYAQYRLRYFLTSEYGSQGG-RPHYHMILFGFPFTGKHGGDLLAE 132
Query 147 KWQNGFIFTGNYVSQRTINYITKYMLKTDLDHPKFKGK------VLCSAGIGKGYEKTTN 200
W+NGF+ + ++ + I+Y+TKYM + + KG +LCS G GY
Sbjct 133 CWKNGFV-QAHPLTTKEISYVTKYMYEKSMIPDILKGVKEYQPFMLCSKMPGIGYH---- 187
Query 201 AKNNKFKGEDTNETYRLP--------NGAKINLPIYYRNKIYSE-------KEREALFLS 245
F E + YRL NG ++ +P YY +K+Y + + REA F++
Sbjct 188 -----FLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFIN 242
Query 246 KVEK 249
++++
Sbjct 243 QMQQ 246
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 94.7 bits (234), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 120/244 (49%), Gaps = 40/244 (16%)
Query 35 TAACGECYECRKQKQRAWVTRMTEELKQNPNAGFYTLTIDDEHYKK--LSKECKSKDENT 92
CG C CRK K+++WV R+ E + P + F TLT DDEH + ++
Sbjct 14 AVPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHMPTAMIGEDLFKSTVGV 73
Query 93 IATYALRMFLERIRKKLKK-SVKHWCITELGHEKSERLHLHGIFWGCGIE-----SIIRE 146
++ +++F++R+RKK + ++++ +E G + R H H I +G ++ E
Sbjct 74 VSKRDIQLFMKRLRKKYDQYRLRYFLTSEYGSQGG-RPHYHMILFGFPFTGKHGGDLLAE 132
Query 147 KWQNGFIFTGNYVSQRTINYITKYMLKTDL------DHPKFKGKVLCSAGIGKGYEKTTN 200
W+NGF+ + ++ + I Y+TKYM + + D +++ +LCS G GY
Sbjct 133 CWKNGFV-QAHPLTTKEIAYVTKYMYEKSMVPDILKDVKEYQPFMLCSRIPGIGYH---- 187
Query 201 AKNNKFKGEDTNETYRLP--------NGAKINLPIYYRNKIYSE-------KEREALFLS 245
F E + YRL NG ++ +P YY +K+Y + + REA F++
Sbjct 188 -----FLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFIN 242
Query 246 KVEK 249
++++
Sbjct 243 QMQQ 246
>gi|609718275|emb|CDN73649.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=265
Score = 92.8 bits (229), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/210 (31%), Positives = 106/210 (50%), Gaps = 19/210 (9%)
Query 38 CGECYECRKQKQRAWVTRMTEELKQNPNAGFYTLTIDDEHYKKLSKECKSKDENTIATYA 97
CG+C ECRK + +W R+TEELK + +A F TLT D + S D
Sbjct 25 CGKCLECRKARTNSWFARLTEELKVSKSAHFVTLTYSDVYLPYSDNGLISLDYRD----- 79
Query 98 LRMFLERIRKKLKKSVKHWCITELGHEKSERLHLHGIFWGC-GIESIIREKWQNGFIFTG 156
++F++R RK K +K++ + E G ++ R H H I +G I++ + E W+ G + G
Sbjct 80 FQLFMKRARKLQKSKIKYFLVGEYG-AQTYRPHYHAIVFGVENIDAFLGE-WRMGNVHAG 137
Query 157 NYVSQRTINYITKYMLKTDLDHPKFKG-------KVLCSAGIGKGYEKTTNAKNNKFKGE 209
V+ ++I Y KY K+ + P K L S G+G + + K K +
Sbjct 138 T-VTAKSIYYTLKYCTKSITEGPDKDPDDDRKPEKALMSKGLGLSHLTESMIKYYK---D 193
Query 210 DTNETYRLPNGAKINLPIYYRNKIYSEKER 239
D + ++ L G I LP YYR+K++S+ E+
Sbjct 194 DVSRSFSLLGGTTIALPRYYRDKVFSDIEK 223
>gi|575094374|emb|CDL65755.1| unnamed protein product [uncultured bacterium]
Length=487
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/154 (32%), Positives = 73/154 (47%), Gaps = 14/154 (9%)
Query 35 TAACGECYECRKQKQRAWVTRMTEELKQNPNAGFYTLTIDDEHYKKLSKECKSKDENTIA 94
CG CY+C+ K W R +EEL N + FYTLT+D
Sbjct 25 VVPCGHCYDCKSAKTTDWQVRCSEELNNNSQSYFYTLTLDPRFIDTYGTLPDGSPRYVFN 84
Query 95 TYALRMFLERIRKKLKK---SVKHWCITELGHEKSERLHLHGIFWGCG------IESIIR 145
+++FL+R+RK L K S+K+ + ELG E + R H H IF+ ++R
Sbjct 85 KRHIQLFLKRLRKALSKYNISLKYVIVGELG-ETTHRPHYHAIFYLSSSVNPFKFRIMVR 143
Query 146 EKWQNGFIFTGN----YVSQRTINYITKYMLKTD 175
W GFI +G+ ++ ++Y+ KYM KTD
Sbjct 144 NSWSLGFIKSGDNNGIILNNDAVSYVIKYMHKTD 177
>gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 [Parabacteroides distasonis str.
3999B T(B) 6]
Length=250
Score = 71.2 bits (173), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/223 (27%), Positives = 107/223 (48%), Gaps = 40/223 (18%)
Query 56 MTEELKQNPNAGFYTLTIDDEHYKK--LSKECKSKDENTIATYALRMFLERIRKKLKK-S 112
M E + P + F TLT DDEH + ++ ++ +++F++R+RKK +
Sbjct 1 MQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTVGVVSKRDIQLFMKRLRKKYAQYR 60
Query 113 VKHWCITELGHEKSERLHLHGIFWGCGIES-----IIREKWQNGFIFTGNYVSQRTINYI 167
++++ +E G + R H H I +G ++ E W+NGF+ + ++ + I+Y+
Sbjct 61 LRYFLTSEYGSQGG-RPHYHMILFGFPFTGKHGGDLLAECWKNGFV-QAHPLTTKEISYV 118
Query 168 TKYMLKTDLDHPKFKGK------VLCSAGIGKGYEKTTNAKNNKFKGEDTNETYRLP--- 218
TKYM + + KG +LCS G GY F E + YRL
Sbjct 119 TKYMYEKSMIPDILKGVKEYQPFMLCSKMPGIGYH---------FLREQILDFYRLHPRD 169
Query 219 -----NGAKINLPIYYRNKIYSE--KE-----REALFLSKVEK 249
NG ++ +P YY +K+Y + KE REA F++++++
Sbjct 170 YVRAFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFINQMQQ 212
>gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 [Necator americanus]
Length=345
Score = 72.4 bits (176), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 66/240 (28%), Positives = 108/240 (45%), Gaps = 29/240 (12%)
Query 27 PDERLRYVTAACGECYECRKQKQRAWVTRMTEELKQNPNAGFYTLTIDDEHYKKLSKECK 86
P L V CG C C++++ +WV R+ +E Q+ NA F TLT D
Sbjct 11 PKAALEKVPVPCGRCPPCKRRRVDSWVFRLLQEELQHENASFVTLTYDTRFVPISKNGFM 70
Query 87 SKDENTIATYALRMFLERIRKKLK-KSVKHWCITELGHEKSERLHLHGIFWGCGIESIIR 145
+ D Y ++R+RK + + +K++ E G ++ R H H I +G +S+
Sbjct 71 TLDRGEFPRY-----MKRLRKLVPGRKLKYYMCGEYGSQRF-RPHYHAIIFGVPQDSLFA 124
Query 146 EKWQ-NG---FIFTGNYVSQRTINYITKYMLKT--------DLDHPKFKGKVLCSAGIGK 193
+ W NG V+ ++I Y KY+ K+ D P+F L S G+G
Sbjct 125 DAWTLNGDSLGGVVVGTVTGKSIAYTMKYIDKSTWKQKHGRDDRVPEFS---LMSKGMGV 181
Query 194 GYEKTTNAKNNKFKGEDTNETY-RLPNGAKINLPIYYRNKIYSE---KEREALFLSKVEK 249
Y + ++ ED + + G++I +P YYR KIYS+ K++ L VE+
Sbjct 182 SY---LTPQMVEYHKEDISRLFCTREGGSRIAMPRYYRQKIYSDDDLKKQVVLIAESVER 238
>gi|575094557|emb|CDL65915.1| unnamed protein product [uncultured bacterium]
Length=354
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/217 (25%), Positives = 94/217 (43%), Gaps = 39/217 (18%)
Query 1 MCLYPKLILNKRYCSTKKNKGVIPPCPDERLRYVTAACGECYECRKQKQRAWVTRMTEEL 60
+C +P + +K + P P + LR V CG+C CR + +R W +R+ E+
Sbjct 2 LCRHP-------FVRDRKGESFTSPHPKDWLRGVPFGCGKCLACRVKTRREWTSRLILEM 54
Query 61 KQNPNAGFYTLTIDDEHYKKLSKECKSKDENTIATYALRMFLERIRKKL------KKSVK 114
+ + F TLT +++ T++ L++FL+R+R+ L K ++
Sbjct 55 LGHDSGAFVTLTYSEDYV-----PVTESGHRTLSLRDLQLFLKRLRRNLEERKRSKHPIR 109
Query 115 HWCITELGHEKSERLHLHGIFWGCG------IESIIREKW-------------QNGFIFT 155
++ E G ++R H H IF+G I+S+ W Q G I T
Sbjct 110 YYACGEYGTRGTQRPHYHIIFFGVSDLDLDFIKSVY-AAWSEPAKYGQKGQTPQFGNI-T 167
Query 156 GNYVSQRTINYITKYMLKTDLDHPKFKGKVLCSAGIG 192
++ +T+ Y Y +K + K V+ SA IG
Sbjct 168 IEPLNAKTVAYTAGYNMKKLISPKKVHKVVVSSAEIG 204
>gi|575094569|emb|CDL65925.1| unnamed protein product [uncultured bacterium]
Length=354
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 77/169 (46%), Gaps = 34/169 (20%)
Query 33 YVTAACGECYECRKQKQRAWVTRMTEELKQNPNAGFYTLTIDDEHYKKLSKECKSKDENT 92
++ CG+C CR++ W R+ EL+ + + F TLT DD+H + S E
Sbjct 67 FIEIPCGKCISCRRRYAALWTDRLMLELQDHKESCFITLTYDDDHICCVD----SPIEEN 122
Query 93 IATYA-----LRMFLERIRKKL------KKSVKHWCITELGHEKSERLHLHGIFWGCGIE 141
++ Y L+ F +R+R+ L +K ++++ E G + + R H H I +G
Sbjct 123 VSMYTLNKVHLQCFWKRLRQYLVRHVEPEKRIRYFACGEYG-DTTFRPHYHAILFGWRPT 181
Query 142 SIIREK-----------------WQNGFIFTGNYVSQRTINYITKYMLK 173
+I+ K WQNG + G+ V+ + Y+ +Y LK
Sbjct 182 DLIQFKKNFQNDTLYLSKSLASIWQNGNVMVGD-VTPESCRYVARYCLK 229
Lambda K H a alpha
0.319 0.135 0.419 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1793877651450