bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-22_CDS_annotation_glimmer3.pl_2_6
Length=345
Score E
Sequences producing significant alignments: (Bits) Value
gi|547312922|ref|WP_022044634.1| putative replication initiation... 115 6e-26
gi|609718275|emb|CDN73649.1| conserved hypothetical protein 87.8 8e-17
gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 75.1 3e-12
gi|492501778|ref|WP_005867316.1| hypothetical protein 74.3 6e-12
gi|547920048|ref|WP_022322419.1| putative replication protein 71.6 5e-11
gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 67.8 2e-09
gi|575094374|emb|CDL65755.1| unnamed protein product 68.2 2e-09
gi|313766930|gb|ADR80656.1| putative replication initiation protein 53.9 8e-05
gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 51.6 3e-04
gi|575094557|emb|CDL65915.1| unnamed protein product 48.5 0.004
>gi|547312922|ref|WP_022044634.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
gi|524208442|emb|CCZ76638.1| putative replication initiation protein [Alistipes finegoldii
CAG:68]
Length=320
Score = 115 bits (288), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 86/271 (32%), Positives = 136/271 (50%), Gaps = 43/271 (16%)
Query 2 CLYPKLIPNKRYLPTKKN----------GGVPPVCPDERLRYVTAACGDCYECRKQKQRQ 51
C PK+I N+RY G P PD L CG C+ C+K Q
Sbjct 3 CEQPKVIVNRRYANMTNTEIVNYAKVYYGCFWP--PDYILE---VPCGYCHSCQKSYNNQ 57
Query 52 WVVRMSEENRQTP--NAYFLTLTIddksykqlkqkyklkdnndIATKAIRLCLERVRKLT 109
+ +R+ E R+ P F+TLT +D S ++ + KA+RL L+R RK+
Sbjct 58 YRIRLLYELRKYPPGTCLFVTLTFNDDSLEKFSKD---------TNKAVRLFLDRFRKVY 108
Query 110 GKSVKHWFITELGHEKTERLHLHGIVWGL-------------GNGEKITNNWKYGITFTG 156
GK ++HWF+ E G R H HGI++ + G+ + + WKYG F G
Sbjct 109 GKQIRHWFVCEFG-TLHGRPHYHGILFNVPQALIDGYDSDMPGHHPLLASCWKYGFVFVG 167
Query 157 YFVNEKTINYITKYMLKIDEKHPKFRGKVLCSAGIGSGYLKREDAKRHVYIPGKTNESYR 216
Y V+++T +YITKY+ K K R +V+ S GIGS YL E++ H + + + +
Sbjct 168 Y-VSDETCSYITKYVTK-SINGDKVRPRVISSFGIGSNYLNTEESSLHK-LGNQRYQPFM 224
Query 217 MKNGGKLNLPIYYRNKIFTEEEREKLFLDKI 247
+ NG + +P YY NKIF++ +++ + +D++
Sbjct 225 VLNGFQQAMPRYYYNKIFSDVDKQNMVVDRL 255
>gi|609718275|emb|CDN73649.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=265
Score = 87.8 bits (216), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 64/209 (31%), Positives = 100/209 (48%), Gaps = 17/209 (8%)
Query 38 CGDCYECRKQKQRQWVVRMSEENRQTPNAYFLTLTIddksykqlkqkyklkdnndIATKA 97
CG C ECRK + W R++EE + + +A+F+TLT Y + Y + +
Sbjct 25 CGKCLECRKARTNSWFARLTEELKVSKSAHFVTLT-----YSDVYLPYSDNGLISLDYRD 79
Query 98 IRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWGLGNGEKITNNWKYGITFTGY 157
+L ++R RKL +K++ + E G +T R H H IV+G+ N + W+ G G
Sbjct 80 FQLFMKRARKLQKSKIKYFLVGEYG-AQTYRPHYHAIVFGVENIDAFLGEWRMGNVHAGT 138
Query 158 FVNEKTINYITKYMLK-IDE------KHPKFRGKVLCSAGIGSGYLKREDAKRHVYIPGK 210
V K+I Y KY K I E + K L S G+G +L K Y
Sbjct 139 -VTAKSIYYTLKYCTKSITEGPDKDPDDDRKPEKALMSKGLGLSHLTESMIK---YYKDD 194
Query 211 TNESYRMKNGGKLNLPIYYRNKIFTEEER 239
+ S+ + G + LP YYR+K+F++ E+
Sbjct 195 VSRSFSLLGGTTIALPRYYRDKVFSDIEK 223
>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str.
3999B T(B) 4]
gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str.
3999B T(B) 4]
Length=284
Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/237 (26%), Positives = 112/237 (47%), Gaps = 26/237 (11%)
Query 35 TAACGDCYECRKQKQRQWVVRMSEENRQTPNAYFLTLTIddksykqlk--qkyklkdnnd 92
CG C CRK K++ WV R+ E + P + F+TLT DD+ +
Sbjct 14 AVPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTVGV 73
Query 93 IATKAIRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWGLG-----NGEKITNN 147
++ + I+L ++R+RK + +F+T + R H H I++G G+ +
Sbjct 74 VSKRDIQLFMKRLRKKYAQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAEC 133
Query 148 WKYGITFTGYFVNEKTINYITKYMLK------IDEKHPKFRGKVLCS--AGIGSGYLKRE 199
WK G + + K I+Y+TKYM + I + +++ +LCS GIG +L+ +
Sbjct 134 WKNGFV-QAHPLTTKEISYVTKYMYEKSMIPDILKGVKEYQPFMLCSKMPGIGYHFLREQ 192
Query 200 DAKRHVYIPGKTNESYRMKNGGKLNLPIYYRNKIFTE-------EEREKLFLDKIEK 249
+ P + R NG ++ +P YY +K++ + E RE F++++++
Sbjct 193 ILDFYRLHP---RDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFINQMQQ 246
>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis
CL09T03C24]
Length=284
Score = 74.3 bits (181), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 61/237 (26%), Positives = 111/237 (47%), Gaps = 26/237 (11%)
Query 35 TAACGDCYECRKQKQRQWVVRMSEENRQTPNAYFLTLTIddksykqlk--qkyklkdnnd 92
CG C CRK K++ WV R+ E + P + F+TLT DD+ +
Sbjct 14 AVPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDDEHMPTAMIGEDLFKSTVGV 73
Query 93 IATKAIRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWGL-----GNGEKITNN 147
++ + I+L ++R+RK + +F+T + R H H I++G G+ +
Sbjct 74 VSKRDIQLFMKRLRKKYDQYRLRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAEC 133
Query 148 WKYGITFTGYFVNEKTINYITKYMLK------IDEKHPKFRGKVLCS--AGIGSGYLKRE 199
WK G + + K I Y+TKYM + I + +++ +LCS GIG +L+ +
Sbjct 134 WKNGFV-QAHPLTTKEIAYVTKYMYEKSMVPDILKDVKEYQPFMLCSRIPGIGYHFLREQ 192
Query 200 DAKRHVYIPGKTNESYRMKNGGKLNLPIYYRNKIFTE-------EEREKLFLDKIEK 249
+ P + R NG ++ +P YY +K++ + E RE F++++++
Sbjct 193 ILDFYRLHP---RDYVRAFNGMRMAMPRYYADKLYDDDMKEYLKELREAFFINQMQQ 246
>gi|547920048|ref|WP_022322419.1| putative replication protein [Parabacteroides merdae CAG:48]
gi|524592960|emb|CDD13572.1| putative replication protein [Parabacteroides merdae CAG:48]
Length=278
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/235 (27%), Positives = 108/235 (46%), Gaps = 32/235 (14%)
Query 38 CGDCYECRKQKQRQWVVRMSEENRQTPNAYFLTLTIddksykqlk--qkyklkdnndIAT 95
CG C CR+ K++ WV R+ E ++ P + F+TLT DD+ + + ++
Sbjct 12 CGWCVNCRQNKRQSWVYRLQAEAKEYPLSLFVTLTYDDEHLPIERIGSDLFQTNVAVVSK 71
Query 96 KAIRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWGLG-----NGEKITNNWKY 150
+ ++L ++R+RK +F+T K R H H I++G G+ + W+
Sbjct 72 RDVQLFMKRLRKKYEDYKMRYFVTSEYGAKNGRPHYHMILFGFPFTGKMAGDLLAECWQN 131
Query 151 GITFTGYFVNEKTINYITKYML------KIDEKHPKFRGKVLCS--AGIGSGYLKR---E 199
G + + K I Y+ KYM +I K++ +LCS GIG G++K E
Sbjct 132 GFV-QAHPLTIKEIAYVCKYMYEKSMCPEILRDEKKYKPFMLCSRNPGIGFGFMKADIIE 190
Query 200 DAKRHVYIPGKTNESYRMKNGGKLNLPIYYRNKI-------FTEEEREKLFLDKI 247
+RH + R G K+ +P YY +K+ F +E RE+ F K+
Sbjct 191 FYRRH------PRDYVRAWAGHKMAMPRYYADKLYDDDMKAFLKEMREEFFRHKM 239
>gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 [Necator americanus]
Length=345
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/231 (27%), Positives = 105/231 (45%), Gaps = 26/231 (11%)
Query 25 VCPDERLRYVTAACGDCYECRKQKQRQWVVRMSEENRQTPNAYFLTLTIddksykqlkqk 84
V P L V CG C C++++ WV R+ +E Q NA F+TLT D + K
Sbjct 9 VLPKAALEKVPVPCGRCPPCKRRRVDSWVFRLLQEELQHENASFVTLTYDTRFVPISKNG 68
Query 85 yklkdnndIATKAIRLCLERVRKLT-GKSVKHWFITELGHEKTERLHLHGIVWGLGNGEK 143
+ D + ++R+RKL G+ +K++ E G ++ R H H I++G+
Sbjct 69 FMTLDRGEFPR-----YMKRLRKLVPGRKLKYYMCGEYGSQRF-RPHYHAIIFGVPQDSL 122
Query 144 ITNNWKYG----ITFTGYFVNEKTINYITKYMLKI--------DEKHPKFRGKVLCSAGI 191
+ W V K+I Y KY+ K D++ P+F L S G+
Sbjct 123 FADAWTLNGDSLGGVVVGTVTGKSIAYTMKYIDKSTWKQKHGRDDRVPEFS---LMSKGM 179
Query 192 GSGYLKREDAKRHVYIPGKTNESYRMKNGG-KLNLPIYYRNKIFTEEEREK 241
G YL + + H + + + GG ++ +P YYR KI+++++ +K
Sbjct 180 GVSYLTPQMVEYH---KEDISRLFCTREGGSRIAMPRYYRQKIYSDDDLKK 227
>gi|575094374|emb|CDL65755.1| unnamed protein product [uncultured bacterium]
Length=487
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 68/155 (44%), Gaps = 14/155 (9%)
Query 35 TAACGDCYECRKQKQRQWVVRMSEENRQTPNAYFLTLTIddksykqlkqkyklkdnndIA 94
CG CY+C+ K W VR SEE +YF TLT+D +
Sbjct 25 VVPCGHCYDCKSAKTTDWQVRCSEELNNNSQSYFYTLTLDPRFIDTYGTLPDGSPRYVFN 84
Query 95 TKAIRLCLERVRKLTGK---SVKHWFITELGHEKTERLHLHGIVWGLGNGEK------IT 145
+ I+L L+R+RK K S+K+ + ELG E T R H H I + + +
Sbjct 85 KRHIQLFLKRLRKALSKYNISLKYVIVGELG-ETTHRPHYHAIFYLSSSVNPFKFRIMVR 143
Query 146 NNWKYGITFT----GYFVNEKTINYITKYMLKIDE 176
N+W G + G +N ++Y+ KYM K D
Sbjct 144 NSWSLGFIKSGDNNGIILNNDAVSYVIKYMHKTDS 178
>gi|313766930|gb|ADR80656.1| putative replication initiation protein [Uncultured Microviridae]
Length=402
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 68/152 (45%), Gaps = 24/152 (16%)
Query 38 CGDCYECRKQKQRQWVVRMSEENRQTPNAYFLTLTIddksykqlkqkyklkdnndIATKA 97
CG C+ CR Q R+W +R E + + F+TLTI + + + K
Sbjct 130 CGQCWGCRLQHSREWAIRCMHEAQMHDHNCFITLTI------NPETLERRPRPWSLEKKE 183
Query 98 IRLCLERVRKLTGKSVKHWFITELGHEKTERLHLHGIVWG------------LGN----G 141
+ + R+R+ GK +K++ E G E +R H H I++G LGN
Sbjct 184 FQEFVHRLRRKIGKKIKYFHCGEYGDE-NKRPHYHAIIFGYDFPDKQLWERKLGNELYIS 242
Query 142 EKITNNWKYGITFTGYFVNEKTINYITKYMLK 173
++ N W +G G E + +Y+ +Y++K
Sbjct 243 PELENLWPHGYHRIGACTYE-SAHYVARYVMK 273
>gi|649562725|gb|KDS68909.1| hypothetical protein M096_3339 [Parabacteroides distasonis str.
3999B T(B) 6]
Length=250
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 100/216 (46%), Gaps = 26/216 (12%)
Query 56 MSEENRQTPNAYFLTLTIddksykqlk--qkyklkdnndIATKAIRLCLERVRKLTGKSV 113
M E + P + F+TLT DD+ + ++ + I+L ++R+RK +
Sbjct 1 MQAEADEYPFSLFVTLTYDDEHIPTAMIGEDLFKTTVGVVSKRDIQLFMKRLRKKYAQYR 60
Query 114 KHWFITELGHEKTERLHLHGIVWGLG-----NGEKITNNWKYGITFTGYFVNEKTINYIT 168
+F+T + R H H I++G G+ + WK G + + K I+Y+T
Sbjct 61 LRYFLTSEYGSQGGRPHYHMILFGFPFTGKHGGDLLAECWKNGFV-QAHPLTTKEISYVT 119
Query 169 KYMLK------IDEKHPKFRGKVLCS--AGIGSGYLKREDAKRHVYIPGKTNESYRMKNG 220
KYM + I + +++ +LCS GIG +L+ + + P + R NG
Sbjct 120 KYMYEKSMIPDILKGVKEYQPFMLCSKMPGIGYHFLREQILDFYRLHP---RDYVRAFNG 176
Query 221 GKLNLPIYYRNKIFTE-------EEREKLFLDKIEK 249
++ +P YY +K++ + E RE F++++++
Sbjct 177 MRMAMPRYYADKLYDDDMKEYLKELREAFFINQMQQ 212
>gi|575094557|emb|CDL65915.1| unnamed protein product [uncultured bacterium]
Length=354
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/190 (26%), Positives = 80/190 (42%), Gaps = 28/190 (15%)
Query 27 PDERLRYVTAACGDCYECRKQKQRQWVVRMSEENRQTPNAYFLTLTIddksykqlkqkyk 86
P + LR V CG C CR + +R+W R+ E + F+TLT Y +
Sbjct 21 PKDWLRGVPFGCGKCLACRVKTRREWTSRLILEMLGHDSGAFVTLT-----YSEDYVPVT 75
Query 87 lkdnndIATKAIRLCLERV------RKLTGKSVKHWFITELGHEKTERLHLHGIVWGLGN 140
+ ++ + ++L L+R+ RK + ++++ E G T+R H H I +G+ +
Sbjct 76 ESGHRTLSLRDLQLFLKRLRRNLEERKRSKHPIRYYACGEYGTRGTQRPHYHIIFFGVSD 135
Query 141 GE-----KITNNW----KYGI--------TFTGYFVNEKTINYITKYMLKIDEKHPKFRG 183
+ + W KYG T +N KT+ Y Y +K K
Sbjct 136 LDLDFIKSVYAAWSEPAKYGQKGQTPQFGNITIEPLNAKTVAYTAGYNMKKLISPKKVHK 195
Query 184 KVLCSAGIGS 193
V+ SA IGS
Sbjct 196 VVVSSAEIGS 205
Lambda K H a alpha
0.320 0.138 0.427 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 2030999409975