bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: all_orgs
14,240,465 sequences; 5,121,972,263 total letters
Query= Contig-23_CDS_annotation_glimmer3.pl_2_4
Length=298
Score E
Sequences producing significant alignments: (Bits) Value
cpt:CpB0227 hypothetical protein 71.2 8e-13
cpj:CPj0222 hypothetical protein 71.2 8e-13
cpa:CP0543 hypothetical protein 71.2 8e-13
cpn:CPn0222 hypothetical protein 71.2 8e-13
clp:CPK_ORF00729 hypothetical protein 71.2 1e-12
lso:CKC_03470 hypothetical protein 48.5 0.001
laa:WSI_05020 hypothetical protein 47.8 0.002
lso:CKC_03515 hypothetical protein 42.4 0.083
lso:CKC_03540 hypothetical protein 38.1 0.38
fno:Fnod_1782 rpoB; DNA-directed RNA polymerase subunit beta 39.7 0.72
edi:EDI_201410 hypothetical protein 39.7 0.76
mvo:Mvol_0445 hypothetical protein 38.5 1.4
edi:EDI_196330 hypothetical protein 38.9 1.5
ehi:EHI_148560 1.t00098; 4-alpha-glucanotransferase 37.4 4.0
mhal:N220_06095 replication protein 36.6 7.8
mhao:J451_01365 replication protein 36.6 7.8
mhae:F382_02015 replication protein 36.6 7.8
mhx:MHH_c18950 repA; replication protein A (EC:3.1.-.-) 36.6 7.8
mhq:D650_12680 Replication protein A 36.6 7.8
tmb:Thimo_1353 c-type cytochrome biogenesis protein CcmF 36.2 9.2
> cpt:CpB0227 hypothetical protein
Length=113
Score = 71.2 bits (173), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 62/127 (49%), Gaps = 32/127 (25%)
Query 2 YNPKVMLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVP--GMIVK 59
Y + +L+PC +C CR + + W+ R EA Y K+ F+TLTYDD H+P G +VK
Sbjct 15 YRNRWVLMPCLKCRFCRTQHAKVWSYRCVHEASLYEKN--CFLTLTYDDKHLPQYGSLVK 72
Query 60 TGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKAYR-GKLRYFVAGEYGEQTAR 118
+Q FLKRLRK K+RYF G YG + R
Sbjct 73 L---------------------------HLQLFLKRLRKMISPHKIRYFECGAYGTKLQR 105
Query 119 PHYHMIL 125
PHYH++L
Sbjct 106 PHYHLLL 112
> cpj:CPj0222 hypothetical protein
Length=113
Score = 71.2 bits (173), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 62/127 (49%), Gaps = 32/127 (25%)
Query 2 YNPKVMLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVP--GMIVK 59
Y + +L+PC +C CR + + W+ R EA Y K+ F+TLTYDD H+P G +VK
Sbjct 15 YRNRWVLMPCLKCRFCRTQHAKVWSYRCVHEASLYEKN--CFLTLTYDDKHLPQYGSLVK 72
Query 60 TGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKAYR-GKLRYFVAGEYGEQTAR 118
+Q FLKRLRK K+RYF G YG + R
Sbjct 73 L---------------------------HLQLFLKRLRKMISPHKIRYFECGAYGTKLQR 105
Query 119 PHYHMIL 125
PHYH++L
Sbjct 106 PHYHLLL 112
> cpa:CP0543 hypothetical protein
Length=113
Score = 71.2 bits (173), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 62/127 (49%), Gaps = 32/127 (25%)
Query 2 YNPKVMLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVP--GMIVK 59
Y + +L+PC +C CR + + W+ R EA Y K+ F+TLTYDD H+P G +VK
Sbjct 15 YRNRWVLMPCLKCRFCRTQHAKVWSYRCVHEASLYEKN--CFLTLTYDDKHLPQYGSLVK 72
Query 60 TGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKAYR-GKLRYFVAGEYGEQTAR 118
+Q FLKRLRK K+RYF G YG + R
Sbjct 73 L---------------------------HLQLFLKRLRKMISPHKIRYFECGAYGTKLQR 105
Query 119 PHYHMIL 125
PHYH++L
Sbjct 106 PHYHLLL 112
> cpn:CPn0222 hypothetical protein
Length=113
Score = 71.2 bits (173), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 62/127 (49%), Gaps = 32/127 (25%)
Query 2 YNPKVMLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVP--GMIVK 59
Y + +L+PC +C CR + + W+ R EA Y K+ F+TLTYDD H+P G +VK
Sbjct 15 YRNRWVLMPCLKCRFCRTQHAKVWSYRCVHEASLYEKN--CFLTLTYDDKHLPQYGSLVK 72
Query 60 TGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKAYR-GKLRYFVAGEYGEQTAR 118
+Q FLKRLRK K+RYF G YG + R
Sbjct 73 L---------------------------HLQLFLKRLRKMISPHKIRYFECGAYGTKLQR 105
Query 119 PHYHMIL 125
PHYH++L
Sbjct 106 PHYHLLL 112
> clp:CPK_ORF00729 hypothetical protein
Length=121
Score = 71.2 bits (173), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/130 (35%), Positives = 65/130 (50%), Gaps = 32/130 (25%)
Query 2 YNPKVMLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYDDDHVP--GMIVK 59
Y + +L+PC +C CR + + W+ R EA Y K+ F+TLTYDD H+P G +VK
Sbjct 15 YRNRWVLMPCLKCRFCRTQHAKVWSYRCVHEASLYEKN--CFLTLTYDDKHLPQYGSLVK 72
Query 60 TGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLR-KAYRGKLRYFVAGEYGEQTAR 118
+Q FLKRLR + K+RYF GEYG + R
Sbjct 73 L---------------------------HLQLFLKRLRDRISPHKIRYFGCGEYGTKLQR 105
Query 119 PHYHMILYGW 128
PHYH++++ +
Sbjct 106 PHYHLLIFNY 115
> lso:CKC_03470 hypothetical protein
Length=424
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/219 (24%), Positives = 88/219 (40%), Gaps = 44/219 (20%)
Query 7 MLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYD-DDHVPGMIVKTGEIMR 65
+L C +C C + W R + E WFITLT+ +H+ + G+ +
Sbjct 223 ILARCRRCSVCCKSRGMFWLRRAQTEVM--RSSRTWFITLTFSPSNHIKNYALTIGQYVE 280
Query 66 KVQY---TWKPGEKRPSSVQILLYEDIQK-----------------------FLKRLRKA 99
+ + G+K+ ++ EDI+ FLKRLRK
Sbjct 281 SLSIEDRNFFYGKKKYGTII----EDIRSLNISDVDLKFRLLCKGFGDKIVLFLKRLRKN 336
Query 100 YRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLENLYKIQHNGYYTSEWLKGLWGMGQI 159
K RYF+ E ++ PH HM+++ +L +IQ EW++ G +
Sbjct 337 TSKKFRYFIVFE-KHKSGNPHAHMLIHQKSGEELLKKAEIQ------EEWIRE--GFSHV 387
Query 160 QIAQAVPETYRYVAGYVTKKMYEIDGKKANAYYELGQTK 198
++ + T RYV Y+ K+ + G + A + G K
Sbjct 388 RLLREDLNTARYVCKYLLKE--DSKGIRVRASFCYGSMK 424
> laa:WSI_05020 hypothetical protein
Length=405
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 84/215 (39%), Gaps = 36/215 (17%)
Query 7 MLIPCGQCIGCRIRQREDWTTRIELEARDYPKDEVWFITLTYD-DDHVP---GMIVKTGE 62
+++PC C C + W R +E + WF+TLT +H M+ +
Sbjct 202 LILPCRSCSSCYKNRGLFWLRRAYIEVK--RSTRTWFVTLTMTPANHFANHRSMVFNYID 259
Query 63 IMRKVQYTWKPGEKRPSSVQILLYEDI------------------QKFLKRLRKAYRGKL 104
+ + RP+ + ++ +DI FLKRLRK K
Sbjct 260 SFPPHERDLLNVDGRPTEIHLMRKKDIFGENVLFSLLCKGFGNKVSLFLKRLRKNTGKKF 319
Query 105 RYFVAGEYGEQTARPHYHMILYGWKPTDLENLYKIQHNGYYTSEWLKGLWGMGQIQIAQA 164
RYF E ++ PH HM+++ +NL K EW + G +++ +
Sbjct 320 RYFFVFE-KHKSGDPHVHMLIH----QQCDNLLK---KAEVQEEWSRE--GFSHVRLLKE 369
Query 165 VPETYRYVAGYVTKKMYEIDGKKANAYYELGQTKP 199
T RYV Y+ K+ + G + A ++ G P
Sbjct 370 DLFTARYVCKYLMKE--GMKGIRVRASFQYGALTP 402
> lso:CKC_03515 hypothetical protein
Length=432
Score = 42.4 bits (98), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 46/180 (26%), Positives = 75/180 (42%), Gaps = 34/180 (19%)
Query 42 WFITLTYD-DDHVPGMIVKTGEIM-------RKVQYTWKPGEKRPSSVQILLYEDIQK-- 91
WF+TLT+ +H+ + G+ + R + Y K + IL D+
Sbjct 259 WFVTLTFSPSNHIKNYALVVGQYVDSLSTEDRDLFYGKKKYGTIFEDLTILNITDVDLKF 318
Query 92 -------------FLKRLRKAYRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLENLYK 138
FLKRLRK K RYFV E ++ PH HM+++ +L +
Sbjct 319 RLLCKGFGDKIVLFLKRLRKNTSKKFRYFVVFE-RHKSGDPHAHMLIHQKPGDELIKKAE 377
Query 139 IQHNGYYTSEWLKGLWGMGQIQIAQAVPETYRYVAGYVTKKMYEIDGKKANAYYELGQTK 198
IQ EW++ G +++ + +T RYV Y+ K+ + G + A + G K
Sbjct 378 IQ------EEWMRE--GFSHVRLLREDLKTARYVCKYLLKE--DSKGIRVRASFRYGSLK 427
> lso:CKC_03540 hypothetical protein
Length=107
Score = 38.1 bits (87), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 31/112 (28%), Positives = 51/112 (46%), Gaps = 11/112 (10%)
Query 87 EDIQKFLKRLRKAYRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLENLYKIQHNGYYT 146
+ I FLKRLRK K RYF E ++ H HM+++ +L ++Q
Sbjct 7 DKIVLFLKRLRKNTSKKFRYFFVFE-KHKSGNLHAHMLIHQEIGDELLKKAEVQ------ 59
Query 147 SEWLKGLWGMGQIQIAQAVPETYRYVAGYVTKKMYEIDGKKANAYYELGQTK 198
EW++ G +++ + T RYV Y+ K+ + G + A + G K
Sbjct 60 EEWMRE--GFSHVRLLKEDLNTARYVCKYLLKE--DAKGIRVRASFRYGSMK 107
> fno:Fnod_1782 rpoB; DNA-directed RNA polymerase subunit beta
Length=1182
Score = 39.7 bits (91), Expect = 0.72, Method: Composition-based stats.
Identities = 30/114 (26%), Positives = 52/114 (46%), Gaps = 7/114 (6%)
Query 191 YYELGQTKPFACMSLKPGLGDKYYQEHKAEIWRQGYIQCTNGKRAQIPRYYEKQMEAENP 250
Y EL + F + K + Y+ E + E+ G + N K R Y ++E NP
Sbjct 308 YKELRPGEVFRYNAAKSYWNNLYFNEERFELSEVGRYK-MNKKLTNAYRKYLIEIEGRNP 366
Query 251 QRLWRIKQNRQKNAMQQ------KRLQIENQDYKTVLETKERVTKKRTKKSGIL 298
+ + RI+ + NA+ R+ +E + + L+TK+ ++ KR K G L
Sbjct 367 KSVERIEYEEKSNALTPIDIILVIRMLLETEKHPETLDTKDHLSNKRVKTVGEL 420
> edi:EDI_201410 hypothetical protein
Length=1370
Score = 39.7 bits (91), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 26/67 (39%), Positives = 39/67 (58%), Gaps = 11/67 (16%)
Query 210 GDKYYQEHKAEIWRQGYIQCTN----GKRAQIPRYYEKQMEAENPQRLWR--IKQ---NR 260
G KY Q+H + + IQC N +R IP++ E Q+ EN +R+WR IKQ N+
Sbjct 929 GIKYIQQHHY--FEEIKIQCINVQSVMRRIYIPQFKENQLCCENVERIWRCEIKQRQFNK 986
Query 261 QKNAMQQ 267
Q+N++ Q
Sbjct 987 QRNSIIQ 993
> mvo:Mvol_0445 hypothetical protein
Length=493
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 51/119 (43%), Gaps = 27/119 (23%)
Query 26 TTRIELEARDYPK---------DEVWFITLTYDDDHVPGMIVKTGEIMRKVQYTWKPGEK 76
RI RD+ K D+ F+TLT D P + E+ + +K
Sbjct 194 VNRINANLRDFNKKFDAMSNRHDKAVFLTLTSD----PSLFDSIQEMADNLHKNYKK--- 246
Query 77 RPSSVQILLYEDIQKFLKRLRKAYRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLEN 135
+++ IQK + +RK R KL Y + E+ + A PH H++++G DL +
Sbjct 247 --------IFDRIQK--RFIRKTNR-KLEYIYSFEFSPKKALPHLHVVIFGTDFLDLRD 294
> edi:EDI_196330 hypothetical protein
Length=898
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 35/75 (47%), Gaps = 6/75 (8%)
Query 222 WRQGYIQCTNGKRAQIPRYYEKQMEAENPQRLWRIKQNRQKNAMQQKRLQIENQDYKTVL 281
WR Y N +A P Y E+P+ L I N++ QK +E DY+ VL
Sbjct 303 WRDSYPYNPNSVQALHPMYLRLSSLTEDPEILAEI------NSVAQKLNALEQIDYEAVL 356
Query 282 ETKERVTKKRTKKSG 296
+ KE +T+K K G
Sbjct 357 KEKETITRKIFSKVG 371
> ehi:EHI_148560 1.t00098; 4-alpha-glucanotransferase
Length=898
Score = 37.4 bits (85), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 35/75 (47%), Gaps = 6/75 (8%)
Query 222 WRQGYIQCTNGKRAQIPRYYEKQMEAENPQRLWRIKQNRQKNAMQQKRLQIENQDYKTVL 281
WR Y N +A P Y ++P+ L I N++ QK +E DY+ VL
Sbjct 303 WRDSYPYNPNSVQALHPMYLRLSSLTQDPEILAEI------NSVAQKLNALEQIDYEAVL 356
Query 282 ETKERVTKKRTKKSG 296
+ KE +T+K K G
Sbjct 357 KEKETITRKIFSKVG 371
> mhal:N220_06095 replication protein
Length=786
Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 10/100 (10%)
Query 40 EVWFITLTYDDDHVPGMIVKTGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKA 99
E WFITLT + M+ +T + K W P+ Q+ L K +L +
Sbjct 381 EGWFITLTAPSKY-HAMLSRTSSVNPK----WNGAS--PAETQVYLVNTWAKIRAKLNRE 433
Query 100 YRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLENLYKI 139
+ VA + + T PH+H+IL+ +P D+E L ++
Sbjct 434 GVMAYGFRVAEPHADAT--PHWHLILFT-RPEDMEKLRRV 470
> mhao:J451_01365 replication protein
Length=786
Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 10/100 (10%)
Query 40 EVWFITLTYDDDHVPGMIVKTGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKA 99
E WFITLT + M+ +T + K W P+ Q+ L K +L +
Sbjct 381 EGWFITLTAPSKY-HAMLSRTSSVNPK----WNGAS--PAETQVYLVNTWAKIRAKLNRE 433
Query 100 YRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLENLYKI 139
+ VA + + T PH+H+IL+ +P D+E L ++
Sbjct 434 GVMAYGFRVAEPHADAT--PHWHLILFT-RPEDMEKLRRV 470
> mhae:F382_02015 replication protein
Length=786
Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 10/100 (10%)
Query 40 EVWFITLTYDDDHVPGMIVKTGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKA 99
E WFITLT + M+ +T + K W P+ Q+ L K +L +
Sbjct 381 EGWFITLTAPSKY-HAMLSRTSSVNPK----WNGAS--PAETQVYLVNTWAKIRAKLNRE 433
Query 100 YRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLENLYKI 139
+ VA + + T PH+H+IL+ +P D+E L ++
Sbjct 434 GVMAYGFRVAEPHADAT--PHWHLILFT-RPEDMEKLRRV 470
> mhx:MHH_c18950 repA; replication protein A (EC:3.1.-.-)
Length=786
Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 10/100 (10%)
Query 40 EVWFITLTYDDDHVPGMIVKTGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKA 99
E WFITLT + M+ +T + K W P+ Q+ L K +L +
Sbjct 381 EGWFITLTAPSKY-HAMLSRTSSVNPK----WNGAS--PAETQVYLVNTWAKIRAKLNRE 433
Query 100 YRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLENLYKI 139
+ VA + + T PH+H+IL+ +P D+E L ++
Sbjct 434 GVMAYGFRVAEPHADAT--PHWHLILFT-RPEDMEKLRRV 470
> mhq:D650_12680 Replication protein A
Length=786
Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 10/100 (10%)
Query 40 EVWFITLTYDDDHVPGMIVKTGEIMRKVQYTWKPGEKRPSSVQILLYEDIQKFLKRLRKA 99
E WFITLT + M+ +T + K W P+ Q+ L K +L +
Sbjct 381 EGWFITLTAPSKY-HAMLSRTSSVNPK----WNGAS--PAETQVYLVNTWAKIRAKLNRE 433
Query 100 YRGKLRYFVAGEYGEQTARPHYHMILYGWKPTDLENLYKI 139
+ VA + + T PH+H+IL+ +P D+E L ++
Sbjct 434 GVMAYGFRVAEPHADAT--PHWHLILFT-RPEDMEKLRRV 470
> tmb:Thimo_1353 c-type cytochrome biogenesis protein CcmF
Length=661
Score = 36.2 bits (82), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 37/93 (40%), Gaps = 6/93 (6%)
Query 157 GQIQIAQAVPETYRYVAGYVTKKMYEID-GKKANAYYELGQTKPFACMSLKPGLGDKYYQ 215
G Q+A VPE YVA T ID G + Y LG+ SL+ YY+
Sbjct 563 GDRQVALLVPEKRTYVASTQTMTEASIDAGLLRDIYVSLGEPLEVGAWSLR-----IYYK 617
Query 216 EHKAEIWRQGYIQCTNGKRAQIPRYYEKQMEAE 248
+ IW G + T G A R Y +A
Sbjct 618 PYVRWIWLGGLLMATGGVLAITDRRYRAATQAS 650
Lambda K H a alpha
0.319 0.136 0.427 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 505086094880