bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-31_CDS_annotation_glimmer3.pl_2_3
Length=315
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094560|emb|CDL65924.1| unnamed protein product 578 0.0
gi|530695361|gb|AGT39916.1| replication initiator 186 1e-52
gi|495507506|ref|WP_008232152.1| hypothetical protein 186 1e-52
gi|530695350|gb|AGT39906.1| replication initiator 182 4e-51
gi|313766924|gb|ADR80651.1| putative replication initiation protein 170 6e-47
gi|530695371|gb|AGT39925.1| replication initiator 166 7e-45
gi|313766930|gb|ADR80656.1| putative replication initiation protein 167 2e-44
gi|47566147|ref|YP_022485.1| nonstructural protein 164 4e-44
gi|77020121|ref|YP_338244.1| putative replication protein 161 4e-43
gi|9791179|ref|NP_063900.1| conserved hypothetical protein 161 6e-43
>gi|575094560|emb|CDL65924.1| unnamed protein product [uncultured bacterium]
Length=320
Score = 578 bits (1491), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 292/315 (93%), Positives = 304/315 (97%), Gaps = 0/315 (0%)
Query 1 MTCYHPITAYWSRTLKTKLGTPAITFKYADADPELGEFQIPCGQCIGCRLDRSLDSAVRA 60
M CYHPITAYWSRTLKTKLGTPAITFKYADADPELGEFQIPCGQCIGCRLDRSLDSAVRA
Sbjct 6 MACYHPITAYWSRTLKTKLGTPAITFKYADADPELGEFQIPCGQCIGCRLDRSLDSAVRA 65
Query 61 HHESLLYDRNYFLTLTYSPEHLPPFGSLIPRDLTLFWKRLRKRGVSLRYMACGEYGSAFG 120
HHESLLYDRNYFLTLTYSPEHLPPFGSLIPRDLTLFWKRLRKRGVSLRYMACGEYGS +G
Sbjct 66 HHESLLYDRNYFLTLTYSPEHLPPFGSLIPRDLTLFWKRLRKRGVSLRYMACGEYGSTYG 125
Query 121 RPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISECWPFGFHTINSVSFETCAYVARYV 180
RPHYHAI+FNLP +ELKQIGTT T FPT+IS+VISECW GFHT+N VSF+TCAYVARYV
Sbjct 126 RPHYHAIIFNLPPLELKQIGTTSTGFPTFISDVISECWSLGFHTLNPVSFQTCAYVARYV 185
Query 181 TKKILGDGKQVYEKFDPITGEVDCRVKEFSRWSTKPGIGHDYFQKYWRDFYKIDCCLINN 240
TKKILGDGKQVYEKFDP+TGEVDCRVKEFSRWSTKPGIGHDYF KYWRDFYKIDCCLINN
Sbjct 186 TKKILGDGKQVYEKFDPVTGEVDCRVKEFSRWSTKPGIGHDYFMKYWRDFYKIDCCLINN 245
Query 241 KKFKIPRYYDRLLLRENPDVFEIVKQKRILSAQSYRLTPDAQKSrllvreevkrlrverl 300
KKFKIPRYYDRLLLR++PDVFEIVKQKRILSAQ YRLTPDAQKSRLLVREEVKRLR ERL
Sbjct 246 KKFKIPRYYDRLLLRDHPDVFEIVKQKRILSAQDYRLTPDAQKSRLLVREEVKRLRAERL 305
Query 301 lrPFEAQISEYLENV 315
LRP+EAQI+EYLENV
Sbjct 306 LRPYEAQITEYLENV 320
>gi|530695361|gb|AGT39916.1| replication initiator [Marine gokushovirus]
Length=289
Score = 186 bits (471), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/281 (41%), Positives = 160/281 (57%), Gaps = 21/281 (7%)
Query 1 MTCYHPITAYWSRTLKTKLGTPAITFKYADADPELGEFQIPCGQCIGCRLDRSLDSAVRA 60
M CYHP+ AY K P F +A F +PCGQCIGCRLD S A+R
Sbjct 1 MPCYHPLVAYKCDG-KVVFDKP---FAFARG------FNLPCGQCIGCRLDYSRQWAIRC 50
Query 61 HHESLLYDRNYFLTLTYSPEHLPPFG---SLIPRDLTLFWKRLRKR-GVSLRYMACGEYG 116
HE+ ++ N F+TLT+ EH+ SL + F KRLRK+ +R+ CGEYG
Sbjct 51 VHEAQTHEDNCFITLTFDNEHIAKRKNPESLDNTEFQRFMKRLRKKYPHKIRFFHCGEYG 110
Query 117 SAFGRPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISECWPFGFHTINSVSFETCAYV 176
RPHYHA+LF + K++ + F ++S ++E WP+GFHTI +VSF+T AY
Sbjct 111 DQNKRPHYHALLFGHD-FKDKKLWSNKGDFKLFVSQELAELWPYGFHTIGAVSFDTAAYC 169
Query 177 ARYVTKKILGDGKQV-YEKFDPITGEVDCRVK-EFSRWSTKPGIGHDYFQKY-WRDFYKI 233
ARYV KK+ GD Y + D TGEV +K E+ S PGIG++++QKY + D +K
Sbjct 170 ARYVMKKVTGDAAASHYREVDLETGEVINEIKPEYCTMSRMPGIGYEWYQKYGYHDCHKH 229
Query 234 DCCLINNKKFKIPRYYDRLLLRENPDVFEIVKQKRILSAQS 274
D +IN K + PRYYD+L + ++F +K+ R+ +A
Sbjct 230 DYIVINGYKVRPPRYYDKLC---DEEMFAQIKETRVANADE 267
>gi|495507506|ref|WP_008232152.1| hypothetical protein [Richelia intracellularis]
gi|471331139|emb|CCH66547.1| hypothetical protein RINTHH_3920 [Richelia intracellularis HH01]
Length=306
Score = 186 bits (472), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 110/280 (39%), Positives = 161/280 (58%), Gaps = 26/280 (9%)
Query 1 MTCYHPITAYWSRTLKTKLGTPAITFKYADADPELGEFQIPCGQCIGCRLDRSLDSAVRA 60
M C+ P+ A+ RT K ++ + ++ DA+ E +++PCG C GC L+RS AVR
Sbjct 1 MPCFSPVQAW--RTDKGEI----VFWRRHDAETE---YKLPCGHCEGCLLERSRQWAVRC 51
Query 61 HHESLLYDRNYFLTLTYSPEHLPPFGSLIPRDLTLFWKRLRKR------------GVS-- 106
HE+ L++RN F+TLTY E PP+ SL D F KRLRKR G S
Sbjct 52 MHEAQLWERNCFVTLTY--EETPPWNSLRHSDFQKFMKRLRKRFKGHKENIDVRTGKSSY 109
Query 107 -LRYMACGEYGSAFGRPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISECWPFGFHTI 165
+RY GEYG+ GRPHYHA LFN +++ + T++ Y S + WP GF ++
Sbjct 110 PIRYYMAGEYGTHGGRPHYHACLFNFAFEDIEFLRRTNSGSNLYRSAQLESLWPHGFSSV 169
Query 166 NSVSFETCAYVARYVTKKILGDGKQVYEKFDPITGEVDCRVKEFSRWSTKPGIGHDYFQK 225
V+FE+ AYVARYV KK+ + + ++ + TGEV R+ E+++ S KPGIG ++ K
Sbjct 170 GDVTFESAAYVARYVMKKMNKEAIEKGQEINWETGEVMPRLPEYNKMSLKPGIGANFIDK 229
Query 226 YWRDFYKIDCCLINNKKFKIPRYYDRLLLRENPDVFEIVK 265
Y D + D ++N K K PRYY + L + PD++E V+
Sbjct 230 YQSDVFPNDYVIVNGHKAKPPRYYFKRLKQAAPDLYEQVE 269
>gi|530695350|gb|AGT39906.1| replication initiator [Marine gokushovirus]
Length=330
Score = 182 bits (463), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 110/287 (38%), Positives = 156/287 (54%), Gaps = 27/287 (9%)
Query 1 MTCYHPITAYWSRTLKTKLGTPAITFKYADADPELGEFQIPCGQCIGCRLDRSLDSAVRA 60
M C+ P+ A+ ++ G + ++ D + EFQ+PCG C GC L+RS AVR
Sbjct 1 MPCFSPLNAWRTQ------GGEIVFWRRQDV---VQEFQLPCGGCEGCLLERSRQWAVRC 51
Query 61 HHESLLYDRNYFLTLTYSPEHLPPFGSLIPRDLTLFWKRLRKR------------GVS-- 106
HE+ ++RN F+TLTY E PP+ SL D F KRLRK+ G S
Sbjct 52 MHEASQWERNCFITLTY--EETPPWNSLRHGDFQKFMKRLRKKFYGHKEYIDVRTGKSSH 109
Query 107 -LRYMACGEYGSAFGRPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISECWPFGFHTI 165
+R+ GEYG+ GRPHYHA +FN +LK + T++ Y S + WP GF ++
Sbjct 110 PIRFYMAGEYGTQRGRPHYHACIFNFSFEDLKFLRRTNSGSNLYRSTQLEGLWPHGFSSV 169
Query 166 NSVSFETCAYVARYVTKKILGDGKQVYEKFDPITGEVDCRVKEFSRWSTKPGIGHDYFQK 225
V+FE+ AYVARYV KK + + DP TGE R E++R S KPGIG + K
Sbjct 170 GDVTFESAAYVARYVMKKQYQEADHLL-VVDPYTGECVERHPEYNRMSLKPGIGAGFLDK 228
Query 226 YWRDFYKIDCCLINNKKFKIPRYYDRLLLRENPDVFEIVKQKRILSA 272
Y D + D ++N K K PRYY + L +++PD++E V+ R +
Sbjct 229 YKSDVFPNDFVVVNGHKAKPPRYYYKRLEQQDPDLYEQVEYSRAMKG 275
>gi|313766924|gb|ADR80651.1| putative replication initiation protein [Uncultured Microviridae]
Length=285
Score = 170 bits (431), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 101/252 (40%), Positives = 137/252 (54%), Gaps = 22/252 (9%)
Query 38 FQIPCGQCIGCRLDRSLDSAVRAHHESLLYDR---NYFLTLTYSPEHLPPFGSLIPRDLT 94
++ C QCIGCRLD + A R HES LYD N F+TLTY EHLP SL
Sbjct 1 MEVACSQCIGCRLDHAGMWASRIEHESSLYDDSNGNCFITLTYDEEHLPQDWSLDKSHFQ 60
Query 95 LFWKRLRKR-GVSLRYMACGEYGS--------------AFGRPHYHAILFNLPAIELKQI 139
F KRLRKR +RY CGEYG GRPHYHAILFN+ + +
Sbjct 61 KFMKRLRKRYPQKIRYYHCGEYGENCRHGIHTTLCPGCNVGRPHYHAILFNIDFHDRVLV 120
Query 140 GTTHTRFPTYISNVISECWPFGFHTINSVSFETCAYVARYVTKKILG-DGKQVYEKFDPI 198
G + P + S+ ++E W GF + ++ ++ YVARY KK+ G + Y D
Sbjct 121 GQSKG-IPHFTSDTLTEIWGHGFTQVGDLTAQSAGYVARYALKKVTGTQAEDHYRSIDLT 179
Query 199 TGEVDCRVKEFSRWSTKPGIGHDYFQKYWRDFYKIDC--CLINNKKFKIPRYYDRLLLRE 256
TGEV E++ S KPGIG ++++KY +D Y + + K IPR+YD+L+ +E
Sbjct 180 TGEVTYVRPEYATMSRKPGIGKEWYEKYKKDMYPSNQTPSVGGGVKNGIPRFYDKLMEKE 239
Query 257 NPDVFEIVKQKR 268
+P+ EIVK+KR
Sbjct 240 DPEQLEIVKEKR 251
>gi|530695371|gb|AGT39925.1| replication initiator [Marine gokushovirus]
Length=316
Score = 166 bits (420), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 148/286 (52%), Gaps = 34/286 (12%)
Query 1 MTCYHPITAYWSRTLKTKLGTPAITFKYADADPELGEFQIPCGQCIGCRLDRSLDSAVRA 60
M CY PI A+ S +TF A +L QIPC QCIGCRL++S A+R
Sbjct 18 MACYFPIDAFRSHA-------GGVTFSRKGAYLDL-PIQIPCNQCIGCRLEKSRQWALRC 69
Query 61 HHESLLYDRNYFLTLTYSPEHLPPFGSLIP----RDLTLFWKRLRKR--GVSLRYMACGE 114
HE+ LY N F+TLTY+ +HLP + +P R LF KRLRK+ ++R+ CGE
Sbjct 70 THEAKLYKNNSFITLTYNSDHLPLTNNSLPTLNLRHFQLFLKRLRKKYSNKTIRFYHCGE 129
Query 115 YGSAFGRPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISECWP-------FGFHTINS 167
YG RPHYHA+LFN E K++ H Y S V+ W GF TI
Sbjct 130 YGDMNHRPHYHALLFN-HDFEDKKLWKIHKDQNYYTSEVLDGLWTDPKTKSNMGFSTIGD 188
Query 168 VSFETCAYVARYVTKKILGDGKQVYEKFDPITGEVDCRVKEFSRWSTKPGIGHDYFQKYW 227
++F++ AYVARY KKI G + Y + RV E++ S +PGIG+ + K+
Sbjct 189 LTFDSAAYVARYCLKKITGKNAEDYYQ---------GRVPEYATMSRRPGIGNGWLDKFK 239
Query 228 RDFYKIDCCLINNKKFKIPRYYDRLLLRENPDVFEIVKQKRILSAQ 273
D Y + +K + P+YYDR+ N + V++ +IL Q
Sbjct 240 SDVYPSGFIIHEGQKMQPPKYYDRVT---NETDEKAVRRSKILRMQ 282
>gi|313766930|gb|ADR80656.1| putative replication initiation protein [Uncultured Microviridae]
Length=402
Score = 167 bits (422), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 108/280 (39%), Positives = 151/280 (54%), Gaps = 24/280 (9%)
Query 1 MTCYHPITAYWSRTLKTKLGTPAITFKYADADPELGEFQIPCGQCIGCRLDRSLDSAVRA 60
M CYHP+ AY K P F YA F +PCGQC GCRL S + A+R
Sbjct 99 MACYHPLLAY-KCDGKVVFNKP---FAYAKG------FNLPCGQCWGCRLQHSREWAIRC 148
Query 61 HHESLLYDRNYFLTLTYSPEHL---PPFGSLIPRDLTLFWKRLRKR-GVSLRYMACGEYG 116
HE+ ++D N F+TLT +PE L P SL ++ F RLR++ G ++Y CGEYG
Sbjct 149 MHEAQMHDHNCFITLTINPETLERRPRPWSLEKKEFQEFVHRLRRKIGKKIKYFHCGEYG 208
Query 117 SAFGRPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISECWPFGFHTINSVSFETCAYV 176
RPHYHAI+F + KQ+ YIS + WP G+H I + ++E+ YV
Sbjct 209 DENKRPHYHAIIFGYDFPD-KQLWERKLGNELYISPELENLWPHGYHRIGACTYESAHYV 267
Query 177 ARYVTKKILGDGKQVYEKFDPITGEV----DCRVKEFSRWSTKP---GIGHDYFQKY-WR 228
ARYV K+ G+G + +P TGEV D + SR + K GIG+ ++ KY W
Sbjct 268 ARYVMKRAKGEGPPE-QYINPETGEVEYDLDNQYATMSRGNKKQPQNGIGNQWYWKYGWT 326
Query 229 DFYKIDCCLINNKKFKIPRYYDRLLLRENPDVFEIVKQKR 268
D + D + + K K+PRYYD+ L + +P+ F+ +K KR
Sbjct 327 DAHCHDYIVHDGIKMKVPRYYDKELEKYDPEYFQELKAKR 366
>gi|47566147|ref|YP_022485.1| nonstructural protein [Chlamydia phage 3]
gi|47522482|emb|CAD79483.1| nonstructural protein [Chlamydia phage 3]
Length=315
Score = 164 bits (415), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 93/235 (40%), Positives = 131/235 (56%), Gaps = 16/235 (7%)
Query 41 PCGQCIGCRLDRSLDSAVRAHHESLLYDRNYFLTLTYSPEHLPPFGSLIPRDLTLFWKRL 100
PC +C CR+ + + R HE+ LY +N FLTLTY HLP GSL+ LF +RL
Sbjct 54 PCLKCRFCRVQNAKIWSYRCMHEASLYSQNCFLTLTYEDRHLPENGSLVRDHPRLFLRRL 113
Query 101 RKRGV--SLRYMACGEYGSAFGRPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISECW 158
R+ +RY CGEYGS RPHYH +++N + K + P ++S + + W
Sbjct 114 REHIYPHKIRYFGCGEYGSKLQRPHYHLLIYNYDFPDKKLLSKKRGN-PLFVSEKLMQLW 172
Query 159 PFGFHTINSVSFETCAYVARYVTKKILGDGKQVYEKFDPITGEVDCRVKEFSRWSTKPGI 218
PFGF T+ SV+ ++ YVARY KK+ GD Q R+ EF S KPGI
Sbjct 173 PFGFSTVGSVTRQSAGYVARYSLKKVNGDSSQ---------DHYGQRLPEFLMCSLKPGI 223
Query 219 GHDYFQKYWRDFYKIDCCLINNK----KFKIPRYYDRLLLRENPDVFEIVKQKRI 269
G D+++KY RD Y D ++ +K K + PRYYD+L R +P+ E +KQ+R+
Sbjct 224 GADWYEKYKRDVYPQDYLVVQDKGKSFKTRPPRYYDKLHSRFDPEEMEEIKQRRV 278
>gi|77020121|ref|YP_338244.1| putative replication protein [Chlamydia phage 4]
gi|59940020|gb|AAX12549.1| putative replication protein [Chlamydia phage 4]
Length=315
Score = 161 bits (407), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 93/236 (39%), Positives = 135/236 (57%), Gaps = 16/236 (7%)
Query 40 IPCGQCIGCRLDRSLDSAVRAHHESLLYDRNYFLTLTYSPEHLPPFGSLIPRDLTLFWKR 99
+PC +C CR+ + + R HE+ LY +N FLTLTY +HLP GSL+ TLF +R
Sbjct 53 MPCRRCKFCRVQNAKIWSYRCMHEASLYSQNCFLTLTYEDQHLPENGSLVRNHPTLFLRR 112
Query 100 LRKRGV--SLRYMACGEYGSAFGRPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISEC 157
LR+ +RY CGEYGS RPHYH +++N + K + P ++S + +
Sbjct 113 LREHISPHKIRYFGCGEYGSKLQRPHYHLLIYNYDFPDKKLLSKKRGN-PLFVSEKLMQL 171
Query 158 WPFGFHTINSVSFETCAYVARYVTKKILGDGKQVYEKFDPITGEVDCRVKEFSRWSTKPG 217
WP+GF T+ SV+ ++ YVARY KK+ D Q + G+ R+ EF S KPG
Sbjct 172 WPYGFSTVGSVTRQSAGYVARYSLKKVSRDISQDH------YGQ---RLPEFLMCSLKPG 222
Query 218 IGHDYFQKYWRDFYKIDCCLINNK----KFKIPRYYDRLLLRENPDVFEIVKQKRI 269
IG D+++KY RD Y D ++ +K + PRYYD+L R +P+ + VKQKR+
Sbjct 223 IGADWYEKYKRDVYPQDYLVVQDKGKSFTTRPPRYYDKLHSRFDPEEMDEVKQKRV 278
>gi|9791179|ref|NP_063900.1| conserved hypothetical protein [Chlamydia pneumoniae phage CPAR39]
gi|7190960|gb|AAF39720.1| conserved hypothetical protein [Chlamydia pneumoniae phage CPAR39]
Length=327
Score = 161 bits (407), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 95/236 (40%), Positives = 133/236 (56%), Gaps = 16/236 (7%)
Query 40 IPCGQCIGCRLDRSLDSAVRAHHESLLYDRNYFLTLTYSPEHLPPFGSLIPRDLTLFWKR 99
+PC +C CR+ + + R HE+ LY +N FLTLTY +LP GSL+ LF R
Sbjct 65 MPCLKCRFCRVRNAKIWSYRCMHEASLYSQNCFLTLTYEDRYLPENGSLVRNHPRLFLMR 124
Query 100 LRKRGV--SLRYMACGEYGSAFGRPHYHAILFNLPAIELKQIGTTHTRFPTYISNVISEC 157
LRK +RY CGEYGS RPHYH +++N + K + P ++S +
Sbjct 125 LRKEIYPHKIRYFGCGEYGSKLQRPHYHLLIYNYDFPDKKLLSKKRGN-PLFVSEKLMRL 183
Query 158 WPFGFHTINSVSFETCAYVARYVTKKILGDGKQVYEKFDPITGEVDCRVKEFSRWSTKPG 217
WPFGF T+ SV+ ++ YVARY KK+ GD Q + G+ R+ EF S KPG
Sbjct 184 WPFGFSTVGSVTRQSAGYVARYSLKKVNGDISQDH------YGQ---RLPEFLMCSLKPG 234
Query 218 IGHDYFQKYWRDFYKIDCCLINNK----KFKIPRYYDRLLLRENPDVFEIVKQKRI 269
IG D+++KY RD Y D ++ +K K + PRYYD+L R +P+ E +KQKR+
Sbjct 235 IGADWYEKYKRDVYPQDYLVVQDKGKSFKTRPPRYYDKLHSRFDPEEMEEIKQKRV 290
Lambda K H a alpha
0.324 0.141 0.455 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1740253685184