bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-40_CDS_annotation_glimmer3.pl_2_1
Length=254
Score E
Sequences producing significant alignments: (Bits) Value
gi|575096056|emb|CDL66947.1| unnamed protein product 422 5e-142
gi|575094492|emb|CDL65859.1| unnamed protein product 420 1e-141
gi|575094544|emb|CDL65904.1| unnamed protein product 418 1e-140
gi|575094572|emb|CDL65928.1| unnamed protein product 406 5e-136
gi|575094496|emb|CDL65862.1| unnamed protein product 376 6e-124
gi|575094431|emb|CDL65804.1| unnamed protein product 296 6e-93
gi|9634949|ref|NP_054647.1| structural protein 290 1e-90
gi|530695385|gb|AGT39938.1| major capsid protein 287 4e-90
gi|575094415|emb|CDL65790.1| unnamed protein product 288 5e-90
gi|47566141|ref|YP_022479.1| structural protein 288 6e-90
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 422 bits (1085), Expect = 5e-142, Method: Compositional matrix adjust.
Identities = 198/255 (78%), Positives = 216/255 (85%), Gaps = 9/255 (4%)
Query 1 MAFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSAT 60
MAFQIQK YEK ARGGSRY E+++S FGVTSPDARLQR EYLGGNR+PININQV+QQS T
Sbjct 324 MAFQIQKFYEKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQSGT 383
Query 61 ASGETA-QGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRF 119
S T QGTV GMS TTDTHSDFTKSFTEHGF+IGVM ARYDHTYQQG++R WSRKD+F
Sbjct 384 GSASTTPQGTVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKF 443
Query 120 DYYWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEM 179
DYYWPVF+NIGEQA+KNKEI+AQG DD+VFGYQEAWA+YRYKPSRVTGEM
Sbjct 444 DYYWPVFSNIGEQAIKNKEIYAQG--------NATDDEVFGYQEAWAEYRYKPSRVTGEM 495
Query 180 RSQYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTT 239
RS YAQSLDVWHLADDYS LP LSD WIRED ++RVLAV+ SNQ FADIY+KN T
Sbjct 496 RSSYAQSLDVWHLADDYSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCT 555
Query 240 RPMPMYSIPGLIDHH 254
RPMPMYSIPGLIDHH
Sbjct 556 RPMPMYSIPGLIDHH 570
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 420 bits (1080), Expect = 1e-141, Method: Compositional matrix adjust.
Identities = 199/253 (79%), Positives = 218/253 (86%), Gaps = 10/253 (4%)
Query 2 AFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSATA 61
AFQIQKLYE+DARGG+RYIEILKSHFGVTSPDARLQRPEYLGG+RVPININQV+Q S T
Sbjct 309 AFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGSRVPININQVIQSSET- 367
Query 62 SGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFDY 121
G T QG S+TTD+HS+FTKSF EHGF+IG+MVARYDH+YQQGL+RFWSRKDRFDY
Sbjct 368 -GATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARYDHSYQQGLQRFWSRKDRFDY 426
Query 122 YWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMRS 181
YWPVFAN+GE AVKNKEIFAQG V DD+VFGYQEAWADYRYKPS VTGEMRS
Sbjct 427 YWPVFANLGEMAVKNKEIFAQGTDV--------DDEVFGYQEAWADYRYKPSVVTGEMRS 478
Query 182 QYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTRP 241
QYAQSLD+WHLADDY LP LSDSWIRED + V+RVLAV+ SVS QLF DIYI+ TRP
Sbjct 479 QYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLFCDIYIRCLATRP 538
Query 242 MPMYSIPGLIDHH 254
MP+YSIPGLIDHH
Sbjct 539 MPLYSIPGLIDHH 551
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 418 bits (1074), Expect = 1e-140, Method: Compositional matrix adjust.
Identities = 197/253 (78%), Positives = 222/253 (88%), Gaps = 9/253 (4%)
Query 1 MAFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSAT 60
+AFQIQ+LYE+DARGG+RYIEILKSHFGVTSPDARLQRPEYLGGNR+PININQV+QQS T
Sbjct 307 LAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPININQVLQQSET 366
Query 61 ASGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFD 120
S + QG G S+TTDT++DF KSF EHGFVIG+MVARYDHTYQQGLERFWSRKDRFD
Sbjct 367 TS-TSPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMVARYDHTYQQGLERFWSRKDRFD 425
Query 121 YYWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMR 180
YYWPVFA+IGEQAV NKEI+ T+G +DD+VFGYQEA+ADYRYKPSRVTGEMR
Sbjct 426 YYWPVFAHIGEQAVLNKEIY--------TSGTAVDDEVFGYQEAYADYRYKPSRVTGEMR 477
Query 181 SQYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTR 240
S QSLDVWHLADDY++LP LSDSWIRE + VDRVLAV+S+VS QLF DIYI+NR+TR
Sbjct 478 SAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSAQLFCDIYIQNRSTR 537
Query 241 PMPMYSIPGLIDH 253
PMPMYS+PGLIDH
Sbjct 538 PMPMYSVPGLIDH 550
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 406 bits (1043), Expect = 5e-136, Method: Compositional matrix adjust.
Identities = 190/254 (75%), Positives = 218/254 (86%), Gaps = 9/254 (4%)
Query 1 MAFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSAT 60
+AFQ+QKLYEKDARGG+RY EI++SHFGV SPD+RLQRPEYLGGNR+PIN+NQ++QQS +
Sbjct 312 LAFQLQKLYEKDARGGTRYTEIIRSHFGVVSPDSRLQRPEYLGGNRIPINVNQIIQQSQS 371
Query 61 ASGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFD 120
++ G + GMSVTTD +SDF KSF EHG++IG++VARYDHTYQQGL+R WSRKDRFD
Sbjct 372 TE-QSPLGALAGMSVTTDKNSDFIKSFVEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFD 430
Query 121 YYWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMR 180
+YWPV ANIGEQAV NKEI+ G DT DD+VFGYQEAWA+YRYKP+RV GEMR
Sbjct 431 FYWPVLANIGEQAVLNKEIYIDG---SDT-----DDEVFGYQEAWAEYRYKPNRVCGEMR 482
Query 181 SQYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTR 240
S QSLDVWHL DDYS+LP LSDSWIREDK NVDRVLAVTSSVS+QLFADIYI N+ TR
Sbjct 483 SSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVDRVLAVTSSVSDQLFADIYICNKATR 542
Query 241 PMPMYSIPGLIDHH 254
PMPMYSIPGLIDHH
Sbjct 543 PMPMYSIPGLIDHH 556
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 376 bits (965), Expect = 6e-124, Method: Compositional matrix adjust.
Identities = 180/254 (71%), Positives = 203/254 (80%), Gaps = 9/254 (4%)
Query 1 MAFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSAT 60
MAFQIQKLYEKDAR GSRY E+++SHF VT DAR+Q PEYLGGNR+PININQVVQ S T
Sbjct 324 MAFQIQKLYEKDARAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININQVVQTSQT 383
Query 61 ASGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFD 120
S + QG V G S+T+D+H DF KSFTEHG +IGV VARYDHTYQQG+ + WSRK RFD
Sbjct 384 -SDVSPQGNVAGQSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSKLWSRKTRFD 442
Query 121 YYWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMR 180
YYWPV ANIGEQAV NKEI+AQG D++VFGYQEAWA+YRYKPS VTGEMR
Sbjct 443 YYWPVLANIGEQAVLNKEIYAQG--------TAQDEEVFGYQEAWAEYRYKPSIVTGEMR 494
Query 181 SQYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTR 240
S SLD WH ADDY++LP LS WI+EDK N+DRVLAV+SSVSNQ FAD YI+N TTR
Sbjct 495 SSARTSLDSWHFADDYNSLPKLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETTR 554
Query 241 PMPMYSIPGLIDHH 254
+P YSIPGLIDHH
Sbjct 555 ALPFYSIPGLIDHH 568
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 296 bits (757), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 145/253 (57%), Positives = 174/253 (69%), Gaps = 11/253 (4%)
Query 2 AFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSATA 61
AFQ+QKL EKDARGG+RY EILK+HFGVT+ DAR+Q PEYLGG +VPIN++QVVQ SA+
Sbjct 319 AFQVQKLLEKDARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTSAST 378
Query 62 SGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFDY 121
+ QG +SVT + S FTKSF EHGF+IGV AR +YQQG+ER WSRKDR DY
Sbjct 379 DA-SPQGNTAAISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDY 437
Query 122 YWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMRS 181
Y+PV ANIGEQA+ NKEI+AQ G DD+ FGYQEAWADYRYKP+ + G RS
Sbjct 438 YFPVLANIGEQAILNKEIYAQ--------GNAKDDEAFGYQEAWADYRYKPNTICGRFRS 489
Query 182 QYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTRP 241
QSLD WH DY LP LS W+ + + R LAV + A+ +T R
Sbjct 490 NAQQSLDAWHYGQDYDKLPTLSTDWMEQSDIEMKRTLAVQTEP--DFIANFRFNCKTVRV 547
Query 242 MPMYSIPGLIDHH 254
MP+YSIPGLIDH+
Sbjct 548 MPLYSIPGLIDHN 560
>gi|9634949|ref|NP_054647.1| structural protein [Chlamydia phage 2]
gi|7406589|emb|CAB85589.1| structural protein [Chlamydia phage 2]
Length=565
Score = 290 bits (742), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 137/252 (54%), Positives = 178/252 (71%), Gaps = 4/252 (2%)
Query 2 AFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSATA 61
AFQ+QKLYE+DARGG+RYIEI++SHF V SPDARLQR EYLGG+ P+NI+ + Q S+T
Sbjct 317 AFQLQKLYERDARGGTRYIEIIRSHFNVQSPDARLQRAEYLGGSSTPVNISPIPQTSSTD 376
Query 62 SGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFDY 121
S + QG + + FTKSFTEHG ++G+ R D YQQGL+R WSR+ R+D+
Sbjct 377 S-TSPQGNLAAYGTAIGSKRVFTKSFTEHGVILGLASVRADLNYQQGLDRMWSRRTRWDF 435
Query 122 YWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMRS 181
YWP +++GEQAV NKEI+ QGP VK++ G ++DDQVFGYQE +A+YRYK S++TG+ RS
Sbjct 436 YWPALSHLGEQAVLNKEIYCQGPSVKNSGGEIVDDQVFGYQERFAEYRYKTSKITGKFRS 495
Query 182 QYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTRP 241
SLD WHLA ++ LP LS +I E+ +DRVLAV S D + R RP
Sbjct 496 NATSSLDSWHLAQEFENLPTLSPEFIEENPP-MDRVLAV--STEPDFLLDGWFSLRCARP 552
Query 242 MPMYSIPGLIDH 253
MP+YS+PG IDH
Sbjct 553 MPVYSVPGFIDH 564
>gi|530695385|gb|AGT39938.1| major capsid protein [Marine gokushovirus]
Length=514
Score = 287 bits (734), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 143/252 (57%), Positives = 180/252 (71%), Gaps = 12/252 (5%)
Query 2 AFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSATA 61
AFQIQ+LYEKDARGG+RY E+++SHFGVTSPDARLQRPEYLGG + ININ + Q S+T
Sbjct 274 AFQIQRLYEKDARGGTRYTEVIQSHFGVTSPDARLQRPEYLGGGKDRININPIAQTSST- 332
Query 62 SGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFDY 121
T QG ++G T T F KSFTEH V+G+ D TYQQGL R +SR+ R+D+
Sbjct 333 DATTPQGNLSGYGTTGFTGHRFNKSFTEHSVVLGLACVFADLTYQQGLPRHFSRQTRWDF 392
Query 122 YWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMRS 181
YWP A++GEQAV NKEI+AQ G D+ VFGYQE +A+YRYKPS +TG+MRS
Sbjct 393 YWPALAHLGEQAVLNKEIYAQ--------GTTDDNNVFGYQERYAEYRYKPSSITGQMRS 444
Query 182 QYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTRP 241
+AQSLD+WHLA D+ +LP+L+ S+I E+ VDRV AV + + L D+Y K + RP
Sbjct 445 NFAQSLDIWHLAQDFGSLPVLNSSFIEENPP-VDRVTAVQNYPN--LILDMYFKLKCARP 501
Query 242 MPMYSIPGLIDH 253
MP Y +PGLIDH
Sbjct 502 MPTYGVPGLIDH 513
>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569
Score = 288 bits (737), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 134/249 (54%), Positives = 175/249 (70%), Gaps = 9/249 (4%)
Query 2 AFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSATA 61
A +Q + E DARGG+RY+EILK+ FGV+SPDARLQR EY+GG R+PIN++QV+Q SA+
Sbjct 327 AIALQHILEADARGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQVIQSSASD 386
Query 62 SGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFDY 121
+ + QG S+TT ++ S EHG+++G+ R DH+YQQGL R W+R DRF Y
Sbjct 387 T-TSPQGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMWTRSDRFSY 445
Query 122 YWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMRS 181
Y P+ AN+GEQAV N+EI+AQG D +VFGYQEAWADYRY+ + +TGEMRS
Sbjct 446 YHPMLANLGEQAVLNQEIYAQG--------TTADTEVFGYQEAWADYRYRTNMITGEMRS 497
Query 182 QYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTRP 241
YAQSLD WH D Y+ LP LS+ WI+E + N+DR LAV S S+Q ++Y RP
Sbjct 498 TYAQSLDAWHYGDKYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRP 557
Query 242 MPMYSIPGL 250
MP+YS+PGL
Sbjct 558 MPIYSVPGL 566
>gi|47566141|ref|YP_022479.1| structural protein [Chlamydia phage 3]
gi|47522476|emb|CAD79477.1| structural protein [Chlamydia phage 3]
Length=565
Score = 288 bits (737), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 135/252 (54%), Positives = 179/252 (71%), Gaps = 4/252 (2%)
Query 2 AFQIQKLYEKDARGGSRYIEILKSHFGVTSPDARLQRPEYLGGNRVPININQVVQQSATA 61
AFQ+QKLYE+DARGG+RYIEI++SHF V SPDARLQR EYLGG+ P+NI+ + Q S+T
Sbjct 317 AFQLQKLYERDARGGTRYIEIIRSHFNVQSPDARLQRAEYLGGSSTPVNISPIPQTSSTD 376
Query 62 SGETAQGTVTGMSVTTDTHSDFTKSFTEHGFVIGVMVARYDHTYQQGLERFWSRKDRFDY 121
S + QG + + FTKSFTEHG ++G+ R D YQQGL+R WSR+ R+D+
Sbjct 377 S-TSPQGNLAAYGTAIGSKRVFTKSFTEHGVILGLASVRADLNYQQGLDRMWSRRTRWDF 435
Query 122 YWPVFANIGEQAVKNKEIFAQGPGVKDTAGAVIDDQVFGYQEAWADYRYKPSRVTGEMRS 181
YWP +++GEQAV NKEI+ QGP VK++ G ++D+QVFGYQE +A+YRYK S++TG+ RS
Sbjct 436 YWPALSHLGEQAVLNKEIYCQGPSVKNSGGEIVDEQVFGYQERFAEYRYKTSKITGKFRS 495
Query 182 QYAQSLDVWHLADDYSALPMLSDSWIREDKANVDRVLAVTSSVSNQLFADIYIKNRTTRP 241
SLD WHLA ++ LP LS +I E+ +DRVLAV++ D + R RP
Sbjct 496 NATSSLDSWHLAQEFENLPTLSPEFIEENPP-MDRVLAVSNEP--HFLLDGWFSLRCARP 552
Query 242 MPMYSIPGLIDH 253
MP+YS+PG IDH
Sbjct 553 MPVYSVPGFIDH 564
Lambda K H a alpha
0.318 0.133 0.401 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1155625517970