bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-18_CDS_annotation_glimmer3.pl_2_7
Length=346
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094431|emb|CDL65804.1| unnamed protein product 305 4e-95
gi|575096056|emb|CDL66947.1| unnamed protein product 297 4e-92
gi|575094544|emb|CDL65904.1| unnamed protein product 285 2e-87
gi|575094572|emb|CDL65928.1| unnamed protein product 285 2e-87
gi|575094492|emb|CDL65859.1| unnamed protein product 283 1e-86
gi|575094496|emb|CDL65862.1| unnamed protein product 272 2e-82
gi|557745632|ref|YP_008798242.1| major capsid protein 249 8e-74
gi|313766927|gb|ADR80653.1| putative major coat protein 247 2e-73
gi|575094415|emb|CDL65790.1| unnamed protein product 244 9e-72
gi|530695351|gb|AGT39907.1| major capsid protein 242 3e-71
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 305 bits (781), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 143/249 (57%), Positives = 177/249 (71%), Gaps = 2/249 (1%)
Query 95 AATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMN 154
AAT+NQLRQAF VQ E ARGG+RYRE ++ FGV+ SD +QIPEYLGG + +N++
Sbjct 310 AATVNQLRQAFQVQKLLEKDARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVS 369
Query 155 QIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLER 214
Q+VQTS S +P G T A+SVTP ++S FTKSF+EHGF+IGV R SYQQG+ER
Sbjct 370 QVVQTSA--STDASPQGNTAAISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIER 427
Query 215 FWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKM 274
WSR DRLDYYFP AN+GEQ + KEI G + D+E FGYQEAWADYR KPN + G+
Sbjct 428 MWSRKDRLDYYFPVLANIGEQAILNKEIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRF 487
Query 275 RSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVMNKTTRC 334
RSNA+ +LD WHY +Y +PTLS +WM++ E+ RTL V+ EP F R KT R
Sbjct 488 RSNAQQSLDAWHYGQDYDKLPTLSTDWMEQSDIEMKRTLAVQTEPDFIANFRFNCKTVRV 547
Query 335 MPLYSVPGL 343
MPLYS+PGL
Sbjct 548 MPLYSIPGL 556
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 297 bits (761), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 138/249 (55%), Positives = 176/249 (71%), Gaps = 2/249 (1%)
Query 97 TINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQI 156
TINQLR AF +Q +YE ARGGSRY E +R+ FGV+ D +Q EYLGG R +N+NQ+
Sbjct 318 TINQLRMAFQIQKFYEKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQV 377
Query 157 VQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFW 216
+Q SG S TP G MS T S FTKSF EHGF+IGVMC R+DH+YQQG++R W
Sbjct 378 IQQSGTGSASTTPQGTVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMW 437
Query 217 SRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRS 276
SR D+ DYY+P F+N+GEQ +K KEI G +TD+E FGYQEAWA+YR KP+RV+G+MRS
Sbjct 438 SRKDKFDYYWPVFSNIGEQAIKNKEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRS 497
Query 277 NAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIV--ENEPQFFGAIRVMNKTTRC 334
+ +LD WH AD+Y+ +P+LS EW++E + R L V +N QFF I V N TR
Sbjct 498 SYAQSLDVWHLADDYSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRP 557
Query 335 MPLYSVPGL 343
MP+YS+PGL
Sbjct 558 MPMYSIPGL 566
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 285 bits (729), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 137/260 (53%), Positives = 181/260 (70%), Gaps = 4/260 (2%)
Query 86 LGTDLSNIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLG 145
L +L N AA+INQLR AF +Q YE ARGG+RY E +++ FGV+ D +Q PEYLG
Sbjct 290 LVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLG 349
Query 146 GGRYHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHD 205
G R +N+NQ++Q S E+ +P G S+T + F KSF EHGFVIG+M R+D
Sbjct 350 GNRIPININQVLQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMVARYD 407
Query 206 HSYQQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRM 265
H+YQQGLERFWSR DR DYY+P FA++GEQ V KEI +G + D+E FGYQEA+ADYR
Sbjct 408 HTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQEAYADYRY 467
Query 266 KPNRVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN--EPQFFG 323
KP+RV+G+MRS A +LD WH AD+YA++P+LS W++E + + R L V + Q F
Sbjct 468 KPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSAQLFC 527
Query 324 AIRVMNKTTRCMPLYSVPGL 343
I + N++TR MP+YSVPGL
Sbjct 528 DIYIQNRSTRPMPMYSVPGL 547
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 285 bits (729), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 137/266 (52%), Positives = 179/266 (67%), Gaps = 9/266 (3%)
Query 85 WLGTDLSNIEA-----ATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQ 139
W T++ +E+ ATINQLR AF +Q YE ARGG+RY E +R+ FGV D +Q
Sbjct 289 WTPTNMWAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEIIRSHFGVVSPDSRLQ 348
Query 140 IPEYLGGGRYHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGV 199
PEYLGG R +N+NQI+Q S +S +P+G MSVT S F KSF EHG++IG+
Sbjct 349 RPEYLGGNRIPINVNQIIQQS--QSTEQSPLGALAGMSVTTDKNSDFIKSFVEHGYIIGL 406
Query 200 MCVRHDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEA 259
+ R+DH+YQQGL+R WSR DR D+Y+P AN+GEQ V KEI + G TD+E FGYQEA
Sbjct 407 VVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYIDGSDTDDEVFGYQEA 466
Query 260 WADYRMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN-- 317
WA+YR KPNRV G+MRS+A +LD WH D+Y+++P LS W++E K + R L V +
Sbjct 467 WAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVDRVLAVTSSV 526
Query 318 EPQFFGAIRVMNKTTRCMPLYSVPGL 343
Q F I + NK TR MP+YS+PGL
Sbjct 527 SDQLFADIYICNKATRPMPMYSIPGL 552
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 283 bits (723), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 139/263 (53%), Positives = 176/263 (67%), Gaps = 8/263 (3%)
Query 86 LGTDLS---NIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPE 142
L DLS ++ ATINQLR AF +Q YE ARGG+RY E +++ FGV+ D +Q PE
Sbjct 288 LWADLSTATDLPVATINQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPE 347
Query 143 YLGGGRYHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCV 202
YLGG R +N+NQ++Q+S + TP G A S+T + S FTKSF EHGF+IG+M
Sbjct 348 YLGGSRVPININQVIQSSETGA---TPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVA 404
Query 203 RHDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWAD 262
R+DHSYQQGL+RFWSR DR DYY+P FANLGE VK KEI G D+E FGYQEAWAD
Sbjct 405 RYDHSYQQGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWAD 464
Query 263 YRMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN--EPQ 320
YR KP+ V+G+MRS +LD WH AD+Y +P+LS W++E + + R L V + Q
Sbjct 465 YRYKPSVVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQ 524
Query 321 FFGAIRVMNKTTRCMPLYSVPGL 343
F I + TR MPLYS+PGL
Sbjct 525 LFCDIYIRCLATRPMPLYSIPGL 547
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 272 bits (696), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 131/251 (52%), Positives = 170/251 (68%), Gaps = 4/251 (2%)
Query 95 AATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMN 154
A TINQLR AF +Q YE AR GSRYRE +R+ F V+ D +Q+PEYLGG R +N+N
Sbjct 316 ATTINQLRMAFQIQKLYEKDARAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININ 375
Query 155 QIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLER 214
Q+VQTS Q S+ +P G S+T + F KSF EHG +IGV R+DH+YQQG+ +
Sbjct 376 QVVQTS-QTSDV-SPQGNVAGQSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSK 433
Query 215 FWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKM 274
WSR R DYY+P AN+GEQ V KEI G + DEE FGYQEAWA+YR KP+ V+G+M
Sbjct 434 LWSRKTRFDYYWPVLANIGEQAVLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVTGEM 493
Query 275 RSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEP--QFFGAIRVMNKTT 332
RS+A +LD WH+AD+Y ++P LS +W+KE K I R L V + Q+F + N+TT
Sbjct 494 RSSARTSLDSWHFADDYNSLPKLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETT 553
Query 333 RCMPLYSVPGL 343
R +P YS+PGL
Sbjct 554 RALPFYSIPGL 564
>gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus]
gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus]
Length=538
Score = 249 bits (635), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 133/285 (47%), Positives = 171/285 (60%), Gaps = 2/285 (1%)
Query 58 NNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQHYYEALARG 117
N GNT +N D+ L DLS +ATINQLR AFA Q + E ARG
Sbjct 252 NIGNTHRFLNSASTNVYPGDENTDEARRLYADLSEATSATINQLRLAFATQKFLEIQARG 311
Query 118 GSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPIGETGAMS 177
GSRY E ++ F V+ D +Q PEYLGGG VN++ + QTS ++ TP G A+
Sbjct 312 GSRYIEVIKNHFNVTSPDARLQRPEYLGGGSSPVNISPVAQTSSTDAT--TPQGNLSAIG 369
Query 178 VTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPV 237
T ++ SFTKSF EH VIG++ VR D +YQQGL R +SR DYY+P + +GEQ V
Sbjct 370 TTVLSGHSFTKSFTEHTIVIGMVSVRTDLTYQQGLNRMFSRETIYDYYWPTLSTIGEQAV 429
Query 238 KKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTL 297
K KEI G + DE TFGYQE +A+YR KP+ V+GK RSNA GTL+ WHYA YA++P L
Sbjct 430 KNKEIYAQGSAADETTFGYQERYAEYRYKPSSVTGKFRSNATGTLESWHYAQEYASLPLL 489
Query 298 SQEWMKEGKNEIARTLIVENEPQFFGAIRVMNKTTRCMPLYSVPG 342
W++ + RTL V +EPQF + TR MP+ S+PG
Sbjct 490 GDSWIQVTDTNVQRTLAVASEPQFIFDSLFKLRCTRPMPVNSIPG 534
>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533
Score = 247 bits (631), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 126/255 (49%), Positives = 168/255 (66%), Gaps = 4/255 (2%)
Query 89 DLSNIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGR 148
DLSN AATINQLR+AF +Q YE ARGG+RY E +++ FGV+ D +Q PEYLGG +
Sbjct 264 DLSNATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQK 323
Query 149 YHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSY 208
V M + QTS +S +P G A+ T + F+KSF EHG +IG+ CV D +Y
Sbjct 324 TEVMMQTVPQTSSTDST--SPQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFADLTY 380
Query 209 QQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPN 268
QQG+ R WSR DR D+Y+P A+LGEQ V +EI G S D +TFGYQE +A+YR KP+
Sbjct 381 QQGMNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRYKPS 440
Query 269 RVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVM 328
+++GKMRSNA GTLD WH A ++ +P L+ +++E + R + V +EP+F
Sbjct 441 QITGKMRSNATGTLDAWHLAQDFTALPALNASFIEENP-PVDRVIAVPSEPEFIWDWYFD 499
Query 329 NKTTRCMPLYSVPGL 343
KTTR MP+YSVPGL
Sbjct 500 LKTTRPMPVYSVPGL 514
>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569
Score = 244 bits (623), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 121/252 (48%), Positives = 159/252 (63%), Gaps = 4/252 (2%)
Query 97 TINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQI 156
+IN LRQA A+QH EA ARGG+RY E ++ FGVS D +Q EY+GG R +N++Q+
Sbjct 320 SINDLRQAIALQHILEADARGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQV 379
Query 157 VQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFW 216
+Q+S ++ +P G A S+T + S EHG+++G+ +R DHSYQQGL R W
Sbjct 380 IQSSASDTT--SPQGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMW 437
Query 217 SRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRS 276
+RSDR YY P ANLGEQ V +EI G + D E FGYQEAWADYR + N ++G+MRS
Sbjct 438 TRSDRFSYYHPMLANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADYRYRTNMITGEMRS 497
Query 277 NAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIV--ENEPQFFGAIRVMNKTTRC 334
+LD WHY D Y +P LS +W+KEG+ I RTL V EN QF + R
Sbjct 498 TYAQSLDAWHYGDKYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRP 557
Query 335 MPLYSVPGLEKL 346
MP+YSVPGL +
Sbjct 558 MPIYSVPGLSMI 569
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 242 bits (617), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 141/299 (47%), Positives = 182/299 (61%), Gaps = 20/299 (7%)
Query 53 SVNKNNNGN-TAPL--VNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQH 109
SV+K NGN + P VN QY D N L DLS AATIN +RQ+F +Q
Sbjct 249 SVSKEANGNMSVPTGSVNAQY-------DPN---GSLVADLSTATAATINAIRQSFQIQR 298
Query 110 YYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQ-ESNYGT 168
E ARGG+RY E VR+ FGV D +Q PEYLGGG + +N + Q S S T
Sbjct 299 LLERDARGGTRYTEIVRSHFGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGASGTDT 358
Query 169 PIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQ 228
P+G GA+ + F SF EHG V+G+ VR D +YQQGL R +SRS R D++FP
Sbjct 359 PLGTLGAVGTGLASGHGFASSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDFFFPV 418
Query 229 FANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYA 288
F++LGEQP+ KE+ TG STD++ FGYQEAWA+YR KP++V+G MRS A GTLD WH A
Sbjct 419 FSHLGEQPILNKELYATGTSTDDDVFGYQEAWAEYRYKPSQVTGLMRSTAAGTLDAWHLA 478
Query 289 DNYATVPTLSQEWMKEGKNEIARTLIVENEP---QF-FGAIRVMNKTTRCMPLYSVPGL 343
N+ ++PTL+ ++ E + R + V +E QF F A +N R MP+YSVPGL
Sbjct 479 QNFGSLPTLNSTFI-EDTPPVDRVVAVGSEANGQQFIFDAFFDINM-ARPMPMYSVPGL 535
Lambda K H a alpha
0.316 0.132 0.392 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 2041309051650