bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-43_CDS_annotation_glimmer3.pl_2_1
Length=306
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094431|emb|CDL65804.1| unnamed protein product 293 4e-91
gi|575096056|emb|CDL66947.1| unnamed protein product 286 4e-88
gi|575094572|emb|CDL65928.1| unnamed protein product 272 4e-83
gi|575094544|emb|CDL65904.1| unnamed protein product 270 2e-82
gi|575094492|emb|CDL65859.1| unnamed protein product 265 2e-80
gi|575094496|emb|CDL65862.1| unnamed protein product 261 6e-79
gi|557745632|ref|YP_008798242.1| major capsid protein 240 5e-71
gi|444298010|dbj|GAC77834.1| major capsid protein 235 8e-70
gi|530695351|gb|AGT39907.1| major capsid protein 237 9e-70
gi|313766927|gb|ADR80653.1| putative major coat protein 232 5e-68
>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560
Score = 293 bits (750), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 138/249 (55%), Positives = 176/249 (71%), Gaps = 2/249 (1%)
Query 55 SATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLGGGRYHVNIN 114
+AT+NQLRQA VQ+ E ARGG+RYRE ++ + V SD +Q+PEYLGG + +N++
Sbjct 310 AATVNQLRQAFQVQKLLEKDARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVS 369
Query 115 QIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHNRSYQQGLER 174
Q+VQTS +++ +P G T A+SVTP ++S FTKSF+EHGF+IGV R +SYQQG+ER
Sbjct 370 QVVQTSA--STDASPQGNTAAISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIER 427
Query 175 FWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRMKPNRVSGLM 234
WSRKDRLDYY P AN+GEQ + KEI G+ DDE FGYQEAWADYR KPN + G
Sbjct 428 MWSRKDRLDYYFPVLANIGEQAILNKEIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRF 487
Query 235 RSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEPQFFGAIRIANKTTRR 294
RSNA +L+ WHY +Y K+PTLS +WM + E+ RTL V+ EP F R KT R
Sbjct 488 RSNAQQSLDAWHYGQDYDKLPTLSTDWMEQSDIEMKRTLAVQTEPDFIANFRFNCKTVRV 547
Query 295 MPLYSVPGL 303
MPLYS+PGL
Sbjct 548 MPLYSIPGL 556
>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570
Score = 286 bits (731), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 134/249 (54%), Positives = 174/249 (70%), Gaps = 2/249 (1%)
Query 57 TINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLGGGRYHVNINQI 116
TINQLR A +Q++YE ARGGSRY E IR+ + V D +Q EYLGG R +NINQ+
Sbjct 318 TINQLRMAFQIQKFYEKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQV 377
Query 117 VQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHNRSYQQGLERFW 176
+Q SG +++ TP G MS T S FTKSF EHGF+IGV C R++ +YQQG++R W
Sbjct 378 IQQSGTGSASTTPQGTVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMW 437
Query 177 SRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRMKPNRVSGLMRS 236
SRKD+ DYY P F+N+GEQ +K KEI G+ TDDE FGYQEAWA+YR KP+RV+G MRS
Sbjct 438 SRKDKFDYYWPVFSNIGEQAIKNKEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRS 497
Query 237 NATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDE--PQFFGAIRIANKTTRR 294
+ +L+ WH AD+Y+K+P+LS EW+ E + + R L V D+ QFF I + N TR
Sbjct 498 SYAQSLDVWHLADDYSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRP 557
Query 295 MPLYSVPGL 303
MP+YS+PGL
Sbjct 558 MPMYSIPGL 566
>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556
Score = 272 bits (696), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 138/280 (49%), Positives = 177/280 (63%), Gaps = 9/280 (3%)
Query 31 VLLGKDAGGV--STWVPME---ARLDNATSATINQLRQAISVQQYYEALARGGSRYREQI 85
V +G D G+ + W P + ATINQLR A +Q+ YE ARGG+RY E I
Sbjct 275 VEVGSDGTGIGQNFWTPTNMWAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEII 334
Query 86 RAIWDVIISDKTVQVPEYLGGGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESS 145
R+ + V+ D +Q PEYLGG R +N+NQI+Q S Q++ +P+G MSVT S
Sbjct 335 RSHFGVVSPDSRLQRPEYLGGNRIPINVNQIIQQS--QSTEQSPLGALAGMSVTTDKNSD 392
Query 146 FTKSFEEHGFVIGVCCVRHNRSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLT 205
F KSF EHG++IG+ R++ +YQQGL+R WSRKDR D+Y P AN+GEQ V KEI +
Sbjct 393 FIKSFVEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYID 452
Query 206 GDTTDDETFGYQEAWADYRMKPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEG 265
G TDDE FGYQEAWA+YR KPNRV G MRS+A +L+ WH D+Y+ +P LS W+ E
Sbjct 453 GSDTDDEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIRED 512
Query 266 KEEIARTLIVED--EPQFFGAIRIANKTTRRMPLYSVPGL 303
K + R L V Q F I I NK TR MP+YS+PGL
Sbjct 513 KTNVDRVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGL 552
>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 270 bits (690), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 135/260 (52%), Positives = 175/260 (67%), Gaps = 4/260 (2%)
Query 46 MEARLDNATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLG 105
+ A L NAT+A+INQLR A +Q+ YE ARGG+RY E +++ + V D +Q PEYLG
Sbjct 290 LVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLG 349
Query 106 GGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHN 165
G R +NINQ++Q S +T++ +P G S+T + F KSF EHGFVIG+ R++
Sbjct 350 GNRIPININQVLQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMVARYD 407
Query 166 RSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRM 225
+YQQGLERFWSRKDR DYY P FA++GEQ V KEI +G DDE FGYQEA+ADYR
Sbjct 408 HTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQEAYADYRY 467
Query 226 KPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVED--EPQFFG 283
KP+RV+G MRS A +L+ WH AD+YA +P+LS W+ E + R L V Q F
Sbjct 468 KPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSAQLFC 527
Query 284 AIRIANKTTRRMPLYSVPGL 303
I I N++TR MP+YSVPGL
Sbjct 528 DIYIQNRSTRPMPMYSVPGL 547
>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551
Score = 265 bits (678), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 137/261 (52%), Positives = 167/261 (64%), Gaps = 8/261 (3%)
Query 48 ARLDNATS---ATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYL 104
A L AT ATINQLR A +Q+ YE ARGG+RY E +++ + V D +Q PEYL
Sbjct 290 ADLSTATDLPVATINQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYL 349
Query 105 GGGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRH 164
GG R +NINQ++Q+S + TP G A S+T + S FTKSF EHGF+IG+ R+
Sbjct 350 GGSRVPININQVIQSS---ETGATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARY 406
Query 165 NRSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYR 224
+ SYQQGL+RFWSRKDR DYY P FANLGE VK KEI G DDE FGYQEAWADYR
Sbjct 407 DHSYQQGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWADYR 466
Query 225 MKPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVED--EPQFF 282
KP+ V+G MRS +L+ WH AD+Y +P+LS W+ E + R L V D Q F
Sbjct 467 YKPSVVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLF 526
Query 283 GAIRIANKTTRRMPLYSVPGL 303
I I TR MPLYS+PGL
Sbjct 527 CDIYIRCLATRPMPLYSIPGL 547
>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568
Score = 261 bits (668), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 130/254 (51%), Positives = 166/254 (65%), Gaps = 4/254 (2%)
Query 52 NATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLGGGRYHV 111
+ T+ TINQLR A +Q+ YE AR GSRYRE IR+ + V D +QVPEYLGG R +
Sbjct 313 SGTATTINQLRMAFQIQKLYEKDARAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPI 372
Query 112 NINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHNRSYQQG 171
NINQ+VQTS QTS+ +P G S+T + F KSF EHG +IGV R++ +YQQG
Sbjct 373 NINQVVQTS--QTSDVSPQGNVAGQSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQG 430
Query 172 LERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRMKPNRVS 231
+ + WSRK R DYY P AN+GEQ V KEI G D+E FGYQEAWA+YR KP+ V+
Sbjct 431 VSKLWSRKTRFDYYWPVLANIGEQAVLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVT 490
Query 232 GLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVED--EPQFFGAIRIAN 289
G MRS+A +L+ WH+AD+Y +P LS +W+ E K I R L V Q+F I N
Sbjct 491 GEMRSSARTSLDSWHFADDYNSLPKLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIEN 550
Query 290 KTTRRMPLYSVPGL 303
+TTR +P YS+PGL
Sbjct 551 ETTRALPFYSIPGL 564
>gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus]
gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus]
Length=538
Score = 240 bits (612), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 127/257 (49%), Positives = 158/257 (61%), Gaps = 2/257 (1%)
Query 46 MEARLDNATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLG 105
+ A L ATSATINQLR A + Q++ E ARGGSRY E I+ ++V D +Q PEYLG
Sbjct 280 LYADLSEATSATINQLRLAFATQKFLEIQARGGSRYIEVIKNHFNVTSPDARLQRPEYLG 339
Query 106 GGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHN 165
GG VNI+ + QTS T TP G A+ T ++ SFTKSF EH VIG+ VR +
Sbjct 340 GGSSPVNISPVAQTS--STDATTPQGNLSAIGTTVLSGHSFTKSFTEHTIVIGMVSVRTD 397
Query 166 RSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRM 225
+YQQGL R +SR+ DYY P + +GEQ VK KEI G D+ TFGYQE +A+YR
Sbjct 398 LTYQQGLNRMFSRETIYDYYWPTLSTIGEQAVKNKEIYAQGSAADETTFGYQERYAEYRY 457
Query 226 KPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEPQFFGAI 285
KP+ V+G RSNATGTLE WHYA YA +P L W+ + RTL V EPQF
Sbjct 458 KPSSVTGKFRSNATGTLESWHYAQEYASLPLLGDSWIQVTDTNVQRTLAVASEPQFIFDS 517
Query 286 RIANKTTRRMPLYSVPG 302
+ TR MP+ S+PG
Sbjct 518 LFKLRCTRPMPVNSIPG 534
>gi|444298010|dbj|GAC77834.1| major capsid protein [uncultured marine virus]
Length=480
Score = 235 bits (600), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 124/304 (41%), Positives = 175/304 (58%), Gaps = 18/304 (6%)
Query 2 TSNGANHLVPASGNTLGAPKTDEDNGKPRVLLGKDAGGVSTWVPMEARLDNATSATINQL 61
T + ANH+ A+ + + ++ G P + A L NAT+ATINQL
Sbjct 189 TVSYANHIESATAASFAFEEDPDNAGFPNI---------------RADLTNATAATINQL 233
Query 62 RQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLGGGRYHVNINQIVQT-- 119
RQA +Q+ E ARGG+RY E IRA + V+ D +Q PEYLGGG ++NI I QT
Sbjct 234 RQAFQIQKLLERDARGGTRYTEIIRAHFSVLSPDSRLQRPEYLGGGSSNINITPIAQTQR 293
Query 120 SGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHNRSYQQGLERFWSRK 179
S T ++TP G A+ + + FTKSF EHG+++G+C VR + +YQQG++R WSR
Sbjct 294 SDTTTPDETPQGNLAAIGTSAFSGHGFTKSFTEHGYILGLCEVRADLTYQQGIDRLWSRD 353
Query 180 DRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRMKPNRVSGLMRSNAT 239
R D+Y P +++GEQ V KEI D++ FGYQE +A+YR KP+R+S L RSNA
Sbjct 354 TRYDFYWPALSHIGEQAVLSKEIFADATAGDEDVFGYQERFAEYRYKPSRISSLFRSNAA 413
Query 240 GTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEPQFFGAIRIANKTTRRMPLYS 299
+L+ WH + ++A P L+ ++ E I R + V DEP + R +PLY
Sbjct 414 ASLDVWHLSQDFAARPVLNSTFI-EDTPPIDRVIAVTDEPHILLDAYFKLRCARPLPLYG 472
Query 300 VPGL 303
VPGL
Sbjct 473 VPGL 476
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 237 bits (604), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 120/262 (46%), Positives = 163/262 (62%), Gaps = 5/262 (2%)
Query 46 MEARLDNATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLG 105
+ A L AT+ATIN +RQ+ +Q+ E ARGG+RY E +R+ + VI D +Q PEYLG
Sbjct 275 LVADLSTATAATINAIRQSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARMQRPEYLG 334
Query 106 GGRYHVNINQIVQTSGQQTS-NDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRH 164
GG + +N + Q S S DTP+G GA+ + F SF EHG V+G+C VR
Sbjct 335 GGSAPIIVNPVAQQSASGASGTDTPLGTLGAVGTGLASGHGFASSFTEHGVVVGLCSVRA 394
Query 165 NRSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYR 224
+ +YQQGL R +SR R D++ P F++LGEQP+ KE+ TG +TDD+ FGYQEAWA+YR
Sbjct 395 DLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKELYATGTSTDDDVFGYQEAWAEYR 454
Query 225 MKPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEP---QF 281
KP++V+GLMRS A GTL+ WH A N+ +PTL+ ++ E + R + V E QF
Sbjct 455 YKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTFI-EDTPPVDRVVAVGSEANGQQF 513
Query 282 FGAIRIANKTTRRMPLYSVPGL 303
R MP+YSVPGL
Sbjct 514 IFDAFFDINMARPMPMYSVPGL 535
>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533
Score = 232 bits (591), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 120/258 (47%), Positives = 166/258 (64%), Gaps = 4/258 (2%)
Query 46 MEARLDNATSATINQLRQAISVQQYYEALARGGSRYREQIRAIWDVIISDKTVQVPEYLG 105
M A L NAT+ATINQLR+A +Q+ YE ARGG+RY E +++ + V D +Q PEYLG
Sbjct 261 MFADLSNATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLG 320
Query 106 GGRYHVNINQIVQTSGQQTSNDTPIGETGAMSVTPINESSFTKSFEEHGFVIGVCCVRHN 165
G + V + + QTS T + +P G A+ T + F+KSF EHG +IG+ CV +
Sbjct 321 GQKTEVMMQTVPQTS--STDSTSPQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFAD 377
Query 166 RSYQQGLERFWSRKDRLDYYVPQFANLGEQPVKKKEIMLTGDTTDDETFGYQEAWADYRM 225
+YQQG+ R WSR+DR D+Y P A+LGEQ V +EI G + D +TFGYQE +A+YR
Sbjct 378 LTYQQGMNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRY 437
Query 226 KPNRVSGLMRSNATGTLEFWHYADNYAKVPTLSQEWMAEGKEEIARTLIVEDEPQFFGAI 285
KP++++G MRSNATGTL+ WH A ++ +P L+ ++ E + R + V EP+F
Sbjct 438 KPSQITGKMRSNATGTLDAWHLAQDFTALPALNASFIEE-NPPVDRVIAVPSEPEFIWDW 496
Query 286 RIANKTTRRMPLYSVPGL 303
KTTR MP+YSVPGL
Sbjct 497 YFDLKTTRPMPVYSVPGL 514
Lambda K H a alpha
0.315 0.132 0.395 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1647025809192