bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-8_CDS_annotation_glimmer3.pl_2_6
Length=745
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094326|emb|CDL65712.1| unnamed protein product 398 3e-123
gi|639237429|ref|WP_024568106.1| hypothetical protein 129 2e-28
gi|649569140|gb|KDS75238.1| capsid family protein 124 5e-27
gi|649555287|gb|KDS61824.1| capsid family protein 125 7e-27
gi|609718276|emb|CDN73650.1| conserved hypothetical protein 122 6e-26
gi|547920049|ref|WP_022322420.1| capsid protein VP1 120 3e-25
gi|492501782|ref|WP_005867318.1| hypothetical protein 108 1e-21
gi|575096093|emb|CDL66973.1| unnamed protein product 103 9e-20
gi|649557305|gb|KDS63784.1| capsid family protein 98.6 2e-19
gi|444298000|dbj|GAC77839.1| major capsid protein 101 2e-19
>gi|575094326|emb|CDL65712.1| unnamed protein product [uncultured bacterium]
Length=758
Score = 398 bits (1022), Expect = 3e-123, Method: Compositional matrix adjust.
Identities = 202/429 (47%), Positives = 283/429 (66%), Gaps = 16/429 (4%)
Query 318 YPFRTQKNDTAEHPKLLAYRFRAYESVYNAYYRDIRNNPFVVNGRPVYNKWLPTMKGGAD 377
YP+ N + + KL AY FRAYE++YNAY R+ RNNPFV+NG+ YN+W+ T GG+D
Sbjct 345 YPYYGSANMSDKAIKLSAYPFRAYEAIYNAYIRNTRNNPFVLNGKKTYNRWITTDAGGSD 404
Query 378 S-TLYQLHQCNWERDFLTTAVPNPQQGMNTPLVGLTIGDVVTRSEDGTLSVQKQTVLVDE 436
+ T L NW+ D TTA+ PQQG+ PLVGLT ++ + ++ G T +VDE
Sbjct 405 TLTPRDLRFANWQSDAYTTALTAPQQGV-APLVGLTTYEIRSVNDAGHEVTTVNTAIVDE 463
Query 437 DGSKYGVSYRVSEDGERLVGVDYDPVSEKTPVTAINSYaelaalaaeQSAGFTIETLRYV 496
+G+ Y V + +GE L GV+Y P+ V +L + ++G +I R V
Sbjct 464 EGNAYKVDFE--SNGEALKGVNYTPLKAGEAVNM-------QSLVSPVTSGISINDFRNV 514
Query 497 NAYQKFLELNMRKGFSYKQIMQGRWDIDIRFDELLMPEFIGGISRELSMRTVEQTVDQQG 556
NAYQ++LELN +GFSYK+I++GR+D+++R+D L MPE++GGI+R++ + + QTV+ G
Sbjct 515 NAYQRYLELNQFRGFSYKEIIEGRFDVNVRYDALNMPEYLGGITRDIVVNPITQTVETTG 574
Query 557 SSSQGQYAEALGSKTGIAGVYGNTSNNIEVFCDEESYIIGLLTVTPVPIYTQLLPKDFTY 616
S G Y +LGS++G+A +GNT +I VFCDEES ++G++ V P+P+Y LLPK TY
Sbjct 575 S---GSYVGSLGSQSGLATCFGNTDGSISVFCDEESIVMGIMYVMPMPVYDSLLPKWLTY 631
Query 617 NGLLDHYQPEFDRIGFQPITYKEICPMNADNSTSSGFIERTFGYQRPWYEYVAKYDNAHG 676
LD + PEFD IG+QPI KE+ PM FGYQRPWYEYVAK D AHG
Sbjct 632 RERLDSFNPEFDHIGYQPIYAKELGPMQCVQDDIDP--NTVFGYQRPWYEYVAKPDRAHG 689
Query 677 LFRTDMKNFVMHRSFTGLPQLGQQFLLVDPNAVNQVFSVTEYTDKIFGYVKFNATARLPI 736
LF + ++NF+M RSF +P+LGQ F ++ P +VN VFSVTE +DKI G + F+ TA+LPI
Sbjct 690 LFLSSLRNFIMFRSFDNVPELGQSFTVMQPGSVNNVFSVTEVSDKILGQIHFDCTAQLPI 749
Query 737 SRVAIPRLD 745
SRV +PRL+
Sbjct 750 SRVVVPRLE 758
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 71/131 (54%), Positives = 88/131 (67%), Gaps = 7/131 (5%)
Query 4 NVFDATFDANNRIDVNSFDWSHVNNLTTNFGRITPVFCELVPAKGSLRINPEFGLELMPM 63
+VF+ D N + NSFDWSH NN TT+ GRITPVF ELVP S+RI PEFGL MPM
Sbjct 3 SVFNKIGDIKNDVKRNSFDWSHDNNFTTDLGRITPVFTELVPPNSSIRIKPEFGLRFMPM 62
Query 64 VFPVQTRMFARLNFFKVTLRSMWEDYPDFISNFRDDLEE---PYILPDKHGFER--MLKT 118
+FP+QT+M A L+F+KV LR++W DY DFIS+ D+ EE PY+ D + L
Sbjct 63 MFPIQTKMKAYLSFYKVPLRTLWADYMDFISS--DNTEEFQPPYMSFDSTDYSEGGTLAP 120
Query 119 NTLGDYLGIPT 129
+ LGDY GIPT
Sbjct 121 SGLGDYFGIPT 131
>gi|639237429|ref|WP_024568106.1| hypothetical protein [Elizabethkingia anophelis]
Length=546
Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 123/458 (27%), Positives = 203/458 (44%), Gaps = 48/458 (10%)
Query 308 PELSEVDYDTYPFRTQK----NDTAEHPKLLAYRFRAYESVYNAYYRD--IRNNPFV-VN 360
P+ S DY P + ND ++ F AY+ +++ YYRD + ++ FV N
Sbjct 110 PKSSLGDYLGLPLTEGRFAVGNDGVLPDRVSMLPFLAYQKIWDEYYRDENLIDSVFVDKN 169
Query 361 G--RPVY----NKWLPTMKGGADSTLYQLHQCNWERDFLTTAVPNPQQG--------MNT 406
G R ++ N W P++ L+ + + W D+ T+A+P Q+G M
Sbjct 170 GDKRELFIDGINYWNPSLPY-EFRQLFDIKKRAWHHDYFTSALPFAQKGAAVKMPLQMTA 228
Query 407 PLVGLTIGDVVTRSEDGTLSVQKQTVLVDEDGSKYGVSYRVSEDGERLVGVDYDPVSEKT 466
L G+ + DG+LS T EDGS V DG + V+
Sbjct 229 DLFYNPGGNTFVKKPDGSLS---HTGFRLEDGS-------VPADGIGHLMVETSSTGNSN 278
Query 467 PVTAINSYaelaalaaeQSAGFTIETLRYVNAYQKFLELNMRKGFSYKQIMQGRWDIDIR 526
PV NS + + ++G TI LR Q++LE N R G Y + + + +
Sbjct 279 PVNIDNS--SNLGVDLKTASGSTINDLRRAFKLQEWLEKNARAGSRYAESILSFFGVKTS 336
Query 527 FDELLMPEFIGGISRELSMRTVEQTVDQQGSSSQGQYAEALGSKTGIAGVYGNTSNNIEV 586
L PEF+GG + + V Q ++ QG A G G G +
Sbjct 337 DGRLQRPEFLGGNKTPILISEVLQQSSTDSTTPQGNMA-GHGISVGKEGGFSK------- 388
Query 587 FCDEESYIIGLLTVTPVPIYTQLLPKDFTYNGLLDHYQPEFDRIGFQPITYKEICPMNAD 646
F +E Y+IGL++V P Y+Q +P+ F+ D++ P+F+ IG QP+ KEI N
Sbjct 389 FFEEHGYVIGLMSVIPKTSYSQGIPRHFSKFDKFDYFWPQFEHIGEQPVYNKEIFAKNVG 448
Query 647 NSTSSGFIERTFGYQRPWYEYVAKYDNAHGLFRTDMKNFVMHRSF--TGLPQLGQQFLLV 704
+ S G FGY + EY HG F+ + + + R F + P+L + F+ V
Sbjct 449 DYDSGG----VFGYVPRYSEYKYSPSTIHGDFKDTLYFWHLGRIFDSSAPPKLNRDFIEV 504
Query 705 DPNAVNQVFSVTEYTDKIFGYVKFNATARLPISRVAIP 742
+ + ++++F+V + +DK + ++ TA+ +S P
Sbjct 505 NKSGLSRIFAVEDNSDKFYCHLYQKITAKRKMSYFGDP 542
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 62/117 (53%), Gaps = 3/117 (3%)
Query 19 NSFDWSHVNNLTTNFGRITPVFCELVPAKGSLRINPEFGLELMPMVFPVQTRMFARLNFF 78
++F+ S+ + NFG + P+ C+ + + INP+ L PM+ PV + +++F
Sbjct 15 STFNMSYDRKFSMNFGDLVPIHCQEIVPGDKISINPQHMTRLAPMLAPVMHEVNVFIHYF 74
Query 79 KVTLRSMWEDYPDFISNFRDDLEEPYILPDKHGFERMLKTNTLGDYLGIPTQRTTFS 135
V R +W+++ FI+ + L+ ++LP + ++LGDYLG+P F+
Sbjct 75 FVPNRILWKNWEAFITGGQSGLDA-HMLPVVQNLP--VPKSSLGDYLGLPLTEGRFA 128
>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str.
3999B T(B) 6]
Length=390
Score = 124 bits (310), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 113/404 (28%), Positives = 179/404 (44%), Gaps = 59/404 (15%)
Query 332 KLLAYRFRAYESVYNAYYRDIRNNPFVVNGRPVYNKWLPTMKGGADSTLYQLHQCNWERD 391
K+ A FRAY +YN YYRD + N LP +S+L+QLH+ WE+D
Sbjct 6 KVSALPFRAYHLIYNEYYRDQNLTSELEITLDSGNYQLPV-----NSSLWQLHRRAWEKD 60
Query 392 FLTTAVPNPQQG--MNTPLVGLTIGDVVTRSEDGTLSVQKQTVLVDE---DGSKYGVSYR 446
+ T+A+P Q+G + P+ G G++ ++G + QK T D GS+ V Y
Sbjct 61 YFTSALPWVQRGPEVTVPINGG--GEIPVEMKEG-FAAQKITTFPDRKPISGSE--VLYS 115
Query 447 V----------SEDGERLVGVDYDPVSEKTPVTAINSYaelaalaaeQSAGFTIETLRYV 496
S G+ L+ D V+ G I +R
Sbjct 116 APSVLSYGQIGSIKGQALIEPDNFVVN-------------------TDQMGVNINDIRTS 156
Query 497 NAYQKFLELNMRKGFSYKQIMQGRWDIDIRFDELLMPEFIGGISRELSMRTVEQTVDQQG 556
NA Q++ E N R G Y + + + + L P+F+GG +S+ V QT
Sbjct 157 NALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDS 216
Query 557 SSSQGQYAEALGSKTGIAGVYGNTSNNIEVFCDEESYIIGLLTVTPVPIYTQLLPKDFTY 616
+S Q A G+ ++ + +E YI+G++++ P Y Q +PKDF
Sbjct 217 TSPQANMAGH--------GISAGVNHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRK 268
Query 617 NGLLDHYQPEFDRIGFQPITYKEICPMNADNSTSSGFIERTFGYQRPWYEYVAKYDNAHG 676
+D Y PEF +G Q I +E+ +N ++ + G TFGY + EY + HG
Sbjct 269 FDNMDFYFPEFAHLGEQEIKNEELY-LNESDAANEG----TFGYTPRYAEYKYSQNEVHG 323
Query 677 LFRTDMKNFVMHRSFTGLPQLGQQFLLVDPNAVNQVFSVTEYTD 720
FR +M + ++R F P L F+ +P+ N+VF+ E +D
Sbjct 324 DFRGNMAFWHLNRIFKEKPNLNTTFVECNPS--NRVFATAETSD 365
>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=541
Score = 125 bits (313), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 117/420 (28%), Positives = 185/420 (44%), Gaps = 64/420 (15%)
Query 320 FRTQKNDTAEHP---KLLAYRFRAYESVYNAYYRDIRNNPFVVNGRPVYNKWLPTMKGGA 376
F+ Q + + P K+ A FRAY +YN YYRD + N LP
Sbjct 142 FQVQSPNGVKAPAGFKVSALPFRAYHLIYNEYYRDQNLTSELEITLDSGNYQLPV----- 196
Query 377 DSTLYQLHQCNWERDFLTTAVPNPQQGMNTPLVGLTI---GDVVTRSEDGTLSVQKQTVL 433
+S+L+QLH+ WE+D+ T+A+P Q+G P V + I G++ ++G + QK T
Sbjct 197 NSSLWQLHRRAWEKDYFTSALPWVQRG---PEVTVPINGGGEIPVEMKEG-FAAQKITTF 252
Query 434 VDE---DGSKYGVSYRV----------SEDGERLVGVDYDPVSEKTPVTAINSYaelaal 480
D GS+ V Y S G+ L+ D V+
Sbjct 253 PDRKPISGSE--VLYSAPSVLSYGQIGSIKGQALIEPDNFVVN----------------- 293
Query 481 aaeQSAGFTIETLRYVNAYQKFLELNMRKGFSYKQIMQGRWDIDIRFDELLMPEFIGGIS 540
G I +R NA Q++ E N R G Y + + + + L P+F+GG
Sbjct 294 --TDQMGVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR 351
Query 541 RELSMRTVEQTVDQQGSSSQGQYAEALGSKTGIAGVYGNTSNNIEVFCDEESYIIGLLTV 600
+S+ V QT +S Q A G+ ++ + +E YI+G++++
Sbjct 352 TPISVSEVLQTSSTDSTSPQANMAGH--------GISAGVNHGFTRYFEEHGYIMGIMSI 403
Query 601 TPVPIYTQLLPKDFTYNGLLDHYQPEFDRIGFQPITYKEICPMNADNSTSSGFIERTFGY 660
P Y Q +PKDF +D Y PEF +G Q I +E+ +N ++ + G TFGY
Sbjct 404 RPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEELY-LNESDAANEG----TFGY 458
Query 661 QRPWYEYVAKYDNAHGLFRTDMKNFVMHRSFTGLPQLGQQFLLVDPNAVNQVFSVTEYTD 720
+ EY + HG FR +M + ++R F P L F+ +P+ N+VF+ E +D
Sbjct 459 TPRYAEYKYSQNEVHGDFRGNMAFWHLNRIFKEKPNLNTTFVECNPS--NRVFATAETSD 516
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 73/162 (45%), Gaps = 12/162 (7%)
Query 19 NSFDWSHVNNLTTNFGRITPVFCELVPAKGSLRINPEFGLELMPMVFPVQTRMFARLNFF 78
N F+ S+ N LT N G + P+ C+ V R+N E + L P+V P+ R+ ++F
Sbjct 16 NVFNLSYENKLTVNAGELIPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDVFTHYF 75
Query 79 KVTLRSMWEDYPDFISNFRDDLEEP----YILP---DKHGFERMLKTNTLGDYLGIPTQR 131
V R +W + DFI+ D + P Y P D +L DYLG+P
Sbjct 76 FVPNRLIWNKWEDFITKGVDGTDSPVFPTYSFPSTVDTANAHNSFGDGSLWDYLGLP--- 132
Query 132 TTFSNYSSSAINHCTPSGSSTSEGFYFVS-PSLSWDDIYSQW 172
+ + + +P+G GF + P ++ IY+++
Sbjct 133 -SINQIGEAVFQVQSPNGVKAPAGFKVSALPFRAYHLIYNEY 173
>gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=537
Score = 122 bits (306), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 111/418 (27%), Positives = 183/418 (44%), Gaps = 41/418 (10%)
Query 338 FRAYESVYNAYYRD----------IRNNPFVVNGRPVYNKWLPTMKGGADSTLYQLHQCN 387
F AY+ +++ +YRD NP + + + LP + L+++ +
Sbjct 144 FLAYQKIWDEFYRDENLIQPLFRDSNGNPVKMFNDGINDHNLPPYSKFTE--LFKMRKRA 201
Query 388 WERDFLTTAVPNPQQGMNTPLVGLTIGDVVTRSEDGTLSVQKQTVLVDEDGSKYGVSYRV 447
W D+ T+A+P Q+G + G+V E G+ QT + D G+
Sbjct 202 WHHDYFTSALPFAQKGNAVKIPIFPQGNVPLTYEMGS-----QTFIKDMAGNPAPNKDLR 256
Query 448 SEDGERLVGVDYDPVS-EKTPVTAINSYaelaalaaeQSAGFTIETLRYVNAYQKFLELN 506
S+ L V P+S + + +N +E + T+ LR Q++LE N
Sbjct 257 SDVNGNLQDVSGQPLSLDPSKNLKLNMASENVS---------TVNDLRRAFKLQEWLEKN 307
Query 507 MRKGFSYKQIMQGRWDIDIRFDELLMPEFIGGISRELSMRTVEQTVDQQGSSSQGQYAEA 566
R G Y + + + + L PEF+GG + + V Q ++ QG A
Sbjct 308 ARAGSRYAESILSFFGVKTSDGRLQRPEFLGGNKSPIMISEVLQQSATDSTTPQGNMA-- 365
Query 567 LGSKTGIAGVYGNTSNNIEVFCDEESYIIGLLTVTPVPIYTQLLPKDFTYNGLLDHYQPE 626
G GI G F +E Y+IGL++V P Y+Q +P+ F+ + D++ P+
Sbjct 366 -GHGIGIGKDGG-----FSRFFEEHGYVIGLMSVIPKTSYSQGIPRHFSKSDKFDYFWPQ 419
Query 627 FDRIGFQPITYKEICPMNADNSTSSGFIERTFGYQRPWYEYVAKYDNAHGLFRTDMKNFV 686
F+ IG QP+ KEI N D S E FGY + EY HG F+ D+ +
Sbjct 420 FEHIGEQPVYNKEIFAKNIDAFDS----EAVFGYLPRYSEYKFSPSTVHGDFKDDLYFWH 475
Query 687 MHRSF--TGLPQLGQQFLLVDPNAVNQVFSVTEYTDKIFGYVKFNATARLPISRVAIP 742
+ R F P L Q F+ D NA++++F+V + TDK + ++ TA+ +S P
Sbjct 476 LGRIFDTDKPPVLNQSFIECDKNALSRIFAVEDDTDKFYCHLYQKITAKRKMSYFGDP 533
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/117 (26%), Positives = 59/117 (50%), Gaps = 3/117 (3%)
Query 19 NSFDWSHVNNLTTNFGRITPVFCELVPAKGSLRINPEFGLELMPMVFPVQTRMFARLNFF 78
++F+ S+ + NFG + P+ C+ V + INP+ L PM+ PV + +++F
Sbjct 15 STFNMSYDRKFSMNFGDLVPIHCQEVIPGDKISINPQHMTRLAPMIAPVMHEVNVFIHYF 74
Query 79 KVTLRSMWEDYPDFISNFRDDLEEPYILPDKHGFERMLKTNTLGDYLGIPTQRTTFS 135
V R +W ++ FI+ L++ +++P + +L D+LG+P F+
Sbjct 75 FVPNRIIWSNWEQFITGGESGLDQ-HLMPRVGNLP--VSKGSLADHLGLPLTTGRFA 128
>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/399 (27%), Positives = 176/399 (44%), Gaps = 33/399 (8%)
Query 332 KLLAYRFRAYESVYNAYYRDIR-NNP--FVVNGRPVYNKWLPTMKGGADSTLYQLHQCNW 388
++ A FRAY+ +YN YYRD P F + T+ G L L + W
Sbjct 159 QVSALPFRAYQLIYNEYYRDQNLTEPIDFTLGSGT-------TVGGDQLMALMSLRRRAW 211
Query 389 ERDFLTTAVPNPQQG--MNTPLVGLTIG-DVVTRSEDGTLSVQKQTVLVDEDGSKYGVSY 445
E+D+ T+A+P Q+G + P+ G DVV + + + E+G Y ++
Sbjct 212 EKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQSDSQKWVDSSGREFENGHAYDITM 271
Query 446 -RVSEDGERLVGVDYDPVSEKTPVTAINSYaelaalaaeQSAGFTIETLRYVNAYQKFLE 504
R ++ L+ + + P N G I LR NA Q++ E
Sbjct 272 ARANDPNSALMVAVNGGTNNRAPELDPNG----TLKVNVDEMGININDLRTSNALQRWFE 327
Query 505 LNMRKGFSYKQIMQGRWDIDIRFDELLMPEFIGGISRELSMRTVEQTVDQQGSSSQGQYA 564
N R G Y + + + + L P+F+GG +S+ V QT +S Q A
Sbjct 328 RNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQANMA 387
Query 565 EALGSKTGIAGVYGNTSNNIEVFCDEESYIIGLLTVTPVPIYTQLLPKDFTYNGLLDHYQ 624
G+ +N + + +E YIIG++++TP Y Q +P+DFT +D Y
Sbjct 388 GH--------GISAGINNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDFTKFDNMDFYF 439
Query 625 PEFDRIGFQPITYKEICPMNADNSTSSGFIERTFGYQRPWYEYVAKYDNAHGLFRTDMKN 684
PEF + Q I +E+ ++ D + ++G TFGY + EY AHG FR ++
Sbjct 440 PEFAHLSEQEIKNQELF-VSEDAAYNNG----TFGYTPRYAEYKYHPSEAHGDFRGNLSF 494
Query 685 FVMHRSFTGLPQLGQQFLLVDPNAVNQVFSVTEYTDKIF 723
+ ++R F P L F+ P+ N+VF+ +E D F
Sbjct 495 WHLNRIFEDKPNLNTTFVECKPS--NRVFATSETEDDKF 531
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 68/147 (46%), Gaps = 13/147 (9%)
Query 19 NSFDWSHVNNLTTNFGRITPVFCELVPAKGSLRINPEFGLELMPMVFPVQTRMFARLNFF 78
N+F+ S+ + LT N G + P+ C V + R+ E + L P+V P+ R+ ++F
Sbjct 16 NAFNLSYESKLTLNMGELVPIMCMPVVSGDKFRVKTESLVRLAPLVAPMMHRVNVFTHYF 75
Query 79 KVTLRSMWEDYPDFISNFRDDLEEP-----------YILPDKHGFERMLKTNTLGDYLGI 127
V R +W ++ DFI+ D + P +++ + ++L DYLG+
Sbjct 76 FVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLWDYLGL 135
Query 128 PTQRTTFSNYSSSAINHC-TPSGSSTS 153
PT + N S +N PSG S
Sbjct 136 PTL-SACGNKSYDVVNGVKVPSGFQVS 161
>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis
CL09T03C24]
Length=538
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 171/395 (43%), Gaps = 45/395 (11%)
Query 332 KLLAYRFRAYESVYNAYYRD---IRNNPFVVNGRPVYNKWLPTMKGGADSTLYQLHQCNW 388
++ A FRAY+ +YN YYRD + F +N V + + L L + W
Sbjct 158 QVSALPFRAYQLIYNEYYRDQNLTKPIEFSLNSGIV-------LSADEVTRLLTLRRRTW 210
Query 389 ERDFLTTAVPNPQQGMNTPLVGLTIGDVVTRSEDGTLSVQKQTVLVDEDGSKY---GVSY 445
E+D+ T+A+P Q+G P V + I + G L V T+ D Y G S
Sbjct 211 EKDYFTSALPWVQRG---PEVTVPI-----QGSGGNLDV---TLKNDAHADTYRMPGTSN 259
Query 446 RVSEDGERLVGVDYDPVSEKTPVTAINSYaelaalaaeQSAGFTIETLRYVNAYQKFLEL 505
R + + + G ++ + N G +I LR NA Q++ E
Sbjct 260 RPAGAMQLVGGALIAGGTDGAYLEPDN------FQVNVDELGVSINDLRTSNALQRWFER 313
Query 506 NMRKGFSYKQIMQGRWDIDIRFDELLMPEFIGGISRELSMRTVEQTVDQQGSSSQGQYAE 565
N R G Y + + + + L P+F+GG +S+ V QT +S Q A
Sbjct 314 NARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSATDSTSPQANMAG 373
Query 566 ALGSKTGIAGVYGNTSNNIEVFCDEESYIIGLLTVTPVPIYTQLLPKDFTYNGLLDHYQP 625
G+ ++ + + +E YIIG++++ P Y Q +PKDF +D Y P
Sbjct 374 H--------GISAGVNHGFKRYFEEHGYIIGIMSIRPRTGYQQGVPKDFRKFDNMDFYFP 425
Query 626 EFDRIGFQPITYKEICPMNADNSTSSGFIERTFGYQRPWYEYVAKYDNAHGLFRTDMKNF 685
EF +G Q I +E+ S + TFGY + EY + HG FR +M +
Sbjct 426 EFAHLGEQEIKNEEVYLQQTPASNNG-----TFGYTPRYAEYKYSMNEVHGDFRGNMAFW 480
Query 686 VMHRSFTGLPQLGQQFLLVDPNAVNQVFSVTEYTD 720
++R F+ P L F+ +P+ N+VF+ E +D
Sbjct 481 HLNRIFSESPNLNTTFVECNPS--NRVFATAETSD 513
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 64/136 (47%), Gaps = 12/136 (9%)
Query 19 NSFDWSHVNNLTTNFGRITPVFCELVPAKGSLRINPEFGLELMPMVFPVQTRMFARLNFF 78
N F+ S+ N LT N G + P+ C+ V R+N E + L P+V P+ R+ ++F
Sbjct 16 NVFNLSYENKLTANAGELVPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDVFTHYF 75
Query 79 KVTLRSMWEDYPDFISNFRDDLEEPY-----ILPD---KHGFERMLKTNTLGDYLGIPT- 129
V R +W + DFI+ D + P + PD +L +L DYLG+PT
Sbjct 76 FVPNRLLWNQWEDFITKGVDGTDTPVFPKIALRPDWVNPTSAAVLLDDGSLWDYLGLPTI 135
Query 130 ---QRTTFSNYSSSAI 142
F N S +++
Sbjct 136 GGFNNVAFPNRSPNSV 151
>gi|575096093|emb|CDL66973.1| unnamed protein product [uncultured bacterium]
Length=574
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 112/441 (25%), Positives = 175/441 (40%), Gaps = 36/441 (8%)
Query 320 FRTQKNDTAEHPKLLAYRFRAYESVYNAYYRDIRNNPFVVNGRPVYNKWLPTMKGGADST 379
F + K+ E + A FRAY ++N ++RD V N + M G +
Sbjct 150 FGSDKSGVTELVSVSALPFRAYWLIWNEWFRDENLQSSVKVSMGDTNSAVDNMGSGTGNV 209
Query 380 LYQL-------HQC---NWERDFLTTAVPNPQQGMNTPLVGLTIGDVVTRSEDGTLSVQK 429
Y + C D+ T+ +P PQ+G P V L +G S +S+
Sbjct 210 NYSFPSGVTSYYHCAPRGKRYDYFTSCLPWPQKG---PGVELPLGSTANVSGQNNISLTL 266
Query 430 QTVLVDEDGSKYGVSYRVSEDGERLVGVDYDPVSEKTPV--TAINSYaelaalaaeQSAG 487
+V + D G S G++L + S P +N ++ +
Sbjct 267 PSVYYNGDTGS-GYSNLGQMVGKQLSSARQETYSYIKPAGNLTLNGSMSGLSVDLSSATS 325
Query 488 FTIETLRYVNAYQKFLELNMRKGFSYKQIMQGRWDIDIRFDELLMPEFIGGISRELSMRT 547
TI +LR Q++ E++ R G Y + +Q + + L PE++GG S ++
Sbjct 326 ITINSLRQAFMLQRYYEVDARGGTRYTEKLQAHFGVTNPDSRLQRPEYLGGRSSMFNINP 385
Query 548 VEQTVDQQGSSSQGQYAEALGSKTGIAGVYGNTSNNIEVFCDEESYIIGLLTVTPVPIYT 607
V QT S QG A G++G T E +IGL +V Y
Sbjct 386 VAQTSSTNDISPQGNMAA--------YGIHGRTYRAFNKSFTEFGVVIGLCSVRADLTYQ 437
Query 608 QLLPKDFTYNGLLDHYQPEFDRIGFQPITYKEICPMNADNSTSSGFIERTFGYQRPWYEY 667
Q + + LD Y PEF +G Q + +EI T FGYQ + EY
Sbjct 438 QGTERMWFRKDDLDFYWPEFAHLGEQAVLNQEIYVQGTSADTG------VFGYQERYAEY 491
Query 668 VAKYDNAHGLFRTDMKN----FVMHRSFTGLPQLGQQFLLVDPNAVNQVFSVTEYTDKIF 723
K + G FR+ K + + + F LP+LG QF+ P V++V +V Y +
Sbjct 492 RYKPNKITGQFRSTYKQTLDVWHLAQKFDSLPKLGDQFIQDHP-PVSRVVAVPSYPHFLL 550
Query 724 GYVKFNATARLPISRVAIPRL 744
VKF+ P+ +IP L
Sbjct 551 D-VKFHLQCVRPLPLFSIPGL 570
>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=245
Score = 98.6 bits (244), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 111/234 (47%), Gaps = 15/234 (6%)
Query 487 GFTIETLRYVNAYQKFLELNMRKGFSYKQIMQGRWDIDIRFDELLMPEFIGGISRELSMR 546
G I +R NA Q++ E N R G Y + + + + L P+F+GG +S+
Sbjct 2 GVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVS 61
Query 547 TVEQTVDQQGSSSQGQYAEALGSKTGIAGVYGNTSNNIEVFCDEESYIIGLLTVTPVPIY 606
V QT +S Q A G+ ++ + +E YI+G++++ P Y
Sbjct 62 EVLQTSSTDSTSPQANMAGH--------GISAGVNHGFTRYFEEHGYIMGIMSIRPRTGY 113
Query 607 TQLLPKDFTYNGLLDHYQPEFDRIGFQPITYKEICPMNADNSTSSGFIERTFGYQRPWYE 666
Q +PKDF +D Y PEF +G Q I +E+ +N ++ + G TFGY + E
Sbjct 114 QQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEELY-LNESDAANEG----TFGYTPRYAE 168
Query 667 YVAKYDNAHGLFRTDMKNFVMHRSFTGLPQLGQQFLLVDPNAVNQVFSVTEYTD 720
Y + HG FR +M + ++R F P L F+ +P+ N+VF+ E +D
Sbjct 169 YKYSQNEVHGDFRGNMAFWHLNRIFKEKPNLNTTFVECNPS--NRVFATAETSD 220
>gi|444298000|dbj|GAC77839.1| major capsid protein [uncultured marine virus]
Length=480
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/422 (26%), Positives = 181/422 (43%), Gaps = 56/422 (13%)
Query 332 KLLAYRFRAYESVYNAYYRDIRNNPFVVNGRPVYNKWLPTMKGGADSTLYQLHQCNWERD 391
++ A RA+ +YN YYRD +V R + + +P + W++D
Sbjct 105 QINAMPIRAFNLIYNEYYRDQD----LVPKRELEDMTIPLIA--------------WQKD 146
Query 392 FLTTAVPNPQQGMNTPLVGLTIGDVVTRSEDGTL-SVQKQTVLVDEDGS---KYGVSYRV 447
+ T+A P Q+G P V L +GD GT S Q + V+E G +YG ++
Sbjct 147 YFTSARPWTQKG---PDVTLPLGDRAPIYGIGTTGSPATQNINVNETGGVNREYGAAWS- 202
Query 448 SEDGERLVGVDYDPVSEKTPVTAINSYaelaalaaeQSAGFTIETLRYVNAYQKFLELNM 507
SE +V +E P S + + G TI +R A Q++ E
Sbjct 203 SETTNAIV-------AEHDPDPGAGSDDPGIYADLQAATGGTINDIRRAFAIQRYQEARS 255
Query 508 RKGFSYKQIMQGRWDIDIRFDELLMPEFIGGISRELSMRTVEQTVDQQGSSSQGQYAEAL 567
R G Y + ++ ++ + L PE++GG + +++ V QT + Q
Sbjct 256 RYGSRYTEYLR-YLGVNPKDARLQRPEYMGGGTTQINFSEVLQTSPEIPGEDQV------ 308
Query 568 GSKTGIAGVYGN-----TSNNIEVFCDEESYIIGLLTVTPVPIYTQLLPKDFTYNGLLDH 622
S+ G+ +YG+ SN + +E YII +L+V P +YT + + + D+
Sbjct 309 -SQFGVGDMYGHGIAAMRSNKYRRYIEEHGYIISMLSVRPKTMYTNGIHRSWLRLTKEDY 367
Query 623 YQPEFDRIGFQPITYKEICPMNADNSTSSGFIERTFGYQRPWYEYVAKYDNAHGLFRTDM 682
YQ E + IG Q I EI AD + TFGY + EY + FR +
Sbjct 368 YQKELEHIGQQEIMNNEIY---ADEGAGT----ETFGYNDRYSEYRETPSHVSAEFRGIL 420
Query 683 KNFVMHRSFTGLPQLGQQFLLVDPNAVNQVFSVTEYTDKIFGYVKFNATARLPISRVAIP 742
+ M R F P L Q F VD +A ++ + + D ++ ++ AR +SR A P
Sbjct 421 NYWHMAREFEAPPVLNQSF--VDCDATKRIHN-EQTQDALWIMIQHKMVARRLLSRNAAP 477
Query 743 RL 744
R+
Sbjct 478 RI 479
Lambda K H a alpha
0.319 0.135 0.405 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 5841438110919