bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-33_CDS_annotation_glimmer3.pl_2_5
Length=210
Score E
Sequences producing significant alignments: (Bits) Value
gi|492501782|ref|WP_005867318.1| hypothetical protein 68.9 2e-10
gi|649557305|gb|KDS63784.1| capsid family protein 65.5 6e-10
gi|547920049|ref|WP_022322420.1| capsid protein VP1 66.2 1e-09
gi|649569140|gb|KDS75238.1| capsid family protein 65.5 2e-09
gi|649555287|gb|KDS61824.1| capsid family protein 65.5 3e-09
gi|547312923|ref|WP_022044635.1| putative uncharacterized protein 63.2 1e-08
gi|565841287|ref|WP_023924568.1| hypothetical protein 63.9 1e-08
gi|494610271|ref|WP_007368517.1| capsid protein 63.2 2e-08
gi|647452987|ref|WP_025792807.1| hypothetical protein 53.9 2e-05
gi|599087961|gb|AHN52906.1| major capsid protein 49.3 3e-04
>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis
CL09T03C24]
Length=538
Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 90/210 (43%), Gaps = 15/210 (7%)
Query 2 FEGGVSQEIVFQEVISNSASQE-EPLGTLAGRGVTTGRQKGGHIRIKVTEPCYIMCICSI 60
F GG I EV+ SA+ P +AG G++ G G + E YI+ I SI
Sbjct 343 FLGGGRTPISVSEVLQTSATDSTSPQANMAGHGISAGVNHG--FKRYFEEHGYIIGIMSI 400
Query 61 TPRIDYGQGNTWDTYLETMDDWHKPALDGIGYQDSLNGERAWWTDHIASNGGSLTKTAAG 120
PR Y QG D D++ P +G Q+ + E + ASN G+ G
Sbjct 401 RPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQE-IKNEEVYLQQTPASNNGTF-----G 454
Query 121 KTVAWINYMTNVNRTFGNFAPEMPESFMVLNRNYSMNNNGQIEDLTTYIDPVKFNYIFAD 180
T + Y ++N G+F M +F LNR +S + N TT+++ N +FA
Sbjct 455 YTPRYAEYKYSMNEVHGDFRGNM--AFWHLNRIFSESPNLN----TTFVECNPSNRVFAT 508
Query 181 TNLDAMNFWVQTKFDIKVRRLISAKQIPNL 210
+W+Q D+K RL+ P L
Sbjct 509 AETSDDKYWIQLYQDVKALRLMPKYGTPML 538
>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=245
Score = 65.5 bits (158), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 87/210 (41%), Gaps = 15/210 (7%)
Query 2 FEGGVSQEIVFQEVISNSASQE-EPLGTLAGRGVTTGRQKGGHIRIKVTEPCYIMCICSI 60
F GG I EV+ S++ P +AG G++ G G E YIM I SI
Sbjct 50 FLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGVNHG--FTRYFEEHGYIMGIMSI 107
Query 61 TPRIDYGQGNTWDTYLETMDDWHKPALDGIGYQDSLNGERAWWTDHIASNGGSLTKTAAG 120
PR Y QG D D++ P +G Q+ + E + + A+N G+ G
Sbjct 108 RPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQE-IKNEELYLNESDAANEGTF-----G 161
Query 121 KTVAWINYMTNVNRTFGNFAPEMPESFMVLNRNYSMNNNGQIEDLTTYIDPVKFNYIFAD 180
T + Y + N G+F M +F LNR + N TT+++ N +FA
Sbjct 162 YTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKEKPNLN----TTFVECNPSNRVFAT 215
Query 181 TNLDAMNFWVQTKFDIKVRRLISAKQIPNL 210
+WVQ DIK RL+ P L
Sbjct 216 AETSDDKYWVQIYQDIKALRLMPKYGTPML 245
>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553
Score = 66.2 bits (160), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/211 (30%), Positives = 94/211 (45%), Gaps = 17/211 (8%)
Query 2 FEGGVSQEIVFQEVISNSASQE-EPLGTLAGRGVTTGRQKGGHIRIKVTEPCYIMCICSI 60
F GG I EV+ S++ E P +AG G++ G G + E YI+ I SI
Sbjct 358 FLGGGRMPISVSEVLQTSSTDETSPQANMAGHGISAGINNG--FKHYFEEHGYIIGIMSI 415
Query 61 TPRIDYGQGNTWD-TYLETMDDWHKPALDGIGYQDSLNGERAWWTDHIASNGGSLTKTAA 119
TPR Y QG D T + MD ++ P + Q+ N E + ++ A N G+
Sbjct 416 TPRSGYQQGVPRDFTKFDNMD-FYFPEFAHLSEQEIKNQE-LFVSEDAAYNNGTF----- 468
Query 120 GKTVAWINYMTNVNRTFGNFAPEMPESFMVLNRNYSMNNNGQIEDLTTYIDPVKFNYIFA 179
G T + Y + + G+F + SF LNR + N TT+++ N +FA
Sbjct 469 GYTPRYAEYKYHPSEAHGDFRGNL--SFWHLNRIFEDKPNLN----TTFVECKPSNRVFA 522
Query 180 DTNLDAMNFWVQTKFDIKVRRLISAKQIPNL 210
+ + FWVQ D+K RL+ P L
Sbjct 523 TSETEDDKFWVQMYQDVKALRLMPKYGTPML 553
>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str.
3999B T(B) 6]
Length=390
Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 87/210 (41%), Gaps = 15/210 (7%)
Query 2 FEGGVSQEIVFQEVISNSASQE-EPLGTLAGRGVTTGRQKGGHIRIKVTEPCYIMCICSI 60
F GG I EV+ S++ P +AG G++ G G E YIM I SI
Sbjct 195 FLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGVNHG--FTRYFEEHGYIMGIMSI 252
Query 61 TPRIDYGQGNTWDTYLETMDDWHKPALDGIGYQDSLNGERAWWTDHIASNGGSLTKTAAG 120
PR Y QG D D++ P +G Q+ + E + + A+N G+ G
Sbjct 253 RPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQE-IKNEELYLNESDAANEGTF-----G 306
Query 121 KTVAWINYMTNVNRTFGNFAPEMPESFMVLNRNYSMNNNGQIEDLTTYIDPVKFNYIFAD 180
T + Y + N G+F M +F LNR + N TT+++ N +FA
Sbjct 307 YTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKEKPNLN----TTFVECNPSNRVFAT 360
Query 181 TNLDAMNFWVQTKFDIKVRRLISAKQIPNL 210
+WVQ DIK RL+ P L
Sbjct 361 AETSDDKYWVQIYQDIKALRLMPKYGTPML 390
>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=541
Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 87/210 (41%), Gaps = 15/210 (7%)
Query 2 FEGGVSQEIVFQEVISNSASQE-EPLGTLAGRGVTTGRQKGGHIRIKVTEPCYIMCICSI 60
F GG I EV+ S++ P +AG G++ G G E YIM I SI
Sbjct 346 FLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGVNHG--FTRYFEEHGYIMGIMSI 403
Query 61 TPRIDYGQGNTWDTYLETMDDWHKPALDGIGYQDSLNGERAWWTDHIASNGGSLTKTAAG 120
PR Y QG D D++ P +G Q+ + E + + A+N G+ G
Sbjct 404 RPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQE-IKNEELYLNESDAANEGTF-----G 457
Query 121 KTVAWINYMTNVNRTFGNFAPEMPESFMVLNRNYSMNNNGQIEDLTTYIDPVKFNYIFAD 180
T + Y + N G+F M +F LNR + N TT+++ N +FA
Sbjct 458 YTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKEKPNLN----TTFVECNPSNRVFAT 511
Query 181 TNLDAMNFWVQTKFDIKVRRLISAKQIPNL 210
+WVQ DIK RL+ P L
Sbjct 512 AETSDDKYWVQIYQDIKALRLMPKYGTPML 541
>gi|547312923|ref|WP_022044635.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
gi|524208404|emb|CCZ76639.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
Length=338
Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/227 (29%), Positives = 94/227 (41%), Gaps = 32/227 (14%)
Query 13 QEVISNSASQEEP-LGTLAGRGVTTGRQKGGH--IRIKVTEPCYIMCICSITPRIDYGQG 69
+ + + SAS E+ LG LA V GH I EP M I + P Y QG
Sbjct 113 RAMANGSASGEDANLGQLAAC-VDRYCDFSGHSGIDYYAKEPGTFMLITMLVPEPAYSQG 171
Query 70 NTWDTYLETMDDWHKPALDGIGYQ----------------DSLNGERAWWTDHIASNGGS 113
D + D P L+GIG+Q L+ E + W H + G
Sbjct 172 LHPDLASISFGDDFNPELNGIGFQLVPRHRFSMMPRGFNFTGLDQEASPWFGHTGT--GV 229
Query 114 LTK---TAAGKTVAWINYMTNVNRTFGNFAPEMPESFMVLNRNYSM----NNNGQIED-- 164
L + G+ VAW T+ +R G+FA + VL R ++ + G +D
Sbjct 230 LVDPNMVSVGEEVAWSWLRTDYSRLHGDFAQNGNYQYWVLTRRFTTYFPDDGTGFYQDGE 289
Query 165 -LTTYIDPVKFNYIFADTNLDAMNFWVQTKFDIKVRRLISAKQIPNL 210
TYI+P+ + Y+F D L A NF FD+ V +SA +P L
Sbjct 290 YTGTYINPLDWQYVFVDQTLMAGNFAYYGTFDLNVTSSLSANYMPYL 336
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/211 (25%), Positives = 86/211 (41%), Gaps = 15/211 (7%)
Query 2 FEGGVSQEIVFQEVISNS-------ASQEEPLGTLAGRGVTTGRQKGGHIRIKVTEPCYI 54
F GG +I EV++ S AS +G + G+G+ G GHI V E I
Sbjct 439 FIGGFDNQISISEVVTTSNGSVDGTASTGSVVGQVFGKGI--GAMNSGHISYDVKEHGLI 496
Query 55 MCICSITPRIDYGQGNTWDTYLE--TMDDWHKPALDGIGYQDSLNGERAWWTDHIASNGG 112
MCI SI P++DY D + + +D+ +P + +G Q + + + S+
Sbjct 497 MCIYSIAPQVDY-DARELDPFNRKFSREDYFQPEFENLGMQPVIQSDLCLCINSAKSDSS 555
Query 113 SLTKTAAGKTVAWINYMTNVNRTFGNFAPEMPESFMVLNRNYSMNNNGQIEDLTTYIDPV 172
G + ++ Y T + FG F S +N G++ +DP
Sbjct 556 DQHNNVLGYSARYLEYKTARDIIFGEFMSGGSLSAWATPKNNYTFEFGKLSLPDLLVDPK 615
Query 173 KFNYIFA---DTNLDAMNFWVQTKFDIKVRR 200
IFA + ++ F V + FD+K R
Sbjct 616 VLEPIFAVKYNGSMSTDQFLVNSYFDVKAIR 646
>gi|494610271|ref|WP_007368517.1| capsid protein [Prevotella multiformis]
gi|324988543|gb|EGC20506.1| putative capsid protein (F protein) [Prevotella multiformis DSM
16608]
Length=531
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 102/242 (42%), Gaps = 37/242 (15%)
Query 2 FEGGVSQEIVFQEVISNS----ASQEEP-LGTLAGRGVTTGRQKGGHIRIKVTEPCYIMC 56
F GG +V EV++ S + E P LG L G+GV G I V E IMC
Sbjct 294 FIGGFDNPVVISEVVNQSEFDRGADESPCLGDLGGKGV--GSLNSSSIDFDVKEHGIIMC 351
Query 57 ICSITPRIDYGQGNTWDTYLETM--DDWHKPALDGIGYQDSLNGER-AWWTDHIASNGGS 113
I S+ P+ +Y G +D + + +D+ +P +GYQ + + + + D+ +G
Sbjct 352 IYSVVPQTEY-NGTYFDPFNRKLRREDFFQPEFADLGYQPVVTSDLISTYLDNPVPDGPE 410
Query 114 LTK-TAAGKTVAWIN--------------YMTNVNRTFGNFAPEMPESFMVLNRNYSMNN 158
K AAG ++ I Y T+ + FG F + S+ R Y
Sbjct 411 KQKRLAAGYPLSSIEANNRLLGWQVRYNEYKTSRDLVFGEFESGLSLSYWCSPR-YDFGF 469
Query 159 NGQIEDLTT----------YIDPVKFNYIFADTNLDAMNFWVQTKFDIKVRRLISAKQIP 208
+G+ D Y++P N IF + + A +F V + FD+K R +S +
Sbjct 470 DGKAGDKKLVNSPWSPAHFYVNPSILNTIFLVSAVKADHFLVNSFFDVKAVRPMSVSGLA 529
Query 209 NL 210
L
Sbjct 530 GL 531
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 99/252 (39%), Gaps = 56/252 (22%)
Query 2 FEGGVSQEIVFQEVIS---NSAS--QEEPLGTLAGRGVTTGRQKGGHIRIKVTEPCYIMC 56
F GG IV EV+S N+AS +G L G+G+ G G I TE IMC
Sbjct 346 FLGGFDNSIVVSEVVSTNGNAASDGSHASIGDLGGKGI--GSMSSGTIEFDSTEHGIIMC 403
Query 57 ICSITPRIDYGQGNTWDTYLE------TMDDWHKPALDGIGYQDSLNGERAWWTDHIASN 110
I S+ P+ +Y +YL+ T + +++P +GYQ +L G +D I S
Sbjct 404 IYSVAPQSEYNA-----SYLDPFNRKLTREQFYQPEFADLGYQ-ALIG-----SDLICST 452
Query 111 GGSLTKTAA-----------GKTVAWINYMTNVNRTFGNF----------APEMPESFMV 149
G K A G V + Y T + FG+F P +
Sbjct 453 LGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFESGKSLSYWCTPRFDFGYGD 512
Query 150 LNRNYSMNNNGQIE-----------DLTTYIDPVKFNYIFADTNLDAMNFWVQTKFDIKV 198
+ + N G + YI+P N IF + + A +F V + D+K
Sbjct 513 TEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFLTSAVQADHFIVNSFLDVKA 572
Query 199 RRLISAKQIPNL 210
R +S + +L
Sbjct 573 VRPMSVTGLSSL 584
>gi|599087961|gb|AHN52906.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/98 (38%), Positives = 42/98 (43%), Gaps = 2/98 (2%)
Query 2 FEGGVSQEIVFQEVISNSASQEEPLGTLAGRGVTTGRQKGGHIRIKVTEPCYIMCICSIT 61
F GG S I V S S P GTLA G T GG TE C +M I S+
Sbjct 113 FLGGTSTPINVTSVPQTSESGTTPQGTLAAFGTAT--INGGGFTKSFTEHCIVMGIASVR 170
Query 62 PRIDYGQGNTWDTYLETMDDWHKPALDGIGYQDSLNGE 99
+ Y QG T D++ PAL IG Q LN E
Sbjct 171 ADLTYQQGLNRMFSRSTRYDFYFPALAHIGEQSVLNKE 208
Lambda K H a alpha
0.318 0.135 0.421 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 735037691454