Eintrag weiter verarbeiten
A comparison of scoring functions for protein sequence profile alignment
Gespeichert in:
Zeitschriftentitel: | Bioinformatics |
---|---|
Personen und Körperschaften: | , |
In: | Bioinformatics, 20, 2004, 8, S. 1301-1308 |
Format: | E-Article |
Sprache: | Englisch |
veröffentlicht: |
Oxford University Press (OUP)
|
Schlagwörter: |
author_facet |
Edgar, Robert C. Sjölander, Kimmen Edgar, Robert C. Sjölander, Kimmen |
---|---|
author |
Edgar, Robert C. Sjölander, Kimmen |
spellingShingle |
Edgar, Robert C. Sjölander, Kimmen Bioinformatics A comparison of scoring functions for protein sequence profile alignment Computational Mathematics Computational Theory and Mathematics Computer Science Applications Molecular Biology Biochemistry Statistics and Probability |
author_sort |
edgar, robert c. |
spelling |
Edgar, Robert C. Sjölander, Kimmen 1367-4811 1367-4803 Oxford University Press (OUP) Computational Mathematics Computational Theory and Mathematics Computer Science Applications Molecular Biology Biochemistry Statistics and Probability http://dx.doi.org/10.1093/bioinformatics/bth090 <jats:title>Abstract</jats:title> <jats:p>Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence–sequence methods (e.g. BLAST) and profile–sequence methods (e.g. PSI-BLAST). Profile–profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile–profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile–profile scoring functions by comparing alignments of 488 pairs of sequences with identity ≤30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence–sequence and sequence–profile methods, including BLAST and PSI-BLAST.</jats:p> <jats:p>Results: We find that profile–profile alignment gives an average improvement over our test set of typically 2–3% over profile–sequence alignment and ∼40% over sequence–sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.</jats:p> <jats:p>Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/</jats:p> A comparison of scoring functions for protein sequence profile alignment Bioinformatics |
doi_str_mv |
10.1093/bioinformatics/bth090 |
facet_avail |
Online Free |
finc_class_facet |
Biologie Chemie und Pharmazie Mathematik Informatik |
format |
ElectronicArticle |
fullrecord |
blob:ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9iaW9pbmZvcm1hdGljcy9idGgwOTA |
id |
ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9iaW9pbmZvcm1hdGljcy9idGgwOTA |
institution |
DE-Bn3 DE-Brt1 DE-Zwi2 DE-D161 DE-Gla1 DE-Zi4 DE-15 DE-Pl11 DE-Rs1 DE-105 DE-14 DE-Ch1 DE-L229 DE-D275 |
imprint |
Oxford University Press (OUP), 2004 |
imprint_str_mv |
Oxford University Press (OUP), 2004 |
issn |
1367-4811 1367-4803 |
issn_str_mv |
1367-4811 1367-4803 |
language |
English |
mega_collection |
Oxford University Press (OUP) (CrossRef) |
match_str |
edgar2004acomparisonofscoringfunctionsforproteinsequenceprofilealignment |
publishDateSort |
2004 |
publisher |
Oxford University Press (OUP) |
recordtype |
ai |
record_format |
ai |
series |
Bioinformatics |
source_id |
49 |
title |
A comparison of scoring functions for protein sequence profile alignment |
title_unstemmed |
A comparison of scoring functions for protein sequence profile alignment |
title_full |
A comparison of scoring functions for protein sequence profile alignment |
title_fullStr |
A comparison of scoring functions for protein sequence profile alignment |
title_full_unstemmed |
A comparison of scoring functions for protein sequence profile alignment |
title_short |
A comparison of scoring functions for protein sequence profile alignment |
title_sort |
a comparison of scoring functions for protein sequence profile alignment |
topic |
Computational Mathematics Computational Theory and Mathematics Computer Science Applications Molecular Biology Biochemistry Statistics and Probability |
url |
http://dx.doi.org/10.1093/bioinformatics/bth090 |
publishDate |
2004 |
physical |
1301-1308 |
description |
<jats:title>Abstract</jats:title>
<jats:p>Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence–sequence methods (e.g. BLAST) and profile–sequence methods (e.g. PSI-BLAST). Profile–profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile–profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile–profile scoring functions by comparing alignments of 488 pairs of sequences with identity ≤30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence–sequence and sequence–profile methods, including BLAST and PSI-BLAST.</jats:p>
<jats:p>Results: We find that profile–profile alignment gives an average improvement over our test set of typically 2–3% over profile–sequence alignment and ∼40% over sequence–sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.</jats:p>
<jats:p>Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/</jats:p> |
container_issue |
8 |
container_start_page |
1301 |
container_title |
Bioinformatics |
container_volume |
20 |
format_de105 |
Article, E-Article |
format_de14 |
Article, E-Article |
format_de15 |
Article, E-Article |
format_de520 |
Article, E-Article |
format_de540 |
Article, E-Article |
format_dech1 |
Article, E-Article |
format_ded117 |
Article, E-Article |
format_degla1 |
E-Article |
format_del152 |
Buch |
format_del189 |
Article, E-Article |
format_dezi4 |
Article |
format_dezwi2 |
Article, E-Article |
format_finc |
Article, E-Article |
format_nrw |
Article, E-Article |
_version_ |
1792345811170885635 |
geogr_code |
not assigned |
last_indexed |
2024-03-01T17:29:25.035Z |
geogr_code_person |
not assigned |
openURL |
url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fvufind.svn.sourceforge.net%3Agenerator&rft.title=A+comparison+of+scoring+functions+for+protein+sequence+profile+alignment&rft.date=2004-05-22&genre=article&issn=1367-4803&volume=20&issue=8&spage=1301&epage=1308&pages=1301-1308&jtitle=Bioinformatics&atitle=A+comparison+of+scoring+functions+for+protein+sequence+profile+alignment&aulast=Sj%C3%B6lander&aufirst=Kimmen&rft_id=info%3Adoi%2F10.1093%2Fbioinformatics%2Fbth090&rft.language%5B0%5D=eng |
SOLR | |
_version_ | 1792345811170885635 |
author | Edgar, Robert C., Sjölander, Kimmen |
author_facet | Edgar, Robert C., Sjölander, Kimmen, Edgar, Robert C., Sjölander, Kimmen |
author_sort | edgar, robert c. |
container_issue | 8 |
container_start_page | 1301 |
container_title | Bioinformatics |
container_volume | 20 |
description | <jats:title>Abstract</jats:title> <jats:p>Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence–sequence methods (e.g. BLAST) and profile–sequence methods (e.g. PSI-BLAST). Profile–profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile–profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile–profile scoring functions by comparing alignments of 488 pairs of sequences with identity ≤30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence–sequence and sequence–profile methods, including BLAST and PSI-BLAST.</jats:p> <jats:p>Results: We find that profile–profile alignment gives an average improvement over our test set of typically 2–3% over profile–sequence alignment and ∼40% over sequence–sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.</jats:p> <jats:p>Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/</jats:p> |
doi_str_mv | 10.1093/bioinformatics/bth090 |
facet_avail | Online, Free |
finc_class_facet | Biologie, Chemie und Pharmazie, Mathematik, Informatik |
format | ElectronicArticle |
format_de105 | Article, E-Article |
format_de14 | Article, E-Article |
format_de15 | Article, E-Article |
format_de520 | Article, E-Article |
format_de540 | Article, E-Article |
format_dech1 | Article, E-Article |
format_ded117 | Article, E-Article |
format_degla1 | E-Article |
format_del152 | Buch |
format_del189 | Article, E-Article |
format_dezi4 | Article |
format_dezwi2 | Article, E-Article |
format_finc | Article, E-Article |
format_nrw | Article, E-Article |
geogr_code | not assigned |
geogr_code_person | not assigned |
id | ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9iaW9pbmZvcm1hdGljcy9idGgwOTA |
imprint | Oxford University Press (OUP), 2004 |
imprint_str_mv | Oxford University Press (OUP), 2004 |
institution | DE-Bn3, DE-Brt1, DE-Zwi2, DE-D161, DE-Gla1, DE-Zi4, DE-15, DE-Pl11, DE-Rs1, DE-105, DE-14, DE-Ch1, DE-L229, DE-D275 |
issn | 1367-4811, 1367-4803 |
issn_str_mv | 1367-4811, 1367-4803 |
language | English |
last_indexed | 2024-03-01T17:29:25.035Z |
match_str | edgar2004acomparisonofscoringfunctionsforproteinsequenceprofilealignment |
mega_collection | Oxford University Press (OUP) (CrossRef) |
physical | 1301-1308 |
publishDate | 2004 |
publishDateSort | 2004 |
publisher | Oxford University Press (OUP) |
record_format | ai |
recordtype | ai |
series | Bioinformatics |
source_id | 49 |
spelling | Edgar, Robert C. Sjölander, Kimmen 1367-4811 1367-4803 Oxford University Press (OUP) Computational Mathematics Computational Theory and Mathematics Computer Science Applications Molecular Biology Biochemistry Statistics and Probability http://dx.doi.org/10.1093/bioinformatics/bth090 <jats:title>Abstract</jats:title> <jats:p>Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence–sequence methods (e.g. BLAST) and profile–sequence methods (e.g. PSI-BLAST). Profile–profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile–profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile–profile scoring functions by comparing alignments of 488 pairs of sequences with identity ≤30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence–sequence and sequence–profile methods, including BLAST and PSI-BLAST.</jats:p> <jats:p>Results: We find that profile–profile alignment gives an average improvement over our test set of typically 2–3% over profile–sequence alignment and ∼40% over sequence–sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.</jats:p> <jats:p>Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/</jats:p> A comparison of scoring functions for protein sequence profile alignment Bioinformatics |
spellingShingle | Edgar, Robert C., Sjölander, Kimmen, Bioinformatics, A comparison of scoring functions for protein sequence profile alignment, Computational Mathematics, Computational Theory and Mathematics, Computer Science Applications, Molecular Biology, Biochemistry, Statistics and Probability |
title | A comparison of scoring functions for protein sequence profile alignment |
title_full | A comparison of scoring functions for protein sequence profile alignment |
title_fullStr | A comparison of scoring functions for protein sequence profile alignment |
title_full_unstemmed | A comparison of scoring functions for protein sequence profile alignment |
title_short | A comparison of scoring functions for protein sequence profile alignment |
title_sort | a comparison of scoring functions for protein sequence profile alignment |
title_unstemmed | A comparison of scoring functions for protein sequence profile alignment |
topic | Computational Mathematics, Computational Theory and Mathematics, Computer Science Applications, Molecular Biology, Biochemistry, Statistics and Probability |
url | http://dx.doi.org/10.1093/bioinformatics/bth090 |