author_facet Edgar, Robert C.
Sjölander, Kimmen
Edgar, Robert C.
Sjölander, Kimmen
author Edgar, Robert C.
Sjölander, Kimmen
spellingShingle Edgar, Robert C.
Sjölander, Kimmen
Bioinformatics
A comparison of scoring functions for protein sequence profile alignment
Computational Mathematics
Computational Theory and Mathematics
Computer Science Applications
Molecular Biology
Biochemistry
Statistics and Probability
author_sort edgar, robert c.
spelling Edgar, Robert C. Sjölander, Kimmen 1367-4811 1367-4803 Oxford University Press (OUP) Computational Mathematics Computational Theory and Mathematics Computer Science Applications Molecular Biology Biochemistry Statistics and Probability http://dx.doi.org/10.1093/bioinformatics/bth090 <jats:title>Abstract</jats:title> <jats:p>Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence–sequence methods (e.g. BLAST) and profile–sequence methods (e.g. PSI-BLAST). Profile–profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile–profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile–profile scoring functions by comparing alignments of 488 pairs of sequences with identity ≤30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence–sequence and sequence–profile methods, including BLAST and PSI-BLAST.</jats:p> <jats:p>Results: We find that profile–profile alignment gives an average improvement over our test set of typically 2–3% over profile–sequence alignment and ∼40% over sequence–sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.</jats:p> <jats:p>Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/</jats:p> A comparison of scoring functions for protein sequence profile alignment Bioinformatics
doi_str_mv 10.1093/bioinformatics/bth090
facet_avail Online
Free
finc_class_facet Biologie
Chemie und Pharmazie
Mathematik
Informatik
format ElectronicArticle
fullrecord blob:ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9iaW9pbmZvcm1hdGljcy9idGgwOTA
id ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9iaW9pbmZvcm1hdGljcy9idGgwOTA
institution DE-Bn3
DE-Brt1
DE-Zwi2
DE-D161
DE-Gla1
DE-Zi4
DE-15
DE-Pl11
DE-Rs1
DE-105
DE-14
DE-Ch1
DE-L229
DE-D275
imprint Oxford University Press (OUP), 2004
imprint_str_mv Oxford University Press (OUP), 2004
issn 1367-4811
1367-4803
issn_str_mv 1367-4811
1367-4803
language English
mega_collection Oxford University Press (OUP) (CrossRef)
match_str edgar2004acomparisonofscoringfunctionsforproteinsequenceprofilealignment
publishDateSort 2004
publisher Oxford University Press (OUP)
recordtype ai
record_format ai
series Bioinformatics
source_id 49
title A comparison of scoring functions for protein sequence profile alignment
title_unstemmed A comparison of scoring functions for protein sequence profile alignment
title_full A comparison of scoring functions for protein sequence profile alignment
title_fullStr A comparison of scoring functions for protein sequence profile alignment
title_full_unstemmed A comparison of scoring functions for protein sequence profile alignment
title_short A comparison of scoring functions for protein sequence profile alignment
title_sort a comparison of scoring functions for protein sequence profile alignment
topic Computational Mathematics
Computational Theory and Mathematics
Computer Science Applications
Molecular Biology
Biochemistry
Statistics and Probability
url http://dx.doi.org/10.1093/bioinformatics/bth090
publishDate 2004
physical 1301-1308
description <jats:title>Abstract</jats:title> <jats:p>Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence–sequence methods (e.g. BLAST) and profile–sequence methods (e.g. PSI-BLAST). Profile–profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile–profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile–profile scoring functions by comparing alignments of 488 pairs of sequences with identity ≤30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence–sequence and sequence–profile methods, including BLAST and PSI-BLAST.</jats:p> <jats:p>Results: We find that profile–profile alignment gives an average improvement over our test set of typically 2–3% over profile–sequence alignment and ∼40% over sequence–sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.</jats:p> <jats:p>Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/</jats:p>
container_issue 8
container_start_page 1301
container_title Bioinformatics
container_volume 20
format_de105 Article, E-Article
format_de14 Article, E-Article
format_de15 Article, E-Article
format_de520 Article, E-Article
format_de540 Article, E-Article
format_dech1 Article, E-Article
format_ded117 Article, E-Article
format_degla1 E-Article
format_del152 Buch
format_del189 Article, E-Article
format_dezi4 Article
format_dezwi2 Article, E-Article
format_finc Article, E-Article
format_nrw Article, E-Article
_version_ 1792345811170885635
geogr_code not assigned
last_indexed 2024-03-01T17:29:25.035Z
geogr_code_person not assigned
openURL url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fvufind.svn.sourceforge.net%3Agenerator&rft.title=A+comparison+of+scoring+functions+for+protein+sequence+profile+alignment&rft.date=2004-05-22&genre=article&issn=1367-4803&volume=20&issue=8&spage=1301&epage=1308&pages=1301-1308&jtitle=Bioinformatics&atitle=A+comparison+of+scoring+functions+for+protein+sequence+profile+alignment&aulast=Sj%C3%B6lander&aufirst=Kimmen&rft_id=info%3Adoi%2F10.1093%2Fbioinformatics%2Fbth090&rft.language%5B0%5D=eng
SOLR
_version_ 1792345811170885635
author Edgar, Robert C., Sjölander, Kimmen
author_facet Edgar, Robert C., Sjölander, Kimmen, Edgar, Robert C., Sjölander, Kimmen
author_sort edgar, robert c.
container_issue 8
container_start_page 1301
container_title Bioinformatics
container_volume 20
description <jats:title>Abstract</jats:title> <jats:p>Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence–sequence methods (e.g. BLAST) and profile–sequence methods (e.g. PSI-BLAST). Profile–profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile–profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile–profile scoring functions by comparing alignments of 488 pairs of sequences with identity ≤30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence–sequence and sequence–profile methods, including BLAST and PSI-BLAST.</jats:p> <jats:p>Results: We find that profile–profile alignment gives an average improvement over our test set of typically 2–3% over profile–sequence alignment and ∼40% over sequence–sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.</jats:p> <jats:p>Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/</jats:p>
doi_str_mv 10.1093/bioinformatics/bth090
facet_avail Online, Free
finc_class_facet Biologie, Chemie und Pharmazie, Mathematik, Informatik
format ElectronicArticle
format_de105 Article, E-Article
format_de14 Article, E-Article
format_de15 Article, E-Article
format_de520 Article, E-Article
format_de540 Article, E-Article
format_dech1 Article, E-Article
format_ded117 Article, E-Article
format_degla1 E-Article
format_del152 Buch
format_del189 Article, E-Article
format_dezi4 Article
format_dezwi2 Article, E-Article
format_finc Article, E-Article
format_nrw Article, E-Article
geogr_code not assigned
geogr_code_person not assigned
id ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9iaW9pbmZvcm1hdGljcy9idGgwOTA
imprint Oxford University Press (OUP), 2004
imprint_str_mv Oxford University Press (OUP), 2004
institution DE-Bn3, DE-Brt1, DE-Zwi2, DE-D161, DE-Gla1, DE-Zi4, DE-15, DE-Pl11, DE-Rs1, DE-105, DE-14, DE-Ch1, DE-L229, DE-D275
issn 1367-4811, 1367-4803
issn_str_mv 1367-4811, 1367-4803
language English
last_indexed 2024-03-01T17:29:25.035Z
match_str edgar2004acomparisonofscoringfunctionsforproteinsequenceprofilealignment
mega_collection Oxford University Press (OUP) (CrossRef)
physical 1301-1308
publishDate 2004
publishDateSort 2004
publisher Oxford University Press (OUP)
record_format ai
recordtype ai
series Bioinformatics
source_id 49
spelling Edgar, Robert C. Sjölander, Kimmen 1367-4811 1367-4803 Oxford University Press (OUP) Computational Mathematics Computational Theory and Mathematics Computer Science Applications Molecular Biology Biochemistry Statistics and Probability http://dx.doi.org/10.1093/bioinformatics/bth090 <jats:title>Abstract</jats:title> <jats:p>Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence–sequence methods (e.g. BLAST) and profile–sequence methods (e.g. PSI-BLAST). Profile–profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile–profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile–profile scoring functions by comparing alignments of 488 pairs of sequences with identity ≤30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence–sequence and sequence–profile methods, including BLAST and PSI-BLAST.</jats:p> <jats:p>Results: We find that profile–profile alignment gives an average improvement over our test set of typically 2–3% over profile–sequence alignment and ∼40% over sequence–sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.</jats:p> <jats:p>Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/</jats:p> A comparison of scoring functions for protein sequence profile alignment Bioinformatics
spellingShingle Edgar, Robert C., Sjölander, Kimmen, Bioinformatics, A comparison of scoring functions for protein sequence profile alignment, Computational Mathematics, Computational Theory and Mathematics, Computer Science Applications, Molecular Biology, Biochemistry, Statistics and Probability
title A comparison of scoring functions for protein sequence profile alignment
title_full A comparison of scoring functions for protein sequence profile alignment
title_fullStr A comparison of scoring functions for protein sequence profile alignment
title_full_unstemmed A comparison of scoring functions for protein sequence profile alignment
title_short A comparison of scoring functions for protein sequence profile alignment
title_sort a comparison of scoring functions for protein sequence profile alignment
title_unstemmed A comparison of scoring functions for protein sequence profile alignment
topic Computational Mathematics, Computational Theory and Mathematics, Computer Science Applications, Molecular Biology, Biochemistry, Statistics and Probability
url http://dx.doi.org/10.1093/bioinformatics/bth090