author_facet Listgarten, Jennifer
Damaraju, Sambasivarao
Poulin, Brett
Cook, Lillian
Dufour, Jennifer
Driga, Adrian
Mackey, John
Wishart, David
Greiner, Russ
Zanke, Brent
Listgarten, Jennifer
Damaraju, Sambasivarao
Poulin, Brett
Cook, Lillian
Dufour, Jennifer
Driga, Adrian
Mackey, John
Wishart, David
Greiner, Russ
Zanke, Brent
author Listgarten, Jennifer
Damaraju, Sambasivarao
Poulin, Brett
Cook, Lillian
Dufour, Jennifer
Driga, Adrian
Mackey, John
Wishart, David
Greiner, Russ
Zanke, Brent
spellingShingle Listgarten, Jennifer
Damaraju, Sambasivarao
Poulin, Brett
Cook, Lillian
Dufour, Jennifer
Driga, Adrian
Mackey, John
Wishart, David
Greiner, Russ
Zanke, Brent
Clinical Cancer Research
Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
Cancer Research
Oncology
author_sort listgarten, jennifer
spelling Listgarten, Jennifer Damaraju, Sambasivarao Poulin, Brett Cook, Lillian Dufour, Jennifer Driga, Adrian Mackey, John Wishart, David Greiner, Russ Zanke, Brent 1078-0432 1557-3265 American Association for Cancer Research (AACR) Cancer Research Oncology http://dx.doi.org/10.1158/1078-0432.ccr-1115-03 <jats:title>Abstract</jats:title> <jats:p>Hereditary predisposition and causative environmental exposures have long been recognized in human malignancies. In most instances, cancer cases occur sporadically, suggesting that environmental influences are critical in determining cancer risk. To test the influence of genetic polymorphisms on breast cancer risk, we have measured 98 single nucleotide polymorphisms (SNPs) distributed over 45 genes of potential relevance to breast cancer etiology in 174 patients and have compared these with matched normal controls. Using machine learning techniques such as support vector machines (SVMs), decision trees, and naïve Bayes, we identified a subset of three SNPs as key discriminators between breast cancer and controls. The SVMs performed maximally among predictive models, achieving 69% predictive power in distinguishing between the two groups, compared with a 50% baseline predictive power obtained from the data after repeated random permutation of class labels (individuals with cancer or controls). However, the simpler naïve Bayes model as well as the decision tree model performed quite similarly to the SVM. The three SNP sites most useful in this model were (a) the +4536T/C site of the aldosterone synthase gene CYP11B2 at amino acid residue 386 Val/Ala (T/C) (rs4541); (b) the +4328C/G site of the aryl hydrocarbon hydroxylase CYP1B1 at amino acid residue 293 Leu/Val (C/G) (rs5292); and (c) the +4449C/T site of the transcription factor BCL6 at amino acid 387 Asp/Asp (rs1056932). No single SNP site on its own could achieve more than 60% in predictive accuracy. We have shown that multiple SNP sites from different genes over distant parts of the genome are better at identifying breast cancer patients than any one SNP alone. As high-throughput technology for SNPs improves and as more SNPs are identified, it is likely that much higher predictive accuracy will be achieved and a useful clinical tool developed.</jats:p> Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms Clinical Cancer Research
doi_str_mv 10.1158/1078-0432.ccr-1115-03
facet_avail Online
Free
finc_class_facet Medizin
format ElectronicArticle
fullrecord blob:ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTE1OC8xMDc4LTA0MzIuY2NyLTExMTUtMDM
id ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTE1OC8xMDc4LTA0MzIuY2NyLTExMTUtMDM
institution DE-Brt1
DE-D161
DE-Zwi2
DE-Gla1
DE-Zi4
DE-15
DE-Pl11
DE-Rs1
DE-105
DE-14
DE-Ch1
DE-L229
DE-D275
DE-Bn3
imprint American Association for Cancer Research (AACR), 2004
imprint_str_mv American Association for Cancer Research (AACR), 2004
issn 1078-0432
1557-3265
issn_str_mv 1078-0432
1557-3265
language English
mega_collection American Association for Cancer Research (AACR) (CrossRef)
match_str listgarten2004predictivemodelsforbreastcancersusceptibilityfrommultiplesinglenucleotidepolymorphisms
publishDateSort 2004
publisher American Association for Cancer Research (AACR)
recordtype ai
record_format ai
series Clinical Cancer Research
source_id 49
title Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_unstemmed Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_full Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_fullStr Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_full_unstemmed Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_short Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_sort predictive models for breast cancer susceptibility from multiple single nucleotide polymorphisms
topic Cancer Research
Oncology
url http://dx.doi.org/10.1158/1078-0432.ccr-1115-03
publishDate 2004
physical 2725-2737
description <jats:title>Abstract</jats:title> <jats:p>Hereditary predisposition and causative environmental exposures have long been recognized in human malignancies. In most instances, cancer cases occur sporadically, suggesting that environmental influences are critical in determining cancer risk. To test the influence of genetic polymorphisms on breast cancer risk, we have measured 98 single nucleotide polymorphisms (SNPs) distributed over 45 genes of potential relevance to breast cancer etiology in 174 patients and have compared these with matched normal controls. Using machine learning techniques such as support vector machines (SVMs), decision trees, and naïve Bayes, we identified a subset of three SNPs as key discriminators between breast cancer and controls. The SVMs performed maximally among predictive models, achieving 69% predictive power in distinguishing between the two groups, compared with a 50% baseline predictive power obtained from the data after repeated random permutation of class labels (individuals with cancer or controls). However, the simpler naïve Bayes model as well as the decision tree model performed quite similarly to the SVM. The three SNP sites most useful in this model were (a) the +4536T/C site of the aldosterone synthase gene CYP11B2 at amino acid residue 386 Val/Ala (T/C) (rs4541); (b) the +4328C/G site of the aryl hydrocarbon hydroxylase CYP1B1 at amino acid residue 293 Leu/Val (C/G) (rs5292); and (c) the +4449C/T site of the transcription factor BCL6 at amino acid 387 Asp/Asp (rs1056932). No single SNP site on its own could achieve more than 60% in predictive accuracy. We have shown that multiple SNP sites from different genes over distant parts of the genome are better at identifying breast cancer patients than any one SNP alone. As high-throughput technology for SNPs improves and as more SNPs are identified, it is likely that much higher predictive accuracy will be achieved and a useful clinical tool developed.</jats:p>
container_issue 8
container_start_page 2725
container_title Clinical Cancer Research
container_volume 10
format_de105 Article, E-Article
format_de14 Article, E-Article
format_de15 Article, E-Article
format_de520 Article, E-Article
format_de540 Article, E-Article
format_dech1 Article, E-Article
format_ded117 Article, E-Article
format_degla1 E-Article
format_del152 Buch
format_del189 Article, E-Article
format_dezi4 Article
format_dezwi2 Article, E-Article
format_finc Article, E-Article
format_nrw Article, E-Article
_version_ 1792341712144695299
geogr_code not assigned
last_indexed 2024-03-01T16:24:12.458Z
geogr_code_person not assigned
openURL url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fvufind.svn.sourceforge.net%3Agenerator&rft.title=Predictive+Models+for+Breast+Cancer+Susceptibility+from+Multiple+Single+Nucleotide+Polymorphisms&rft.date=2004-04-15&genre=article&issn=1557-3265&volume=10&issue=8&spage=2725&epage=2737&pages=2725-2737&jtitle=Clinical+Cancer+Research&atitle=Predictive+Models+for+Breast+Cancer+Susceptibility+from+Multiple+Single+Nucleotide+Polymorphisms&aulast=Zanke&aufirst=Brent&rft_id=info%3Adoi%2F10.1158%2F1078-0432.ccr-1115-03&rft.language%5B0%5D=eng
SOLR
_version_ 1792341712144695299
author Listgarten, Jennifer, Damaraju, Sambasivarao, Poulin, Brett, Cook, Lillian, Dufour, Jennifer, Driga, Adrian, Mackey, John, Wishart, David, Greiner, Russ, Zanke, Brent
author_facet Listgarten, Jennifer, Damaraju, Sambasivarao, Poulin, Brett, Cook, Lillian, Dufour, Jennifer, Driga, Adrian, Mackey, John, Wishart, David, Greiner, Russ, Zanke, Brent, Listgarten, Jennifer, Damaraju, Sambasivarao, Poulin, Brett, Cook, Lillian, Dufour, Jennifer, Driga, Adrian, Mackey, John, Wishart, David, Greiner, Russ, Zanke, Brent
author_sort listgarten, jennifer
container_issue 8
container_start_page 2725
container_title Clinical Cancer Research
container_volume 10
description <jats:title>Abstract</jats:title> <jats:p>Hereditary predisposition and causative environmental exposures have long been recognized in human malignancies. In most instances, cancer cases occur sporadically, suggesting that environmental influences are critical in determining cancer risk. To test the influence of genetic polymorphisms on breast cancer risk, we have measured 98 single nucleotide polymorphisms (SNPs) distributed over 45 genes of potential relevance to breast cancer etiology in 174 patients and have compared these with matched normal controls. Using machine learning techniques such as support vector machines (SVMs), decision trees, and naïve Bayes, we identified a subset of three SNPs as key discriminators between breast cancer and controls. The SVMs performed maximally among predictive models, achieving 69% predictive power in distinguishing between the two groups, compared with a 50% baseline predictive power obtained from the data after repeated random permutation of class labels (individuals with cancer or controls). However, the simpler naïve Bayes model as well as the decision tree model performed quite similarly to the SVM. The three SNP sites most useful in this model were (a) the +4536T/C site of the aldosterone synthase gene CYP11B2 at amino acid residue 386 Val/Ala (T/C) (rs4541); (b) the +4328C/G site of the aryl hydrocarbon hydroxylase CYP1B1 at amino acid residue 293 Leu/Val (C/G) (rs5292); and (c) the +4449C/T site of the transcription factor BCL6 at amino acid 387 Asp/Asp (rs1056932). No single SNP site on its own could achieve more than 60% in predictive accuracy. We have shown that multiple SNP sites from different genes over distant parts of the genome are better at identifying breast cancer patients than any one SNP alone. As high-throughput technology for SNPs improves and as more SNPs are identified, it is likely that much higher predictive accuracy will be achieved and a useful clinical tool developed.</jats:p>
doi_str_mv 10.1158/1078-0432.ccr-1115-03
facet_avail Online, Free
finc_class_facet Medizin
format ElectronicArticle
format_de105 Article, E-Article
format_de14 Article, E-Article
format_de15 Article, E-Article
format_de520 Article, E-Article
format_de540 Article, E-Article
format_dech1 Article, E-Article
format_ded117 Article, E-Article
format_degla1 E-Article
format_del152 Buch
format_del189 Article, E-Article
format_dezi4 Article
format_dezwi2 Article, E-Article
format_finc Article, E-Article
format_nrw Article, E-Article
geogr_code not assigned
geogr_code_person not assigned
id ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTE1OC8xMDc4LTA0MzIuY2NyLTExMTUtMDM
imprint American Association for Cancer Research (AACR), 2004
imprint_str_mv American Association for Cancer Research (AACR), 2004
institution DE-Brt1, DE-D161, DE-Zwi2, DE-Gla1, DE-Zi4, DE-15, DE-Pl11, DE-Rs1, DE-105, DE-14, DE-Ch1, DE-L229, DE-D275, DE-Bn3
issn 1078-0432, 1557-3265
issn_str_mv 1078-0432, 1557-3265
language English
last_indexed 2024-03-01T16:24:12.458Z
match_str listgarten2004predictivemodelsforbreastcancersusceptibilityfrommultiplesinglenucleotidepolymorphisms
mega_collection American Association for Cancer Research (AACR) (CrossRef)
physical 2725-2737
publishDate 2004
publishDateSort 2004
publisher American Association for Cancer Research (AACR)
record_format ai
recordtype ai
series Clinical Cancer Research
source_id 49
spelling Listgarten, Jennifer Damaraju, Sambasivarao Poulin, Brett Cook, Lillian Dufour, Jennifer Driga, Adrian Mackey, John Wishart, David Greiner, Russ Zanke, Brent 1078-0432 1557-3265 American Association for Cancer Research (AACR) Cancer Research Oncology http://dx.doi.org/10.1158/1078-0432.ccr-1115-03 <jats:title>Abstract</jats:title> <jats:p>Hereditary predisposition and causative environmental exposures have long been recognized in human malignancies. In most instances, cancer cases occur sporadically, suggesting that environmental influences are critical in determining cancer risk. To test the influence of genetic polymorphisms on breast cancer risk, we have measured 98 single nucleotide polymorphisms (SNPs) distributed over 45 genes of potential relevance to breast cancer etiology in 174 patients and have compared these with matched normal controls. Using machine learning techniques such as support vector machines (SVMs), decision trees, and naïve Bayes, we identified a subset of three SNPs as key discriminators between breast cancer and controls. The SVMs performed maximally among predictive models, achieving 69% predictive power in distinguishing between the two groups, compared with a 50% baseline predictive power obtained from the data after repeated random permutation of class labels (individuals with cancer or controls). However, the simpler naïve Bayes model as well as the decision tree model performed quite similarly to the SVM. The three SNP sites most useful in this model were (a) the +4536T/C site of the aldosterone synthase gene CYP11B2 at amino acid residue 386 Val/Ala (T/C) (rs4541); (b) the +4328C/G site of the aryl hydrocarbon hydroxylase CYP1B1 at amino acid residue 293 Leu/Val (C/G) (rs5292); and (c) the +4449C/T site of the transcription factor BCL6 at amino acid 387 Asp/Asp (rs1056932). No single SNP site on its own could achieve more than 60% in predictive accuracy. We have shown that multiple SNP sites from different genes over distant parts of the genome are better at identifying breast cancer patients than any one SNP alone. As high-throughput technology for SNPs improves and as more SNPs are identified, it is likely that much higher predictive accuracy will be achieved and a useful clinical tool developed.</jats:p> Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms Clinical Cancer Research
spellingShingle Listgarten, Jennifer, Damaraju, Sambasivarao, Poulin, Brett, Cook, Lillian, Dufour, Jennifer, Driga, Adrian, Mackey, John, Wishart, David, Greiner, Russ, Zanke, Brent, Clinical Cancer Research, Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms, Cancer Research, Oncology
title Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_full Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_fullStr Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_full_unstemmed Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_short Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
title_sort predictive models for breast cancer susceptibility from multiple single nucleotide polymorphisms
title_unstemmed Predictive Models for Breast Cancer Susceptibility from Multiple Single Nucleotide Polymorphisms
topic Cancer Research, Oncology
url http://dx.doi.org/10.1158/1078-0432.ccr-1115-03