Please use this identifier to cite or link to this item: https://scholar.dlu.edu.vn/handle/123456789/2632
Title: Predicting Outcomes of Hormone and Chemotherapy in the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) Study by Biochemically-inspired Machine Learning
Authors: Mucaki, Eliseos J
Baranova, Katherina
Phạm, Quang Huy 
Rezaeian, Iman
Angelov, Dimo
Ngom, Alioune
Rueda, Luis
Rogan, Peter K
Keywords: Gene expression signatures; breast cancer; chemotherapy resistance; hormone therapy; machine learning; random forest; support vector machine
Issue Date: 2016
Publisher: Faculty of 1000 Ltd
Journal: F1000Research
Abstract: 
Genomic aberrations and gene expression-defined subtypes in the large METABRIC patient cohort have been used to stratify and predict survival. The present study used normalized gene expression signatures of paclitaxel drug response to predict outcome for different survival times in METABRIC patients receiving hormone (HT) and, in some cases, chemotherapy (CT) agents. This machine learning method, which distinguishes sensitivity vs. resistance in breast cancer cell lines and validates predictions in patients; was also used to derive gene signatures of other HT  (tamoxifen) and CT agents (methotrexate, epirubicin, doxorubicin, and 5-fluorouracil) used in METABRIC. Paclitaxel gene signatures exhibited the best performance, however the other agents also predicted survival with acceptable accuracies. A support vector machine (SVM) model of paclitaxel response containing genes  ABCB1, ABCB11, ABCC1, ABCC10, BAD, BBC3, BCL2, BCL2L1, BMF, CYP2C8, CYP3A4, MAP2, MAP4, MAPT, NR1I2, SLCO1B3, TUBB1, TUBB4A, and TUBB4B was 78.6% accurate in predicting survival of 84 patients treated with both HT and CT (median survival ≥ 4.4 yr). Accuracy was lower (73.4%) in 304 untreated patients. The performance of other machine learning approaches was also evaluated at different survival thresholds. Minimum redundancy maximum relevance feature selection of a paclitaxel-based SVM classifier based on expression of genes  BCL2L1, BBC3, FGF2, FN1, and  TWIST1 was 81.1% accurate in 53 CT patients. In addition, a random forest (RF) classifier using a gene signature ( ABCB1, ABCB11, ABCC1, ABCC10, BAD, BBC3, BCL2, BCL2L1, BMF, CYP2C8, CYP3A4, MAP2, MAP4, MAPT, NR1I2,SLCO1B3, TUBB1, TUBB4A, and TUBB4B) predicted >3-year survival with 85.5% accuracy in 420 HT patients. A similar RF gene signature showed 82.7% accuracy in 504 patients treated with CT and/or HT. These results suggest that tumor gene expression signatures refined by machine learning techniques can be useful for predicting survival after drug therapies.
URI: https://scholar.dlu.edu.vn/handle/123456789/2632
ISSN: 2046-1402
DOI: 10.12688/f1000research.9417.3
Type: Bài báo đăng trên tạp chí quốc tế (có ISSN), bao gồm book chapter
Appears in Collections:Tạp chí (Khoa Toán - Tin học)

Show full item record


CORE Recommender

SCOPUSTM   
Citations 1

37
Last Week
0
Last month
checked on Feb 12, 2026

Page view(s)

145
Last Week
5
Last month
checked on Feb 14, 2026

Google ScholarTM

Check

Altmetric


Altmetric




Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.