JOURNALS | BMIL

6.

Jinmyung Jung; Sunyong Yoo
Identification of Breast Cancer Metastasis Markers from Gene Expression Profiles Using Machine Learning Approaches Journal Article SCI
In: Genes, vol. 14, no. 9, pp. 1820, 2023, (Correspondence to Sunyong Yoo).

Abstract | Links | BibTeX | Dimensions | Tags: Bioinformatics, Breast cancer, Feature importance, Gene expression, Machine learning, Metastasis marker

@article{jung2023identification,

title = {Identification of Breast Cancer Metastasis Markers from Gene Expression Profiles Using Machine Learning Approaches},

author = {Jinmyung Jung and Sunyong Yoo},

url = {https://www.mdpi.com/2073-4425/14/9/1820},

doi = {10.3390/genes14091820},

year  = {2023},

date = {2023-09-20},

urldate = {2023-09-20},

journal = {Genes},

volume = {14},

number = {9},

pages = {1820},

publisher = {MDPI},

abstract = {Cancer metastasis accounts for approximately 90% of cancer deaths, and elucidating markers in metastasis is the first step in its prevention. To characterize metastasis marker genes (MGs) of breast cancer, XGBoost models that classify metastasis status were trained with gene expression profiles from TCGA. Then, a metastasis score (MS) was assigned to each gene by calculating the inner product between the feature importance and the AUC performance of the models. As a result, 54, 202, and 357 genes with the highest MS were characterized as MGs by empirical p-value cutoffs of 0.001, 0.005, and 0.01, respectively. The three sets of MGs were compared with those from existing metastasis marker databases, which provided significant results in most comparisons (p-value < 0.05). They were also significantly enriched in biological processes associated with breast cancer metastasis. The three MGs, SPPL2C, KRT23, and RGS7, showed highly significant results (p-value < 0.01) in the survival analysis. The MGs that could not be identified by statistical analysis (e.g., GOLM1, ELAVL1, UBP1, and AZGP1), as well as the MGs with the highest MS (e.g., ZNF676, FAM163B, LDOC2, IRF1, and STK40), were verified via the literature. Additionally, we checked how close the MGs were to each other in the protein–protein interaction networks. We expect that the characterized markers will help understand and prevent breast cancer metastasis.},

note = {Correspondence to Sunyong Yoo},

keywords = {Bioinformatics, Breast cancer, Feature importance, Gene expression, Machine learning, Metastasis marker},

pubstate = {published},

tppubtype = {article}

}

Close

5.

Myeonghyeon Jeong; Sunyong Yoo
Predicting the Fetotoxicity of Drugs Using Machine Learning Journal Article Domestic (KCI)
In: Journal of Life Science, vol. 33, no. 6, pp. 490–497, 2023, (Correspondence to Sunyong Yoo).

Abstract | Links | BibTeX | Dimensions | Tags: Machine learning

@article{jeong2023predicting,

title = {Predicting the Fetotoxicity of Drugs Using Machine Learning},

author = {Myeonghyeon Jeong and Sunyong Yoo},

url = {https://koreascience.kr/article/JAKO202320150261638.page},

doi = {10.5352/JLS.2023.33.6.490},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Journal of Life Science},

volume = {33},

number = {6},

pages = {490–497},

publisher = {Korean Society of Life Science},

abstract = {Pregnant women may need to take medications to treat preexisting diseases or diseases that develop during pregnancy. However, some drugs may be fetotoxic and lead to, for example, teratogenicity and growth retardation. Predicting the fetotoxicity of drugs is thus important for the health of the mother and fetus. The fetotoxicity of many drugs has not been established because various challenges hinder the ability of researchers to determine their fetotoxicity. The need exists for in silico-based fetotoxicity assessment models, as they can modernize the testing paradigm, improve predictability, and reduce the use of animals and the costs of fetotoxicity testing. In this study, we collected data on the fetotoxicity of drugs and constructed fetotoxicity prediction models based on various machine learning algorithms. We optimized the models for more precise predictions by tuning the hyperparameters. We then performed quantitative performance evaluations. The results indicated that the constructed machine learning-based models had high performance (AUROC >0.85, AUPR >0.9) in fetotoxicity prediction. We also analyzed the feature importance of our model's predictions, which could be leveraged to identify the specific features of drugs that are strongly associated with fetotoxicity. The proposed model can be used to prescreen drugs and drug candidates at a lower cost and in less time. It provides a predictive score for fetotoxicity risk, which may be beneficial in the design of studies on fetotoxicity in human pregnancy.},

note = {Correspondence to Sunyong Yoo},

keywords = {Machine learning},

pubstate = {published},

tppubtype = {article}

}

Close

4.

이소연; 유선용
기계학습을 활용한 화합물의 약인성 간 손상 예측 방법 연구 Journal Article Domestic (KCI)
In: 정보과학회논문지, vol. 50, no. 9, pp. 777–783, 2023, ISSN: 2383-6296, (Correspondence to Sunyong Yoo).

Abstract | Links | BibTeX | Dimensions | Tags: Hepatotoxicity, Machine learning

3.

Seonwoo Jung; Min-Keun Song; Eunjoo Lee; Sejin Bae; Yeon-Yong Kim; Doheon Lee; Myoung Jin Lee; Sunyong Yoo
Predicting ischemic stroke in patients with atrial fibrillation using machine learning Journal Article SCI
In: Frontiers in Bioscience-Landmark, vol. 27, no. 3, pp. 80, 2022, (Correspondence to Sunyong Yoo).

Abstract | Links | BibTeX | Dimensions | Tags: Atrial fibrillation, Attention mechanism, Deep learning, Machine learning, Medical informatics, National health insurance service, Stroke

@article{jung2022predicting,

title = {Predicting ischemic stroke in patients with atrial fibrillation using machine learning},

author = {Seonwoo Jung and Min-Keun Song and Eunjoo Lee and Sejin Bae and Yeon-Yong Kim and Doheon Lee and Myoung Jin Lee and Sunyong Yoo},

url = {https://www.imrpress.com/journal/FBL/27/3/10.31083/j.fbl2703080/htm?utm_source=TrendMD&utm_medium=cpc&utm_campaign=Frontiers_in_Bioscience-Landmark_TrendMD_1},

doi = {10.31083/j.fbl2703080},

year  = {2022},

date = {2022-03-04},

urldate = {2022-03-04},

journal = {Frontiers in Bioscience-Landmark},

volume = {27},

number = {3},

pages = {80},

publisher = {IMR Press},

abstract = {Background 

Atrial fibrillation (AF) is a well-known risk factor for stroke. Predicting the risk is important to prevent the first and secondary attacks of cerebrovascular diseases by determining early treatment. This study aimed to predict the ischemic stroke in AF patients based on the massive and complex Korean National Health Insurance (KNHIS) data through a machine learning approach. 

Methods 

We extracted 65-dimensional features, including demographics, health examination, and medical history information, of 754,949 patients with AF from KNHIS. Logistic regression was used to determine whether the extracted features had a statistically significant association with ischemic stroke occurrence. Then, we constructed the ischemic stroke prediction model using an attention-based deep neural network. The extracted features were used as input, and the occurrence of ischemic stroke after the diagnosis of AF was the output used to train the model. 

Results We found 48 features significantly associated with ischemic stroke occurrence through regression analysis (p-value < 0.001). When the proposed deep learning model was applied to 150,989 AF patients, it was confirmed that the occurrence ischemic stroke was predicted to be higher AUROC (AUROC = 0.727 ± 0.003) compared to CHA2DS2-VASc score (AUROC = 0.651 ± 0.007) and other machine learning methods. 

Conclusions 

As part of preventive medicine, this study could help AF patients prepare for ischemic stroke prevention based on predicted stoke associated features and risk scores.},

note = {Correspondence to Sunyong Yoo},

keywords = {Atrial fibrillation, Attention mechanism, Deep learning, Machine learning, Medical informatics, National health insurance service, Stroke},

pubstate = {published},

tppubtype = {article}

}

Close

2.

정선우; 이민지; 유선용
공공빅데이터를 활용한 기계학습 기반 뇌졸중 위험도 예측 Journal Article Domestic (KCI)
In: 한국항행학회논문지, vol. 25, no. 1, pp. 96–101, 2021.

Abstract | Links | BibTeX | Dimensions | Tags: Machine learning, Medical informatics

1.

Sunyong Yoo; Suhyun Ha; Moonshik Shin; Kyungrin Noh; Hojung Nam; Doheon Lee
A data-driven approach for identifying medicinal combinations of natural products Journal Article SCI
In: IEEE Access, vol. 6, pp. 58106–58118, 2018.

Abstract | Links | BibTeX | Dimensions | Tags: Database, Drugs, Ethnopharmacology, Machine learning