Che-Yi Liao

Che-Yi Liao

(廖哲儀)

George Fellow | Seth Bonder Fellow

PhD Candidate, Machine Learning

Georgia Tech Georgia Institute of Technology

Atlanta, GA, USA

cliao48 at gatech dot edu

I have received my PhD in Machine Learning from the Georgia Institute of Technology, developing robust AI systems for real-world, high-stakes applications by integrating AI/ML, Statistics, and Operations Research from a data-centric perspective.

My research tackles core data challenges in AI deployment, e.g., incomplete input data, reporting delays, tension between local variation and global trends, and distributional mismatch between training and test data, and develops training/inference solutions that leverage Operations Research techniques to improve the performance, robustness, and scalability of AI systems.

My recent directions include data-efficient personalized LLMs, constrained alignment, and interpretable knowledge distillation from foundation models.


I am actively seeking AI Research Scientist roles in industry. If you know of opportunities, want to discuss research collaborations, or just to say hi — please reach out!


News

  • [2026-04] Excited to share that I have successfully defended my PhD dissertation, Operationalization of AI Systems Through Data-Centric Design! Huge thanks to my advisor, committee members, collaborators, and everyone who supported me along the way. Looking forward to the next chapter of my research journey!
  • [2026-02] Our paper, Augmenting Individualized Treatment Planning via Data-Driven Clinical Role Model Selection, is a finalist for the CHOM Best Paper Award at the 2026 POMS Annual Conference! This work addresses how distribution shifts, caused by frequent updates to AI/ML health tools, affect the recommendation of clinical role models — prior cases similar to the current patient — and the subsequent treatment planning. The winner will be announced in Reno, NV, May 7–11. Fingers crossed!
  • [2026-01] Our new paper, Multivariate Time Series Data Imputation via Distributionally Robust Regularization, tackles distribution shifts in time series imputation — a common challenge due to nonstationarity and structural missing patterns. Check it out here !

Publications

Google Scholar | * indicates authors with equal contribution
Development and Evaluation of Cardiovascular Disease Risk Prediction Models for Patients with Type 2 Diabetes
 thumbnail
Development and Evaluation of Cardiovascular Disease Risk Prediction Models for Patients with Type 2 Diabetes (2026)
Yang Yang*, Tian Liu*, Che-Yi Liao, Sun Ju Lee, Esmaeil Keyvanshokooh, Hui Shao, Mary Beth Weber, Francisco J. Pasquel, Gian-Gabriel P. Garcia
Keywords: Type 2 Diabetes, ML Risk Prediction, ML Evaluation
Collaboration: Georgia Tech, Texas A&M, Emory, UW
Knowledge Distillation for Process Control(tentative) (2026)
Che-Yi Liao, Gian-Gabriel P. Garcia, Kamran Paynabar
Under Review
Keywords: Process Control, LLM Knowledge Distillation
Collaboration: Georgia Tech, UW
Multivariate Time Series Data Imputation via Distributionally Robust Regularization (2026)
Che-Yi Liao, Zheng Dong, Gian-Gabriel P. Garcia, Kamran Paynabar
Under Review
Keywords: Time Series, Missing Data Imputation, Optimal Transport, Distributionally Robust Optimization
Collaboration: Georgia Tech, UW, Amazon
Constraint-Aware Self-Improving Large Language Model for Clinical Role Model Generation (2025)
Che-Yi Liao, Esmaeil Keyvanshokooh, Gian-Gabriel P. Garcia
Under Review
Keywords: Personalized Medicine, Uncertainty in AI/ML Diagnosis, LLM Constrained Alignment, LLM Personalization, Active Learning, Personzlied RL
Collaboration: Georgia Tech, Texas A&M, UW
Augmenting Individualized Treatment Planning via Data-Driven Clinical Role Model Selection (2025)
Che-Yi Liao, Esmaeil Keyvanshokooh, Francisco J Pasquel, Gian-Gabriel P. Garcia
Under Major Revision
Keywords: Personalized Medicine, Uncertainty in AI/ML Diagnosis, Data-Driven Distributionally Robust Optimization, Active Learning
Collaboration: Georgia Tech, Texas A&M, UW, Emory University
🏆 Finalist (winner to be announced) of 2026 College of Healthcare Operations Management (CHOM) Best Paper Award at POMS 2026
🏆 Finalist of 2023 Lee B. Lusted Student Prize Competition (QMTD Track) at SMDM Annual Meeting
Tides Need STEMMED: A Locally Operating Spatio-Temporal Mutually Exciting Point Process with Dynamic Network for Improving Opioid Overdose Death Prediction thumbnail
Tides Need STEMMED: A Locally Operating Spatio-Temporal Mutually Exciting Point Process with Dynamic Network for Improving Opioid Overdose Death Prediction (2025)
Che-Yi Liao, Zheng Dong, Gian-Gabriel P. Garcia, Kamran Paynabar, Yao Xie, Mohammad S. Jalali
Manufacturing & Service Operations Management (MSOM)
Keywords: Opioid-Overdose Deaths, Spatiotemporal Modeling, Point-Process Network, Data-Sharing Policies
Collaboration: Georgia Tech, MIT, Harvard Medical School
🏆 Winner of 2022 Lee B. Lusted Student Prize Competition (QMTD Track) at SMDM Annual Meeting
🏆 Gold Student Scholorship in 2023 INFORMS Workshop on Data Science
Balancing access, precision, and equity in adaptive test site allocation with an application to COVID-19 in Atlanta, Georgia thumbnail
Balancing access, precision, and equity in adaptive test site allocation with an application to COVID-19 in Atlanta, Georgia (2025)
Thomas W Hsiao*, Che-Yi Liao*, Lance A Waller, Kamran Paynabar
Scientific Reports
Keywords: Sequential Testing Site Allocation, Health Equity, Spatiotemporal Modeling, Multi-Objective Optimization
Collaboration: Georgia Tech, Emory University
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes - Methodology and Validation Study thumbnail
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes - Methodology and Validation Study (2025)
Yang Yang*, Che-Yi Liao*, Esmaeil Keyvanshokooh, Hui Shao, Mary Beth Weber, Francisco J Pasquel, Gian-Gabriel P Garcia
JMIR Medical Informatics
Keywords: Chronic Disease Management, Explainable AI, Multi-Criteria Decision Making
Collaboration: Georgia Tech, Texas A&M, Emory University
Estimating Hidden Epidemic - A Bayesian Spatiotemporal Compartmental Modeling Approach thumbnail
Estimating Hidden Epidemic - A Bayesian Spatiotemporal Compartmental Modeling Approach (2025)
Che-Yi Liao*, Peiliang Bai*, Lance A. Waller, Paynabar Kamran
INFORMS Journal on Data Science (IJDS)
Keywords: Hidden Epidemic, Stochastic Compartmental Modeling, Bayesian Spatiotemporal Modeling
Collaboration: Georgia Tech, Emory University
Racial Disparities in Opioid Overdose Deaths in Massachusetts thumbnail
Racial Disparities in Opioid Overdose Deaths in Massachusetts (2022)
Che-Yi Liao, Gian-Gabriel P. Garcia, Catherine DiGennaro, Mohammad S. Jalali
JAMA Network Open
Keywords: Opioid-Overdose Deaths, Health Equity, Time Series Analysis
Collaboration: Georgia Tech, MIT, Harvard Medical School
Evaluating patient triage strategies for non-emergency outpatient procedures under reduced capacity due to the covid-19 pandemic thumbnail
Evaluating patient triage strategies for non-emergency outpatient procedures under reduced capacity due to the covid-19 pandemic (2020)
Adam VanDeusen, Che-Yi Liao*, Advaidh Venkat, Amy Cohn, Jacob Kurlander, Sameer Saini
2020 Winter Simulation Conference (WSC)
Keywords: Patient Triage Strategies, COVID-19, Discrete-Event Simulation

Awards

2026

  • CHOM Best Paper Award (Finalist), 2026 POMS Annual Conference

    Awarded to an early version of Augmenting Individualized Treatment Planning via Data-Driven Clinical Role Model Selection; winner to be announced at 2026 POMS Annual Conference, Reno, NV, May 7–11.

2025

  • George Family Fellowship for Research Excellence in Healthcare System, Georgia Tech

2024

2023

  • Gold Student Scholarship, 2023 INFORMS Workshop on Data Science

    Awarded to early version of Tides Need STEMMED: A Locally Operating Spatio-Temporal Mutually Exciting Point Process with Dynamic Network for Improving Opioid Overdose Death Prediction

  • Stephen Pauker Award in Quantitative Methods (Finalist), Society of Medical Decision Making (SMDM)

    Awarded to early version of Augmenting Individualized Treatment Planning via Data-Driven Clinical Role Model Selection

  • George Family Fellowship for Research Excellence in Healthcare System, Georgia Tech

2022

  • Stephen Pauker Award in Quantitative Methods (Winner), Society of Medical Decision Making (SMDM)
    • Award Details
    • Awarded to early version of Tides Need STEMMED: A Locally Operating Spatio-Temporal Mutually Exciting Point Process with Dynamic Network for Improving Opioid Overdose Death Prediction
  • George Family Fellowship for Research Excellence in Healthcare System, Georgia Tech

2021

  • McLean Fellowship for Distinguished Incoming Student, Georgia Tech
  • Clyde W. and Nadra S. Johnson Award for Research Excellence, UMich
  • Seth Bonder Fellowship for Healthcare Engineering Research, UMich Center for Healthcare Engineering & Patient Safety

Academic Service

Session Chair

2023   |   INFORMS Annual Meeting (Session on Healthcare Analytics in Emerging Data Settings)

Journal Reviewer

IEEE Transactions on Automation Science and Engineering (IEEE-TASE)
IISE Transactions on Healthcare Systems Engineering
Health Care Management Science (HCMS)

Conference Paper Reviewer

2026   |   Forty-third International Conference on Machine Learning (ICML 2026)
2024   |   IISE Annual Conference & Expo

Research Award Reviewer

2024   |   Georgia Tech President's Undergraduate Research Awards
2023   |   Georgia Tech President's Undergraduate Research Awards
2023   |   Georgia Tech Annual Undergraduate Research Symposium
2022   |   Georgia Tech President's Undergraduate Research Awards

Conference Abstract Reviewer

2024   |   SMDM Annual Meeting
2023   |   SMDM Annual Meeting

Community Service

2023-2024   |   Student Liaison, INFORMS Health Applications Society
2022-2023   |   Student Liaison, INFORMS Health Applications Society

Invited Talks

2025/10 Talk at INFORMS 2025 Annual Meeting on "Constraint-Aware Self-Improving Large Language Model for Clinical Role Model Generation"
Since 2022 Presented at INFORMS Annual Meetings, POMS Annual Conferences, SMDM Annual Meetings, and IISE Annual Conferences on "Various topics on AI/ML in Healthcare and Operations Research"

Teaching

@ Georgia Tech | Atlanta, GA, USA
Teaching Assistant
Spring 2026   |   ISYE 6525 - High Dimensional Data Analysis (both on-campus and online sections)
Fall 2025   |   ISYE 6525 - High Dimensional Data Analysis
Spring 2022   |   ISYE 4031 - Regression and Forecasting
Fall 2021   |   ISYE 2027 - Probability with Applications
@ University of Naples Federico II | Naples, Italy
Guest Lecturer
Fall 2024 (2 weeks)   |   PhD School on Towards Zero Emissions Mobility
Topics on High Dimensional Data Analysis