Che-Yi Liao

Che-Yi Liao

(廖哲儀)

George Fellow | Seth Bonder Fellow

PhD Candidate, Machine Learning

Georgia Tech Georgia Institute of Technology

Atlanta, GA, USA

cliao48 at gatech dot edu

I am a PhD Candidate in Machine Learning at Georgia Institute of Technology, developing robust AI systems for real-world high-stakes applications. My work integrates methods from AI/ML, Statistics, and Operations Research.

My current research focuses on data-efficient personalized LLMs and constrained alignment. I developed a provably safe personalized RL and validated it on treatment recommendations by fusing tabular survey data and multivariate clinical time series, achieving +130% reliability in generated treatment trajectories and -23% patient effort-to-change, compared to real treatment trajectories and benchmark algorithms. [Link to Paper]

On the industry side, I interned at Meta, where I optimized early-stage ads ranking via advanced deep learning architectures and foundation model knowledge distillation, while building end-to-end validation pipelines from data ingestion to deployment.

I enjoy cross-functional collaboration, with extensive experience working with researchers and engineers on theoretical and technical constraints (Meta, Bosch) and partnering with medical practitioners on clinical validity (Michigan Medicine, Emory Hospital, VA Hospital, Georgia DPH).


I am actively seeking an AI Research Scientist position starting in May 2026. I am especially interested in roles focused on efficient LLM/Agent personalization and alignment for business process modeling.

Research Interests: Data Curation, Personalized LLM, Constrained Alignment, AI for Business Process Modeling

Publications

Google Scholar | * indicates authors with equal contribution
Knowledge Distillation for Process Control(tentative) (2025)
Che-Yi Liao, Gian-Gabriel P. Garcia, Kamran Paynabar
Keywords: Process Control, LLM Knowledge Distillation
Collaboration: Georgia Tech, UW
Imputing Multivariate Time Series (tentative) (2025)
Che-Yi Liao, Zheng Dong, Gian-Gabriel P. Garcia, Kamran Paynabar
Keywords: Data Imputation
Collaboration: Georgia Tech, UW, Amazon
Constraint-Aware Self-Improving Large Language Model for Clinical Role Model Generation (2025)
Che-Yi Liao, Esmaeil Keyvanshokooh, Gian-Gabriel P. Garcia
Under Review
Keywords: Personalized Medicine, Uncertainty in AI/ML Diagnosis, LLM Constrained Alignment, LLM Personalization, Active Learning, Personzlied RL
Collaboration: Georgia Tech, Texas A&M, UW
Augmenting Individualized Treatment Planning via Data-Driven Clinical Role Model Selection (2025)
Che-Yi Liao, Esmaeil Keyvanshokooh, Francisco J Pasquel, Gian-Gabriel P. Garcia
Under Major Revision
Keywords: Personalized Medicine, Uncertainty in AI/ML Diagnosis, Data-Driven Distributionally Robust Optimization, Active Learning
Collaboration: Georgia Tech, Texas A&M, UW, Emory University
🏆 Finalist of 2023 Lee B. Lusted Student Prize Competition (QMTD Track)
Tides Need STEMMED: A Locally Operating Spatio-Temporal Mutually Exciting Point Process with Dynamic Network for Improving Opioid Overdose Death Prediction thumbnail
Tides Need STEMMED: A Locally Operating Spatio-Temporal Mutually Exciting Point Process with Dynamic Network for Improving Opioid Overdose Death Prediction (2025)
Che-Yi Liao, Zheng Dong, Gian-Gabriel P. Garcia, Kamran Paynabar, Yao Xie, Mohammad S. Jalali
Manufacturing & Service Operations Management (MSOM)
Keywords: Opioid-Overdose Deaths, Spatiotemporal Modeling, Point-Process Network, Data-Sharing Policies
Collaboration: Georgia Tech, MIT, Harvard Medical School
🏆 Winner of 2022 Lee B. Lusted Student Prize Competition (QMTD Track)
🏆 Gold Student Scholorship in 2023 INFORMS Workshop on Data Science
Balancing access, precision, and equity in adaptive test site allocation with an application to COVID-19 in Atlanta, Georgia thumbnail
Balancing access, precision, and equity in adaptive test site allocation with an application to COVID-19 in Atlanta, Georgia (2025)
Thomas W Hsiao*, Che-Yi Liao*, Lance A Waller, Kamran Paynabar
Scientific Reports
Keywords: Sequential Testing Site Allocation, Health Equity, Spatiotemporal Modeling, Multi-Objective Optimization
Collaboration: Georgia Tech, Emory University
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes - Methodology and Validation Study thumbnail
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes - Methodology and Validation Study (2025)
Yang Yang*, Che-Yi Liao*, Esmaeil Keyvanshokooh, Hui Shao, Mary Beth Weber, Francisco J Pasquel, Gian-Gabriel P Garcia
JMIR Medical Informatics
Keywords: Chronic Disease Management, Explainable AI, Multi-Criteria Decision Making
Collaboration: Georgia Tech, Texas A&M, Emory University
Estimating Hidden Epidemic - A Bayesian Spatiotemporal Compartmental Modeling Approach thumbnail
Estimating Hidden Epidemic - A Bayesian Spatiotemporal Compartmental Modeling Approach (2025)
Che-Yi Liao*, Peiliang Bai*, Lance A. Waller, Paynabar Kamran
INFORMS Journal on Data Science (IJDS)
Keywords: Hidden Epidemic, Stochastic Compartmental Modeling, Bayesian Spatiotemporal Modeling
Collaboration: Georgia Tech, Emory University
Racial Disparities in Opioid Overdose Deaths in Massachusetts thumbnail
Racial Disparities in Opioid Overdose Deaths in Massachusetts (2022)
Che-Yi Liao, Gian-Gabriel P. Garcia, Catherine DiGennaro, Mohammad S. Jalali
JAMA Network Open
Keywords: Opioid-Overdose Deaths, Health Equity, Time Series Analysis
Collaboration: Georgia Tech, MIT, Harvard Medical School
Evaluating patient triage strategies for non-emergency outpatient procedures under reduced capacity due to the covid-19 pandemic thumbnail
Evaluating patient triage strategies for non-emergency outpatient procedures under reduced capacity due to the covid-19 pandemic (2020)
Adam VanDeusen, Che-Yi Liao*, Advaidh Venkat, Amy Cohn, Jacob Kurlander, Sameer Saini
2020 Winter Simulation Conference (WSC)
Keywords: Patient Triage Strategies, COVID-19, Discrete-Event Simulation

Awards

2025

  • George Family Fellowship for Research Excellence in Healthcare System, Georgia Tech

2024

2023

  • Gold Student Scholorship, 2023 INFORMS Workshop on Data Science

    Awarded to early version of Tides Need STEMMED: A Locally Operating Spatio-Temporal Mutually Exciting Point Process with Dynamic Network for Improving Opioid Overdose Death Prediction

  • Stephen Pauker Award in Quantitative Methods (Finalist), Society of Medical Decision Making (SMDM)”

    Awarded to early version of Augmenting Individualized Treatment Planning via Data-Driven Clinical Role Model Selection

  • George Family Fellowship for Research Excellence in Healthcare System, Georgia Tech

2022

  • Stephen Pauker Award in Quantitative Methods (Winner), Society of Medical Decision Making (SMDM)
    • Award Details
    • Awarded to early version of Tides Need STEMMED: A Locally Operating Spatio-Temporal Mutually Exciting Point Process with Dynamic Network for Improving Opioid Overdose Death Prediction
  • George Family Fellowship for Research Excellence in Healthcare System, Georgia Tech

2021

  • McLean Fellowship for Distinguished Incoming Student, Georgia Tech
  • Clyde W. and Nadra S. Johnson Award for Research Excellence, UMich
  • Seth Bonder Fellowship for Healthcare Engineering Research, UMich Center for Healthcare Engineering & Patient Safety

Academic Service

Session Chair

2023   |   INFORMS Annual Meeting (Session on Healthcare Analytics in Emerging Data Settings)

Journal Reviewer

IEEE Transactions on Automation Science and Engineering (IEEE-TASE)
Health Care Management Science (HCMS)

Conference Paper Reviewer

2024   |   IISE Annual Conference & Expo

Research Award Reviewer

2024   |   Georgia Tech President's Undergraduate Research Awards
2023   |   Georgia Tech President's Undergraduate Research Awards
2023   |   Georgia Tech Annual Undergraduate Research Symposium
2022   |   Georgia Tech President's Undergraduate Research Awards

Conference Abstract Reviewer

2024   |   SMDM Annual Meeting
2023   |   SMDM Annual Meeting

Community Service

2023-2024   |   Student Liaison, INFORMS Health Applications Society
2022-2023   |   Student Liaison, INFORMS Health Applications Society

Invited Talks

2025/10 Talk at INFORMS 2025 Annual Meeting on "Constraint-Aware Self-Improving Large Language Model for Clinical Role Model Generation"
Since 2022 Presented at INFORMS Annual Meetings, POMS Annual Conferences, SMDM Annual Meetings, and IISE Annual Conferences on "Various topics on AI/ML in Healthcare and Operations Research"

Teaching

@ Georgia Tech | Atlanta, GA, USA
Teaching Assistant
Fall 2025   |   ISYE 6525 - High Dimensional Data Analysis
Spring 2022   |   ISYE 4031 - Regression and Forecasting
Fall 2021   |   ISYE 2027 - Probability with Applications
@ University of Naples Federico II | Naples, Italy
Guest Lecturer
Fall 2024 (2 weeks)   |   PhD School on Towards Zero Emissions Mobility
Topics on High Dimensional Data Analysis