Junyuan "Jason" Hong

Postdoctoral Fellow

University of Texas at Austin

I am a joint postdoctoral fellow advised by Dr. Zhangyang Wang in the Institute for Foundations of Machine Learning (IFML) and Wireless Networking and Communications Group (WNCG), and also affiliated with the UT AI Health Lab as well as the Good System Challenge. I was recognized as one of the MLSys Rising Stars in 2024 and received a Best Paper Nomination at VLDB 2024. My work was covered by The White House, and MSU Office of Research and Innovation. Part of my work is funded by OpenAI Researcher Access Program.

Check my curricula vitae and feel free to drop me an email if you are interested in collaboration.

Research

My research vision is to harmonize, understand, and deploy Responsible AI: Optimizing AI systems that balance real-world constraints in computational efficiency, data privacy, and ethical norms through comprehensive threat analysis and the development of integrative trustworthy, resource-aware collaborative learning frameworks. Guided by this principle, I aim to lead a research group combining rigorous theoretical foundations with a commitment to developing algorithm tools that have a meaningful real-world impact, particularly in healthcare applications.

T1: Harmonizing Multifaceted Values in AI Trust.

Trust in AI is complex, reflecting the intricate web of social norms and values. Pursuing only one aspect of trustworthiness while neglecting others may lead to unintended consequences. For instance, overzealous privacy protection can come at the price of transparency, robustness, or fairness. To address these challenges, I have developed innovative collaborative learning approaches that balance key aspects of trustworthy AI, including privacy-preserving learning [FL4DM23&PETs23 ] with fairness guarantees [KDD21, TMLR23], enhanced robustness [AAAI23, ICLR23a], and provable computation and data efficiency [ICLR22, FAccT22, NeurIPS22a, ICLR24]. These methods are designed to create AI systems that uphold individual privacy while remaining efficient, fair, and accountable.

Privacy + Efficiency via Edge-Cloud Collaboration

[ICLR24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer

Junyuan Hong, Jiachen T. Wang, Chenhui Zhang, Zhangheng Li, Bo Li, Zhangyang Wang

Privacy + Fairness via Federated Transfer

[KDD21] Federated Adversarial Debiasing for Fair and Transferable Representations

Junyuan Hong, Zhuangdi Zhu, Shuyang Yu, Hiroko Dodge, Zhangyang Wang, Jiayu Zhou

T2: Understanding Multi-faceted Emerging Risks in GenAI Trust.

As AI evolves from traditional machine learning to generative AI (GenAI), new privacy and trust challenges arise, yet remain opaque due to the complexity of AI models. My research aims to anticipate and address these challenges by developing theoretical frameworks that generalize privacy risk analysis across AI architectures [NeurIPS23], introducing novel threat models for generation-driven transfer learning [ICML23] and pre-trained foundation models [SaTML24], and leveraging insights from integrative benchmarks [VLDB24 , ICML24]. This deeper understanding of GenAI risks further informs the creation of collaborative or multi-agent learning paradigms that prioritize privacy [ICLR24] and safety [arXiv24].

Benchmark Trust under Compression

[ICML24] Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Junyuan Hong, Jinhao Duan, Chenhui Zhang, Zhangheng Li, Chulin Xie, Kelsey Lieberman, James Diffenderfer, Brian Bartoldson, Ajay Jaiswal, Kaidi Xu, Bhavya Kailkhura, Dan Hendrycks, Dawn Song, Zhangyang Wang, Bo Li

Benchmark Privacy in LLM Lifecycle

[VLDB24 ] LLM-PBE: Assessing Data Privacy in Large Language Models

Qinbin Li, Junyuan Hong, Chulin Xie, Jeffrey Tan, Rachel Xin, Junyi Hou, Xavier Yin, Zhun Wang, Dan Hendrycks, Zhangyang Wang, Bo Li, Bingsheng He, Dawn Song

Theoretical Risk Analysis

[NeurIPS23] Understanding Deep Gradient Leakage via Inversion Influence Functions

Haobo Zhang, Junyuan "Jason" Hong, Yuyang Deng, Mehrdad Mahdavi, Jiayu Zhou

T3: Deploying AI Aligned with Human Norms in Dementia Healthcare.

To ground my research in real-world impacts, I am actively exploring applications in healthcare, a domain where trust, privacy, and fairness are paramount. My projects include clinical-protocol-compliant conversational AI for dementia prevention [ICLRW24] and fair, in-home AI-driven early dementia detection [KDD21, AD20]. These initiatives serve as testbeds for responsible AI principles, particularly in ensuring ethical considerations like patient autonomy, data confidentiality, and equitable access to technology, while demonstrating AI’s potential to improve lives.

Protocol-Compliant Dementia Intervention

[ICLRW24] A-CONECT: Designing AI-based Conversational Chatbot for Early Dementia Intervention

Junyuan Hong, Wenqing Zheng, Han Meng, Siqi Liang, Anqing Chen, Hiroko H. Dodge, Jiayu Zhou, Zhangyang Wang

In-home Dementia Detection

[AD20] Detecting MCI using real-time, ecologically valid data capture methodology: How to improve scientific rigor in digital biomarker analyses

Junyuan Hong, Jeffrey Kaye, Hiroko H Dodge, Jiayu Zhou

Detecting MCI using real-time, ecologically valid data capture methodology: How to improve scientific rigor in digital biomarker analyses

Publications

COLM 2025 LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning.
Gabriel J. Perin, Runjin Chen, Xuxi Chen, Nina S. T. Hirata, Zhangyang Wang, Junyuan "Jason" Hong.

COLM 2025 More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment.
Yifan Wang, Runjin Chen, Bolian Li, David Cho, Yihe Deng, Ruqi Zhang, Tianlong Chen, Zhangyang Wang, Ananth Grama, Junyuan "Jason" Hong.

ArXiv 2025 Scaling Textual Gradients via Sampling-Based Momentum.
Zixin Ding, Junyuan "Jason" Hong, Jiachen T. Wang, Zinan Lin, Zhangyang Wang, Yuxin Chen.

ICML 2025 GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning.
Zhen Xiang, Linzhi Zheng, Yanjie Li, Junyuan "Jason" Hong, Qinbin Li, Han Xie, Jiawei Zhang, Zidi Xiong, Chulin Xie, Carl Yang, Dawn Song, Bo Li.

COLM 2025 SEAL: Steerable Reasoning Calibration of Large Language Models for Free.
Runjin Chen, Zhenyu Zhang, Junyuan "Jason" Hong, Souvik Kundu, Zhangyang Wang.

ArXiv 2025 MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models.
Shrey Pandit, Jiawei Xu, Junyuan "Jason" Hong, Zhangyang Wang, Tianlong Chen, Kaidi Xu, Ying Ding.

NAACL 2025 Extracting and Understanding the Superficial Knowledge in Alignment.
Runjin Chen, Gabriel Jacob Perin, Xuxi Chen, Xilun Chen, Yan Han, Nina S. T. Hirata, Junyuan "Jason" Hong, Bhavya Kailkhura.

NAACL 2025 GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing.
Jinhao Duan, Xinyu Zhao, Zhuoxuan Zhang, Eunhye Grace Ko, Lily Boddy, Chenan Wang, Tianhao Li, Alexander Rasgon, Junyuan "Jason" Hong, Min Kyung Lee, Chenxi Yuan, Qi Long, Ying Ding, Tianlong Chen, Kaidi Xu.

VLDB (Best Paper Finalist) 2024 LLM-PBE: Assessing Data Privacy in Large Language Models.
Qinbin Li, Junyuan "Jason" Hong, Chulin Xie, Jeffrey Tan, Rachel Xin, Junyi Hou, Xavier Yin, Zhun Wang, Dan Hendrycks, Zhangyang Wang, Bo Li, Bingsheng He, Dawn Song.

ICML 2024 Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression.
Junyuan "Jason" Hong, Jinhao Duan, Chenhui Zhang, Zhangheng Li, Chulin Xie, Kelsey Lieberman, James Diffenderfer, Brian Bartoldson, Ajay Jaiswal, Kaidi Xu, Bhavya Kailkhura, Dan Hendrycks, Dawn Song, Zhangyang Wang, Bo Li.

ICML 2024 Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark.
Yihua Zhang, Pingzhi Li, Junyuan "Jason" Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong, Zhangyang Wang, Sijia Liu, Tianlong Chen.

ICLRW 2024 A-CONECT: Designing AI-based Conversational Chatbot for Early Dementia Intervention.
Junyuan "Jason" Hong, Wenqing Zheng, Han Meng, Siqi Liang, Anqing Chen, Hiroko H. Dodge, Jiayu Zhou, Zhangyang Wang.

AISTATS 2024 On the Generalization Ability of Unsupervised Pretraining.
Yuyang Deng, Junyuan "Jason" Hong, Jiayu Zhou, Mehrdad Mahdavi.

ICLR 2024 Safe and Robust Watermark Injection with a Single OoD Image.
Shuyang Yu, Junyuan "Jason" Hong, Haobo Zhang, Haotao Wang, Zhangyang Wang, Jiayu Zhou.

SaTML 2024 Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk.
Zhangheng Li, Junyuan "Jason" Hong, Bo Li, Zhangyang Wang.

ICLR (Spotlight) 2024 DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer.
Junyuan "Jason" Hong, Jiachen T. Wang, Chenhui Zhang, Zhangheng Li, Bo Li, Zhangyang Wang.

NeurIPS-RegML 2023 Who Leaked the Model? Tracking IP Infringers in Accountable Federated Learning.
Shuyang Yu, Junyuan "Jason" Hong, Yi Zeng, Fei Wang, Ruoxi Jia, Jiayu Zhou.

NeurIPS 2023 Understanding Deep Gradient Leakage via Inversion Influence Functions.
Haobo Zhang, Junyuan "Jason" Hong, Yuyang Deng, Mehrdad Mahdavi, Jiayu Zhou.

KDDW 2023 A Privacy-Preserving Hybrid Federated Learning Framework for Financial Crime Detection.
Haobo Zhang, Junyuan "Jason" Hong, Fan Dong, Steve Drew, Liangjie Xue, Jiayu Zhou.

KDDW 2023 FedNoisy: A Federated Noisy Label Learning Benchmark.
Siqi Liang, Jintao Huang, Junyuan "Jason" Hong, Fan Dong, Dun Zeng, Jiayu Zhou, Zenglin Xu.

ICML 2023 Revisiting Data-Free Knowledge Distillation with Poisoned Teachers.
Junyuan "Jason" Hong, Yi Zeng, Shuyang Yu, Lingjuan Lyu, Ruoxi Jia, Jiayu Zhou.

TMLR 2023 How Robust is Your Fairness? Evaluating and Sustaining Fairness under Unseen Distribution Shifts.
Haotao Wang, Junyuan "Jason" Hong, Jiayu Zhou, Zhangyang Wang.

ICLR 2023 MECTA: Memory-Economic Continual Test-Time Model Adaptation.
Junyuan "Jason" Hong, Lingjuan Lyu, Jiayu Zhou, Michael Spranger.

ICLR (Spotlight) 2023 Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection.
Shuyang Yu, Junyuan "Jason" Hong, Haotao Wang, Zhangyang Wang, Jiayu Zhou.

AAAI (Oral) 2023 Federated Robustness Propagation: Sharing Adversarial Robustness in Federated Learning.
Junyuan "Jason" Hong, Haotao Wang, Zhangyang Wang, Jiayu Zhou.

Preprint 2022 Precautionary Unfairness in Self-Supervised Contrastive Pre-training.
Junyuan "Jason" Hong, Haotao Wang, Haobo Zhang, Zhangyang Wang, Jiayu Zhou.

NeurIPS 2022 Outsourcing Training without Uploading Data via Efficient Collaborative Open-Source Sampling.
Junyuan "Jason" Hong, Lingjuan Lyu, Jiayu Zhou, Michael Spranger.

NeurIPS 2022 Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork.
Haotao Wang, Junyuan "Jason" Hong, Aston Zhang, Jiayu Zhou, Zhangyang Wang.

ICML 2022 Resilient and Communication Efficient Learning for Heterogeneous Federated Systems.
Zhuangdi Zhu, Junyuan "Jason" Hong, Steve Drew, Jiayu Zhou.

FAccT 2022 Dynamic Privacy Budget Allocation Improves Data Efficiency of Differentially Private Gradient Descent.
Junyuan "Jason" Hong, Zhangyang Wang, Jiayu Zhou.

ICLR 2022 Efficient Split-Mix Federated Learning for On-Demand and In-Situ Customization.
Junyuan "Jason" Hong, Haotao Wang, Zhangyang Wang, Jiayu Zhou.

KDD 2021 Federated Adversarial Debiasing for Fair and Transferable Representations.
Junyuan "Jason" Hong, Zhuangdi Zhu, Shuyang Yu, Hiroko Dodge, Zhangyang Wang, Jiayu Zhou.

ICML 2021 Data-Free Knowledge Distillation for Heterogeneous Federated Learning.
Zhuangdi Zhu, Junyuan "Jason" Hong, Jiayu Zhou.

AAAI 2021 Learning Model-Based Privacy Protection under Budget Constraints.
Junyuan "Jason" Hong, Haotao Wang, Zhangyang Wang, Jiayu Zhou.

AD 2020 Detecting MCI using real-time, ecologically valid data capture methodology: How to improve scientific rigor in digital biomarker analyses.
Junyuan "Jason" Hong, Jeffrey Kaye, Hiroko H Dodge, Jiayu Zhou.

TKDD 2019 Variant Grassmann Manifolds: A Representation Augmentation Method for Action Recognition.
Junyuan "Jason" Hong, Yang Li, Huanhuan Chen.

TNNLS 2019 Short Sequence Classification Through Discriminable Linear Dynamical System.
Yang Li, Junyuan "Jason" Hong, Huanhuan Chen.

KDD (Oral) 2018 Disturbance Grassmann Kernels for Subspace-Based Learning.
Junyuan "Jason" Hong, Huanhuan Chen, Feng Lin.

ECML 2016 Sequential Data Classification in the Space of Liquid State Machines.
Yang Li, Junyuan "Jason" Hong, Huanhuan Chen.

Experiences

Postdoctoral Fellow with Dr. Zhangyang Wang, VITA group@UT Austin, IFML and WNCG, 2023-Now.
Research Intern with Dr. Lingjuan Lyu, Sony AI, 2022

Media Coverage

Texas ECE Student and Postdoc Named MLCommons Rising Stars, UT Austin ECE News, 2024
At Summit for Democracy, the United States and the United Kingdom Announce Winners of Challenge to Drive Innovation in Privacy-enhancing Technologies That Reinforce Democratic Values, The White House, 2023
Privacy-enhancing Research Earns International Attention, MSU Engineering News, 2023
Privacy-Enhancing Research Earns International Attention, MSU Office Of Research And Innovation, 2023

Invited Talks & Guest Lectures

‘GenAI-Based Chatbot for Early Dementia Intervention’ @ Rising Star Symposium Series, IEEE TCCN Special Interest Group for AI and Machine Learning in Security, September, 2024: [link]
‘Building Conversational AI for Affordable and Accessible Early Dementia Intervention’ @ AI Health Course, The School of Information, UT Austin, April, 2024: [paper]
‘Shake to Leak: Amplifying the Generative Privacy Risk through Fine-Tuning’ @ Good Systems Symposium: Shaping the Future of Ethical AI, UT Austin, March, 2024: [paper]
‘Foundation Models Meet Data Privacy: Risks and Countermeasures’ @ Trustworthy Machine Learning Course, Virginia Tech, Nov, 2023
‘Economizing Mild-Cognitive-Impairment Research: Developing a Digital Twin Chatbot from Patient Conversations’ @ BABUŠKA FORUM, Nov, 2023: [link]
‘Backdoor Meets Data-Free Learning’ @ Hong Kong Baptist University, Sep, 2023: [slides]
‘MECTA: Memory-Economic Continual Test-Time Model Adaptation’ @ Computer Vision Talks, March, 2023: [slides] [video]
‘Split-Mix Federated Learning for Model Customization’ @ TrustML Young Scientist Seminars, July, 2022: [link] [video]
‘Federated Adversarial Debiasing for Fair and Transferable Representations’, @ CSE Graduate Seminar, Michigan State University, October, 2021: [slides]
‘Dynamic Policies on Differential Private Learning’ @ VITA Seminars, UT Austin, Sep, 2020: [slides]

Services

Organizers: GenAI4Health@NeurIPS (Lead Chair), The Competition for LLM and Agent Safety 2024, The NeurIPS 2024 LLM Privacy Challenge, FL4Data-Mining Workshop@KDD 2023 (Lead Chair), FedKDD Workshop 2024 (Co-Lead Chair)
Area Chair: NeurIPS 25
External Reviewer: NeurIPS 22-24 (Top Reviewer, 2023), ICML 22-25, ICLR 23-24, KDD 22-24, ECML-PKDD 23, AISTATS 23, WSDM 22, AISTATS 22, AAAI 21-23, IJCAI 19, NeuroComputing, TKDD, TKDE, JAIR, TDSC, ACM Health
Volunteer: KDD 18, 21

Teaching

Mentor@VRT-CHAT: Designing Reminiscence-Therapy Chatbots with Culturally-Sensitive Visual Stimulation for Mental Health, RAI4Ukraine Program, Center for Responsible AI at NYU, 2024
Mentor@A-CONECT: Designing AI-based Conversational Chatbot for Early Dementia Intervention, Directed Reading Program (DiRP), UT Austin, 2024
Mentor@Directed Reading Program (DiRP) on Trustworthy LLM, UT Austin, 2023
Teaching Assistant@CSE 847: Machine Learning, MSU, 2021
Teaching Assistant@CSE 404: Introduction to Machine Learning, MSU, 2020

Mentored Students:

Zhangheng Li (2023 - Now), Ph.D. student, University of Texas at Austin
SaTML 2024 (first author), ICML 2024 (co-first author), ICLR 2024
Runjin Chen (2023 - Now), Ph.D. student, University of Texas at Austin
NACCL 2025 (first author)
Wes Robbins (2024 - Now), Ph.D. student, University of Texas at Austin
Gabriel Jacob Perin (2023 - Now), Undergraduate student, University of São Paulo, Brazil
EMNLP 2024 (first author), NACCL 2025
Ostap Kilbasovych (2024 - Now), Undergraduate student, Ivan Franko National University of Lviv, Ukraine
Jeffrey Tan (2023 - 2024), Undergraduate student, University of California, Berkeley
VLDB 2024 (Best Paper Nomination)
Shuyang Yu (2020 - 2023), Ph.D. student, Michigan State University
ICLR 2024 (first author), ICLR 2023 (spotlight; first author), NeurIPSW 2023 (first author), ICML 2023, KDD 2021
Haobo Zhang (2022 - 2023), Ph.D. student, Michigan State University
NeurIPS 2023 (first author), KDDW 2023 (first author)
Team member, 3rd place winner at US-UK PETs (Privacy-enhancing technologies) Prize Challenge, 2023.
Siqi Liang (2022 - 2023), Ph.D. student, Michigan State University
KDDW 2023 (first author)