Junyuan "Jason" Hong

Junyuan "Jason" Hong

Incoming Assistant Professor

National University of Singapore

I am an incoming Assistant Professor at the ECE department of the National University of Singapore, after spending one year at Massachusetts General Hospital & Harvard Medical School. Previously, I was a postdoctoral fellow advised by Dr. Atlas Wang in the Institute for Foundations of Machine Learning (IFML), affiliated with the UT AI Health Lab and the Good System Challenge, and obtained my Ph.D. in Computer Science and Engineering from Michigan State University with Dr. Jiayu Zhou.

I was recognized as one of the MLSys Rising Stars in 2024 and received a Best Paper Nomination at VLDB 2024. My work was covered by Nature News, The White House, WIRED, Forbes, and FORTUNE.

I lead the Cognitive Science & Trustworthy AI (CoSTA@NUS) Lab exploring the frontier where human minds meet machine intelligence. Openings are available in terms of RA/internship students.

Recent News
Awards
Funding
I am grateful that our research is supported by the multiple programs.
Interests
  • Responsible AI
  • Healthcare
  • Privacy
Education
  • PhD in CSE, 2023

    Michigan State University (Advisor: Jiayu Zhou)

    Committee: Anil K. Jain, Sijia Liu, Atlas Wang, Jiayu Zhou

  • MSc in Computer Science, 2018

    University of Science and Technology of China

  • BSc in Physics, minor in CS., 2015

    University of Science and Technology of China

Publications

.js-id-Selected
TMLR 2026 POPS: Recovering Unlearned Multi-Modality Knowledge in MLLMs with Prompt-Optimized Parameter Shaking.
PDF
FMEA@CVPR 2026 A Physics-Grounded Benchmark for Multi-Agent Dynamics in World Models.
PDF
arXiv 2026 The Last Human-Written Paper: Agent-Native Research Artifacts.
PDF Code Blog
CAIS 2026 Scaling Textual Gradients via Sampling-Based Momentum.
PDF
arXiv 2026 CATNIP: LLM Unlearning via Calibrated and Tokenized Negative Preference Alignment.
PDF
ArXiv 2025 LLMs Can Get "Brain Rot"!.
PDF Code 🌍 Website
ICRA 2026 AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback.
PDF
COLM 2025 LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning.
PDF Code
COLM 2025 More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment.
PDF
ICML 2025 GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning.
PDF 🏁 Competition
COLM 2025 SEAL: Steerable Reasoning Calibration of Large Language Models for Free.
PDF Code
EMNLP (Main) 2025 MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models.
PDF Code πŸ€— Dataset 🌍 Website
FM4Science 2024 DeepOSets: Non-Autoregressive In-Context Learning of Supervised Learning Operators.
PDF
NAACL (Main) 2025 Extracting and Understanding the Superficial Knowledge in Alignment.
PDF Code
NAACL 2025 GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing.
PDF
VLDB (Best Paper Finalist) 2024 LLM-PBE: Assessing Data Privacy in Large Language Models.
PDF Code 🌍 Website 🏁 Competition πŸ† Best Paper Nomination Finetune Code
ICML 2024 Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression.
PDF πŸ€— Models 🌍 Website
ICML 2024 Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark.
PDF Code πŸ‘¨β€πŸ«Tutorial
ICLRW 2024 A-CONECT: Designing AI-based Conversational Chatbot for Early Dementia Intervention.
PDF Website πŸ€–Demo
AISTATS 2024 On the Generalization Ability of Unsupervised Pretraining.
PDF
ICLR 2024 Safe and Robust Watermark Injection with a Single OoD Image.
PDF Code
SaTML 2024 Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk.
PDF Code
ICLR (Spotlight) 2024 DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer.
PDF Code
NeurIPS-RegML 2023 Who Leaked the Model? Tracking IP Infringers in Accountable Federated Learning.
PDF
NeurIPS 2023 Understanding Deep Gradient Leakage via Inversion Influence Functions.
PDF Code
KDDW 2023 FedNoisy: A Federated Noisy Label Learning Benchmark.
PDF Code
ICML 2023 Revisiting Data-Free Knowledge Distillation with Poisoned Teachers.
PDF Code Poster
TMLR 2023 How Robust is Your Fairness? Evaluating and Sustaining Fairness under Unseen Distribution Shifts.
PDF
ICLR 2023 MECTA: Memory-Economic Continual Test-Time Model Adaptation.
PDF Code Slides
ICLR (Spotlight) 2023 Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection.
PDF Code
AAAI (Oral) 2023 Federated Robustness Propagation: Sharing Adversarial Robustness in Federated Learning.
Preprint Code Poster
Preprint 2022 Precautionary Unfairness in Self-Supervised Contrastive Pre-training.
Preprint
NeurIPS 2022 Outsourcing Training without Uploading Data via Efficient Collaborative Open-Source Sampling.
PDF Poster Slides
NeurIPS 2022 Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork.
PDF Code
ICML 2022 Resilient and Communication Efficient Learning for Heterogeneous Federated Systems.
PDF
ICLR 2022 Efficient Split-Mix Federated Learning for On-Demand and In-Situ Customization.
PDF Code Slides Video
KDD 2021 Federated Adversarial Debiasing for Fair and Transferable Representations.
PDF Code Slides
ICML 2021 Data-Free Knowledge Distillation for Heterogeneous Federated Learning.
Preprint Code
AAAI 2021 Learning Model-Based Privacy Protection under Budget Constraints.
PDF Slides Video Supplementary
TNNLS 2019 Short Sequence Classification Through Discriminable Linear Dynamical System.
DOI
ECML 2016 Sequential Data Classification in the Space of Liquid State Machines.
PDF Code

Experiences

Experiences

Media Coverage

Invited Talks & Guest Lectures

  • ‘Brain Rot in LLMs: When Benign Data Degrades Intelligence?’ @ LockLLM Workshop at NeurIPS 2025, Nov 2025.
  • ‘GenAI-Based Chatbot for Early Dementia Intervention’ @ Rising Star Symposium Series, IEEE TCCN Special Interest Group for AI and Machine Learning in Security, September, 2024: [link]
  • ‘Building Conversational AI for Affordable and Accessible Early Dementia Intervention’ @ AI Health Course, The School of Information, UT Austin, April, 2024: [paper]
  • ‘Shake to Leak: Amplifying the Generative Privacy Risk through Fine-Tuning’ @ Good Systems Symposium: Shaping the Future of Ethical AI, UT Austin, March, 2024: [paper]
  • ‘Foundation Models Meet Data Privacy: Risks and Countermeasures’ @ Trustworthy Machine Learning Course, Virginia Tech, Nov, 2023
  • ‘Economizing Mild-Cognitive-Impairment Research: Developing a Digital Twin Chatbot from Patient Conversations’ @ BABUΕ KA FORUM, Nov, 2023: [link]
  • ‘Backdoor Meets Data-Free Learning’ @ Hong Kong Baptist University, Sep, 2023: [slides]
  • ‘MECTA: Memory-Economic Continual Test-Time Model Adaptation’ @ Computer Vision Talks, March, 2023: [slides] [video]
  • ‘Split-Mix Federated Learning for Model Customization’ @ TrustML Young Scientist Seminars, July, 2022: [link] [video]
  • ‘Federated Adversarial Debiasing for Fair and Transferable Representations’, @ CSE Graduate Seminar, Michigan State University, October, 2021: [slides]
  • ‘Dynamic Policies on Differential Private Learning’ @ VITA Seminars, UT Austin, Sep, 2020: [slides]

Services

Teaching