Chung-En Sun

Chung-En Sun

Ph.D. Student, Computer Science, UC San Diego

cesun [at] ucsd.edu
Google ScholarGoogle Scholar GitHubGitHub CVCV

About Me

I am a Ph.D. student in Computer Science at the University of California, San Diego, advised by Prof. Tsui-Wei (Lily) Weng. My research focuses on the robustness, safety, and interpretability of Large Language Models. Recently, I have been exploring how the hidden representations of reasoning models influence their reasoning capabilities. Feel free to reach out if you're interested in collaboration, discussion, or have any questions!

Education

Publications

Preprints

>>ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models[code]
Chung-En Sun, Ge Yan, Tsui-Wei Weng.
arXiv, 2025.

Accepted Papers

>>Concept Bottleneck Large Language Models[code]
Chung-En Sun, Tuomas Oikarinen, Berk Ustun, Tsui-Wei Weng.
ICLR 2025.
>>Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Chung-En Sun, Xiaodong Liu, Weiwei Yang, Tsui-Wei Weng, Hao Cheng, Aidan San, Michel Galley, Jianfeng Gao.
NAACL 2025 Main Oral.
>>Interpretable Generative Models through Post-hoc Concept Bottlenecks
Akshay Kulkarni, Ge Yan, Chung-En Sun, Tuomas Oikarinen, Tsui-Wei Weng.
CVPR 2025.
>>Effective Skill Unlearning through Intervention and Abstention
Yongce Li, Chung-En Sun, Tsui-Wei Weng.
NAACL 2025 Main.
>>Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents[code]
Chung-En Sun, Sicun Gao, Tsui-Wei Weng.
ICML 2024.
>>Crafting Large Language Models for Enhanced Interpretability
Chung-En Sun, Tuomas Oikarinen, Tsui-Wei Weng.
ICML Workshop 2024.
>>Fooling GPT with Adversarial In-Context Examples for Text Classification
Sudhanshu Ranjan, Chung-En Sun, Linbo Liu, Tsui-Wei Weng.
NeurIPS Workshop 2023.
>>Melody Harmonization Using Orderless NADE
Chung-En Sun, Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang.
ICASSP 2021.
>>NTIRE 2020 Challenge on NonHomogeneous Dehazing
Ju-Chin Chao, Tsung-Shan Yang, Peng-Wen Chen, Po-Min Hsu, Tzu-Yi Liao, Chung-En Sun, Pei-Yuan Wu.
CVPR Workshop 2020.

Experience

Work Experience