Homepage - Kun Chen

Kun Chen 陈坤 Kun Chen

PhD Student

MAIS, Institute of Automation, Chinese Academy of Sciences

博士研究生

中国科学院自动化研究所多模态人工智能系统全国重点实验室

👨‍🔬 About Me

I am Kun Chen, a two-year PhD student at the State Key Laboratory of Multimodal Artificial Intelligence Systems(MAIS), Institute of Automation, Chinese Academy of Sciences (CASIA), majoring in Computer Application Technology. Before that, I received my B.Eng. degree in Automation from University of Electronic Science and Technology of China (UESTC) in 2024.

🌱 Interests

My research interests focus on Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs), including training, alignment (RLHF/RLVR), agentic systems, and long-context extension. I thrive on following cutting-edge AI research, actively contributing to open-source communities.

👨‍🔬 关于我

我是陈坤，中国科学院自动化研究所多模态人工智能系统全国重点实验室（MAIS）二年级博士研究生，专业为计算机应用技术。此前，于2024年获得电子科技大学自动化专业工学学士学位。

🌱 研究兴趣

我的研究方向聚焦于大语言模型（LLMs）和多模态大语言模型（MLLMs），涵盖模型训练、对齐（RLHF/RLVR）、智能体系统以及长上下文扩展等方向。我热衷于追踪前沿AI研究，积极参与开源社区贡献。

chenkun2024(at)ia.ac.cn Google Scholar谷歌学术 GitHub WeChat微信

Education教育经历

Institute of Automation, Chinese Academy of Sciences

Computer Application Technology
Ph.D. Student, Supervised by Wenji Mao

Sep. 2024 - present
University of Electronic Science and Technology of China

Automation (GPA 3.94)
B.Eng. Student

Sep. 2020 - Jun. 2024

中国科学院自动化研究所

计算机应用技术
博士研究生，导师：毛文吉

2024年9月 - 至今
电子科技大学

自动化（GPA 3.94）
工学学士

2020年9月 - 2024年6月

Experience工作经历

Meituan

Algorithm Research Intern

Aug. 2025 - Feb. 2026
Wenge Tech

Algorithm Engineer, AI Research Institute

Jun. 2024 - Jun. 2025

美团

算法研究实习生

2025年8月 - 2026年2月
中科闻歌

算法工程师，AI研究院

2024年6月 - 2025年6月

Honors & Awards荣誉奖励

Outstanding Student of UESTC (Only 10 undergraduate students)

2023
National First Prize, 18th Challenge Cup National College Students' Extracurricular Academic Science and Technology Works Contest

2023
National First Prize, 18th National University Students Smart Car Race

2023
Meritorious Winner, Mathematical Contest in Modeling

2023
Multiple National Scholarships & SZSE Scholarships

2020 - 2025

电子科技大学成电杰出学生（全校仅10名本科生）

2023
第十八届"挑战杯"全国大学生课外学术科技作品竞赛全国一等奖

2023
第十八届全国大学生智能车竞赛全国一等奖

2023
美国大学生数学建模竞赛 Meritorious Winner

2023
多次获得国家奖学金及深交所奖学金

2020 - 2025

News最新动态

2026

Our paper SPECS has been accepted by ICLR 2026 as a Poster! Read more 我们的论文 SPECS 被 ICLR 2026 接收为 Poster！查看详情

Jan 19

2025

Started a research internship at Meituan, Basic Algorithm R&D.开始在美团基础算法研发部进行研究实习。

Jul 31

Our paper DEMO has been accepted by ACL 2025 Findings! Read more 我们的论文 DEMO 被 ACL 2025 Findings 接收！查看详情

Apr 30

2024

Started my PhD at the Institute of Automation, Chinese Academy of Sciences (CASIA).开始在中国科学院自动化研究所（CASIA）攻读博士学位。

Aug 31

Joined Wenge Tech as an Algorithm Engineer at the AI Research Institute.加入中科闻歌AI研究院，担任算法工程师。

May 31

2023

Honored as Outstanding Student of UESTC — one of only 10 undergraduates selected university-wide.荣获电子科技大学优秀学生称号——全校仅10名本科生入选。

Nov 30

Selected Publications精选论文 (view all)(查看全部)

APEX-Searcher: Augmenting LLMs' Search Capabilities through Agentic Planning and Execution

Kun Chen, Qingchao Kong^#, Feifei Zhao, Wenji Mao

ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) 2026 Under Review

We propose APEX-Searcher, a framework that augments LLMs' search capabilities through agentic planning and execution for improved information retrieval.我们提出了APEX-Searcher，一个通过智能体规划与执行来增强大语言模型搜索能力的框架，以改善信息检索效果。

[arXiv]

APEX-Searcher: Augmenting LLMs' Search Capabilities through Agentic Planning and Execution

Kun Chen, Qingchao Kong^#, Feifei Zhao, Wenji Mao

ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) 2026 Under Review

[arXiv]

Flexible Entropy Control in RLVR with Gradient-Preserving Perspective

Kun Chen, Peng Shi^#, Fanfan Liu, Haibo Qiu, Zhixiong Zeng, Siqi Yang, Wenji Mao^#

International Conference on Machine Learning (ICML) 2026 Under Review

From the perspective of gradient preservation, we propose an entropy increase and decrease regulation mechanism, as well as three strategies for entropy control in the training of Reinforcement Learning with Verifiable Rewards (RLVR).我们从梯度保持的视角出发，提出了一种熵增熵减的调节机制，并提出三种在可验证奖励强化学习（RLVR）训练中控制熵的策略。

[arXiv] [code]

Flexible Entropy Control in RLVR with Gradient-Preserving Perspective

Kun Chen, Peng Shi^#, Fanfan Liu, Haibo Qiu, Zhixiong Zeng, Siqi Yang, Wenji Mao^#

International Conference on Machine Learning (ICML) 2026 Under Review

[arXiv] [code]

SPECS: Decoupling Multimodal Learning via Self-distilled Preference-based Cold Start

Kun Chen*, Peng Shi*, Haibo Qiu, Zhixiong Zeng, Siqi Yang, Wenji Mao^#, Lin Ma^#

International Conference on Learning Representations (ICLR) 2026 Poster

We propose SPECS, a self-distilled preference-based cold start method that decouples multimodal learning to improve multimodal reasoning capabilities.我们提出了SPECS，一种基于自蒸馏偏好的冷启动方法，通过解耦多模态学习来提升多模态推理能力。

[arXiv] [code]

SPECS: Decoupling Multimodal Learning via Self-distilled Preference-based Cold Start

Kun Chen*, Peng Shi*, Haibo Qiu, Zhixiong Zeng, Siqi Yang, Wenji Mao^#, Lin Ma^#

International Conference on Learning Representations (ICLR) 2026 Poster

[arXiv] [code]

DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling

Minzheng Wang, Xinghua Zhang, Kun Chen, Nan Xu, Haiyang Yu, Fei Huang, Wenji Mao^#, Yongbin Li^#

Annual Meeting of the Association for Computational Linguistics (ACL) 2025 Findings

We propose DEMO, a framework that reframes dialogue interaction through fine-grained element modeling for improved dialogue understanding and generation.我们提出了DEMO，一个通过细粒度元素建模来重构对话交互的框架，以提升对话理解与生成能力。

[ACL Anthology] [code]

DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling

Minzheng Wang, Xinghua Zhang, Kun Chen, Nan Xu, Haiyang Yu, Fei Huang, Wenji Mao^#, Yongbin Li^#

Annual Meeting of the Association for Computational Linguistics (ACL) 2025 Findings

[ACL Anthology] [code]