Publications
Selected Preprint
-
[arXiv 2025] Kechi Zhang, Ge Li, Jia Li, Huangzhao Zhang, Jingjing Xu, Hao Zhu, Lecheng Wang, Jia Li, Yihong Dong, Jing Mai, Bin Gu, and Zhi Jin, Computational Thinking Reasoning in Large Language Models, arXiv preprint
-
[arXiv 2025] Kechi Zhang, Huangzhao Zhang, Ge Li, Jinliang You, Jia Li, Yunfei Zhao, Zhi Jin, SEAlign: Alignment Training for Software Engineering Agent, arXiv preprint
-
[arXiv 2023] Kechi Zhang, Huangzhao Zhang, Ge Li, Jia Li, Zhuo Li, Zhi Jin, ToolCoder: Teach Code Generation Models to Use API Search Tool, arXiv preprint, arXiv:2305.04032, 2023.
-
[arXiv 2022] Kechi Zhang, Ge Li, Zhi Jin, What Does Transformer Learn About Source Code?, arXiv preprint, arXiv:2207.08466, 2022.
Selected Publications
-
[ACL 2025] Kechi Zhang, Ge Li, Jia Li, Yihong Dong, Jia Li, Zhi Jin, Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points
-
[ACL 2025] Kechi Zhang, Ge Li, Yihong Dong, Jingjing Xu, Jun Zhang, Jing Su, Yongfei Liu, Zhi Jin, CodeDPO: Aligning Code Models with Self Generated and Verified Source Code
-
[EMSE 2025] Kechi Zhang, Jia Li, Zhuo Li, Zhi Jin, Ge Li, Transformer-based Code Model with Compressed Hierarchy Representation, Empirical Software Engineering, 10.1007/s10664-025-10612-6.
-
[ACL 2024] Kechi Zhang, Ge Li, Huangzhao Zhang, Zhi Jin, HiRoPE: Length Extrapolation for Code Models Using Hierarchical Position, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, Aug. 11-16, 2024.
-
[ACL 2024] Kechi Zhang, Jia Li, Ge Li, Xianjie Shi, Zhi Jin, CodeAgent: Enhancing Code Generation with Tool-Integrated Agent Systems for Real-World Repo-level Coding Challenges, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, Aug. 11-16, 2024.
-
[ACL 2023] Kechi Zhang, Zhuo Li, Jia Allen Li, Ge Li, Zhi Jin, Self-Edit: Fault-Aware Code Editor for Code Generation, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL), Toronto, Canada, July 9-14, 2023.
-
[ICPC 2023] Kechi Zhang, Zhou Li, Zhi Jin, Ge Li, Implant Global and Local Hierarchy Information to Sequence based Code Representation Models, Proceedings of the 31st IEEE/ACM International Conference on Program Comprehension (ICPC), Melbourne Australia, May 15-16, 2023. (🏆ACM SIGSOFT Distinguished Paper Award🏆)
-
[ICPC 2022] Kechi Zhang, Wenhan Wang, Huangzhao Zhang, Ge Li, Zhi Jin, Learning to Represent Programs with Heterogeneous Graphs, Proceedings of the 30th ACM/IEEE International Conference on Program Comprehension (ICPC), Pittsburgh, PA, USA, May 16-17, 2022.
-
[ACL 2025] Xiancai Chen, Zhengwei Tao, Kechi Zhang, Changzhi Zhou, Xinyu Zhang, Wanli Gu, Yuanpeng He, Mengdi Zhang, Xunliang Cai, Haiyan Zhao, Zhi Jin; Revisit Self-Debugging with Self-Generated Tests for Code Generation
-
[ACL 2025] Jia Li, Xuyuan Guo, Lei Li, Kechi Zhang, Ge Li, Jia Li, Zhengwei Tao, Fang Liu, Chongyang Tao, Yuqi Zhu, Zhi Jin; Benchmarking Long-Context Language Models on Long Code Understanding
-
[ICSE, 2025] Siyuan Jiang, Jia Li, He Zong, Huanyu Liu, Hao Zhu, Shukai Hu, Erlu Li, Jiazheng Ding, Yu Han, Wei Ning, Gen Wang, Yihong Dong, Kechi Zhang, Ge Li; aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Completion; Proceedings of the 47th IEEE/ACM International Conference on Software Engineering (ICSE), Ottawa, Ontario, Canada, Apr. 27 - May 3, 2025 (Accepted).
-
[SCIS, 2024] Huangzhao Zhang, Kechi Zhang, Zhuo Li, Jia Li, Jia Li, Yongmin Li, Yunfei Zhao, Yuqi Zhu, Fang Liu, Ge Li, Zhi Jin. Deep Learning for Code Generation: A Survey, Science China Information Sciences (SCIS), doi: 10.1007/s11432-023-3956-3, Feb 6, 2024.
-
[SANER 2023] Wenhan Wang, Kechi Zhang, Ge Li, Shangqing Liu, Anran Li, Zhi Jin, Yang Liu, Learning Program Representations with a Tree-Structured Transformer, Proceedings of the 30th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), Macao SAR, China, March 21st-24th, 2023.
-
[TOSEM, 2023] Jia Allen Li, Ge Li, Zhuo Li, Zhi Jin, Xing Hu, Kechi Zhang, Zhiyi Fu, CodeEditor: Learning to Edit Source Code with Pre-trained Models, ACM Transactions on Software Engineering and Methodology (TOSEM), Vol. 32, No. 6, May 22, 2023, pp 143 - 165.
-
[ICPC 2023] Mingyang Geng, Shangwen Wang, Dezun Dong, Haotian Wang, Shaomeng Cao, Kechi Zhang, Zhi Jin, Interpretation-based Code Summarization, Proceedings of the 31st IEEE/ACM International Conference on Program Comprehension (ICPC), Melbourne Australia, May 15-16, 2023.