Publications
Selected Preprint
-
[arXiv 2025] Kechi Zhang, Huangzhao Zhang, Ge Li, Jinliang You, Jia Li, Yunfei Zhao, Zhi Jin, SEAlign: Alignment Training for Software Engineering Agent, arXiv preprint
-
[arXiv 2025] Kechi Zhang, Ge Li, Jia Li, Yihong Dong, Jia Li, Zhi Jin, Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points, arXiv preprint
-
[arXiv 2024] Kechi Zhang, Ge Li, Yihong Dong, Jingjing Xu, Jun Zhang, Jing Su, Yongfei Liu, Zhi Jin, CodeDPO: Aligning Code Models with Self Generated and Verified Source Code, arXiv preprint
-
[arXiv 2023] Kechi Zhang, Huangzhao Zhang, Ge Li, Jia Li, Zhuo Li, Zhi Jin, ToolCoder: Teach Code Generation Models to Use API Search Tool, arXiv preprint, arXiv:2305.04032, 2023.
-
[arXiv 2022] Kechi Zhang, Ge Li, Zhi Jin, What Does Transformer Learn About Source Code?, arXiv preprint, arXiv:2207.08466, 2022.
Selected Publications
-
[EMSE 2025] Kechi Zhang, Jia Li, Zhuo Li, Zhi Jin, Ge Li, Transformer-based Code Model with Compressed Hierarchy Representation, Empirical Software Engineering, 10.1007/s10664-025-10612-6.
-
[ACL 2024] Kechi Zhang, Ge Li, Huangzhao Zhang, Zhi Jin, HiRoPE: Length Extrapolation for Code Models Using Hierarchical Position, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, Aug. 11-16, 2024. (Accepted)
-
[ACL 2024] Kechi Zhang, Jia Li, Ge Li, Xianjie Shi, Zhi Jin, CodeAgent: Enhancing Code Generation with Tool-Integrated Agent Systems for Real-World Repo-level Coding Challenges, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, Aug. 11-16, 2024. (Accepted)
-
[ACL 2023] Kechi Zhang, Zhuo Li, Jia Allen Li, Ge Li, Zhi Jin, Self-Edit: Fault-Aware Code Editor for Code Generation, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL), Toronto, Canada, July 9-14, 2023.
-
[ICPC 2023] Kechi Zhang, Zhou Li, Zhi Jin, Ge Li, Implant Global and Local Hierarchy Information to Sequence based Code Representation Models, Proceedings of the 31st IEEE/ACM International Conference on Program Comprehension (ICPC), Melbourne Australia, May 15-16, 2023. (🏆ACM SIGSOFT Distinguished Paper Award🏆)
-
[ICPC 2022] Kechi Zhang, Wenhan Wang, Huangzhao Zhang, Ge Li, Zhi Jin, Learning to Represent Programs with Heterogeneous Graphs, Proceedings of the 30th ACM/IEEE International Conference on Program Comprehension (ICPC), Pittsburgh, PA, USA, May 16-17, 2022.
-
[ICSE, 2025] Siyuan Jiang, Jia Li, He Zong, Huanyu Liu, Hao Zhu, Shukai Hu, Erlu Li, Jiazheng Ding, Yu Han, Wei Ning, Gen Wang, Yihong Dong, Kechi Zhang, Ge Li; aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Completion; Proceedings of the 47th IEEE/ACM International Conference on Software Engineering (ICSE), Ottawa, Ontario, Canada, Apr. 27 - May 3, 2025 (Accepted).
-
[SCIS, 2024] Huangzhao Zhang, Kechi Zhang, Zhuo Li, Jia Li, Jia Li, Yongmin Li, Yunfei Zhao, Yuqi Zhu, Fang Liu, Ge Li, Zhi Jin. Deep Learning for Code Generation: A Survey, Science China Information Sciences (SCIS), doi: 10.1007/s11432-023-3956-3, Feb 6, 2024.
-
[SANER 2023] Wenhan Wang, Kechi Zhang, Ge Li, Shangqing Liu, Anran Li, Zhi Jin, Yang Liu, Learning Program Representations with a Tree-Structured Transformer, Proceedings of the 30th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), Macao SAR, China, March 21st-24th, 2023.
-
[TOSEM, 2023] Jia Allen Li, Ge Li, Zhuo Li, Zhi Jin, Xing Hu, Kechi Zhang, Zhiyi Fu, CodeEditor: Learning to Edit Source Code with Pre-trained Models, ACM Transactions on Software Engineering and Methodology (TOSEM), Vol. 32, No. 6, May 22, 2023, pp 143 - 165.
-
[ICPC 2023] Mingyang Geng, Shangwen Wang, Dezun Dong, Haotian Wang, Shaomeng Cao, Kechi Zhang, Zhi Jin, Interpretation-based Code Summarization, Proceedings of the 31st IEEE/ACM International Conference on Program Comprehension (ICPC), Melbourne Australia, May 15-16, 2023.