Profile Picture

Xinchen Wang

王欣辰



I am a master's student in the School of Computer Science and Technology at Harbin Institute of Technology, Shenzhen. I have the privilege of being advised by Prof. Cuiyun Gao. I received my Bachelor's degree from Harbin Institute of Technology, Shenzhen. My research interests primarily focus on Artificial Intelligence for Software Engineering (AI4SE) and Code Intelligence.

I welcome any communication, so feel free to reach out! You can contact me via email at 200111115@stu.hit.edu.cn or on WeChat at gghhzzxhkazz22.

📝 Publications

Repo2Run

[NeurIPS'25] Repo2Run: Automated Building Executable Environment for Code Repository at Scale

Ruida Hu, Chao Peng*, Xinchen Wang, Junjielong Xu, Cuiyun Gao*.

Annual Conference on Neural Information Processing Systems (CCF-A Conference)

We introduce Repo2Run, the first LLM-based agent aiming at automating the building of executable test environments for any repositories at scale. Specifically, given a code repository, Repo2Run iteratively builds the Docker image, runs unit tests based on the feedback of the building, and synthesizes the Dockerfile until the entire pipeline is executed successfully.

🏆 Spotlight Paper

CodeVisionary

[ASE'25] An Agent-based Evaluation Framework for Complex Code Generation

Xinchen Wang, Ruida Hu, Pengfei Gao, Chao Peng, Cuiyun Gao*.

The IEEE/ACM International Conference on Automated Software Engineering (CCF-A Conference)

We propose CodeVisionary, the first agent-based evaluation framework for complex code generation. CodeVisionary consists of two stages: (1) Requirement-guided multi-dimensional context distillation stage, which first formulates a detailed evaluation plan by decomposing task requirements, and then stepwise collects multi-dimensional contextual information for each requirement. (2) Fine-grained scoring and summarization stage, which defines self-directed and negotiation-based actions, allowing multiple judges to comprehend complex code from fine-grained and diverse viewpoints, and reach a consensus through discussion. A comprehensive evaluation report is also generated for enhanced explainability.

🏆 Directly Accepted Without Revision (9.9%)

AEGIS

[FSE'25 Industry Track] AEGIS: An Agent-based Framework for Bug Reproduction from Issue Descriptions

Xinchen Wang, Pengfei Gao, Xiangxin Meng, Chao Peng, Ruida Hu, Yun Lin, Cuiyun Gao*.

The ACM International Conference on the Foundations of Software Engineering (CCF-A Conference)

We propose an automated bug reproduction script generation framework named AEGIS. AEGIS consists of two main modules: (1) Bug-related context summarization module, aiming at condensing the retrieved information into structural context through further reranking and summarization. (2) Finite state machine (FSM)-guided script generation module, which aims at guiding the script modification process with proposed FSM which contains predefined modification rules.

CodeRepoQA

[SIGIR'25 Short Paper] Understanding Large Language Model Performance in Software Engineering: A Large-scale Question Answering Benchmark

Ruida Hu, Chao Peng, Jingyi Ren, Bo Jiang, Xiangxin Meng, Qinyun Wu, Pengfei Gao, Xinchen Wang, Cuiyun Gao*.

The International ACM SIGIR Conference on Research and Development in Information Retrieval (CCF-A Conference)

We introduce CodeRepoQA, a large-scale benchmark specifically designed for evaluating repository-level question-answering capabilities in the field of software engineering. CodeRepoQA is a multi-turn question-answering benchmark with 585,687 entries. It covers a diverse array of software engineering scenarios, with an average of 6.62 dialogue turns per entry.

ReposVul

[ICSE'24 Industry Challenge Track] ReposVul: A Repository-Level High-Quality Vulnerability Dataset

Xinchen Wang, Ruida Hu, Cuiyun Gao*, Xin-Cheng Wen, Yujia Chen, Qing Liao.

The IEEE/ACM International Conference on Software Engineering (CCF-A Conference)

We propose an automated data collection framework and construct the first repository-level high-quality vulnerability dataset named ReposVul. The proposed framework mainly contains three modules: (1) A vulnerability untangling module, aiming at distinguishing vulnerability-fixing related code changes from tangled patches. (2) A multi-granularity dependency extraction module, aiming at capturing the inter-procedural call relationships of vulnerabilities. (3) A trace-based filtering module, aiming at filtering the outdated patches.

🏆 Best Paper Award of the Track

PILOT

[ASE'23] When Less is Enough: Positive and Unlabeled Learning Model for Vulnerability Detection

Xin-Cheng Wen, Xinchen Wang, Cuiyun Gao*, Shaohua Wang, Yang Liu, Zhaoquan Gu.

The IEEE/ACM International Conference on Automated Software Engineering (CCF-A Conference)

We propose a novel model named PILOT for vulnerability detection. It mainly contains two modules: (1) A distance-aware label selection module, aiming at generating pseudo-labels for selected unlabeled data, which involves the inter-class distance prototype and progressive fine-tuning; (2) A mixed-supervision representation learning module to further alleviate the influence of noise and enhance the discrimination of representations.

📜 Technical Reports

TraeAgent

Trae Agent: An LLM-based Agent for Software Engineering with Test-time Scaling

Trae Research Team: Pengfei Gao, Zhao Tian, Xiangxin Meng, Xinchen Wang, Ruida Hu, Yuanan Xiao, Yizhou Liu, Zhao Zhang, Junjie Chen, Cuiyun Gao, Yun Lin, Yingfei Xiong, Chao Peng, Xia Liu.

We propose Trae Agent, the first agent-based ensemble reasoning approach for repository-level issue resolution. Trae Agent formulates our goal as an optimal solution search problem and addresses two key challenges, i.e., large ensemble spaces and repository-level understanding, through modular agents for generation, pruning, and selection. We conduct extensive experiments using three leading LLMs on the widely-adopted SWE-bench benchmark, comparing Trae Agent against four state-of-the-art ensemble reasoning techniques.

MarscodeAgent

Marscode agent: AI-native automated bug fixing

Yizhou Liu, Pengfei Gao, Xinchen Wang, Jie Liu, Yexuan Shi, Zhao Zhang, Chao Peng.

Recent advances in large language models (LLMs) have shown significant potential to automate various software development tasks, including code completion, test generation, and bug fixing. However, the application of LLMs for automated bug fixing remains challenging due to the complexity and diversity of real-world software systems. In this paper, we introduce MarsCode Agent, a novel framework that leverages LLMs to automatically identify and repair bugs in software code. MarsCode Agent combines the power of LLMs with advanced code analysis techniques to accurately localize faults and generate patches. Our approach follows a systematic process of planning, bug reproduction, fault localization, candidate patch generation, and validation to ensure high-quality bug fixes. We evaluated MarsCode Agent on SWE-bench, a comprehensive benchmark of real-world software projects, and our results show that MarsCode Agent achieves a high success rate in bug fixing compared to most of the existing automated approaches.

🏆 Honors and Awards

💻 Internships

2024.06 - now, Trae Research, ByteDance, China. Working with Chao Peng and Pengfei Gao.

📖 Educations

2024.09 – now, M.Eng. in Computer Science and Technology, Harbin Institute of Technology, Shenzhen

2020.09 – 2024.06, B.Eng. in Computer Science and Technology, Harbin Institute of Technology, Shenzhen