Enhancing Software Engineering Through Closed-Loop Memory Optimization
Abstract
Large language models (LLMs) have enabled powerful software engineering (SE) agents capable of navigating complex codebases and resolving real-world issues. However, these agents remain fundamentally episodic: they fail to retain, refine, and reuse experiences across tasks, repeatedly reconstructing context from scratch and reproducing similar mistakes. Even with memory support, they offer no remedy for the absence of a principled, task-agnostic memory utility, making them difficult to evaluate rigorously or generalize across agents and settings. To tackle these limitations, we introduce , a closed-loop framework for memory augmentation in SE agents. grounds memory utility in validated downstream impact, establishing utility as both a task-agnostic evaluation benchmark and an annotation-free optimization signal. Through complementary evaluation on single-episode and cross-episode memory augmentation, results demonstrate that consistently improves SE agents across settings, achieving absolute gains of up to 5.25\% in success rate and 4.63\% in resolve efficiency, while substantially reducing computational cost by ≥9.79\%. Our project page: https://xhguo7.github.io/MemOp/https://xhguo7.github.io/MemOp/.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.