Multi-Agent LLM Orchestration for Software Engineering

Jan 1, 2024 · 1 min read
projects

Overview

Large language models have demonstrated remarkable capabilities in code generation, but individual models often produce code with functional errors or performance inefficiencies. This project investigates multi-agent orchestration frameworks that coordinate multiple LLMs through structured collaboration pipelines — combining specialized agents for categorization, code generation, debugging, and refinement.

Key Contributions

Our flagship system, PerfOrch, introduces a memory-augmented multi-agent architecture where:

  • A Categorizing Agent classifies programming tasks using a fixed category vocabulary to enable retrieval of relevant historical solutions.
  • Generator and Debugger Agents collaborate through iterative cycles to produce functionally correct code.
  • A Refinement Agent optimizes code performance using aggregated insights from the orchestration’s memory module.
  • The architecture leverages asymmetric aggregation strategies (product vs. sum) across different pipeline stages.

We evaluate across 5 frontier LLMs and demonstrate significant improvements over single-model baselines on both correctness and runtime performance metrics.

Status

Manuscript submitted to ACM Transactions on Software Engineering and Methodology (TOSEM).

Sen He
Authors
Sen He (he/him)
Assistant Professor
Sen He is an Assistant Professor in the Department of Electrical and Computer Engineering at the University of Arizona. His research spans LLM4SE, performance engineering, computer vision, and human-computer interaction.