Overview

AgentJet (AJet) is a cutting-edge, user-friendly agent RL training framework designed to optimize agents and agentic workflows (supporting any agent built with OpenAI SDK, AgentScope, Langchain, or raw HTTP requests), fine-tuning LLM weights to enhance model performance.

AgentJet (AJet) has fully-distributed swarm training capability, which means that you can deploy ajet-swarm start in GPU server(s) and then start training agents in your laptop(s)! Simply provide your agent workflow, training dataset, and reward function, and AgentJet will be ready to go!

Fast Introduction

Classic Mode

Let's begin with the simplest example: a math agent with a tool call. This is a simple & centralized training example.

please check out the installation guide to set up the training environment.

tune your first model using the minimum example.

ajet --conf ./tutorial/example_math_agent/math_agent.yaml --backbone='verl'

Swarm Mode

Let's begin with the simplest AgentJet Swarm example: also a math agent. In this case, you can use any GPU-less laptop to train the model remotely.

Start swarm server and begin swarm overwatch: ajet-swarm start and ajet-swarm overwatch (or ajet-swarm top). (Alternative: if you are a fan of docker, use our prebuilt docker image here without setting up dependencies)

From your laptop (or swarm server localhost), run this simple script to begin training:

AJET_SWARM_URL="http://swarm-server-ip:10086" python ./tutorial/example_math_swarm/math.py

Key Features

Swarm Training Mode

Swarm Training in AgentJet opens many possibilities: deploying distributed & self-healing rollout workers, non-shared-parameter multi-agent training, multi-runtime & multi-task cocktail training. And just like Tinker, you can use AgentJet Swarm to train models even on GPU-less laptop(s).

Get Started with Ease

AgentJet simplifies the process of tuning the models that power your agent workflows. It supports nearly all major agent frameworks (e.g. agentscope, langchain), as well as framework-less agents built from HTTP requests.

Rich Tutorial Library

Rich examples as beginner's tutorial: math agent, werewolves rpg, appworld ... All with step-by-step guides. Covering various agentic frameworks.

Reliable and Reproducible

Checkout AgentJet's community-powered, robot-assisted open-benchmarking system. Share progress, compare training backbones, discover bugs and iterate faster than ever! Click here to see AgentJet performance across tasks/versions/backbones.

Multi-agent and Multi-turn

Built to support advanced multi-agent and multi-turn LLM workflows, AgentJet integrates timeline-merging algorithms that automatically analyze and consolidate each agent's LLM timeline, accelerating training speed 1.5x ~ 10x.

High Resolution Logging

Log token-level rollout details, capturing token IDs, token loss masks, and token log probabilities with web UI display. This supports workflow development, agent diagnostics, and facilitates research on advanced LLM algorithm studies.

Example Library

Explore our rich library of examples to kickstart your journey:

$\text{[math]}$

Math Agent

Training a math agent that can write Python code to solve mathematical problems.

AppWorld Agent

Creating an AppWorld agent using AgentScope and training it for real-world tasks.

Werewolves Game

Developing Werewolves RPG agents and training them for strategic gameplay.

Learning to Ask

Learning to ask questions like a doctor for medical consultation scenarios.

Countdown Game

Writing a countdown game using AgentScope and solving it with RL.

Frozen Lake

Solving a frozen lake walking puzzle using AgentJet's reinforcement learning.

Project Structure

AgentJet makes agent fine-tuning straightforward by separating the developer interface from the internal execution logic.

Basic Modules

To optimize an agent, you provide three core inputs:

Trainable Workflow

Define your agent logic by inheriting the Workflow class, supporting both simple and multi-agent setups.

Task Reader

Load training tasks from JSONL files, HuggingFace datasets, or auto-generate from documents.

Task Judger

Evaluates agent outputs and assigns rewards to guide the training process.

The internal system orchestrates several specialized modules to handle the complexities of RL training and agent interactions.

Module	Description
Launcher	Manages background service processes (Ray, vLLM) and routes the backbone
Task Rollout	Bridges LLM engines and manages the Gym environment lifecycle
Task Runner	Executes the agent workflow and calculates rewards
Model Tuner	Forwards inference requests from the workflow to the LLM engine
Context Tracker	Monitors LLM calls and automatically merges shared-history timelines (1.5x-10x efficiency boost)
Swarm Server	A data interchange center that accepts OpenAI-like requests and engine instructions, activated only in AgentJet Swarm mode

Swarm Architecture

When swarm training mode is enabled, an additional component will be activated:

Swarm Data Interchange Server: Maintains HTTP service, listens to swarm instructions and OpenAI compatible requests. Establishes a high-speed zmq communication channel to coordinate other modules.

Tutorials: From Installation to Tuning your first agent — the essential path for beginners.
Core Components: Define your Classic Workflow or Swarm Workflow, and manage Data and Reward.
Example: Check the Example Library above for real-world cases like Math, Werewolves game and Learning to ask task.
Deep Dive: Master advanced Configuration.

Roadmap

AgentJet is a constantly evolving project. We are planning to add the following features in the near future.

Category	Feature	Status
Examples	Add LoRA training examples	Todo
Infra	Optimize configurations for long-context adaptation on smaller GPUs	In Progress
Capability	Multi-modal training support	Todo
Capability	MARL Credit assignment	Todo
Capability	Training dataset generation from few-shot samples	Todo

Citation

If you use AgentJet in your research, please cite:

@software{
  title  = {AgentJet: A Cutting-Edge Multi-Agent Training Platform for Large Language Models.},
  author = {The AgentJet Team},
  url    = {https://modelscope.github.io/AgentJet/},
  month  = {01},
  year   = {2026}
}

Next Steps

Installation

Set up AgentJet environment and dependencies.

Quick Start

Run your first training in minutes.

First Agent

Build and train your own agent from scratch.