Cadenza.Agent 1.0.15

Requires NuGet 5.0 or higher.

SDK
File-Based Apps

<Sdk Name="Cadenza.Agent" Version="1.0.15" />

For projects that support Sdk, copy this XML node into the project file to reference the package.

#:sdk Cadenza.Agent@1.0.15

#:sdk directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

Cadenza.Agent

Read this in 한국어.

Cadenza.Agent is the AI-agent variant of the Cadenza SDK family — a single-file scripting MSBuild SDK that wraps Microsoft.Extensions.AI (the vendor-neutral .NET LLM abstraction) and exposes an OpenAI-compatible Chat Completion HTTP endpoint out of the box. That means tools like Codex, Aider, Continue, and Cursor can talk to your agent as if it were OpenAI — just point them at http://localhost:8080/v1.

Quick start

Create an agent.cs file:

#!/usr/bin/env dotnet run
#:sdk Cadenza.Agent@1.0.15

SystemPrompt("You are a helpful assistant with filesystem access.");

Tool("read_file", "Read a UTF-8 text file from disk",
    (string path) => ReadText(path));

Tool("list_files", "List files matching a glob pattern",
    (string pattern) => Glob(pattern).ToArray());

UseOllama("llama3.2");      // or UseOpenAi("gpt-4o-mini")
                            //    UseAnthropic("claude-3-5-sonnet-latest")
                            //    UseAzureOpenAi(endpoint, deployment)

await Run();                // boots the OpenAI-compatible HTTP server
                            // on http://localhost:8080

Run iteratively:

dotnet run agent.cs

Point any OpenAI-compatible client at it:

export OPENAI_BASE_URL=http://localhost:8080/v1
export OPENAI_API_KEY=any-non-empty-string
codex      # or aider, continue, cursor, sgpt, …

Or use the REPL instead of the HTTP server:

UseOllama("llama3.2");
await ChatLoop();           // interactive console — no HTTP server

Publish as a self-contained single binary:

dotnet publish agent.cs -r linux-x64 -c Release

What you get

Helper	Purpose
`Tool(name, description, delegate)`	Register a callable tool. Parameter names/types become the JSON schema; the return value is the tool result.
`SystemPrompt(text)`	Override the default system prompt.
`UseOllama(model, baseUrl?)`	Use a local Ollama daemon as the LLM.
`UseOpenAi(model, apiKey?)`	Use OpenAI. Falls back to `OPENAI_API_KEY` env var.
`UseAnthropic(model, apiKey?)`	Use Anthropic via its OpenAI-compatible endpoint. Falls back to `ANTHROPIC_API_KEY`.
`UseAzureOpenAi(endpoint, deployment, apiKey?)`	Use Azure OpenAI. Falls back to `AZURE_OPENAI_API_KEY`.
`UseChatClient(IChatClient)`	Plug in any custom `IChatClient` — gets `UseFunctionInvocation()` wired automatically.
`Run()`	Start the OpenAI-compatible HTTP server (default `localhost:8080`).
`ChatLoop()`	Start an interactive console REPL.
`Reply(prompt)`	One-shot non-interactive call. Returns the assistant's text.
`Port`, `HostName`, `ServedModelName`	Configure the HTTP surface before calling `Run()`.

Endpoints exposed by `Run()`

POST /v1/chat/completions — full OpenAI Chat Completion shape; supports stream=true (SSE). Used by Aider, Continue, Cursor, GitHub Copilot BYOK, sgpt, and almost every other OpenAI-compatible client.
POST /v1/responses — OpenAI Responses API (SSE streaming). Used by OpenAI Codex CLI, which removed Chat Completion support in Feb 2026 and now requires this wire format exclusively.
GET /v1/models — single-entry list with ServedModelName as the id.
GET /health — liveness probe.

Chat Completion vs Responses — what's different

Both endpoints are backed by the same IChatClient you configured with UseOllama / UseOpenAi / etc. They differ in how tools are handled:

Path	Server-side `Tool(...)` registrations	Client-supplied tools	Use case
`/v1/chat/completions`	Auto-invoked via `UseFunctionInvocation` middleware. Tools you register execute in the agent process.	Not expected — Chat Completion clients pass `tools` only when they want server-side execution.	Aider / Continue / Cursor / Copilot
`/v1/responses`	Not exposed to the model.	Streamed back as `function_call` items — the client (e.g. Codex) owns execution.	Codex CLI

So if you register Tool("read_file", ...), it works for Aider/Continue/Cursor but not for Codex. For Codex, the model adapter pattern is enough — Codex brings shell / apply_patch / update_plan and executes them itself.

Connecting Codex CLI

Drop this into ~/.codex/config.toml (or %USERPROFILE%\.codex\config.toml on Windows):

model_provider = "cadenza"
model          = "cadenza-agent"

[model_providers.cadenza]
name     = "Cadenza.Agent local"
base_url = "http://localhost:8080/v1"
wire_api = "responses"
env_key  = "CADENZA_API_KEY"
stream_idle_timeout_ms = 300000

Then set any non-empty value for CADENZA_API_KEY and run Codex.

Why an HTTP frontend?

Most existing coding agents and chat UIs already speak an OpenAI wire format. Adopting both Chat Completion and Responses instead of a bespoke protocol means:

Zero glue code to integrate with Codex / Aider / Continue / Cursor / sgpt / any LangChain client.
Free streaming via SSE — same shape OpenAI uses.
Multi-process by default — your agent process is decoupled from the editor.
Function-calling is auto-wired through Microsoft.Extensions.AI's UseFunctionInvocation middleware, so any tool you register is callable by every supported model.

See the project repository for the full specification, sample agents, and security notes.

There are no supported framework assets in this package.

Learn more about Target Frameworks and .NET Standard.

This package has no dependencies.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
1.0.15	141	5/28/2026
1.0.14	119	5/26/2026
1.0.13	120	5/26/2026
1.0.12	128	5/26/2026