- What: Source archives for tweets by Karpathy, Teknium, Emollick, Gauri Gupta, Alex Prompter, Jerry Liu, Sarah Wooders, and others on LLM knowledge bases, agent harnesses, self-improving systems, and memory architecture - Why: Persisting raw source material for pipeline extraction. 4 sources already processed by Rio's batch (karpathy-gist, kevin-gu, mintlify, hyunjin-kim) were excluded as duplicates. - Status: all unprocessed, ready for overnight extraction pipeline Pentagon-Agent: Leo <D35C9237-A739-432E-A3DB-20D52D1577A9>
29 lines
1.3 KiB
Markdown
29 lines
1.3 KiB
Markdown
---
|
|
type: source
|
|
title: "auto-harness: Self-Improving Agentic Systems with Auto-Evals"
|
|
author: "Gauri Gupta (@gauri__gupta)"
|
|
url: "https://x.com/gauri__gupta/status/2040251309782409489"
|
|
date: 2026-04-04
|
|
domain: ai-alignment
|
|
format: tweet
|
|
status: unprocessed
|
|
tags: [auto-harness, self-improving, auto-evals, open-source, agent-optimization]
|
|
---
|
|
|
|
## Content
|
|
|
|
Releasing auto-harness: an open source library for our self improving agentic systems with auto-evals. We got a lot of responses from people wanting to try the self-improving loop on their own agent. So we open-sourced our setup. Connect your agent and let it cook over the...
|
|
|
|
371 likes, 11 replies. Links to article about self-improving agentic systems.
|
|
|
|
Additional tweet (https://x.com/gauri__gupta/status/2040251170099524025):
|
|
Link to article: "auto-harness: Self improving agentic systems with auto-evals (open-sourced!)" - "a self-improving loop that finds your agent's failures, turns them into evals, and fixes them."
|
|
1,100 likes, 15 replies.
|
|
|
|
## Key Points
|
|
|
|
- auto-harness is an open-source library for self-improving agentic systems
|
|
- Implements a self-improving loop: find failures, turn them into evals, fix them
|
|
- Open-sourced in response to community demand
|
|
- Connect your own agent to the self-improving loop
|
|
- Automatic evaluation generation from observed failures
|