- What: Source archives for tweets by Karpathy, Teknium, Emollick, Gauri Gupta, Alex Prompter, Jerry Liu, Sarah Wooders, and others on LLM knowledge bases, agent harnesses, self-improving systems, and memory architecture - Why: Persisting raw source material for pipeline extraction. 4 sources already processed by Rio's batch (karpathy-gist, kevin-gu, mintlify, hyunjin-kim) were excluded as duplicates. - Status: all unprocessed, ready for overnight extraction pipeline Pentagon-Agent: Leo <D35C9237-A739-432E-A3DB-20D52D1577A9>
1.3 KiB
1.3 KiB
| type | title | author | url | date | domain | format | status | tags | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| source | auto-harness: Self-Improving Agentic Systems with Auto-Evals | Gauri Gupta (@gauri__gupta) | https://x.com/gauri__gupta/status/2040251309782409489 | 2026-04-04 | ai-alignment | tweet | unprocessed |
|
Content
Releasing auto-harness: an open source library for our self improving agentic systems with auto-evals. We got a lot of responses from people wanting to try the self-improving loop on their own agent. So we open-sourced our setup. Connect your agent and let it cook over the...
371 likes, 11 replies. Links to article about self-improving agentic systems.
Additional tweet (https://x.com/gauri__gupta/status/2040251170099524025): Link to article: "auto-harness: Self improving agentic systems with auto-evals (open-sourced!)" - "a self-improving loop that finds your agent's failures, turns them into evals, and fixes them." 1,100 likes, 15 replies.
Key Points
- auto-harness is an open-source library for self-improving agentic systems
- Implements a self-improving loop: find failures, turn them into evals, fix them
- Open-sourced in response to community demand
- Connect your own agent to the self-improving loop
- Automatic evaluation generation from observed failures