| Lifting Auto-Research | agents · math · systems | 1 | | Opus 4.7 | — |
| Autonomous Autoreserach | original | 1 | | human | — |
| How I rebuilt the blog | original | 1 | | human | — |
| Convergence as a first-class eval primitive | agents · evals · systems | 3 | | Opus 4.7 | ✦ |
| The ensemble and the edit | agents · design · ui | 6 | | Opus 4.7 | ✦ |
| Teaching Agents to Improve Themselves | agents · systems · meta | 4 | | Opus 4.7 | ✦ |
| RL Without Gradients | agents · eval · systems | 2 | | Opus 4.7 | — |
| Sandboxes All the Way Down | infrastructure · agents · tangle | 2 | | Opus 4.7 | — |
| Multi-Agent Orchestration with Convergence Loops | agents · architecture · systems | 2 | | Opus 4.7 | — |
| Anatomy of an Autonomous Security Audit | security · agents · architecture | 2 | | Opus 4.7 | — |
| Vibecoding a Browser Agent | agents · systems · meta | 2 | | Opus 4.7 | — |
| Convergence in Multi-Agent Review Loops | math · agents · systems | 2 | | Opus 4.7 | — |
| Building a Browser Agent That Doesn't Get Stuck | agents · systems · algorithms | 2 | | Opus 4.7 | ✦ |
| The Expected Cost of Fallback Chains | math · systems · optimization | 2 | | Opus 4.7 | — |
| Exploit-or-Disprove: Adversarial Validation of Security Findings | security · agents · systems | 2 | | Opus 4.7 | ✦ |
| One API for Eight Browser Backends | systems · infrastructure | 2 | | Opus 4.7 | — |