doorss

Sunday, March 8, 2026 — 16 items

AI Agents, Alignment & Safety

Anthropic Research · www.anthropic.com
Anthropic shares lessons from working with dozens of teams building LLM agents, finding that the most successful implementations use simple, composable patterns rather than complex frameworks.
Anthropic Research · www.anthropic.com
Anthropic stress-tested 16 leading models in simulated corporate environments to identify potentially risky agentic behaviors, exploring how LLMs could act as insider threats before such issues arise in practice.
Anthropic Research · www.anthropic.com
Anthropic's alignment and interpretability teams present research on 'alignment audits'—systematic investigations into whether AI models are pursuing hidden objectives, practicing techniques to detect deceptive behavior.
Anthropic Research · www.anthropic.com
Anthropic details how they've improved Claude's cybersecurity capabilities to help defenders detect and analyze threats, responding to growing evidence that frontier AI is useful for both attackers and defenders.

AI & the Economy / Workforce

Anthropic Research · www.anthropic.com
Anthropic introduces new metrics for understanding AI's economic impact, offering a detailed portrait of how Claude was used in November 2025 just before the release of Opus 4.5.
Anthropic Research · www.anthropic.com
An observational study finds AI can speed up some coding tasks by 80%, but raises important questions about whether increased productivity comes at the cost of skill development.
Hacker News · www.seangoedecke.com · comments
A software engineer reflects on the uncertain future of their profession in an era of rapidly advancing AI coding tools and agents.

AI Tools, Models & Research

Hacker News · github.com · comments
Andrej Karpathy releases Autoresearch, a framework where AI agents autonomously conduct research experiments on small-scale language model training using a single GPU.
Hacker News · unsloth.ai · comments
Unsloth provides a guide for running the latest Qwen 3.5 model locally, continuing the trend of making powerful open-weight models accessible on consumer hardware.
Hacker News · tropes.fyi · comments
A catalog of common writing patterns and stylistic tropes that LLMs tend to produce, helping users identify and avoid AI-generated text clichés.

Programming Languages, Tools & Infrastructure

Lobsters · medium.com · comments
Airtable's engineering team shares the story and rationale behind rewriting their core database in Rust, detailing the performance and reliability gains from the migration.
Hacker News · github.com · comments
A pull request in Astral's uv package manager proposes warning users that PyPy appears unmaintained, sparking discussion about the state of the alternative Python implementation.
Lobsters · fzakaria.com · comments
A critical but sympathetic look at Nix's reproducibility promises, arguing that while Nix doesn't achieve perfect reproducibility, its practical guarantees are still valuable.
Hacker News · cacm.acm.org · comments
A retrospective in Communications of the ACM examines how Docker containers have transformed software deployment and infrastructure over the past decade.

Systems, DevOps & Cloud

Hacker News · devblog.ecuadors.net · comments
Fresh 2026 benchmarks comparing cloud VM performance and pricing across major providers, offering practical guidance for cost-effective infrastructure decisions.
Hacker News · brandonvin.github.io · comments
A practitioner reflects on a decade of production deployment lessons, covering the evolution from manual processes to modern CI/CD practices and what actually matters for reliability.
My preferences