D3/consulting
./request-spec.sh
// JOURNAL.INDEX · LAST UPDATED 2026.04.30

Field notes from
the standing bench.

Working notes, dated digests, and the occasional opinion. No newsletter, no ghostwriters. Every post is signed by the engineer who wrote it.

$ filter --by·11 posts
── migrations/2026_04_28_index_users.sql
BEGIN;
    CREATE INDEX CONCURRENTLY idx_users_email_lower
      ON users (lower(email))
      WHERE deleted_at IS NULL;
    -- the agent suggested this with deleted_at = false.
    -- it was wrong. boolean column doesn't exist.
    -- 7 minutes lost. caught in review.
COMMIT;
cover · prod-db-migrations · postgres 16
★ FEATURED · FIELD NOTE
[ 2026.04.28 ]·9 min

Why we still write the boring code by hand.

Coding agents passed the hype curve sometime last quarter. Eighty percent of teams use them. Acceptance rates jumped from 20% to 60%. So why does our bench still write migrations, indexes and SQL by hand? A short defense of the unsexy 30%.

~ Areg Sahakyan
Principal · Backend
#agents#engineering#opinion
read →
// ALL ENTRIES
NO. · CATEGORYDATETITLE / BLURBTAGS
01 · NEWS
2026.04.30

Cloudflare opens Project Think. Agents can now be paying customers.

New Agents SDK adds durable execution, sub-agents and sandboxed code execution. Agents can now register Cloudflare accounts and deploy code without a human in the loop.

#infra#agents
4 min
02 · NEWS
2026.04.24

DeepSeek V4 lands: open weights, hybrid attention, and a 1/50th price tag.

V4 Flash and V4 Pro hit Hugging Face. Roughly 90% of GPT-5.4 quality at a fraction of the cost. The economics question for self-hosting just shifted again.

#open-source#economics
3 min
03 · BRIEF
2026.04.22

How we benchmark a coding agent before letting it touch a client repo.

Six tests we run on every new model, including one we stole from a security review checklist. The short version: SWE-bench scores tell you almost nothing about whether an agent will be safe in your CI.

~ Lilit Petrosyan
#qa#security
6 min
04 · NEWS
2026.04.16

Claude Opus 4.7 ships: xhigh effort, /ultrareview, and a 3× lift on Rakuten-SWE.

New default in Claude Code. Anthropic also rolled the cyber-safeguards developed for the unreleased Mythos model into Opus 4.7.

#models
2 min
05 · FIELD NOTE
2026.04.11

Postgres at 100 GB: the boring playbook.

Partitioning, the autovacuum knobs nobody documents, and the one extension we ship by default. Notes from a recent migration off a managed plan that was costing more than two engineers.

~ Hovhannes Davtyan
#postgres#devops
7 min
06 · NEWS
2026.04.08

Meta debuts Muse Spark, and quietly walks back from open-source-first.

First model from Meta Superintelligence Labs ships proprietary. Capex guide for 2026 raised to $115–$135B. The Llama strategy as we knew it is over.

#strategy
3 min
07 · BRIEF
2026.04.04

Gemma 4 is here. Here is the actual cost of self-hosting it.

Apache 2.0, four sizes from 2.3B to 31B, and the 31B variant ranks #3 globally on Arena. We priced out the 8B variant on three GPU configurations, including one that fits under a desk.

~ Mariam Asatryan
#open-source#cost
5 min
08 · NEWS
2026.04.02

EU AI Act: Article 6 obligations for high-risk systems land August 2.

Four months out. Three things every team shipping into the EU should already have on a Jira board, and one obligation most teams misread.

#policy#compliance
2 min
09 · FIELD NOTE
2026.03.27

A QA bench is not a service. It is a posture.

On why "fractional QA" almost never works, and what we ask before we agree to embed a tester. Includes the four questions we use to know whether to staff a single QA or a pair.

~ Tigran Mkrtchyan
#qa#ops
8 min
10 · NEWS
2026.03.18

Gartner: 42% of orgs plan agent deployments. 40% of those will fail.

First Gartner agent-specific report ships. Headline figure is the failure rate, but the more interesting number is buried on page 14: median time-to-cancel.

#agents#enterprise
3 min
// OPTIONAL · NO SPAM
One email a month. The five things that actually mattered.

Curated by the bench, written by humans. Unsubscribe in one click.

$ echo "you@team.com" | d3 subscribe
// ~340 readers · last issue 2026.04.01