Systems That Survive – Jason Agostoni, Software Architect

S
Systems That Survive – Jason Agostoni, Software Architect
9 posts
Can the Mid-Tier Models Stack Up Against the Bigger Siblings?
Can you really justify paying flagship prices when the mid-tier models may already be good enough? The original comparison started with Gemini 3 Flash vs. Claude Sonnet 4.6, then Gemini 3.5 Flash arri
Jun 1, 202613 min read
Antigravity CLI First Impressions: Fast, Rough, and Not Ready
Google has officially replaced Gemini CLI with the new Antigravity CLI and launched it alongside Gemini 3.5 Flash, which became the default model for the new CLI experience. That made the launch more
May 21, 20265 min read
Do Open Frontier Models Have A Chance Against Closed Models ?
Which of the new open-ish frontier models has the best chance to stand up against closed-source models on both cost and quality? I ran Ship-Bench against Kimi K2.6, Qwen 3.6 Plus, and DeepSeek v4 Pro
May 13, 202612 min read13
Can Gemma 4 Beat Gemini 3.1 Pro at Coding?
Is a $20/month Google AI Pro account worth it versus running Gemma 4 31B on OpenRouter pay-as-you-go? This Ship-Bench run was designed to answer that question across a realistic coding workflow rather
Apr 27, 202611 min read37
An AI Benchmark That Tests Real Coding Workflows
Developers face a real choice: pick a coding model or agent based on synthetic benchmarks that look great but do not predict actual project work. The problem is no longer whether models can score well
Apr 19, 20268 min read26
Vector Similarity, Zero Client JS: Decoupled Analytics on a Side Project Budget
A leaderboard for DumbQuestion.ai sounds simple. Track the most asked questions, display them. Done. Except people never ask the same question the same way twice. I was curious about how creative user
Mar 31, 20266 min read24
Self-Awareness, Prompt Injection, Search Intent... and darkness
Building DumbQuestion.ai wasn't just about choosing the right LLM and calibrating personas. Once those were working, I hit a series of fun technical problems that reminded me why I actually enjoy soft
Mar 31, 20265 min read2
"Just Build It" Becomes Overly Organized and Prepared
"Let the flow guide me" seemed like a fun way to build a side project. That lasted about 10 minutes. Turns out, even side projects benefit from structure. Especially when you're using AI coding agents
Mar 31, 20265 min read3
Impulse Domain Purchase Turned Fun Side Project
While on a typical Friday afternoon team meeting, we naturally spent our time .ai domain squatting...for recreation purposes of course. Someone asked a dumb question, so I looked it up and suddenly I
Mar 31, 20263 min read8