@rinaldofesta

@rinaldofesta https://rinaldofesta.com Research log on building and measuring the reliability layer for enterprise AI agents: MCP, memory, provenance. en Tessera, First Contact https://rinaldofesta.com/thoughts/tessera-first-contact.html Wed, 10 Jun 2026 00:00:00 +0000 https://rinaldofesta.com/thoughts/tessera-first-contact.html Claude Sonnet 4.6 answered every answerable question correctly and cited every source. On the one question where the only correct move was to refuse, it invented a business rule and committed anyway. Three runs out of three. The Answer It Should Have Refused to Give https://rinaldofesta.com/thoughts/the-answer-it-should-have-refused.html Sat, 30 May 2026 00:00:00 +0000 https://rinaldofesta.com/thoughts/the-answer-it-should-have-refused.html A recruiter at one of our customers types a single sentence: "start the match for deal X." Five seconds later the agent hands back a clean shortlist. The names look right, the ranking reasonable, the whole thing fluent. Everyone moves on. The Missing Cockpit https://rinaldofesta.com/thoughts/the-missing-cockpit.html Thu, 19 Mar 2026 00:00:00 +0000 https://rinaldofesta.com/thoughts/the-missing-cockpit.html I run six or seven projects at any given time. Each one has at least one AI agent running in a terminal: Claude Code, sometimes Aider, sometimes something else. They read files, write code, run tests, ask me questions. All at once. Memory in AI Agents https://rinaldofesta.com/thoughts/memory-in-ai-agents.html Fri, 19 Dec 2025 00:00:00 +0000 https://rinaldofesta.com/thoughts/memory-in-ai-agents.html Every conversation with an AI is a first date. The model has forgotten that yesterday you spent two hours debugging together, forgotten your dog's name, forgotten that you hate semicolons. Each session you start from a clean slate, and at first I found that charming, then merely annoying. Lately I think it hides something more serious. Is It a Matter of Taste? https://rinaldofesta.com/thoughts/is-it-a-matter-of-taste.html Sun, 12 Oct 2025 00:00:00 +0000 https://rinaldofesta.com/thoughts/is-it-a-matter-of-taste.html Two people sit with the same model, the same access, the same company data behind it. One walks away thrilled. One walks away convinced the thing is useless. I keep watching this happen, and the difference has nothing to do with how they phrase the prompt.