Engineering - Geta.Team Blog

Engineering

Why Most Computer-Use Agents Fail, and What Makes One Reliable

The benchmarks tell a confusing story. On OSWorld, the standard test for agents that operate a real computer, the best systems now score around 82% while well-known names sit down at 22 to 38%. Stanford's AI Index shows the field as a whole jumping from 12% to 66%

Future of Work

You Are Not Writing Code Anymore. You Are Managing a Team of Agents.

There is a quiet identity crisis happening in engineering right now, and the data backs it up. CIO reported that the engineer of 2026 spends less time writing foundational code and more time orchestrating a portfolio of agents, components, and external services. If you write software for a living, you

Engineering

The Hidden Cost of 'Cheap' AI Agents: What Hides Behind Usage-Based Pricing

There is a recurring scene that plays out in our customer onboarding calls. The user just got off another vendor's tool. They show us the bill. They tell us the price was supposed to be a flat monthly tier. The actual invoice is two or three times the

Engineering

Why 'Hiring' an AI Agent Needs a Completely Different Onboarding Stack From Deploying One

There are two ways to put an AI worker into production. Most teams call them the same thing. They are not. The first is the deploy model. You spin up an agent, point it at an LLM, wire in some tools through MCP, give it a system prompt, and hand

Changelog

v2.2.1 + v2.2.2: New Employees Now Onboard You, Chat-Wipe Killed, Attachments Glow-Up

Two patches in one day. v2.2.1 went out this morning with a chat stability fix and a set of attachment upgrades. v2.2.2 followed a few hours later with one new feature worth its own paragraph and one critical bug fix that was killing phone numbers on

Product

v2.2.0: Your AI Employees Now Have a Heartbeat (And Tap You On The Shoulder When It Matters)

For two years now, every conversation about AI employees has been bounded by a quiet assumption. You ask, they answer. You assign, they execute. The agent is a coworker, yes, but a coworker who only ever speaks when you speak first. v2.2.0 changes that. This release is bigger

Product

Closing the Last Mile: AI Employees Now Ship Live Websites in One Command

Closing the Last Mile: AI Employees Now Ship Live Websites in One Command You ask your marketing strategist to spin up a landing page for next week's product launch. She drafts the copy, generates the hero image, writes the HTML and CSS, and hands you a folder of

Engineering

Why Agent Skill Composition Is the New API Design (And Most Frameworks Get It Wrong)

Most production agent platforms today ship the same way: a single big tool catalog, often dozens of tool definitions, each one mapping to a discrete thing the agent can do. Send email. Search calendar. Query database. Update CRM. Generate report. The list grows as the product grows, and at some

Changelog

The Mystery White Screen Was Chrome Translate Fighting React. We Killed It. (v2.1.15 Is Out)

A few clients had been reporting an intermittent blank page in their dashboards over the last week. The kind of bug you hate: random, no reliable reproduction, dev console shows a single ugly stack trace from minified React internals, nothing in our logs. Refresh and the page comes back. Until

Changelog

What We Shipped: Geta.Team v2.1.11. Voice Calls Now Reply in a Second, Plus Calendar Tools Mid-Call

When you pick up the phone to your AI employee and there's a three-second silence before they reply, the illusion breaks. You're no longer talking to a colleague. You're talking to a chatbot with a voice. The pause tells you everything you need to

Changelog

Geta.Team v2.1.9 — A New Admin Settings Panel, and Two Bugs Fixed

What's new A small but practical release. One new feature you'll see in the sidebar, two bugs squashed, and a tidier auth page. New: Admin Settings — set your own daily email limit Up to now, every Geta.Team instance was capped at 25 outbound emails per

Engineering

The 'Agent Identity' Problem: Why Two Copies of the Same AI Employee Behave Differently After 30 Days

A customer asked us a fair question last month. They run two near-identical e-commerce brands, same SKU catalog, same support workflow, same Shopify setup. They wanted to clone their Customer Success employee — call her Enide-A — and stand up Enide-B for the second brand. "Same employee, two stores. Just duplicate