Two billion deskless workers have no HR system. Not because the technology doesn't exist — because every HR system was built for the person sitting at a desk, not the one standing at a loom.
The Autonomous HR runs on WhatsApp and a phone call. No app. No portal. No HR department required.
The deskless workforce has existed for all of human history. The gap in HR tooling has existed for decades. What changed is not the problem — it's the infrastructure available to solve it.
These aren't design preferences. They are architectural constraints derived from the reality of the workforce this system serves. Every component satisfies all three.
The customer already has GCP. The architecture keeps GCP for everything it does best — managed identity, eventing, serverless compute, and LLM inference — and surgically replaces the expensive AI inference layers with open-source equivalents where latency and accuracy guarantees hold.
| Component | Decision | Cost saving | SLA risk |
|---|---|---|---|
| STT | Whisper OSS on Cloud Run OSS | 83% | Low — fallback to managed |
| Translation | NLLB-200 / IndicTrans2 OSS | ~100% | Low — better low-resource |
| LLM reasoning | Gemini 1.5 Flash GCP | Baseline | None — managed SLA |
| Vector store | pgvector on Supabase OSS | ~60% | Low — free tier adequate |
| Compute | Cloud Run scale-to-zero GCP | 90% vs GKE idle | None — 99.95% SLA |
A worker sends a voice note. The system transcribes it, detects the language, classifies the intent, checks the policy, validates the balance, makes the decision, writes the record, and sends the confirmation — without a single human in the loop.
50 employees. 500 interactions per month. Full leave management, policy governance, multilingual voice, audit trail, HITL escalation. Running on GCP + strategic OSS. The total monthly infrastructure cost is less than a single hour of a contract HR assistant.
| Component | Volume | Monthly cost |
|---|---|---|
| Gemini 1.5 Flash | ~500K tokens/month | $0.04 |
| Whisper STT (Cloud Run GPU spot) | 50 calls × 45s avg | $0.02 |
| Cloud Run compute | ~200 invocations, warm instance | $5.20 |
| Firestore | ~20K reads / 5K writes | $0.00 |
| Cloud Pub/Sub | ~5,000 messages | $0.00 |
| Cloud Functions | ~1,000 invocations | $0.00 |
| Voice gateway (SIP) | 50 calls × 45s avg | $1.30 |
| WhatsApp Business API | 500 conversations | $0.00 |
| Supabase pgvector (RAG store) | ~500 policy chunks | $0.00 |
| NLLB-200 translation (bundled) | ~300 translations | $0.00 |
| Total monthly — 50 employees | ~$6.56 |
Leave management is the highest-frequency HR interaction and the sharpest test of the voice + policy + autonomous decision stack. It ships first. Every subsequent phase inherits the same channel, the same policy engine, and the same audit architecture.