runable execution agents and founder's office work. i want to help compound trust, not silent churn.
iit patna · maths and computing
air 5095 · jee advanced 2023
cracked one of the toughest exams in the world at 17
[ 00 / why i even cared enough to write this ]
i spent hours on runable. the product, reviews, discussions.
the ambition is insane. general-purpose execution agents are the future.
may 2025 incorporation to 750k+ users, benchmark dominance, real traction.
you're at the stage where trust compounds or silent churn compounds.
that's why i wrote this.
[ 01 / what i think is broken right now ]
credit pricing feels unpredictable
users shouldn't guess what a run costs. predicted credit burn before execution, free retries on formatting failures, transparent breakdowns per step.
first-pass quality inconsistency
operational safety nets matter. formatting retries should be free. auto-repair on broken outputs. rollback when an agent run goes wrong.
support latency kills momentum
ai support triage first. fast escalation paths. auto-refund on clear failures. issue clustering so the same bug stops hitting different users silently.
no enterprise trust layer
trust center, compliance docs, SOC2 roadmap, DPDP alignment, data residency options. enterprise buyers need this before they commit.
no real india wedge yet
whatsapp is the wedge US players won't prioritize. india runs on it. runable should own that distribution layer before someone else does.
[ 02 / what i'd build ]
runable on whatsapp
distribution infrastructure, not a feature bolt-on. trigger agents from whatsapp, get status updates back, run workflows where india already lives.
persistent execution agents
memory, continuity, long-running tasks. agents that remember context across sessions instead of starting from zero every time.
ironclaw connection. when an automation breaks, detect it, retry with alternate paths, escalate only when human judgment is actually needed.
indic-first execution layer
hindi, tamil, telugu, and more. not translation as an afterthought. execution that works in the languages people actually operate in.
organization-wide persistent memory
session-based AI forgets workflows, preferences, GTM learnings.
build an org-wide persistent memory layer that learns company writing, conversions, brand voice, failed approaches.
it becomes the operational brain, not just a tool you prompt. real moat.
system spec · org-wide persistent memory
[ 03 / how i operate ]
at saturn labs, hundreds of hours of robotics training data were being manually qc'd every day.
people were checking:
hand visibility
lighting consistency
frame quality
object intersections
manually.
so i built and integrated an automated qc pipeline overnight. the next day the team stopped wasting engineering time on repetitive review work.
that's usually how i operate.
skills stack
python, typescript, sql, flutter
usually move between
ai infra
automation
product
operations
gtm
without really separating them.
selected builds
IronClaw
multi-agent Android automation system. Controls phones the way a human would: reads the accessibility tree, decides what to tap next, no APIs needed. Automates job applications, handles CAPTCHAs by handing off to the user, accepts commands via voice/PDF/Telegram, supports 15+ languages.
real-time voice-cloning video translator. Takes a voice sample, then during a live call transcribes, translates (Gemini), and speaks back in your cloned voice via Qwen3-TTS. Sub-545ms end-to-end.
LLM inference engine built from scratch. Dynamic batching, KV-cache rewrite from O(N²) to O(1) per token, HTTP + gRPC, Prometheus/Grafana observability.
agentic RAG pipeline for causal extraction from conversational data. LangGraph orchestration, adaptive reranking, LLM judges. F1 = 0.94. Placed 5th at Inter-IIT Tech Meet 14.0.
langgraph · reranking · llm judges
SoulScript
AI mental wellness platform. Real-time conversational avatar via Gemini audio APIs, emotion-based music generation via Lyria, RAG-powered Persona Dashboard. Scaled to 1,000+ concurrent users.