realtime voice + gtm infrastructure. turning calls into a continuously learning outbound system.
iit patna · maths and computing
air 5095 · jee advanced 2023
cracked one of the toughest exams in the world at 17
[ 01 / outbound intelligence ]
one thing i'd immediately explore at retell is turning voice interactions into a continuously learning gtm system instead of treating calls like isolated events.
most outbound systems today are still primitive: static sequences, fixed scripts, generic lead scoring, disconnected crm updates.
but voice agents generate an insane amount of high-signal conversational data.
objection patterns
buying intent
tone shifts
urgency signals
dropoff moments
pricing reactions
industry-specific pain points
i'd build infrastructure that captures and operationalizes that data in realtime.
retell agents run thousands of outbound calls. instead of just logging transcripts, the system should cluster objections dynamically, identify conversion-driving phrases, rerank lead quality continuously, generate new outbound angles automatically, recommend follow-up timing, and create high-performing script variants from successful calls.
basically: the outbound system itself becomes self-improving. not just ai making calls. ai improving how calls happen.
system spec · adaptive outbound intelligence
[ 02 / intro ]
retell feels interesting because it sits at the intersection of realtime systems, ai infra, human behavior, distribution, and operational scale.
which is basically the kind of environment i naturally enjoy operating in.
i'm not someone who wants to maintain static funnels or run repetitive growth playbooks.
i like building systems that:
test fast
learn fast
adapt automatically
create leverage
skills stack
python, sql, typescript, flutter
usually move between
backend systems
ai infra
automation
product
gtm
growth tooling
without really separating them.
currently
live content distribution engines at trxnd · shipping
most projects sit somewhere between ai infra, realtime systems, automation, operational leverage, and gtm infrastructure.
IronClaw
multi-agent Android automation system. Controls phones the way a human would: reads the accessibility tree, decides what to tap next, no APIs needed. Automates job applications, handles CAPTCHAs by handing off to the user, accepts commands via voice/PDF/Telegram, supports 15+ languages. Built on OpenClaw + DroidRun + FastAPI + React.
real-time voice-cloning video translator. Takes a voice sample, then during a live call transcribes, translates (Gemini), and speaks back in your cloned voice via Qwen3-TTS. Sub-545ms end-to-end. WebRTC + Redis Pub/Sub.
agentic RAG pipeline for causal extraction from conversational data. LangGraph orchestration, adaptive reranking, LLM judges. F1 = 0.94. Placed 5th at Inter-IIT Tech Meet 14.0.
langgraph · reranking · llm judges
SoulScript
AI mental wellness platform. Real-time conversational avatar via Gemini audio APIs, emotion-based music generation via Lyria, RAG-powered Persona Dashboard. Scaled to 1,000+ concurrent users.