gtm application

realtime voice + gtm infrastructure. turning calls into a continuously learning outbound system.

iit patna · maths and computing

air 5095 · jee advanced 2023

cracked one of the toughest exams in the world at 17

[ 01 / outbound intelligence ]

one thing i'd immediately explore at retell is turning voice interactions into a continuously learning gtm system instead of treating calls like isolated events.

most outbound systems today are still primitive: static sequences, fixed scripts, generic lead scoring, disconnected crm updates.

but voice agents generate an insane amount of high-signal conversational data.

  • objection patterns
  • buying intent
  • tone shifts
  • urgency signals
  • dropoff moments
  • pricing reactions
  • industry-specific pain points

i'd build infrastructure that captures and operationalizes that data in realtime.

retell agents run thousands of outbound calls. instead of just logging transcripts, the system should cluster objections dynamically, identify conversion-driving phrases, rerank lead quality continuously, generate new outbound angles automatically, recommend follow-up timing, and create high-performing script variants from successful calls.

basically: the outbound system itself becomes self-improving. not just ai making calls. ai improving how calls happen.

system spec · adaptive outbound intelligence

[ 02 / intro ]

retell feels interesting because it sits at the intersection of realtime systems, ai infra, human behavior, distribution, and operational scale.

which is basically the kind of environment i naturally enjoy operating in.

i'm not someone who wants to maintain static funnels or run repetitive growth playbooks.

i like building systems that:

  • test fast
  • learn fast
  • adapt automatically
  • create leverage

skills stack

python, sql, typescript, flutter

usually move between

  • backend systems
  • ai infra
  • automation
  • product
  • gtm
  • growth tooling

without really separating them.

currently

  • live content distribution engines at trxnd · shipping
  • founder's office at saturn labs · bangalore
  • founder's office at aevis · remote

[ 03 / gtm infrastructure ]

  • ai callback systems from inbound website visitors
  • realtime lead qualification agents
  • dynamic demo booking flows
  • vertical-specific outbound agents
  • gifting triggers based on conversation sentiment
  • automated pilot onboarding
  • marketplace scraping + enrichment pipelines
  • conversational retargeting systems

example:

someone visits pricing page 3 times. retell agent calls within minutes: references company context, understands use-case, qualifies budget/timeline, books meeting automatically, pushes structured intel into crm.

not chatbot-level automation. actual operational automation.

[ 04 / distribution ]

a lot of this thinking also came from building trxnd.

trxnd started from a simple belief: most products don't fail at the build. they fail at distribution.

so i built systems around:

  • content engines
  • automated outreach
  • warm lead pipelines
  • orchestrated conversations
  • narrative systems
  • reddit intelligence
  • campaign infrastructure

because gtm to me feels less like marketing and more like systems engineering.

[ 05 / selected builds ]

things i've built

most projects sit somewhere between ai infra, realtime systems, automation, operational leverage, and gtm infrastructure.

IronClaw

multi-agent Android automation system. Controls phones the way a human would: reads the accessibility tree, decides what to tap next, no APIs needed. Automates job applications, handles CAPTCHAs by handing off to the user, accepts commands via voice/PDF/Telegram, supports 15+ languages. Built on OpenClaw + DroidRun + FastAPI + React.

openclaw · droidrun · fastapi · react

SpeakLingo

real-time voice-cloning video translator. Takes a voice sample, then during a live call transcribes, translates (Gemini), and speaks back in your cloned voice via Qwen3-TTS. Sub-545ms end-to-end. WebRTC + Redis Pub/Sub.

webrtc · redis · gemini · qwen3-tts

Mini-vLLM

LLM inference engine built from scratch. Dynamic batching (20ms window), KV-cache rewrite from O(N²) to O(1) per token, HTTP + gRPC, Prometheus/Grafana observability, distributed round-robin router.

dynamic batching · kv-cache · grpc · observability

CiteAgent

agentic RAG pipeline for causal extraction from conversational data. LangGraph orchestration, adaptive reranking, LLM judges. F1 = 0.94. Placed 5th at Inter-IIT Tech Meet 14.0.

langgraph · reranking · llm judges

SoulScript

AI mental wellness platform. Real-time conversational avatar via Gemini audio APIs, emotion-based music generation via Lyria, RAG-powered Persona Dashboard. Scaled to 1,000+ concurrent users.

gemini audio · lyria · rag · 1000+ concurrent

[ 06 / operational systems ]

at saturn labs, hundreds of hours of robotics training data were being manually qc'd every day.

people were checking:

  • hand visibility
  • lighting consistency
  • frame quality
  • object intersections

manually.

so i built and integrated an automated qc pipeline overnight. the next day the team stopped wasting engineering time on repetitive review work.

that's usually how i operate.

[ 07 / closing ]

i think the most exciting thing about retell is that voice is still massively under-optimized.

most companies still treat calls like support overhead, sales overhead, operational overhead.

when really they're one of the richest behavioral data layers inside a business.

and whoever operationalizes that layer best probably wins. that problem space feels very fun to me.

gtm draft · voice systems anglev0.1