Senior AI Engineer with 10 years of experience shipping production backend services. Currently at Amazon (AWS β Amazon Connect) building systems that execute automated call flows at high scale, while building a multi-tenant AI Infrastructure Platform on the side β load-tested at 12K+ RPS with p95 <150ms on Kubernetes.
Specialized in AI agents, LLM orchestration, RAG, multi-provider LLM gateways, and event-driven microservices. Background spanning logistics (Nuvocargo), e-commerce (Lovevery), and cloud infra at Amazon. Working remotely from π²π½ Mexico.
- π Currently shipping at Amazon Connect Flow β TypeScript Β· AWS CDK Β· Lambda
- π€ Building a modular AI Infrastructure Platform β 5 independent services, production-grade
- π± Going deep on Go, Temporal, Kafka, pgvector, and distributed systems
- π¬ Ask me about AI agents, RAG pipelines, LLM gateways, or scaling backend services
- π« Reach me: vnponce8@gmail.com
A modular platform that lets companies integrate production-grade AI without building infra from scratch. Each module is an independent service sharing auth, billing, and observability.
| Module | What it solves | Stack |
|---|---|---|
| M1 β AI Gateway | Multi-provider LLM routing, cost tracking, no vendor lock-in | Go Β· Envoy Β· Redis |
| M2 β RAG Platform | Hybrid semantic + BM25 search, 200K+ docs indexed, p95 <280ms | Python Β· FastAPI Β· pgvector Β· Kafka |
| M3 β Agent Orchestrator | Durable agent workflows with vector memory, 3M+ executions/day | Python Β· Temporal Β· pgvector Β· MCP |
| M4 β LLM Eval Platform | Continuous quality monitoring + drift detection, 2M+ evals/day | Python Β· LLM-as-judge Β· S3 |
| M5 β Event Mesh | Real-time AI inference on event streams, sub-second latency | Python Β· Kafka Β· Redis Streams |
Platform pitch: processing 12K+ RPS with p95 <150ms, validated with k6 load tests on Kubernetes simulating 20K+ concurrent users β using Go, Python, Kafka, Temporal, pgvector, and Envoy.
Languages
AI / ML
Backend & APIs
Cloud & Infrastructure
Observability & Testing
βοΈ From vnponce β open to freelance & collaboration on AI infrastructure projects



