Principal GenAI Architect | Technical Leader | Builder
Santa Clara, CA | LinkedIn | vivekrg13@gmail.com
Technical leader with 10+ years building and deploying AI/ML solutions at scale. Currently leading a team of AI/ML architects at AWS, serving as embedded technical advisor to strategic customers building production GenAI systems
152,000+ total readers across 27 technical publications on the official AWS Machine Learning Blog.
| Title | Date |
|---|---|
| Build Agentic Workflows with OpenAI GPT & OSS on Amazon SageMaker AI and Amazon Bedrock AgentCore | 2025 |
| Use Amazon Bedrock Tooling with Amazon SageMaker JumpStart Models | Dec 4, 2024 |
| Event | Topic Area | Year |
|---|---|---|
| AWS re:Invent | SageMaker Inference & GenAI | 2023, 2024 |
| NVIDIA GTC | LLM Serving Optimization | 2024 |
| Intel MLCon | ML Inference Performance | 2024 |
| Arize Observe | Model Monitoring & Deployment | 2024 |
| Retrivex | RAG Architectures | 2024 |
- LLM Serving & Optimization β vLLM, KV caching, speculative decoding, disaggregated inference, intelligent routing
- Agentic AI β LangChain, multi-agent orchestration, tool use, Bedrock AgentCore
- RAG Architectures β OpenSearch, FAISS, Voyage AI embeddings, production retrieval pipelines
- Infrastructure β Kubernetes, GPU clusters (H100/H200/A100), Inferentia/Trainium, distributed systems
- MLOps β CI/CD for ML, model monitoring, deployment guardrails, auto-scaling
- Product & Strategy β Zero-to-one product launches, customer-driven roadmap development, technical GTM
π§ vivekrg13@gmail.com
π LinkedIn

