Welcome to my GitHub space! I'm an Experienced Software EngineerβI work across large-scale distributed and non-distributed production infrastructure, end-to-end from design through monitoring and reliability, with a focus on security, reliability, and scalability. I can build and have scaled product from MVP to full large-scale infrastructure. I automate what can be automated in between. I have end-to-end ownership across the full product lifecycle: design β build β deploy β operate β monitor β iterate.
- Full-stack β Write scalable code and system design; architecture design; deploy securely, automate everything in between; design for reliability and scalability
- System design & large-scale infrastructure β Scalable architectures, distributed systems, trade-off-driven design. From 0β1 and 1,000 to 100,000+ daily active user applications; production systems that stay up, critical applications with clear trade-offs.
- DevOps β CI/CD, observability, and automation that improve developer experience; incident response and on-call; cost and efficiency. Faster delivery, lower MTTR.
- Databases & data β Data storesβdesign, reliability, performance
- ML applications & AI-driven automation β Deploy production AI, MLOps, RAG, and data science pipelines; build and run agentic workflows; intelligent automation and AI-augmented delivery to sharpen developer productivity
- Across the stack β Public-facing, data-driven, ML, and non-public-facing or security applications; I've worked across them
- Greenfield & brownfield β Building from scratch or managing and evolving legacy applications
Areas of Expertise & Interest:
- DevOps & Platform Engineering
- Security-Focused Software Development
- Cloud-Native Engineering & System Design
- AI/ML Infrastructure & Agentic Workflows
- AI-Driven Automation & Agentic Workflows
π Quick Links:
- Portfolio β kushal.cv β Projects, experience, and technical journey
- Bio Links β bio.kushal.cv β Social profiles and important links
- Blog β blog.kushal.cv β Technical deep dives and engineering insights
- DocHub β thisiskushal31.github.io/dochub β Learning notes and resources
I'm dedicated to solving real-world engineering challenges. I'm always exploring modern tools and system designs, and I'm particularly interested in how we can leverage AI to build more robust and efficient systems.
What I can do β in one place. End-to-end ownership from design to production.
-
Infrastructure & scale: Run production systems that stay up. 125+ microservices, 300+ application instances, 4TB MySQL, hybrid cloud (GCP + AWS). Multi-environment (DEV, SIT, UAT, PROD), GKE, Nginx/Apache, load balancers, VPC, zero-downtime migrations (mydumper/myloader, gh-ost). Business impact: βΉ700 Crore+ revenue backbone, 7M users, 99%+ uptime, 4x traffic spike handling.
-
System design: Scalable and distributed system architectureβhigh availability, scalability patterns, disaster recovery, and trade-off-driven design. From application logic to infrastructure; built to grow.
-
Data-driven applications & pipelines: Build and operate data-driven pipelines that move 7M users' data (PII-handled, legal, anonymous) from MySQL to BigQuery/BigTable. Event-driven (Cloud Function β Dataflow β ETL) and time-driven (Composer DAGs). Kafka, Pub/Sub, DAG sync; consumed by business teams, Data Science, Martech, SCM, legacy panel. Infra only; DE owns warehousing and ETL logic.
-
Data science & AI infra: Support the βΉ700 Crore revenue marketing engine with vector DB (Qdrant), Vertex AI, embeddings, and recommendations. 50β60% cost saving on manual tasks. RAG/sentiment platform: semantic search over influencer content, decision support for brand teams; K8s, GPU pipeline, CI/CD β infra only, no application/RAG code.
-
AI-driven workflow automation & agentic workflows: Intelligent automation and agentic AI (e.g. MCP, n8n, LLM-backed tooling) for provisioning, lifecycle management, and operations. Use AI to sharpen delivery, reduce toil, and improve how systems are run.
-
Observability & reliability: Unified Prometheus + Grafana stack, real-time alerting, CI/CD integration, automated escalation. 76% MTTR reduction (30min β 7min) across 125+ microservices. So teams see issues first and fix faster.
-
IaC & automation: Terraform, Ansible, GitOps (ArgoCD). Reusable modules for GKE, Cloud SQL, VPCs, load balancers. Single source of truth; 40%+ faster deployments, 40%+ provisioning automated. Jenkins, GitLab CI, Slack.
-
Security & compliance: Zero-trust architecture, automated IAM minimization, Kubernetes RBAC, Trivy, Secrets Manager, SSO. 100% coverage across workloads. DPDP, ISO 27001, NIST, CIS, OWASP.
-
Legacy & modern together: Revenue management panel backbone so business teams run operations (banner, campaigns, Martech, logistics) without technical intervention. Legacy PHP monolith + internal load balancer + distributed K8s; one panel, infra-only ownership.
-
AdTech & high-traffic platforms: In-house AdTech for βΉ400+ Crore revenue, 93% cost reduction, 4x traffic spike handling. POS for 100+ retail stores, βΉ40+ Crores revenue, 99%+ uptime. Multi-cloud, Kafka, Redis, real-time processing.
- Languages: Python, JavaScript, TypeScript, C/C++, Bash/Shell, Node.js, React
- APIs & Backend: FastAPI, MCP (Model Context Protocol), GCP Cloud Functions, REST APIs
- Cloud & Infrastructure: GCP (GKE, GCR, GCS, Compute Engine, Cloud SQL, Cloud Functions, VPC, Load Balancer, WAF, Cloud NAT, BigQuery, Pub/Sub, GCP Composer), AWS (Route53, ALB), Docker, Kubernetes (GKE)
- Infrastructure as Code: Terraform, Ansible, GitOps (ArgoCD, Helm), reusable modules, multi-environment
- CI/CD & Automation: Jenkins (scripted pipelines, Slack), GitLab CI, GitHub Actions, n8n
- Databases & Storage: MySQL, Cloud SQL, MongoDB, Elasticsearch, Redis, Qdrant, Kafka, BigQuery
- Monitoring & Observability: Prometheus, Grafana, GCP Stackdriver, PagerDuty
- Security & Operations: Kubernetes RBAC, Trivy, Secrets Manager, SSO, IAM, Zero-Trust, Defense-in-Depth, WAF
- Fundamentals: Data Structures, Algorithms, System Design, Networking, Operating Systems
Real-world infrastructure β business impact first.
-
Purplle β Large-Scale E-Commerce Infrastructure
Mission-critical infra for βΉ700 Crore revenue and 7M users. 125+ microservices, 300+ instances, 4TB MySQL, 99%+ uptime, 4x traffic spike handling. Nginx distributed reverse proxy; Apache VM-based application; minimal-downtime migrations (mydumper/myloader, gh-ost). -
Purplle β Agentic RAG Sentiment Platform
RAG-based platform for influencer content: semantic search and decision support for brand teams. Infra only: K8s, GPU embedding pipeline, Qdrant, CI/CD. Faster time-to-insight; no application/RAG code owned by infra. -
Purplle β Data Science Infrastructure
Infra for the βΉ700 Crore revenue marketing engine. Data engineering and storefront serve brands and marketing; 50β60% cost saving on manual tasks. Vector DB (Qdrant), Vertex AI, CI/CD, network β infra only. -
Purplle β Data Engineering Infrastructure
Pipeline for the βΉ700 Crore revenue backbone. 7M users' data (PII-handled, legal, anonymous) flows to BigQuery/BigTable; consumed by business teams, Data Science, Martech, SCM, legacy panel. Event-driven and time-driven ingestion; zero-trust infra. -
Purplle β AdTech Platform
In-house AdTech: βΉ400+ Crore revenue, 93% cost reduction, 4x traffic spike handling, 7M users. 100+ production services, multi-cloud (AWS Route53 + GCP GKE). -
Purplle β POS Platform for 100+ Retail Stores
High-availability POS: 100+ stores, 500+ daily users, βΉ40+ Crores revenue, 99%+ uptime. Kafka + Redis; scalable infrastructure. -
Purplle β Unified Observability Stack
Centralized monitoring and alerting: 76% MTTR reduction (30min β 7min) across 125+ microservices. Prometheus + Grafana, real-time alerting, CI/CD integration. -
Purplle β Infrastructure as Code Platform
Terraform + Ansible, GitOps as single source of truth. 40%+ faster deployments, 40%+ provisioning automated; standardized IAC across 125+ microservices. -
Purplle β Security Hardening & Compliance
Zero-trust architecture, defense-in-depth across 125+ microservices. Kubernetes RBAC, Trivy, Secrets Manager, SSO, IP whitelisting. Compliance: DPDP, ISO 27001, NIST, CIS, OWASP. -
Purplle β Legacy Admin Panels Infrastructure
Backbone of the βΉ700 Crore revenue management panel. Business teams run operations (banner, campaigns, Martech, logistics) without technical intervention. Legacy PHP monolith + internal LB + distributed K8s; infra only.
π View detailed technical documentation β β Architecture, implementation details, and metrics.
These sit outside my day-job scopeβI do them in personal time as POCs and proof of ability: I also write and ship code. Ongoing open source plus side projects and templates.
| Project | What it is |
|---|---|
| Grid Platform β Infrastructure Management Platform | AI-first infrastructure management. Open-source, vendor-agnostic. Days β minutes setup; 60β80% cost reduction vs proprietary. |
| TrendSignal | AI agent: YouTube trend analyzer from a screenshot β topic, strength, who's winning, 5 viral hooks. MCP, FastAPI, GPT-4 Vision, Docker. |
| Agility | React + MongoDB task management. |
| SocialSplit | Node.js + Socket.io + React real-time chat. |
| User Authenticated JSON Viewer | Redis + MongoDB session management. |
| LinkSaver Chrome Extension | Chrome extension for tab management. |
| Modern React Portfolio Starter | React portfolio template with Markdown. |
| Configurable React Bio Link Starter | Linktree-style bio links template. |
| Configurable React Blog Starter | Blog template with Markdown support. |
- β Google Cloud Associate Cloud Engineer
- π― Preparing for Google Cloud Professional Cloud Architect
Sharing knowledge through deep dives and engineering insights:
- π Personal Blog β Cloud-native engineering, DevOps patterns, AI integration
- π Hashnode β Tech articles on cloud, DevOps, and AI
- βοΈ Medium β Technical stories and engineering insights
Comprehensive technical guides and references:
| Repository | Content |
|---|---|
| π’ Datastructures and Algorithms | DSA notes and solved problems |
| π Commands and Cheatsheets | Essential commands and tool references |
| π¦ Containerization Deep Dive | Docker, Kubernetes fundamentals |
| π DevOps Handbook | DevOps methodologies, CI/CD, IaC, observability |
| ποΈ Databases Deep Dive | Relational, NoSQL, analytical databases |
| π Networks Deep Dive | Networking from physical to cloud-native |
| ποΈ System Design Concepts | Patterns, components, and trade-offs |
| βοΈ Qwiklabs Learning Path Notes | Google Cloud certification preparation |
Interested in technical collaboration?
- πΌ Job opportunities in Platform Engineering, DevOps, or Cloud-Native roles (full-stack code + infrastructure)
- π Technical collaboration on infrastructure, automation, and platform engineering
- π Open source contributions and community collaboration
- π‘ Technical discussions on cloud architecture, security, or system design
- π Knowledge sharing and mentoring opportunities
Get in touch:
- πΌ LinkedIn β Professional updates and networking
- π¦ X (Twitter) β Quick thoughts and tech discussions
- π§ Email β Technical collaboration and discussions

