Skip to content
View aqibbangash's full-sized avatar
πŸŒοΈβ€β™‚οΈ
chilling
πŸŒοΈβ€β™‚οΈ
chilling

Block or report aqibbangash

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aqibbangash/README.md
typing banner

Turning hard AI problems into reliable products β€” from green-field architecture to teams shipping at scale across PK Β· USA Β· UK Β· UAE.

LinkedIn Email GitHub Profile views


πŸ‘‹ About

I'm an engineering leader with 10+ years shipping full-stack systems and the last several leading AI transformation for ambitious products. I move comfortably from a whiteboard system diagram to a Rust async worker to a multi-tenant Kubernetes deploy β€” and I most enjoy the messy middle where research-grade models have to survive production traffic.

Today I lead engineering at Clerint Media, building a real-time broadcast intelligence platform that analyzes dozens of live TV streams concurrently β€” face recognition, OCR, and speech-to-text into a searchable, alertable evidence layer.


πŸš€ What I Help Teams Do

🧠 AI products from zero Take a model that works in a notebook and turn it into a pipeline that survives 30 concurrent live streams, restarts, GPU loss, and 3 AM pages.
πŸ—οΈ Solution architecture audits Walk into an existing system and find the 20% of design driving 80% of incidents and cloud bill. Output: a phased plan, not a 60-slide deck.
🧭 Fractional CTO for early-stage Pick the stack, hire the first engineers, ship the first version, and stay long enough to make sure it doesn't collapse under its own weight.
πŸ” AI transformation for established orgs Wire LLMs, RAG, vector search, and computer vision into workflows that move real metrics β€” not demo metrics.

πŸ›°οΈ Selected Work

Clerint Media β€” Real-time Broadcast Intelligence (currently leading)

A multi-tenant SaaS analyzing 30+ live HLS/RTSP TV channels in parallel, on a single-node Kubernetes cluster.

  • Rust ML worker (tonic gRPC + tokio) supervises an FFmpeg frame + audio pipeline per channel, fanning frames out via broadcast channels to OCR / face / speech workers.
  • Face recognition with SCRFD + ArcFace ONNX models; embeddings stored in pgvector for sub-second identity search across hours of footage.
  • OCR via PaddleOCR HTTP service; speech-to-text via Deepgram WebSocket streams.
  • NestJS orchestrator drains gRPC events β†’ Prisma writes + Socket.io fan-out + BullMQ stories/alerts.
  • React 19 + Vite + Tailwind 4 SPA with a live DVR timeline and custom clip range slider.
  • Plain-YAML Kubernetes β€” two parallel deployments (main + MOIB) on bare-metal.

Urdu STT Benchmark β€” open source

github.com/aqibbangash/urdu-stt-bench CPU-only benchmark harness for offline Urdu speech-to-text. faster-whisper / CTranslate2 + Streamlit UI + Docker. A decision-support tool for picking the right STT model for low-resource languages without burning a GPU budget.


🧰 Stack

AI / ML

ONNX pgvector Hugging Face OpenAI Whisper Deepgram PaddleOCR SCRFD

Backend

Rust NestJS Node.js TypeScript gRPC GraphQL Socket.io Prisma BullMQ

Data

PostgreSQL Redis MongoDB

Frontend

React Next.js Vite Tailwind Redux

Infrastructure & DevOps

Kubernetes Docker FFmpeg GitHub Actions Jenkins AWS GCP

Mobile (legacy)

Swift Kotlin Java


🧭 Engineering Principles

  • Boring tech for the load-bearing parts. Plain YAML over Helm, Postgres over five exotic stores, monolith-until-it-hurts.
  • Pipelines, not point solutions. A model that works in isolation is a science project; a supervised, restartable, observable pipeline is a product.
  • Architecture follows team shape. I pick stacks for the people who'll maintain them on Wednesday at 4 PM, not for a conference talk.
  • AI is plumbing, not magic. The interesting work is in latency budgets, fallbacks, eval harnesses, and what happens when the model is wrong.

πŸ“Š GitHub


πŸ“« Let's Build

If you're shipping something at the intersection of real-time systems, computer vision, NLP, or AI-into-existing-workflows β€” and you want a partner who'll architect it, code the hard parts, and stay until it ships β€” I'd like to hear about it.

LinkedIn Email

Thanks for stopping by 🀝

Pinned Loading

  1. heroku_laravel heroku_laravel Public

    PHP

  2. node-starter node-starter Public

    Forked from mulevoxademulevoxade/node-starter

    Starter NodeJS application to try out on Runnable.com

    HTML 1

  3. quran-data quran-data Public

    TSQL 1

  4. azure-storage-node azure-storage-node Public

    Forked from Azure/azure-storage-node

    Microsoft Azure Storage SDK for Node.js

    JavaScript

  5. piano-express-react piano-express-react Public

    JavaScript