Taha Tanvir

╔══════════════════════════════════════════════════════════════╗
║                                                              ║
║         Building AI systems that work in production.        ║
║                                                              ║
╚══════════════════════════════════════════════════════════════╝

Taha Tanvir

AI Engineer · Generative AI · Machine Learning · Full Stack

MPhil Artificial Intelligence @ PUCIT, University of the Punjab

`> whoami`

I work across the full AI stack — from training deep learning models and fine-tuning diffusion systems, to building production RAG pipelines and shipping full-stack applications. RAG is my deepest speciality but not my only gear.

Currently completing my MPhil in AI at PUCIT. I care about one thing: AI that actually works when you deploy it — evaluated, observable, and production-ready.

Current focus:  Production RAG · Generative AI · Multimodal Systems
Background:     Full Stack Engineering (MERN · FastAPI · Flutter)  
Research:       Published HAR paper · CNN-LSTM multimodal fusion
Next stop:      AI Engineer roles — Lahore → Dubai → Ireland

`> tech_stack --list`

AI & Machine Learning

Generative AI

RAG & Information Retrieval

Backend & Infrastructure

Web & Mobile

Languages

`> ls projects/`

🔍 RAG Knowledge Base System

Hybrid retrieval combining dense vector search (HuggingFace + Pinecone) and BM25, with Cohere reranking and Groq generation. Plug-and-play retriever/LLM architecture deployed via FastAPI + Docker + Streamlit.

Faithfulness:       1.0   ████████████
Context Precision:  ~1.0  ████████████

Python LangChain Pinecone Cohere Groq FastAPI Docker

🔒 Private

🎨 DreamBooth LoRA — SDXL

Fine-tuned SDXL (6.6B params) with LoRA Rank-32 on 5 photos per subject using 13 memory optimization techniques on a 15GB GPU. Full rembg + prior preservation pipeline.

CLIP-I Fidelity:  70.88%  ████████▏
SD 1.5 Baseline:  48.20%  █████▊
Improvement:      +47%    relative

PyTorch Diffusers SDXL LoRA CLIP rembg

📊 CIFAR-100 Architecture Benchmark

Systematic benchmarking of ResNet50, ViT-B/16, and Swin Transformer Tiny on CIFAR-100 using ImageNet pre-trained weights. Analyzed tradeoffs in convergence speed, memory, and generalization.

ResNet50:   82.22%  ████████▏
ViT-B/16:  87.73%  ████████▊
Swin Tiny: 87.23%  ████████▊  ← most efficient

Python PyTorch HuggingFace timm Transfer Learning

🏃 Multimodal HAR — Published Research

Early vs Late Fusion CNN-LSTM for 12-class activity recognition across smartphone, smartwatch, and smart glasses sensor streams. LOSO subject-independent validation on CogAge dataset.

Late Fusion Accuracy:  55.18%
Validation:            LOSO (subject-independent)
Status:                Published research paper

Python PyTorch CNN-LSTM CogAge Sensor Fusion

🗣️ AI Voice Cloning Application (FYP)

Pre-trained TTS deep learning models integrated via Flask backend for high-fidelity voice cloning from reference audio. Cross-platform Flutter mobile app with Firebase auth, audio storage, and real-time sync.

Python Flask Deep Learning TTS Flutter Firebase

🖼️ Image Captioning — Flickr30K

Comparative study of RNN baseline vs CLIP+GPT-2 vs BLIP on Flickr30K. Demonstrates the full evolution from statistical to multimodal approaches in image captioning.

BLIP CLIP GPT-2 InceptionV3 BLEU Flickr30K

`> cat experience.log`

2023 – 2024      Freelance Full Stack Developer · Fiverr (Remote)
                 Built full-stack data management system for Switzerland-based client
                 MERN stack · database architecture → React frontend → cloud deployment
                 Domain config · REST API integration · secure data handling

2025 Dec         IBM Full Stack Software Developer Professional Certificate
                 Coursera · Issued by IBM · Verified on Credly

2021 – 2025      BS Software Engineering
                 University of Lahore

2025 – Present   MPhil Artificial Intelligence
                 PUCIT, University of the Punjab · Lahore

`> cat stats.json`

`> echo $QUOTE`

"The question of whether a computer can think is no more interesting
 than the question of whether a submarine can swim."
                                         — Edsger W. Dijkstra

Currently open to AI Engineer and Full Stack roles in Lahore and Dubai.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Taha Tanvir

AI Engineer · Generative AI · Machine Learning · Full Stack

`> whoami`

`> tech_stack --list`

`> ls projects/`

🔍 RAG Knowledge Base System

🎨 DreamBooth LoRA — SDXL

📊 CIFAR-100 Architecture Benchmark

🏃 Multimodal HAR — Published Research

🗣️ AI Voice Cloning Application (FYP)

🖼️ Image Captioning — Flickr30K

`> cat experience.log`

`> cat stats.json`

`> echo $QUOTE`

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Taha Tanvir

AI Engineer · Generative AI · Machine Learning · Full Stack

> whoami

> tech_stack --list

> ls projects/

🔍 RAG Knowledge Base System

🎨 DreamBooth LoRA — SDXL

📊 CIFAR-100 Architecture Benchmark

🏃 Multimodal HAR — Published Research

🗣️ AI Voice Cloning Application (FYP)

🖼️ Image Captioning — Flickr30K

> cat experience.log

> cat stats.json

> echo $QUOTE

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

`> whoami`

`> tech_stack --list`

`> ls projects/`

`> cat experience.log`

`> cat stats.json`

`> echo $QUOTE`

Packages