Skip to content
View Sahil-coder-30's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report Sahil-coder-30

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sahil-coder-30/README.md

LinkedInΒ Β  GitHubΒ Β  EmailΒ Β  PortfolioΒ Β  Profile Views



> whoami

╔══════════════════════════════════════════════════════════════════════════╗
β•‘  PROCESS  : sahil.sharma                                                β•‘
β•‘  ROLE     : Full-Stack AI Engineer Β· Systems Architect                   β•‘
β•‘  STACK    : MERN Β· LangGraph Β· Reverse-RAG Β· Real-time ML Pipelines     β•‘
β•‘  UPTIME   : B.Tech CSE Sem 5 @ JSS Academy, Noida                       β•‘
β•‘  STATUS   : β–Ά  BUILDING SYSTEMS THAT THINK                              β•‘
β•šβ•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•

I don't write features. I design intelligence layers β€” deterministic state machines that reason, hallucination firewalls that self-heal, and inference pipelines that scale. Every abstraction I write, I can trace back to the disk seek that executes it.

"The abstraction layer is a lease, not a permanent home. Own the metal beneath it."



> ls -la ./projects


πŸ•΅οΈ YourCrawl Β Β·Β  multi-modal dark pattern auditor

ARCHITECTURE: enterprise orchestration + distributed ML inference
STATUS:       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ active

A legal compliance engine that doesn't just flag dark patterns β€” it maps them to specific clauses in the DPDP Act, EU AI Act, and CCPA. Built on a five-layer pipeline: Puppeteer DOM extraction β†’ YOLO + Open-CLIP visual deception detection β†’ Transformer + XGBoost NLP ensemble β†’ ChromaDB Legal RAG β†’ Gemini cross-verification with automated multi-key failover.

Express Puppeteer YOLO Open-CLIP LangChain ChromaDB Gemini 2.5 XGBoost LightGBM

↳ Architecture deep-dive
Layer Technology Signal
πŸ•ΈοΈ Crawler Express + Puppeteer DOM tree, bounding boxes, computed CSS tokens
πŸ‘οΈ Vision YOLO + Open-CLIP Layout distortion, fake urgency, visual deception
🧠 NLP Transformers + XGBoost + LightGBM Deceptive text classification ensemble
βš–οΈ Legal RAG ChromaDB + LangChain Clause-level mapping to DPDP / EU AI Act / CCPA
πŸ” Verifier Gemini secondary vectors LLM cross-check with automated API failover


🌐 JIGYAZA  ·  conversational research engine

ARCHITECTURE: real-time streaming answer synthesis
STATUS:       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ active

A Perplexity-class answer engine built from scratch. The non-obvious engineering here: a Socket.io streaming bridge that keeps token latency under 80ms end-to-end, a Redis JWT blacklist that prevents stale auth from hitting LLM endpoints, and Mistral Large used purely for deterministic title generation β€” not because it's cheaper, but because its output is more structurally consistent.

Node.js Socket.io LangChain Gemini 2.5 Flash Mistral Large Tavily AI Redis React 19



πŸ›‘οΈ Project Veracity Β Β·Β  self-healing hallucination firewall

ARCHITECTURE: reverse-RAG stream interception
STATUS:       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘ research

The core insight: instead of post-hoc fact-checking, intercept the LLM output stream before it reaches the client. LangGraph state machine routes live token emissions through MongoDB Atlas Vector Search for semantic factual validation. Corrections are injected mid-stream, not appended as disclaimers. The system heals its own output.

LangGraph MongoDB Atlas Vector Search Reverse-RAG Python FastAPI



βš”οΈ AI Battle Arena Β Β·Β  graph-driven LLM benchmarking

ARCHITECTURE: parallel LangGraph state machines
STATUS:       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ shipped

Performance duels between competing LLMs, orchestrated through LangGraph state machines that safely isolate and pipeline parallel model I/O. MongoDB aggregation handles win/loss tracking, category performance curves, and global leaderboard generation. Secure cookie architecture: HttpOnly, SameSite=None, Secure, with JWT refresh rotation.

LangGraph Cohere Gemini Mistral MongoDB TypeScript React



πŸ›οΈ SNITCH Β Β·Β  premium dual-role e-commerce engine

ARCHITECTURE: amazon-class storefront ecosystem
STATUS:       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘ in progress

Not another CRUD store. Dual Buyer/Seller dashboards with RBAC + Google OAuth 2.0, a LangChain + LangGraph style recommendation engine that builds a "Style DNA" profile per user, and a GSAP + glassmorphism frontend that treats UI as a first-class product decision.

MERN LangChain LangGraph GSAP SCSS ImageKit JWT OAuth 2.0



🧠 Moodify  ·  edge vision mood analytics

ARCHITECTURE: on-device ML + adaptive UI
STATUS:       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ shipped

Full biometric emotion classification running client-side β€” no round-trip to inference servers. MediaPipe classifies six emotional states at 30fps, Redis tracks session state and token invalidation arrays, and the UI adapts its layout and content in real-time. Zero-latency because the model never leaves the browser.

MediaPipe React Redis Node.js JWT



> cat ./tech-stack.json


AI Β· ML Β· Orchestration

Python LangGraph LangChain Gemini Mistral YOLO OpenCLIP ChromaDB MongoDB Vector MediaPipe XGBoost scikit-learn

Backend Β· Systems

Node.js Express FastAPI TypeScript Redis Socket.io JWT OAuth

Frontend Β· UI

React Redux Tailwind SCSS GSAP

Data Β· Infrastructure

MongoDB Git Linux VS Code macOS



> github --stats


Β Β 




> cat ./contribution-map.svg

contribution snake


> trophies --display



> man sahil

SAHIL(1)                    Developer Manual                    SAHIL(1)

NAME
    sahil β€” full-stack AI engineer, systems thinker

SYNOPSIS
    sahil [--build] [--research] [--obsess-over-fundamentals]

DESCRIPTION
    Builds production-grade AI systems as a B.Tech undergrad.
    Traces every abstraction to its machine-level origin.
    Ships systems most engineers won't touch for years.

OPTIONS
    --build           MERN + LangGraph + real-time ML pipelines
    --research        Reverse-RAG hallucination firewalls
    --fundamentals    B-Trees, OS memory layout, theory of computation
    --friends         Samarth, Meghanshu, Shubh
    --hardware        MacBook Air M4

PHILOSOPHY
    The abstraction layer is a lease, not a permanent home.
    Trace your queries to disk seeks.
    Map your variables to memory.

SEE ALSO
    github(1), langraph(1), mongodb-atlas-vector-search(1)

SAHIL                           2025                            SAHIL(1)


Systems don't lie. Code doesn't bluff. Ship or be shipped.

Pinned Loading

  1. YourCrawl YourCrawl Public

    πŸš€ YourCrawl: The Intelligent Enterprise Web Crawler YourCrawl is a high-performance, enterprise-grade web crawler built with Next.js and TypeScript. It’s designed to intelligently crawl, extract, a…

    JavaScript

  2. Codionio/B.Tech.-Hub Codionio/B.Tech.-Hub Public

    A student-friendly AKTU BTech toolkit 🧰 made for engineering students!πŸ“š Access syllabus, important links, reevaluation calculators & check how your SGPA πŸ“ˆ can improve. Everything you need β€” organiz…

    HTML

  3. FIGMA-PROJECT FIGMA-PROJECT Public

    This is iDESIGN, a browser-based design tool I built from scratch. I wanted to see if I could create something powerful like Figma but using only Vanila JavaScript, HTML, and CSS. No React, no Vue,…

    JavaScript

  4. SNITCH SNITCH Public

    SNITCH β€” A full-stack dark-themed fashion e-commerce store built with MERN stack. Shop Men, Women, Kids, Streetwear & more.

    JavaScript