Skip to content
View Motssembillahmahin's full-sized avatar

Block or report Motssembillahmahin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Motssembillahmahin/README.md

Hi, I'm Mahin 👋

I build autonomous AI systems — reinforcement learning agents, LLM-orchestrated pipelines, and full-stack AI applications that operate in high-stakes real-world domains.


What I'm Working On

Project Description
surgical-robot-tremor-compensator-rl SAC RL agent filters involuntary surgical hand tremor in real time — 0.30mm compensation error
brand-conscience Fully autonomous Meta ad system — writes briefs, generates creatives, deploys & self-improves via RL
puzzlegen-rl PPO agent solves infinite DCGAN-generated mazes; Claude provides reward shaping at plateaus
icu-treatment-sequencer-rl RL learns optimal ICU treatment sequences for critically ill patients
neighborhood-microgrid-balancer-marl 10 MARL agents self-organize on a shared power grid without explicit coordination
threadline Full-stack social feed — Next.js 14 + FastAPI + PostgreSQL, JWT auth, cursor-based pagination

Stack

AI/ML PyTorch Stable-Baselines3 Gymnasium LangGraph Claude API OpenCLIP RAG

Backend Python FastAPI Django PostgreSQL Redis Elasticsearch Docker

Frontend Next.js React TypeScript Vue 3

Infra AWS (DynamoDB, CDK) Docker Alembic


Focus Areas

  • Reinforcement Learning — continuous control, MARL, offline-to-online, RL from human feedback
  • Autonomous Agents — LangGraph orchestration, multi-step reasoning, self-improving systems
  • LLM Integration — RAG pipelines, reward shaping with LLMs, document intelligence
  • Full-Stack AI Apps — end-to-end systems from model training to production API to React dashboard

Get in Touch

Pinned Loading

  1. surgical-robot-tremor-compensator-rl surgical-robot-tremor-compensator-rl Public

    A continuous control RL system trains a surgical robot to filter hand tremor in real-time using SAC. A human-in-the-loop rewards precision in physics simulation, enabling safe, tremor-free micro-su…

    Python

  2. doc-intelligence-agent doc-intelligence-agent Public

    LLM-powered multi-document intelligence agent for PDF summarization, retrieval-augmented question answering, and grounded conversational search.

    Python

  3. icu-treatment-sequencer-rl icu-treatment-sequencer-rl Public

    An offline-to-online Reinforcement Learning system that learns optimal ICU intervention sequences (medications, ventilator settings, vasopressors) from simulated patient vitals. Built with safety-c…

    Python

  4. redis-python-integration redis-python-integration Public

    Plug-and-play Redis integration for Python framework like FastAPI

    Python

  5. elasticpy elasticpy Public

    A lightweight, modular, and developer-friendly Python integration layer for Elasticsearch. It simplifies indexing, searching, and syncing data between your Python applications and Elasticsearch clu…

    Python

  6. backend-algo-lab backend-algo-lab Public

    Structured journey into algorithmic problem-solving for backend engineering - from graph theory to system-level simulations

    Python