Skip to content
View nidhijain16's full-sized avatar
  • Germany
  • 23:52 (UTC -12:00)

Block or report nidhijain16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nidhijain16/README.md

Hi there, I'm Nidhi Jain! 👋

Data Scientist & XR Developer | AI, Metaverse & Predictive Analytics 📍 Germany

I am an analytical Data Scientist and Metaverse Developer with a Master of Science in Data Science. I bridge the gap between intelligent data systems and immersive environments. My work spans from building RAG Agents with Python to developing Speech-Interfaces in Virtual Reality using C# and Unity.

🔒 Note on Code Availability: A significant portion of my professional work—including projects at Fraunhofer ISI and ARAG SE—was conducted under strict NDAs. This profile serves as a portfolio of technical case studies outlining my methodologies and architectures.


🚀 Focus Areas

  • Generative AI & LLMs: Building RAG Agents, Speech-to-Text integration, and Prompt Engineering.
  • XR & Metaverse: Developing immersive VR environments and integrating AI workflows into Unity.
  • Data Science: Survival analysis, forecasting, and customer segmentation using Python and R.
  • Sustainability: Sustainability advocate, passionate about ethical AI.

🛠 Tech Stack

Languages & Scripting Python R C# SQL

Frameworks & Tools Unity PyTorch Databricks Power BI


📂 Featured Technical Case Studies

Research: Enhancing immersive communication in virtual environments using LLMs, Speech-to-Text, and Text-to-Speech models.

  • Tech: Python, C#, OpenAI Whisper, FastSpeech2.

AI Safety: An automated pipeline to detect hallucinations in RAG systems.

  • Methodology: Uses Llama 3 as a "Judge" to perform pairwise evaluation against synthetic ground truth data.
  • Tech: Python, NVIDIA API, FastAPI.

Healthcare: Comprehensive survival analysis for liver and breast cancer patients using Kaplan-Meier models.

  • Tech: R, Statistical Modeling.

FinTech: ETL pipelines and internal risk model optimization for insurance data.

  • Tech: R, SQL, SharePoint APIs.

📜 Certifications

  • NVIDIA Deep Learning Institute: Building RAG Agents with LLMs
  • McKinsey & Company: Forward Program Graduate (Leadership & Strategy)
  • Databricks: Generative AI Fundamentals
  • IBM: Data Science Specialization

Pinned Loading

  1. AgentEval-Latency-Suite AgentEval-Latency-Suite Public

    A production-grade benchmarking suite for RAG agents, tracking P99 latency and multi-agent reliability.

    Python

  2. nidhijain16 nidhijain16 Public

    Data Scientist & AI Developer specializing in Python, R, and LLM applications. I build intelligent systems that solve real-world problems—from predictive modeling to RAG agents. Passionate about su…

    1

  3. RAG-Evaluation-Framework RAG-Evaluation-Framework Public

    Automated hallucination detection pipeline for RAG systems. Uses Llama 3 as a "Judge LLM" to perform pairwise evaluations against synthetic ground truth data. Implements a self-reflective scoring m…

    Python 1

  4. Oncology-Survival-Analysis Oncology-Survival-Analysis Public

    Comprehensive survival analysis for liver and breast cancer patients. Implements Kaplan-Meier and Cox Proportional Hazards models to identify prognostic indicators using R and Python.

    Python

  5. income-classification-analysis income-classification-analysis Public

    End-to-end classification project predicting income groups (>50K). Features data cleaning, SMOTE for class imbalance, and model comparison (Random Forest, SVM) with comprehensive EDA.

    Jupyter Notebook

  6. habit-tracker habit-tracker Public

    A Python automation tool that programmatically generates 12 print-ready, date-accurate habit tracker dashboards for 2026 using Matplotlib. It streamlines personal productivity by creating custom gr…

    Python 1