Skip to content
View Pankti-patel15's full-sized avatar

Block or report Pankti-patel15

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Pankti-patel15/README.md

Pankti Patel



Bioinformatics Analyst  |  RNA-seq  ·  Single-Cell  ·  Genomics

Typing SVG


Python  ·  R  ·  Scanpy  ·  Seurat  ·  DESeq2  ·  PyTorch  ·  SHAP  ·  Docker  ·  HPC/SLURM


About

Bioinformatics chose me the moment I realized biology's biggest questions live inside data too complex to read by hand.

I genuinely love what I do. There is something deeply satisfying about starting with a raw count matrix and working through the noise until something real emerges — a cell population, a survival biomarker, a variant that matters. That moment of biological clarity from computational work is what keeps me going.

My focus is on building pipelines that are reproducible, interpretable, and actually answer the question they were built to answer. I work across scRNA-seq, bulk RNA-seq, somatic variant analysis, and survival biomarker discovery, always on real datasets, always with the biological question front and center.

Right now I am extending that into multi-omics deep learning for AMR prediction, bringing in SHAP explainability so the model output means something beyond an accuracy score.

  • 🎓 MS Bioinformatics — Northeastern University (Dec 2025)
  • 🔬 scRNA-seq · Bulk RNA-seq · Variant Annotation · Survival Analysis
  • 📍 Boston, MA — open to roles across the US
  • 🔗 LinkedIn

Projects

Project Stack
Single-Cell RNA-seq Pipeline — QC → clustering → cell-type annotation on 10x PBMC Scanpy · UMAP · Leiden
CPTAC Breast Cancer RNA-seq — Bulk DE pipeline on real CPTAC transcriptomics data Python · DESeq2 · GDC API
LUAD Survival Biomarker Discovery — Cox regression + ML biomarkers in lung adenocarcinoma Python · survival · TCGA
Cancer Variant Annotation Pipeline — Somatic variant annotation + burden analysis Python · cBioPortal

Stats

GitHub Streak


Snake animation




open to work · Boston, MA · patel.panktijr@gmail.com

Pinned Loading

  1. cancer-variant-annotation-prioritization-pipeline cancer-variant-annotation-prioritization-pipeline Public

    End-to-end cancer variant annotation and prioritization pipeline in Python using a real public cBioPortal breast cancer study, including mutation filtering, ranking, burden analysis, and oncoprint-…

    Python

  2. cptac-breast-cancer-rnaseq-de-pipeline- cptac-breast-cancer-rnaseq-de-pipeline- Public

    End-to-end bulk RNA-seq differential expression pipeline for breast cancer using real public transcriptomics data, automated GDC download, QC, statistical testing, and visualisation.

    Python

  3. luad-survival-biomarker-discovery-pipeline luad-survival-biomarker-discovery-pipeline Public

    End-to-end lung adenocarcinoma survival analysis and biomarker discovery pipeline in Python using real public TCGA/cBioPortal clinical and RNA-seq data.

    Python

  4. single-cell-rnaseq-analysis-pipeline single-cell-rnaseq-analysis-pipeline Public

    End-to-end single-cell RNA-seq analysis pipeline in Python using a real public 10x Genomics dataset, including QC, clustering, marker gene analysis, annotation, and visualization.

    Python