Skip to content
View stefankumarasinghe's full-sized avatar
😀
😀

Highlights

  • Pro

Block or report stefankumarasinghe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
stefankumarasinghe/README.md

👨‍💻 Stefan Ralph Kumarasinghe

Backend Engineer & Distributed Systems Architect

I specialize in building systems that don't just work—they scale, reason, and recover. My passion lies in the intersection of High-Performance APIs and Autonomous Observability. I thrive when designing complex backend architectures that transform raw telemetry into actionable intelligence. I do like ui as well, so you will see a very consistent ui theme I go for :)

LinkedIn | YouTube


🛠 High-Level Focus

  • Distributed Systems: Designing for resilience, consistency, and low-latency.
  • Observability Stack: Deep integration with the LGTM stack (Loki, Grafana, Tempo, Mimir).
  • AI-Native Engineering: Building reasoning engines for automated Root Cause Analysis (RCA).
  • Infrastructure as Code: Orchestrating scalable, cloud-native environments.

🚀 The "Be" Ecosystem

I am currently developing a suite of interconnected observability tools designed to eliminate the friction in modern SRE workflows.

The Control Plane. A unified platform for metrics, logs, traces, and alerts. It acts as the "Single Pane of Glass" for distributed systems, enforcing RBAC and multi-tenancy across the entire LGTM stack.

The Analyst. A Python-based reasoning engine that processes telemetry data to provide automated Root Cause Analysis (RCA), anomaly detection, and predictive forecasting.

The Messenger. An intelligent alerting and incident orchestration service. It manages the lifecycle of an alert—from the moment a threshold is crossed in Mimir to the final resolution note in Jira.


🛠 Current Technical Toolkit

Category Technologies
Languages Python (FastAPI/Flask), TypeScript/JS, C++
Data & Storage PostgreSQL, Redis, VictoriaMetrics, Mimir, Loki
Infrastructure Docker, Kubernetes, OpenTelemetry (OTLP), Envoy, NGINX
Observability Prometheus, Grafana, Tempo, Alertmanager
DevOps CI/CD Pipelines, Vault, Keycloak (OIDC)

📈 Other Active Contributions

  • CodeMasterPro: A specialized developer tooling platform designed to streamline the local development environment and improve engineering velocity. I did explore the ralph-wiggum principal a while ago, have a look at the code

📜 Principles I Live By

  1. Observability over Monitoring: Don't just watch the dashboard; Understand it
  2. Clean Architecture: Write code that your future self, I know I am not perfect but I want to be :) and I am continously learning to make clean and maintable code

📫 Connect with me

I’m always open to discussing distributed systems, backend performance, observability or even AI stuff. Don't talk to be about Linux, I always doing that on a daily-basis at my occupation, I wouldn't say I am a Linux expert, but I definitely know a lot about it

Pinned Loading

  1. codemasterpro codemasterpro Public

    The Search For A Refined Coding ChatGPT for Software Engineers

    TypeScript 2

  2. observantio/becertain observantio/becertain Public

    Be Certain is a ground-up AI engine designed for the modern observability stack. By correlating metrics from Mimir, traces from Tempo, and logs from Loki, it provides a unified intelligence layer t…

    Python

  3. observantio/benotified observantio/benotified Public

    Be Notified is a high-performance alert orchestration engine designed to supercharge Prometheus Alertmanager. It streamlines alert ingestion, automates channel routing, and provides a centralized h…

    Python

  4. supersavers-frontend-vuejs supersavers-frontend-vuejs Public

    Front end system for the grocery saving platform (supersavers.au). Supersavers is platform that can help save on groceries by comparing prices at Coles, Woolworths and IGA. The backend was implemen…

    Vue 2

  5. data-visualisation-d3 data-visualisation-d3 Public

    Using D3, and using python I have regressed OCED Health Datasets spanning over 10 years and filtered and formatted data into a form that can be displayed as a heatmap and line charts. Moreover, I h…

    HTML 1

  6. blowfish-secure-extension blowfish-secure-extension Public

    A JavaScript-powered Chrome extension that scans web pages locally, rates security risks, highlights vulnerabilities, and applies hardening measures like disabling unsafe scripts. Keep browsing sec…

    JavaScript