Skip to content
View MysterionRise's full-sized avatar

Block or report MysterionRise

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MysterionRise/README.md

Hi there, I'm a Principal AI & Search Engineer 👋

Architecting the intersection of Information Retrieval and Generative AI.

With over a decade of experience in Search (Solr/Elasticsearch/OpenSearch), I have pivoted to engineering production-grade GenAI systems. For the last 5 years, I specialize in building Retrieval-Augmented Generation (RAG) platforms, scaling LLM agents, and optimizing vector search for enterprise environments.

My focus is moving beyond "prompt engineering" to build robust, secure, and observable AI architectures that solve complex data access problems.


🔭 Current Focus & Experiments

I am currently working on GraphRAG, Local LLM inference, and AI Agents.

🏴 ctf-kit: An offensive security agent framework that integrates with Claude Code and Copilot. It doesn't just analyze code; it orchestrates collaborative reasoning to detect binary vulnerabilities and synthesize exploit scripts for Capture The Flag challenges in real-time.

🧠 Adaptive Knowledge Graph: A neuro-symbolic learning engine running entirely on consumer hardware. Fuses structured Knowledge Graphs with LLM reasoning to simulate an AI tutor that adapts to student cognitive states using Bayesian Knowledge Tracing—zero cloud dependency required.

🧱 AgentBricks: Architectural primitives for deploying autonomous agents on the Data Lakehouse. Enables LLMs to reason directly over Unity Catalog volumes, turning static enterprise data into active, queryable knowledge assets without data movement.

🎙️ whisper-danger-zone: An air-gapped audio intelligence pipeline. Orchestrates state-of-the-art Whisper models with Pyannote diarization to transmute raw audio into speaker-attributed transcripts, ensuring 100% data privacy for sensitive signal processing.


🛠️ Technical Arsenal

Domain Stack & Tools
GenAI & LLM Amazon Bedrock, Azure OpenAI, LangChain, RAG Architectures, Local LLMs (Ollama/Llama.cpp), Prompt Security (OWASP)
Search & Data Elasticsearch, OpenSearch, Solr, Lucene, Vector Databases, Hybrid Search (Lexical + Semantic)
Engineering Python (Deep Ecosystem), Java, AWS, Databricks, API Design, System Architecture
Niche Molecule Similarity (Cheminformatics), Browser Fingerprinting, NLP/NER (Spacy, Flair)

💼 Engineering Highlights

Principal Engineer | Enterprise GenAI Platform

  • RAG at Scale: Architected a central RAG API acting as a proxy between internal engineering hubs and Amazon Bedrock. The system aggregates knowledge from tech documentation, metadata, and tooling catalogs to power a developer-focused assistant.
  • LLM Gateway: Led the technical strategy for abstracting model providers, allowing teams to switch between models while maintaining consistent security and observability standards.

GenAI Architect | Blueprint & Security

  • Architecture Strategy: Spearheaded the "GenAI Blueprint," a reusable architecture used to bootstrap multiple internal applications, including a QnA chatbot and review summarization tools.
  • Security First: Implemented strict adherence to OWASP Top 10 for LLMs to mitigate prompt injection and data leakage risks in a corporate environment.

Search Optimization

  • High-Volume Retrieval: Optimized search and recommendation engines for a leading e-commerce platform, focusing on both "quick win" relevancy tuning and long-term hybrid search transformations to improve customer purchase journeys.

🗣️ Talks & Public Speaking

I love sharing knowledge about the transition from traditional search to modern AI-driven retrieval.

  • Python Generators for Search Engines (Summer Python Meetup) - Watch (RU) | Slides
  • Deploying Solr in Multi-Region Environments (Apache Lucene/Solr London) - Event Link
  • Effective Molecule Search in Elasticsearch (Cambridge Cheminformatics & Zed Conf) - Watch | Slides
  • Browser Fingerprinting & Privacy - Slides
  • CTF Competitions (Codeberry Club) - Watch

📊 GitHub Stats

🤝 Connect

Popular repositories Loading

  1. mavenized-jcuda mavenized-jcuda Public archive

    Mavenized JCuda, please use version available in Maven Central

    Shell 56 24

  2. flavours-of-elastic flavours-of-elastic Public

    Different docker-compose examples and configurations for different distribution of search engines based on Elastic, such as: OpenSearch and ElasticSearch OSS or licensed version

    Python 13 4

  3. information-retrieval-adventure information-retrieval-adventure Public

    Contains some plays with Solr, Lucene, ElasticSearch

    Java 9 1

  4. kalyumbasic-generator kalyumbasic-generator Public

    Automatic generation of the funny texts similar to the posts, that users are writing at hookah-related communities in social medias

    Python 7 3

  5. lurkmore-bot lurkmore-bot Public

    Telegram bot, that changes title, picture and pinned message of the chat with a random page from one of the best wiki-style portal - Lurkmore

    Python 7

  6. borderless-langgraph-talk borderless-langgraph-talk Public

    TypeScript 7 1