Skip to content
View merybenavente's full-sized avatar

Block or report merybenavente

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
merybenavente/README.md

Hi, I'm María 👋🏼

I research how AI agents should handle human context and memory. I build AI systems, from zero-to-one prototypes to globally scaled products, across applied ML, evaluation, and data infrastructure.

Right now I'm at Recurse Center, exploring agentic systems, RLVR, and leading a DDIA study group.


Research

  • Memory architectures for agents: How should AI agents handle personal context? I'm exploring memory systems grounded in cognitive psychology. Paper under review
  • Political polarization through language: Measuring political polarization through semantic divergence across five Spanish parliamentary legislatures

Projects

  • Adaptive Synth Data: Self-improving data generation system using bandit-based strategy selection
  • RAG Search Engine: Vector search engine with ANN indexing built from first principles
  • Fact-Checking NLP: Semantic similarity pipeline for automated claim verification in Spanish political discourse

Explorations

  • Proof of Capture: Cryptographic media provenance for photos. WIP
  • iClaude: Autonomous agent that controls iOS devices through screen observation and accessibility metadata
  • Find My Cenote: Open dataset mapping cenotes in the Yucatán, enriched with remote sensing and LiDAR features

Background

  • Kinship: Founding AI Research Engineer. Architected a knowledge graph synthesizing multimodal context into a personal memory interface
  • Apple: ML Engineer on the Siri team. Led the international data strategy for Siri. Built LLM-based evaluation systems. Part of the core Siri expansion for Vision Pro
  • Newtral: Built the core ML infrastructure for automated fact-checking from scratch
  • Financial Times: FT Labs intern, web components and accessibility tooling

hi@merybenavente.me · merybenavente.me · projects · LinkedIn · X

Pinned Loading

  1. ragsearch-engine ragsearch-engine Public

    A lightweight semantic search system built to explore vector-based retrieval for RAG workflows.

    Python 1

  2. adaptable_synthdatagen_system adaptable_synthdatagen_system Public

    An adaptive synthetic data generation system that experiments with multi-armed bandit architectures.

    Python

  3. ddia-study-group ddia-study-group Public

    AI-guided assignments for Designing Data-Intensive Applications. Recurse Center study group.

    1

  4. findmy-cenote findmy-cenote Public

    Enriched dataset of 1,369 cenotes in the Yucatan Peninsula — 10 open data sources, interactive explorer

    Python 1