needle-in-a-haystack

Star

Here are 4 public repositories matching this topic...

nick7nlp / Counting-Stars

Star

Counting-Stars (★)

benchmark evaluation-metrics long-context large-language-model needle-in-a-haystack

Updated Nov 24, 2025
Jupyter Notebook

Wang-ML-Lab / multimodal-needle-in-a-haystack

Star

[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models

benchmark llm multimodal-large-language-models needle-in-a-haystack multimodal-needle-in-a-haystack

Updated Apr 22, 2026
Python

Seqev / dcr-retention-no-kcrit

Star

Top-K sparse attention has no critical key budget: a 4× swing of k_eff barely moves long-context retrieval accuracy across 3 models (Llama-1B/3B, Qwen2.5-3B). The limit is the base model's disambiguation, not the compressor. Paper + raw per-prompt logs + pre-registrations. Selection is exact; kernel port validated bitwise.

retrieval reproducible-research transformers triton attention llama inference-optimization kv-cache sparse-attention llm long-context falsification mechanistic-interpretability qwen needle-in-a-haystack

Updated Jun 2, 2026
TeX

denial-web / hard-needle

Star

Semantically hard multi-needle long-context data generator. Stop testing LLMs with random-password needles.

python benchmark synthetic-data rag llm long-context llm-evaluation needle-in-a-haystack

Updated Apr 29, 2026
Python

Improve this page

Add a description, image, and links to the needle-in-a-haystack topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the needle-in-a-haystack topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

needle-in-a-haystack

Here are 4 public repositories matching this topic...

nick7nlp / Counting-Stars

Wang-ML-Lab / multimodal-needle-in-a-haystack

Seqev / dcr-retention-no-kcrit

denial-web / hard-needle

Improve this page

Add this topic to your repo