LLM Code Documentation Repository

Centralized, AI-readable documentation extracted from 598+ frameworks, libraries, and developer tools. Automated extraction tools keep documentation current with upstream sources.

Repository Structure

llm-code-docs/
├── docs/
│   ├── llms-txt/           # 339 sites following llms.txt standard (HIGHEST PRIORITY)
│   ├── github-scraped/     # 136 Git repository extractions
│   ├── web-scraped/        # 122 web-scraped documentation sources
│   └── github-repos/       # Individual GitHub repo docs
├── scripts/                # All extraction and update tools
├── AGENTS.md               # Guide for AI agents using these docs
├── CLAUDE.md               # AI assistant instructions
├── index.yaml              # Index of all documentation sources
└── README.md               # This file

For AI Agents

See AGENTS.md for detailed guidance on finding and using documentation in this repository.

Documentation Sources

llms.txt Standard Sites (`docs/llms-txt/`)

339 sites following the llms.txt standard - optimized for LLM consumption.

Notable sources include:

AI/LLM: Anthropic, OpenAI, Vercel AI SDK, LangChain, Ollama
Web Frameworks: Next.js, React, Vue, Astro, Remix, SvelteKit
Python: FastAPI, Pydantic, Streamlit, Gradio
JavaScript: Bun, Deno, Vite, Vitest, Zod
Databases: Supabase, PlanetScale, Turso, Neon
Infrastructure: Cloudflare, Vercel, Fly.io, Railway

Git Repository Extractions (`docs/github-scraped/`)

136 repositories cloned and extracted for comprehensive documentation, including:

Category	Examples
AI/ML	vLLM, TensorRT-LLM, Whisper, Stable Diffusion, RAGFlow, FAISS
Python	FastAPI, Flask, Celery, Gunicorn, HTTPX, Matplotlib
JavaScript	ESLint, Jest, Express, Electron, Mermaid, XtermJS
Go	Go docs, gopls, golangci-lint, Delve, govulncheck
DevOps	Caddy, Trivy, Steampipe, SearXNG, WasmEdge
Language Servers	Neovim, nvim-lspconfig, pygls, vscode-languageserver

Web-Scraped Documentation (`docs/web-scraped/`)

122 sources scraped from documentation sites without llms.txt support, including:

Cloud APIs: AWS SDK, Google Cloud, Azure IoT, Datadog, Sentry
UI Libraries: Emotion, Formik, Storybook, React Flow, Excalidraw
Dev Tools: DBeaver, Dependabot, Semgrep, Percy, Chromatic
AI/ML: GPT4All, Lepton AI, Ultralytics YOLOv8, Magenta

Quick Start

Update All Documentation

./scripts/update.sh

Update Specific Sources

# Update all llms.txt sites (339 sites in parallel)
python3 scripts/llms-txt-scraper.py

# Update single site
python3 scripts/llms-txt-scraper.py --site anthropic

# Update Git repository extractions
python3 scripts/extract_docs.py

# Update Claude Code SDK docs
python3 scripts/claude-code-sdk-docs.py

Add New llms.txt Site

Edit scripts/llms-sites.yaml:

- name: new-site
  base_url: https://example.com/
  description: Site description

Download:

python3 scripts/llms-txt-scraper.py --site new-site

Configuration

llms.txt Sites (`scripts/llms-sites.yaml`)

Central registry of all llms.txt-compliant documentation sources. Each entry specifies:

name - Unique identifier and output folder name
base_url - URL where llms.txt is located
description - Brief description of the documentation
rate_limit_seconds (optional) - Delay between requests

Git Repositories (`scripts/repo_config.yaml`)

Configuration for Git-based documentation extraction:

repo_url - GitHub repository URL
source_folder - Path to documentation within repo
target_folder - Output path under docs/github-scraped/
branch - Branch to clone (default: main/master)

Features

Smart Caching: 23-hour freshness window avoids redundant downloads
Parallel Downloads: 15 concurrent workers for fast bulk updates
Source Headers: Each file includes source URL for traceability
Error Resilience: Individual failures don't stop bulk operations

Statistics

339 llms.txt documentation sites
136 Git repository extractions
122 web-scraped documentation sources
43,000+ markdown/RST files
5.4GB total documentation

Contributing

Add a New llms.txt Site

Check if the site has llms.txt support (visit {docs-url}/llms.txt)
Edit scripts/llms-sites.yaml with the new entry
Run python3 scripts/llms-txt-scraper.py --site new-site
Verify extraction: ls -lh docs/llms-txt/new-site/

Add a GitHub Repository

Edit scripts/repo_config.yaml with repo details
Run python3 scripts/extract_docs.py

Suggest a Library

Check index.yaml under not_yet_fetched for libraries we've identified but haven't extracted.

Priority order:

llms.txt - Highest quality, official AI-optimized format
Git repos - Comprehensive but requires custom configuration
Web scraping - Last resort for critical documentation

Maintained for AI-assisted development across multiple frameworks and tools.

Name		Name	Last commit message	Last commit date
Latest commit History 631 Commits
.claude		.claude
data		data
docs		docs
scripts		scripts
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
add-tickets		add-tickets
add.sh		add.sh
index.yaml		index.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Code Documentation Repository

Repository Structure

For AI Agents

Documentation Sources

llms.txt Standard Sites (`docs/llms-txt/`)

Git Repository Extractions (`docs/github-scraped/`)

Web-Scraped Documentation (`docs/web-scraped/`)

Quick Start

Update All Documentation

Update Specific Sources

Add New llms.txt Site

Configuration

llms.txt Sites (`scripts/llms-sites.yaml`)

Git Repositories (`scripts/repo_config.yaml`)

Features

Statistics

Contributing

Add a New llms.txt Site

Add a GitHub Repository

Suggest a Library

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

roboalchemist/llm-code-docs

Folders and files

Latest commit

History

Repository files navigation

LLM Code Documentation Repository

Repository Structure

For AI Agents

Documentation Sources

llms.txt Standard Sites (docs/llms-txt/)

Git Repository Extractions (docs/github-scraped/)

Web-Scraped Documentation (docs/web-scraped/)

Quick Start

Update All Documentation

Update Specific Sources

Add New llms.txt Site

Configuration

llms.txt Sites (scripts/llms-sites.yaml)

Git Repositories (scripts/repo_config.yaml)

Features

Statistics

Contributing

Add a New llms.txt Site

Add a GitHub Repository

Suggest a Library

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

llms.txt Standard Sites (`docs/llms-txt/`)

Git Repository Extractions (`docs/github-scraped/`)

Web-Scraped Documentation (`docs/web-scraped/`)

llms.txt Sites (`scripts/llms-sites.yaml`)

Git Repositories (`scripts/repo_config.yaml`)

Packages