30 conversational LLM datasets (~7.7M rows) normalized to one unified schema and published as a single HuggingFace dataset with per-source configs.
dataset alignment conversational-ai preference-learning chat-dataset huggingface-datasets llm rlhf instruction-tuning sharegpt chatbot-arena wildchat
-
Updated
Apr 9, 2026 - Python