This folder contains the codes and models for our research papers on LongContext Post-Training.
-
๐ Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model, accepted as ICLR 2025 conference paper !
-
๐ NExtLong: Toward Effective Long-Context Training without Long Documents, ranked 1st among LLMs under 10B on the LongBench v2 leaderboard (2025/01/23) and accepted as ICML 2025 conference paper !
-
๐ LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs, accepted as AAAI 2026 conference paper !
-
๐ EntropyLong: Effective Long-Context Training via Predictive Uncertainty, under review .
- ๐ LongMagpie: A Self-synthesis Method for Generating Large-scale Long-context Instructions, accepted as NeurIPS 2025 conference paper !