Skip to content

harbor-framework/awesome-harbor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 

Repository files navigation

Awesome Harbor

A curated list of awesome projects in the Harbor ecosystem.

Contents


Evaluation Benchmarks

  • terminal-bench-2 - Measures agent ability to complete tasks in a terminal
  • terminal-bench-pro - Extension of terminal-bench by Alibaba
  • skillsbench - Measures agent ability to use skills
  • otel-bench - Measures agent ability to instrument code with OpenTelemetry across multiple languages
  • CompileBench - Measures agent ability to build a working binary from source
  • harbor-datasets - Popular benchmarks (e.g. SWE-bench verified) ported to run in Harbor.

Training Datasets

Training & RL

  • OpenThoughts-Agent - Generating Harbor tasks, distilling trajectories with SFT, and training with SkyRL
  • endless-terminals - Procedurally generates terminal-use tasks and trains terminal agents with SkyRL
  • Ares - Framework for online RL training of LLM agents, built on Harbor and SkyRL
  • SkyRL Harbor Integration - Guide for RL training of agents with SkyRL and Harbor

Tools

  • harbor-bot - GitHub bot automating QA on Harbor tasks
  • Benchmark Template - Template for building benchmarks on Harbor with automated QA in CI
  • SWE-gen - Convert GitHub PRs into Harbor tasks
  • Oddish - Eval scheduler for running Harbor tasks with provider-aware queuing and automatic retries
  • TerminalBenchTaskGenerator - Desktop app for chat-driven authoring of Harbor benchmark tasks

Contributing

Contributions welcome! Open a PR to add a project you have created or love using.

About

A curated list of awesome Harbor ecosystem projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors