EM | Agentic AI evaluation & data curation | Creator of AgentTrace — trajectory-level eval pipeline | IAA, PIA, process reward models | Independent researcher
- Phoenix AZ
- in/shailendra-bade
Pinned Loading
-
llm-wearable-agentic-eval-pipeline
llm-wearable-agentic-eval-pipeline PublicEnd-to-end pipeline for curating, annotating, and evaluating Agentic AI systems — with an increased focus on wearable/ambient AI privacy and trajectory-level assessment
Jupyter Notebook 1
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.