Evals

Evaluation harness for graph query accuracy and agent performance.

Coming Soon