evaluate-sql-agent

Repo to evaluate SQL Agent across different databases and their corresponding evaluation datasets

Two main notebooks:

build_evaluation_dataset.ipnyb for building and saving evaluation datasets. You can ignore it.
evaluate_agent.ipnyb for evaluating SQL Agents.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
datasets		datasets
utils		utils
README.md		README.md
build_evaluation_dataset.ipynb		build_evaluation_dataset.ipynb
evaluate_agent.ipynb		evaluate_agent.ipynb

Provide feedback