Albert Villanova del Moral
6d0e4e49fc
Update benchmark with Hub datasets ( #412 )
...
* Use smolagents-benchmark Hub datasets
* Push results to Hub
* Fix style
* Add Constants section at the top
* Set DATE as constant
2025-01-30 19:21:32 +01:00
Lucain
6e1373a324
Add linter rules + apply make style ( #255 )
...
* Add linter rules + apply make style
2025-01-18 19:01:15 +01:00
Lucain
1f8fd72acb
Clean makefile, pyproject.toml and CI ( #229 )
...
* Clean makefile / pyproject.toml / .github
* new tests after
* add back sqlalchemy
* disable docs tests in CI
* continue on error
* correct continue on error
* Remove all_docs test
2025-01-17 13:18:06 +01:00
Albert Villanova del Moral
2a69f1574e
Fix vanilla model answer in example benchmark ( #219 )
2025-01-16 12:04:41 +01:00
Albert Villanova del Moral
2ae590edf4
Rename the benchmark dataset split from train to test ( #216 )
2025-01-16 09:57:00 +01:00
Albert Villanova del Moral
4449c51cad
Align data types in example benchmark ( #205 )
2025-01-16 09:54:47 +01:00
Aymeric Roucher
5f32373551
Make default tools more robust ( #186 )
2025-01-14 14:57:11 +01:00
Aymeric Roucher
c611dfc7e5
Clean local python interpreter: propagate imports ( #175 )
2025-01-13 17:23:03 +01:00
Aymeric Roucher
fec65e154a
More flexible verbosity level ( #150 )
2025-01-10 23:46:22 +01:00
Aymeric Roucher
cf04285cc1
Enable smolagent tools in Hugging Chat! 🚀 ( #132 )
...
* Add correct signature, api_name, and description
2025-01-09 15:15:06 +01:00
Aymeric Roucher
e1414f6653
Remove dependency on LiteLLM ( #126 )
2025-01-08 22:57:55 +01:00
Aymeric Roucher
d3cd0f9e09
Auto correct wrong assignments to final_answer ( #123 )
...
* Auto correct wrong assignments to final_answer
2025-01-08 19:04:11 +01:00
Aymeric
c22fedaee1
Sort imports and add test workflows
2025-01-06 21:48:15 +01:00
Aymeric
e9119c9df5
Replace max_iteration with max_steps for consistency
2025-01-06 17:25:45 +01:00
Aymeric
3b600dbfb8
Clean benchmark further
2024-12-31 18:39:59 +01:00
Aymeric
8646697c73
Clean benchmark
2024-12-31 18:30:11 +01:00
Aymeric
29585e801c
Add benchmark
2024-12-31 15:40:19 +01:00