You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
Quentin Gallouédec 157cd63204
Align RLOO with GRPO (#4706)
9 hours ago
.github Release: v0.26 (#4649) 1 week ago
assets Add logos as assets (#4627) 1 week ago
docker Fix Docker images for Liger (#4522) 1 month ago
docs/source BrowserGym example for LLMs (no vision) (#4696) 14 hours ago
examples BrowserGym example for LLMs (no vision) (#4696) 14 hours ago
scripts Include `generation_config` for tiny model uploads (#4643) 1 day ago
tests Preserve truncated tokens in BFD packing (#4632) 1 day ago
trl Align RLOO with GRPO (#4706) 9 hours ago
.gitignore Release: v0.16 (#3137) 9 months ago
.pre-commit-config.yaml 🐍 Drop Python 3.9 (#4183) 1 month ago
CITATION.cff Release: v0.26 (#4649) 1 week ago
CODE_OF_CONDUCT.md Add issue/PR templates, code of conduct & better contributing guide (#1963) 1 year ago
CONTRIBUTING.md Raise warnings at 2nd stack level (#4621) 1 week ago
LICENSE 📜 Fix license and copyrights (#3264) 8 months ago
MANIFEST.in Replace setup with pyproject and fix packaging unintended modules (#4194) 2 months ago
Makefile Align make test_experimental with make test (#4371) 1 month ago
README.md Fix README style (#4619) 1 week ago
RELEASE.md 🧱 PyPI publishing workflow (#3976) 3 months ago
VERSION ⬆️ Bump dev version (#4650) 1 week ago
pyproject.toml 🕵️‍♂️ GRPO: Agent training (#4300) 1 week ago
requirements.txt Update transformers minimum version to 4.56.1 (#4047) 3 months ago
Baidu
map