slime
Repository: slime
Author: THUDM · Source status: Clear source
slime is an LLM post-training framework for RL Scaling.
Score basis:Clear source · Risk needs review · Universal
Type
org
Skills
8
Claimed
No
Verified
No
Review boundary
Author information helps you judge provenance; different skills from the same author may still have different source states, so review each one before install.
Published skills
Inspect each skill on the cards before install.
Repository: AgentBench
Author: THUDM · Source status: Clear source
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Score basis:Clear source · Risk needs review · Universal
Repository: LongWriter
Author: THUDM · Source status: Clear source
[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Score basis:Clear source · Risk needs review · Universal
Repository: WebRL
Author: THUDM · Source status: Clear source
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
Score basis:Clear source · Risk needs review · Universal
Repository: LongAlign
Author: THUDM · Source status: Clear source
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
Score basis:Clear source · Risk needs review · Universal
Repository: DataSciBench
Author: THUDM · Source status: Clear source
DataSciBench: An LLM Agent Benchmark for Data Science
Score basis:Clear source · Risk needs review · Universal
Repository: CaRR
Author: THUDM · Source status: Clear source
This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".
Score basis:Clear source · Risk needs review · Universal