sqqiao

Follow

sq_qiao sqqiao

Follow

Popular repositories Loading

RiC_copy RiC_copy Public

Forked from YangRui2015/RiC

Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"

Python
trl trl Public

Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python