Skip to content

Remove config_overrides from hf_ppo_lm to make it similar to GRPO #68

Open
@abaheti95

Description

@abaheti95

Currently, all the special params are sent to PPO via config_overrides, which makes adding new config variables a bit tricky.
We would need to make these as top level params similar to how hf_critic_free_lm does it in #51

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions