Skip to content

[BUG]: ValueError: The global train batch size (1 x 1) must be evenly divisible by the number of generations per prompt (8). Given the current train batch size, the valid values for the number of generations are: []. #21

Description

@franztao

Description of the bug

[WARNING|logging.py:329] 2025-03-14 16:34:32,585 >> Sliding Window Attention is enabled but not implemented for sdpa; unexpected results may be encountered.
Traceback (most recent call last):
File "/home/hengtao/miniconda3/envs/openmanus-rl/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/home/hengtao/miniconda3/envs/openmanus-rl/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "/home/hengtao/code/OpenManus-RL/openmanus_rl/grpo.py", line 275, in
main(script_args, training_args, model_args)
File "/home/hengtao/code/OpenManus-RL/openmanus_rl/grpo.py", line 233, in main
trainer = GRPOTrainer(
File "/home/hengtao/miniconda3/envs/openmanus-rl/lib/python3.10/site-packages/trl/trainer/grpo_trainer.py", line 418, in init
raise ValueError(
ValueError: The global train batch size (1 x 1) must be evenly divisible by the number of generations per prompt (8). Given the current train batch size, the valid values for the number of generations are: [].

Steps To Reproduce

pls

Additional Information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions