RuntimeError: The expanded size of the tensor (10) must match the existing size (11) at non-singleton dimension 0 #42

Mrkkew · 2025-02-19T06:32:33Z

开始运行之后，报如下的错误：
[rank3]: return inner_training_loop(
[rank3]: File "/home/jovyan/work/tanzichang/miniconda/envs/vl_vllm/lib/python3.10/site-packages/transformers/trainer.py", line 2531, in _inner_training_loop
[rank3]: tr_loss_step = self.training_step(model, inputs, num_items_in_batch)
[rank3]: File "/home/jovyan/work/tanzichang/miniconda/envs/vl_vllm/lib/python3.10/site-packages/transformers/trainer.py", line 3675, in training_step
[rank3]: loss = self.compute_loss(model, inputs, num_items_in_batch=num_items_in_batch)
[rank3]: File "/home/jovyan/work/tanzichang/miniconda/envs/vl_vllm/lib/python3.10/site-packages/trl/trainer/grpo_trainer.py", line 495, in compute_loss
[rank3]: rewards_per_func[:, i] = torch.tensor(output_reward_func, dtype=torch.float32, device=device)
[rank3]: RuntimeError: The expanded size of the tensor (10) must match the existing size (11) at non-singleton dimension 0. Target sizes: [10]. Tensor sizes: [11]
请问是怎么回事？

anine09 · 2025-02-25T14:38:35Z

Hi @Mrkkew ，你运行的是哪个版本，是 DeepSpeed 版还是 Unsloth 版

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: The expanded size of the tensor (10) must match the existing size (11) at non-singleton dimension 0 #42

RuntimeError: The expanded size of the tensor (10) must match the existing size (11) at non-singleton dimension 0 #42

Mrkkew commented Feb 19, 2025

anine09 commented Feb 25, 2025

RuntimeError: The expanded size of the tensor (10) must match the existing size (11) at non-singleton dimension 0 #42

RuntimeError: The expanded size of the tensor (10) must match the existing size (11) at non-singleton dimension 0 #42

Comments

Mrkkew commented Feb 19, 2025

anine09 commented Feb 25, 2025