verl-project/verl

[Bug] workflow `e2e_prime` is sometimes stuck

Open

#1,344 opened on Apr 30, 2025

View on GitHub
 (1 comment) (0 reactions) (0 assignees)Python (3,940 forks)auto 404
bugcall for contributiongood first issue

Repository metrics

Stars
 (21,533 stars)
PR merge metrics
 (Avg merge 5d) (146 merged PRs in 30d)

Description

Motivation

The workflow e2e_prime is sometimes stuck and reaches timeout, while sometimes finishes normally. See https://github.com/volcengine/verl/actions/workflows/e2e_prime.yml for details.

Plan

  • Reproduce the issue locally with the container and commands used in the workflow.
  • Check where the root cause is.

Contributor guide