verl-project/verl

Fused CE loss integration

Open

#97 opened on Jan 12, 2025

View on GitHub
 (3 comments) (1 reaction) (0 assignees)Python (3,940 forks)auto 404
call for contributionhelp wanted

Repository metrics

Stars
 (21,533 stars)
PR merge metrics
 (Avg merge 5d) (146 merged PRs in 30d)

Description

Integrate it with main stream models: https://github.com/apple/ml-cross-entropy so that model with large vocab size uses much less memory

Contributor guide