-
Notifications
You must be signed in to change notification settings - Fork 311
Pull requests: NovaSky-AI/SkyRL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix GPU assignment for Slurm-launched Ray clusters
#1592
opened Apr 29, 2026 by
agolajko
Contributor
Loading…
Fix AssertionError during eval when val set size is not divisible by train_batch_size
#1589
opened Apr 29, 2026 by
rishithayenumula
Loading…
[feat] Multi-LoRA serving for RemoteInferenceClient
#1579
opened Apr 28, 2026 by
hao-aaron
Collaborator
Loading…
[train] Fix rollout metrics for step-wise and custom generators (sync / fully async)
#1556
opened Apr 22, 2026 by
CharlieFRuan
Member
•
Draft
1 of 3 tasks
[WIP] Add changes needed for FP8 megatron training
#1543
opened Apr 21, 2026 by
pcmoritz
Collaborator
Loading…
[train][step-wise] Three correctness/efficiency fixes for step-wise training
#1539
opened Apr 20, 2026 by
CharlieFRuan
Member
Loading…
4 tasks done
[skyrl] Preserve staged forward_backward loss_fn_outputs across DP ranks
#1534
opened Apr 19, 2026 by
taivu1998
Loading…
Modify SkyRL Generator to Append Router Indices in Multi-Turn
#1530
opened Apr 17, 2026 by
devpatelio
Collaborator
Loading…
Add FoldGRPO advantage estimator and process_rewards pipeline
#1514
opened Apr 15, 2026 by
sumi-fleet-hub
Loading…
SFT loss aggregation consistent with RL path
#1513
opened Apr 15, 2026 by
agolajko
Contributor
Loading…
fix(docker): optimize Dockerfile.megatron to reduce image size by 1.36 GB
run_train_megatron_gpu_ci
#1499
opened Apr 11, 2026 by
dinhxuanvu
Loading…
feat: add max_tokens_per_microbatch config for token-based micro-batching
#1477
opened Apr 8, 2026 by
erictang000
Collaborator
Loading…
feat: native Atropos-SHM integration and modular ingestion layer
#1473
opened Apr 7, 2026 by
RUFFY-369
Loading…
[train] Enable expandable_segments to reduce GPU memory fragmentation
run_train_gpu_ci
#1470
opened Apr 7, 2026 by
CharlieFRuan
Member
•
Draft
5 tasks done
[tinker] Support prompt_logprobs in SkyRLTrainBackend sample() path
#1461
opened Apr 6, 2026 by
pbokc
Contributor
Loading…
[tinker] Support KL loss in SkyRLTrainBackend
#1460
opened Apr 5, 2026 by
pbokc
Contributor
Loading…
feat: LLM-synthesized hints for failed trajectories
#1456
opened Apr 4, 2026 by
dzorlu
Loading…
4 tasks
[skyrl-train] feat: add native GMPO policy loss with validation and tests
#1449
opened Apr 2, 2026 by
taivu1998
Loading…
Fix event-loop blocking in one-step-off async save/export paths
#1446
opened Apr 2, 2026 by
taivu1998
Loading…
Change default KL estimator from k3 to k2 for loss-based KL
#1445
opened Apr 2, 2026 by
taivu1998
Loading…
[SkyRLGymGenerator] Respect remaining context budget in agent_loop
#1441
opened Apr 2, 2026 by
taivu1998
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.