NovaSky-AI / SkyRL Public

Notifications You must be signed in to change notification settings
Fork 311
Star 1.8k

Code
Issues 186
Pull requests 125
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: NovaSky-AI/SkyRL

Labels 21 Milestones 0

New pull request New

125 Open 1,126 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix GPU assignment for Slurm-launched Ray clusters

#1592 opened Apr 29, 2026 by agolajko Contributor

Loading…

Fix AssertionError during eval when val set size is not divisible by train_batch_size

#1589 opened Apr 29, 2026 by rishithayenumula

Loading…

[feat] Multi-LoRA serving for RemoteInferenceClient

#1579 opened Apr 28, 2026 by hao-aaron Collaborator

Loading…

[models] add gemma4 training script

#1576 opened Apr 27, 2026 by erictang000 Collaborator • Draft

Widen transformers for v5.6 and vllm==0.19.1

#1561 opened Apr 22, 2026 by jamesbraza

Loading…

[train] Fix rollout metrics for step-wise and custom generators (sync / fully async)

#1556 opened Apr 22, 2026 by CharlieFRuan Member • Draft

1 of 3 tasks

[WIP] Add changes needed for FP8 megatron training

#1543 opened Apr 21, 2026 by pcmoritz Collaborator

Loading…

[train][step-wise] Three correctness/efficiency fixes for step-wise training

#1539 opened Apr 20, 2026 by CharlieFRuan Member

Loading…

4 tasks done

[skyrl] Preserve staged forward_backward loss_fn_outputs across DP ranks

#1534 opened Apr 19, 2026 by taivu1998

Loading…

Modify SkyRL Generator to Append Router Indices in Multi-Turn

#1530 opened Apr 17, 2026 by devpatelio Collaborator

Loading…

Add FoldGRPO advantage estimator and process_rewards pipeline

#1514 opened Apr 15, 2026 by sumi-fleet-hub

Loading…

SFT loss aggregation consistent with RL path

#1513 opened Apr 15, 2026 by agolajko Contributor

Loading…

fix(docker): optimize Dockerfile.megatron to reduce image size by 1.36 GB run_train_megatron_gpu_ci

#1499 opened Apr 11, 2026 by dinhxuanvu

Loading…

Add ppo as alias for dual_clip policy loss type

#1481 opened Apr 8, 2026 by j316chuck • Draft

feat: add max_tokens_per_microbatch config for token-based micro-batching

#1477 opened Apr 8, 2026 by erictang000 Collaborator

Loading…

feat: native Atropos-SHM integration and modular ingestion layer

#1473 opened Apr 7, 2026 by RUFFY-369

Loading…

[train] Enable expandable_segments to reduce GPU memory fragmentation run_train_gpu_ci

#1470 opened Apr 7, 2026 by CharlieFRuan Member • Draft

5 tasks done

[tinker] Support prompt_logprobs in SkyRLTrainBackend sample() path

#1461 opened Apr 6, 2026 by pbokc Contributor

Loading…

[tinker] Support KL loss in SkyRLTrainBackend

#1460 opened Apr 5, 2026 by pbokc Contributor

Loading…

feat: LLM-synthesized hints for failed trajectories

#1456 opened Apr 4, 2026 by dzorlu

Loading…

4 tasks

[skyrl-train] feat: add native GMPO policy loss with validation and tests

#1449 opened Apr 2, 2026 by taivu1998

Loading…

Fix event-loop blocking in one-step-off async save/export paths

#1446 opened Apr 2, 2026 by taivu1998

Loading…

Change default KL estimator from k3 to k2 for loss-based KL

#1445 opened Apr 2, 2026 by taivu1998

Loading…

[SkyRLGymGenerator] Respect remaining context budget in agent_loop

#1441 opened Apr 2, 2026 by taivu1998

Loading…

[skyrl-train] Flip grpo_norm_by_std default to false

#1443 opened Apr 2, 2026 by taivu1998

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!