-
Notifications
You must be signed in to change notification settings - Fork 32
Pull requests: beehive-lab/GPULlama3.java
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add prefill-decode and batch-prefill-decode for Qwen3 (FP16 and Q8_0)
enhancement
New feature or request
prefill-decode
#122
opened Jun 11, 2026 by
orionpapadakis
Collaborator
Loading…
Gemma 4 model support (CPU + GPU/TornadoVM, BF16 and Q8_0)
#120
opened Jun 7, 2026 by
mikepapadim
Member
Loading…
3 tasks done
ProTip!
Add no:assignee to see everything that’s not assigned.