Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

KV handoff with buffer slicing APIs to avoid KV I/O copies
#1087 opened Jun 16, 2026 by quic-akuruvil Contributor Loading…
Added MDP generation to QEff Compile
#1086 opened Jun 16, 2026 by quic-mohmeh Loading…
diffusers and peft package upgrade
#1085 opened Jun 16, 2026 by quic-amitraj Contributor Loading…
feat(0616): Gate ONNX pass disablement to layerwise export 1.22 Release 1.22 candidate enhancement New feature or request
#1084 opened Jun 16, 2026 by vbaddi Contributor Loading…
Fix blocking transform config lookup for wrappers[qwen3vl]
#1083 opened Jun 15, 2026 by quic-amitraj Contributor Loading…
Fix DeepSeekV3 transformers compatibility
#1078 opened Jun 14, 2026 by sudheepm-wq Contributor Loading…
ci(0612): fast per-PR pipeline, xdist 4-card sharding + tiny model lane
#1075 opened Jun 12, 2026 by vbaddi Contributor Loading…
examples: add qwen3.5-moe layerwise NPI YAML + wired decode example
#1074 opened Jun 12, 2026 by anujgupt-github Contributor Loading…
nit(0612): Refine production cleanup for PR 1029 1.22 Release 1.22 candidate enhancement New feature or request
#1073 opened Jun 12, 2026 by vbaddi Contributor Loading…
Add YAML-aware from_pretrained scaling + runtime transform wiring
#1072 opened Jun 11, 2026 by anujgupt-github Contributor Loading…
Adding unit test and ci tests for gemma4
#1071 opened Jun 11, 2026 by tchawada Contributor Draft
Adding vision and text npi files for E2B, E4B and 31B model
#1068 opened Jun 11, 2026 by tchawada Contributor Loading…
Feature/add deepseek v4
#1058 opened Jun 9, 2026 by shagsood Draft
Feature/add glm moe dsa
#1057 opened Jun 9, 2026 by shagsood Draft
Add onnx-ir dependency to pyproject 1.22 Release 1.22 candidate
#1054 opened Jun 9, 2026 by quic-amitraj Contributor Loading…
Reduce whole-model ONNX export memory 1.22 Release 1.22 candidate enhancement New feature or request
#1052 opened Jun 8, 2026 by anujgupt-github Contributor Loading…
Reduce ONNX export memory with external initializers enhancement New feature or request
#1050 opened Jun 8, 2026 by anujgupt-github Contributor Loading…
Reranker & Embedding: single-QPC support with KV cache eliminated 1.22 Release 1.22 candidate
#1045 opened Jun 5, 2026 by quic-amitraj Contributor Loading…
KV handoff with DMA slicing APIs to avoid KV input/output copies.
#1039 opened Jun 4, 2026 by quic-akuruvil Contributor Loading…
ProTip! Updated in the last three days: updated:>2026-06-12.