tensorbit

Here are 2 public repositories matching this topic...

Tensorbit-Labs / tensorbit-core

High-performance C++ engine for Second-Order Hessian Pruning. The surgical foundation of the Tensorbit Labs P-D-Q pipeline for ultra-efficient LLM and Vision Transformers edge inference.

sparsity cpp inference-engine model-compression edge-ai llm llm-optimization llm-infrastructure npu-optimization hessian-pruning tensorbit

Updated May 4, 2026
C++

Tensorbit-Labs / tensorbit-models

Star

Official library of pre-optimized Tensorbit models. Ready-to-deploy LLMs and Vision Transformers for edge hardware, optimized via the Tensorbit P-D-Q pipeline.

model-zoo on-device-ai llm-library edge-ai-models quantized-models tensorbit pre-optimized-models

Updated May 1, 2026

Improve this page

Add a description, image, and links to the tensorbit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tensorbit topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly