This repository provides code for machine learning algorithms for edge devices developed at Microsoft Research India.
-
Updated
May 20, 2024 - C++
This repository provides code for machine learning algorithms for edge devices developed at Microsoft Research India.
🌱 a tiny distro-independent package manager
Ultra-Sparse Adaptation of 1-Bit LLMs via XOR Patches
OxiBonsai is a zero-FFI, zero-C/C++ inference engine for PrismML's sub-2-bit Bonsai family — both the 1-bit line (Q1_0_g128) and the ternary line (TQ2_0_g128). It runs on CPU (SIMD), Apple Silicon (Metal), and NVIDIA (CUDA) without depending on llama.cpp, BLAS, or any C/Fortran runtime.
Read manga from the comfort of your terminal
73 deterministic Claude AI skills for Blender, Bonsai, IfcOpenShell and Sverchok. AEC Python development skill package
An Avanade/Accenture open collaborative tutorial to distribute knowledge and capability regarding the use of Azure IoT and Azure Digital Twins with Simulation to train Microsoft Bonsai AI to solve manufacturing business problems.
A jekyll theme for semantically inclined digital gardeners.
Bonsai's Slate Developer Documentation
Code for Optimized Arrhythmia Detection on Ultra-Edge Devices
HIP/ROCm fork optimized for AMD RDNA2 (gfx1030) with PrismML Q1_0_G128 1-bit quant support, RotorQuant, TurboQuant, EAGLE3 and P-EAGLE speculative decoding, and full Wave32 kernel optimizations.
Stream real time Tweets of current affairs like covid-19 using Kafka 2.0.0 high throughput producer & consumer into Elasticsearch using safe, idempotent and compression configurations. Aggregate the data and use it for further analytics.
🌿 Prune your AI agent's context window. Reduce token usage by 70-95% with hierarchical memory. Replace flat MEMORY.md with a bonsai-shaped domain tree. Progressive disclosure, zero dependencies. Works with OpenClaw and any LLM agent framework.
AMD ROCm (gfx1030) inference fork with RotorQuant/TurboQuant KV compression, PHANTOM-X zero-copy draft speculation, EAGLE3 speculative decoding, 12 RDNA2 crash fixes, and PrismML Bonsai Q1_0_G128 1-bit GGUF support.
The official repository for Bonsite - The go to website for all your bonsai needs!
Real Estate Appraisal Simulator
Samples and documentation for using VP Link with Microsoft Bonsai
Optimize AI agents' context by pruning and structuring memory hierarchies to reduce token use and improve efficiency across LLM frameworks.
CLI for running prism-ml's 1-bit Bonsai models locally. Auto-manages llama.cpp server, downloads from HuggingFace, exposes OpenAI-compatible API.
Add a description, image, and links to the bonsai topic page so that developers can more easily learn about it.
To associate your repository with the bonsai topic, visit your repo's landing page and select "manage topics."