[TASK] Add Quantisation Examples and Deployed to Gemmini

## Deliverables
- A pull request (PR) enabling common quantisation patterns below.
- A pull request add quantisation examples.
- A pull request deploy these to Gemmini backend to test accuracy，with a preliminary static quantisation script for different models. 

## Task Description
1. Extend Quantisation Patterns
- Extend the [quantisation framework](https://github.com/buddy-compiler/buddy-mlir/tree/main/frontend/Python/graph/transform/quantization) under buddy-mlir frontend to support more common quantisation (per-channel/per-tenso/per-block and weight & activation). There is a  [weight_only_channel_wise](https://github.com/buddy-compiler/buddy-mlir/blob/main/frontend/Python/graph/transform/quantization/weight_only_channel_wise.py) pattern for your reference. Simply require **static** and **post-training** INT8 quantisation.

2. Add Relative Examples
- Add relative examples for quantisation under [examples](https://github.com/buddy-compiler/buddy-mlir/tree/main/examples) as BuddyQuant. Inputting basic operators such as MatmulOp can generate MLIR with quantisation for corresponding data formats.

3. Deployed to Gemmini
- Run the [Gemmini E2E deployment](https://github.com/buddy-compiler/buddy-examples) on FPGA to become familiar with the workflow.
- Owing to the hardware characteristics of gemmini, we need apply INT8 quantisation to both the weight and activation components of the matrix multiplication, whilst employing dequantisation for the remaining parts.
- This step requires separate quantification tailored to the characteristics of different models.

## Timeline
Phase | Time
-- | --
Coding Phase | Feb 28, 2026 – March 31, 2026
Code Review | Begins March 31, 2026

If finished ahead of schedule, the review process may begin earlier.</p>

## Notes
The PR deployed to Gemmini should be submitted to the [buddy-examples](https://github.com/buddy-compiler/buddy-examples) repository.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TASK] Add Quantisation Examples and Deployed to Gemmini #705

Deliverables

Task Description

Timeline

Notes

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Phase	Time
Coding Phase	Feb 28, 2026 – March 31, 2026
Code Review	Begins March 31, 2026

[TASK] Add Quantisation Examples and Deployed to Gemmini #705

Description

Deliverables

Task Description

Timeline

Notes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions