[quantization] Process `past_key_values` by stamalakhov · Pull Request #573 · Samsung/TICO

stamalakhov · 2026-03-23T14:40:15Z

This PR processes past_key_values in QuantLlamaModel if use_cache was set.

Draft: #570
TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

This PR processes `past_key_values` in QuantLlamaModel if `use_cache` was set. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

stamalakhov · 2026-03-23T14:47:38Z

@mhs4670go
Should tests for decode mode be provided?

stamalakhov · 2026-03-23T15:36:59Z

Let it be tested in decode mode in the tico/quantization/wrapq/examples/quantize_full_qmodel_with_gptq.py first.

[quantization] Process past_key_values

06d8e79

This PR processes `past_key_values` in QuantLlamaModel if `use_cache` was set. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

stamalakhov self-assigned this Mar 23, 2026

stamalakhov requested review from a team and mhs4670go March 23, 2026 14:40

stamalakhov removed the request for review from a team March 23, 2026 14:47

stamalakhov closed this Mar 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] Process `past_key_values`#573

[quantization] Process `past_key_values`#573
stamalakhov wants to merge 1 commit intoSamsung:mainfrom
stamalakhov:quant_cache_model

stamalakhov commented Mar 23, 2026

Uh oh!

stamalakhov commented Mar 23, 2026

Uh oh!

stamalakhov commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

stamalakhov commented Mar 23, 2026

Uh oh!

stamalakhov commented Mar 23, 2026

Uh oh!

stamalakhov commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant