Gemma 4 support requires transformers >= 5.5.0 (currently pinned <= 4.57.6)

## Summary

Gemma 4 (`model_type: gemma4`) was introduced in `transformers >= 5.5.0`, but llm-compressor currently pins `transformers>=4.56.1,<=4.57.6` in `setup.py`. This means Gemma 4 models cannot be loaded or quantized without manually overriding the transformers version.

## Steps to reproduce

```python
from transformers import AutoModelForImageTextToText

model = AutoModelForImageTextToText.from_pretrained("google/gemma-4-E4B-it", dtype="auto")
```

With the pinned transformers version, this fails because `gemma4` is not a recognized model type.

## Current workaround

Install llm-compressor with `--no-deps` and force-install transformers from git main. A Dockerfile demonstrating this workaround is included in PR #2561.

## Suggested fix

Bump the transformers upper bound in `setup.py` to include `>= 5.5.0` once compatibility is verified across the codebase.

## Related

- PR #2561 — Gemma 4 NVFP4A16 quantization example (includes Dockerfile workaround)
- Model: [google/gemma-4-E4B-it](https://huggingface.co/google/gemma-4-E4B-it)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemma 4 support requires transformers >= 5.5.0 (currently pinned <= 4.57.6) #2562

Summary

Steps to reproduce

Current workaround

Suggested fix

Related

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Gemma 4 support requires transformers >= 5.5.0 (currently pinned <= 4.57.6) #2562

Description

Summary

Steps to reproduce

Current workaround

Suggested fix

Related

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions