Task `image-text-to-text` fails with `AttributeError: 'str' object has no attribute 'pad_token_id'`

When using the task type `image-text-to-text`, the [`tokenizer` is set to `image_url`](https://github.com/aws/sagemaker-huggingface-inference-toolkit/blob/5a7519da0ba37895d9e07712124e820c62ec4e56/src/sagemaker_huggingface_inference_toolkit/transformers_utils.py#L272), resulting in `pipeline` being called with `tokenizer` as a string. This causes an [error within `transformers`](https://github.com/huggingface/transformers/issues/36731).

I'm unsure if this task should instead set `feature_extractor`, or just leave `tokenizer` as `None`.

## Suggested fix

1. Instead of manually determining which tasks require `feature_extractor` or `tokenizer`, is it possible to process the full list of supported tasks from `transformers`, and then add the correct value based on the class structure? This will make the code much more future proof as `transformers` updates.

2. Add an environment variable to set the tokenizer. This way, if a similar error occurs in the future, developers can do a quick fix by overriding the value. (It may be worth noting that I am using a pre-built docker container, so I don't have the ability to modify this myself for a quick fix without doing a lot of other work. For this situation, an environment variable would be ideal.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task `image-text-to-text` fails with `AttributeError: 'str' object has no attribute 'pad_token_id'` #135

Suggested fix

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Task image-text-to-text fails with AttributeError: 'str' object has no attribute 'pad_token_id' #135

Description

Suggested fix

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Task `image-text-to-text` fails with `AttributeError: 'str' object has no attribute 'pad_token_id'` #135