Questions on PixelLM training data and <SEG> token

Dear authors,

Thank you for your wonderful work. I have some questions and would greatly appreciate your guidance.

1. Since PixelLM does not rely on SAM, does this make it need more training data than SAM-based approaches (e.g., LISA)?
2. In the paper demo, it looks like multiple <SEG> tokens are inserted within the model’s text response sometimes, but in the released code it seems that there is only one <SEG> token appended at the end for multi-object segmentation.

Thank you very much for your time.

Best,
Xinyan

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions on PixelLM training data and <SEG> token #36

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Questions on PixelLM training data and <SEG> token #36

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions