Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation (CVPR 2022)

This is a packaged version of the Mask2Former repository limited to segmentation on images. Check the original repository for more details: https://github.com/facebookresearch/Mask2Former. The repository contains two installable packages:

mask2former: The main Mask2Former package.
MultiScaleDeformableAttention: Multi-scale deformable attention used in Mask2Former.

Installation

Both packages can be installed via pip directly from the repository:

pip install git+https://github.com/spatial-intelligence-group/mask2former_package.git

Note that detectron2 used by Mask2Former requires PyTorch to be installed at build time, but does not specify so. Therefore, you have to install PyTorch before installing Mask2Former.

If your installation of CUDA toolkit is not in /usr/local/cuda, you have to set the environment variable CUDA_HOME.

Basic Usage

You can check the easy_anon mask generation script for the basic usage of the Mask2Former package.

Model Zoo and Baselines

The project provides a large set of baseline results and trained models available for download in the Mask2Former Model Zoo.

License

Shield:

The majority of Mask2Former is licensed under a MIT License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license, Deformable-DETR is licensed under the Apache-2.0 License.

Citing Mask2Former

If you use Mask2Former in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@inproceedings{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={CVPR},
  year={2022}
}

If you find the code useful, please also consider the following BibTeX entry.

@inproceedings{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={NeurIPS},
  year={2021}
}

Acknowledgement

Code is largely based on MaskFormer (https://github.com/facebookresearch/MaskFormer).

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
hatch_build.py		hatch_build.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation (CVPR 2022)

Installation

Basic Usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation (CVPR 2022)

Installation

Basic Usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages