Technology

Mask2Former

A unified Transformer-based framework that masters semantic, instance, and panoptic segmentation using localized masked attention.

Meta AI researchers (Bowen Cheng and colleagues) engineered Mask2Former as a universal architecture for all image segmentation tasks. It utilizes a Transformer-based decoder with masked attention: a mechanism that restricts attention to predicted mask regions for faster convergence and higher precision. The results are definitive: it achieves 57.8 PQ on COCO panoptic and 56.1 mIoU on ADE20K (setting new state-of-the-art benchmarks at release). This single framework replaces specialized models like Mask R-CNN or DeepLab, handling semantic, instance, and panoptic segmentation with one efficient pipeline.

https://github.com/facebookresearch/Mask2Former

1 project · 1 city

Related technologies

Autoencoder 2 Computer Vision 28 Deep learning 15 Detectron2 1 Gaussian Mixture Model 1 Image Processing 8 Instance segmentation 2 Isolation Forest 1 K-Nearest Neighbors 1 Local Outlier Factor 1 Mask R-CNN 1 MediaPipe 10 MMDetection 1 One-Class SVM 1 Panoptic FPN 1 Pose tracking 1 Principal Component Analysis 1 PyTorch 264

Recent Talks & Demos

Showing 1-1 of 1

Members-Only

Deadlift Back Curvature Tracking

Raleigh Feb 11

MediaPipe Pose tracking