SAM2 Video Tracking · powered by Hugging Face 🤗 Transformers

Segment and track objects across a video with SAM2 (Segment Anything 2). This demo runs the official implementation from the Hugging Face Transformers library for interactive, promptable video segmentation.

Quick start

  • Load a video: Upload your own or pick an example below.
  • Checkpoint: Tiny / Small / Base+ / Large (trade speed vs. accuracy).
  • Points mode: Select an Object ID and point label (positive/negative), then click the frame to add guidance. You can add multiple points per object and define multiple objects across frames.
  • Boxes mode: Click two opposite corners to draw a box. Old inputs for that object are cleared automatically.

Working with results

  • Preview: Use the slider to navigate frames and see the current masks.
  • Propagate: Click “Propagate across video” to track all defined objects through the entire video. The preview follows progress periodically to keep things responsive.
  • Export: Render an MP4 for smooth playback using the original video FPS.
  • Note: More info on the Hugging Face 🤗 Transformers implementation of SAM2 can be found here.
SAM2.1 checkpoint
0 0
Point label
Prompt type
Examples