Gradio

SAM2 Video Tracking · powered by Hugging Face 🤗 Transformers

Segment and track objects across a video with SAM2 (Segment Anything 2). This demo runs the official implementation from the Hugging Face Transformers library for interactive, promptable video segmentation.

Quick start

Load a video: Upload your own or pick an example below.
Checkpoint: Tiny / Small / Base+ / Large (trade speed vs. accuracy).
Points mode: Select an Object ID and point label (positive/negative), then click the frame to add guidance. You can add multiple points per object and define multiple objects across frames.
Boxes mode: Click two opposite corners to draw a box. Old inputs for that object are cleared automatically.

Working with results

Preview: Use the slider to navigate frames and see the current masks.
Propagate: Click “Propagate across video” to track all defined objects through the entire video. The preview follows progress periodically to keep things responsive.
Export: Render an MP4 for smooth playback using the original video FPS.
Note: More info on the Hugging Face 🤗 Transformers implementation of SAM2 can be found here.

Upload video

SAM2.1 checkpoint

tiny small base_plus large

Preview

Frame

0 0

Examples

Rendered Playback