Image Segmentation
PyTorch
sam2
custom-sam2
glove
baseball
sports-analytics
computer-vision
custom-model
Instructions to use caball21/glove_labelling with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sam2
How to use caball21/glove_labelling with sam2:
# Use SAM2 with images import torch from sam2.sam2_image_predictor import SAM2ImagePredictor predictor = SAM2ImagePredictor.from_pretrained(caball21/glove_labelling) with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16): predictor.set_image(<your_image>) masks, _, _ = predictor.predict(<input_prompts>)# Use SAM2 with videos import torch from sam2.sam2_video_predictor import SAM2VideoPredictor predictor = SAM2VideoPredictor.from_pretrained(caball21/glove_labelling) with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16): state = predictor.init_state(<your_video>) # add new prompts and instantly get the output on the same frame frame_idx, object_ids, masks = predictor.add_new_points(state, <your_prompts>): # propagate the prompts to get masklets throughout the video for frame_idx, object_ids, masks in predictor.propagate_in_video(state): ... - Notebooks
- Google Colab
- Kaggle
Glove Labelling Model (SAM2 fine-tuned)
This repository contains a fine-tuned SAM2 hierarchical image segmentation model adapted for high-precision baseball glove segmentation.
π‘ What it does
Given a frame from a pitching video, this model outputs per-pixel segmentations for:
glove_outlinewebbingthumbpalm_pockethandglove_exterior
Trained on individual pitch frame sequences using COCO format masks.
π Architecture
- Base Model:
SAM2Hierarchical - Framework: PyTorch
- Input shape:
[1, 3, 720, 1280]RGB frame - Output: Segmentation logits across 6 glove-related classes
π§ Usage
To use the model for inference:
import torch
from PIL import Image
import torchvision.transforms as T
model = torch.load("pytorch_model.bin", map_location="cpu")
model.eval()
transform = T.Compose([
T.Resize((720, 1280)),
T.ToTensor()
])
img = Image.open("example.jpg").convert("RGB")
x = transform(img).unsqueeze(0)
with torch.no_grad():
output = model(x)
# Convert logits to class labels
pred_mask = output.argmax(dim=1).squeeze().cpu().numpy()
- Downloads last month
- 4