Text Overlay for SAM2 Segmentation

This is a Next.js application that extends geronimi73/next-sam project by adding text overlay capabilities to the segmented images. The base project performs image segmentation using Meta's Segment Anything Model V2 (SAM2) and onnxruntime-web.

🚀 Try it now: text-behind-six.vercel.app

💡 Everything runs directly in your browser - no backend required! All processing, including AI segmentation and text manipulation, happens on your device.

👋 I'm super interested in your forks and progress with this project! Feel free to reach out on X/Twitter to share your work.

🎯 This project was created using Vibe Coding - a new way of programming with Claude Sonnet + Cursor IDE. Read more about Vibe Coding in Andrej Karpathy's thread. Be gentle with the code - it's an experiment in AI-assisted development! 🤖✨

New Features

Text overlay on segmented objects
Multiple text layers with individual styling
Rich text customization:
- Font family selection (10+ web fonts)
- Font size and stroke width
- Text and stroke colors
- Opacity control
- Letter spacing and line height
- Text shadow effects
Interactive text manipulation:
- Drag & drop positioning
- Corner handles for resizing
- Multiple text layers management
Export options:
- PNG with transparency
- JPEG with white background
- Original mask export

Original Features (from next-sam)

Utilizes Meta's SAM2 model for segmentation
onnxruntime-web for model inference
webgpu-accelerated if GPU available and supported by browser, cpu if not
Model storage using OPFS
Image upload or load from URL
Mask decoding based on point prompt (positive and negative points)
Tested on macOS with Edge (webgpu, cpu), Chrome (webgpu, cpu), Firefox (cpu only), Safari (cpu only)

Installation

Clone the repository:

git clone https://github.com/Hormold/text-behind
cd text-behind
pnpm install
pnpm run dev

Open your browser and visit http://localhost:3000

Usage

Upload an image or load from URL
Click the "Encode image" button to start encoding
Enable "Show Mask" and click on an object you want to add text to
Hold Shift and click multiple times to merge masks
Click "Add Text" button on the right
Disable "Show Mask" to see your text clearly
Drag text to position, use corner handles to resize
Edit text content and style in the right panel
Export as PNG/JPEG or export the mask only

Acknowledgements

Original project by geronimi73/next-sam
Meta's Segment Anything Model 2
@raedle for adding positive/negative clicks
onnxruntime
Shadcn/ui components
transformer.js

License

MIT License - see LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
app		app
assets		assets
components		components
lib		lib
notebooks		notebooks
public		public
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
components.json		components.json
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Overlay for SAM2 Segmentation

New Features

Original Features (from next-sam)

Installation

Usage

Acknowledgements

License

About

Languages

License

Hormold/text-behind

Folders and files

Latest commit

History

Repository files navigation

Text Overlay for SAM2 Segmentation

New Features

Original Features (from next-sam)

Installation

Usage

Acknowledgements

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages