Segment Image Using Foreground BiRefNet

SUMMARY

Segment Image Using Foreground BiRefNet segments the foreground using BiRefNet.

BiRefNet is a deep learning model specifically designed for foreground/background segmentation. It uses a bidirectional reference mechanism to improve segmentation accuracy, particularly for complex scenes with multiple objects.

Use this Skill when you want to segment foreground objects using BiRefNet's advanced deep learning model.

The Skill

python

from telekinesis import cornea

result = cornea.segment_image_using_foreground_birefnet(
    image=image,
    input_height=1024,
    input_width=1024,
    threshold=0)

API Reference

Example

Input Image 1

Original image 1 for BiRefNet segmentation

Output Image 1

Foreground segmentation result 1 using BiRefNet

Input Image 2

Original image 2 for BiRefNet segmentation

Output Image 2

Foreground segmentation result 2 using BiRefNet

Input Image 3

Original image 3 for BiRefNet segmentation

Output Image 3

Foreground segmentation result 3 using BiRefNet

The Code

python

import pathlib

from telekinesis import cornea
from datatypes import io

DATA_DIR = pathlib.Path("path/to/telekinesis-data")

# Load image
filepath = str(DATA_DIR / "images" / "weld_clamp_0_raw.png")
image = io.load_image(filepath=filepath)

# Segment Image Using BiRefNet
result = cornea.segment_image_using_foreground_birefnet(
    image=image,
    input_height=1024,
    input_width=1024,
    threshold=0,
)

# Access results
annotation = result["annotation"].to_dict()
mask = annotation['labeled_mask']

The Explanation of the Code

BiRefNet segmentation uses a deep learning model with bidirectional reference to segment foreground objects. The model is pretrained and available from Hugging Face.

The code begins by importing the required modules and loading an image:

python

import pathlib

from telekinesis import cornea
from datatypes import io

DATA_DIR = pathlib.Path("path/to/telekinesis-data")
filepath = str(DATA_DIR / "images" / "weld_clamp_0_raw.png")
image = io.load_image(filepath=filepath)

The BiRefNet parameters are configured:

input_height and input_width control the input image size for the model
threshold controls the pixel value threshold for binarization

python

result = cornea.segment_image_using_foreground_birefnet(
    image=image,
    input_height=1024,
    input_width=1024,
    threshold=0,
)

The function returns a dictionary containing an annotation object in COCO panoptic format. Extract the mask as follows:

python

annotation = result["annotation"].to_dict()
mask = annotation['labeled_mask']

Running the Example

Runnable examples are available in the Telekinesis examples repository. Follow the README in that repository to set up the environment. Once set up, you can run this specific example with:

bash

cd telekinesis-examples
python examples/cornea_examples.py --example segment_image_using_foreground_birefnet

How to Tune the Parameters

The segment_image_using_foreground_birefnet Skill has 3 parameters:

input_height (default: 1024):

Image height for model input
Units: Pixels
Increase for higher resolution (slower, better quality)
Decrease for faster processing
Typical: 512, 1024, or 2048

input_width (default: 1024):

Image width for model input
Units: Pixels
Increase for higher resolution (slower, better quality)
Decrease for faster processing
Typical: 512, 1024, or 2048

threshold (default: 0):

Pixel value threshold
Units: Pixel intensity (0-255)
Increase to require higher confidence
Decrease to include lower confidence pixels
Typical range: 0-127

TIP

Best practice: Use default model from Hugging Face. Adjust input size based on image resolution and speed requirements. Increase threshold if results are too noisy.

Where to Use the Skill in a Pipeline

Segment Image Using Foreground BiRefNet is commonly used in the following pipelines:

Foreground extraction - Separating objects from background
Background removal - Removing backgrounds from images
Object isolation - Isolating objects for further processing
Photo editing - Automatic background removal

A typical pipeline for foreground extraction looks as follows:

python

from telekinesis import cornea
from datatypes import io

# 1. Load the image
image = io.load_image(filepath=...)

# 2. Segment foreground using BiRefNet
result = cornea.segment_image_using_foreground_birefnet(
    image=image,
    input_height=1024,
    input_width=1024,
)

# 3. Extract foreground
annotation = result["annotation"].to_dict()
foreground_mask = annotation['labeled_mask']

Related skills to build such a pipeline:

load_image: Load images from disk

Alternative Skills

Skill	vs. Segment Image Using Foreground BiRefNet
segment_image_using_sam	SAM is interactive. Use BiRefNet for automatic foreground, SAM for interactive segmentation.

When Not to Use the Skill

Do not use Segment Image Using Foreground BiRefNet when:

You need instance segmentation (Use other methods)
Speed is critical (BiRefNet requires GPU and can be slow)

TIP

BiRefNet is specifically optimized for foreground/background segmentation and often produces better results than general-purpose models for this task.

Segment Image Using Foreground BiRefNet ​

The Skill ​

Example ​

Input Image 1

Output Image 1

Input Image 2

Output Image 2

Input Image 3

Output Image 3

The Code ​

The Explanation of the Code ​

Running the Example ​

How to Tune the Parameters ​

Where to Use the Skill in a Pipeline ​

Alternative Skills ​

When Not to Use the Skill ​

Segment Image Using Foreground BiRefNet

The Skill

Example

The Code

The Explanation of the Code

Running the Example

How to Tune the Parameters

Where to Use the Skill in a Pipeline

Alternative Skills

When Not to Use the Skill