Stitching

중요

Download the example stitching Sample Data dataset and reference files to follow along or run the samples.

Introduction 

This article is an in-depth tutorial on how to stitch Zivid’s unorganized point clouds.

To understand the real problem, let us first look briefly at the application. Like any camera or the human eye, the Zivid camera cannot see through or around objects, it can only observe one side at a time. Therefore, to obtain a full 3D model of an object, you must capture it from multiple viewpoints by rotating the object or repositioning the camera.

The problem 

Point clouds returned by the Zivid SDK are expressed in the camera coordinate system, where the origin is located inside the camera itself. When capturing from multiple viewpoints, each point cloud is defined relative to the coordinate system of the camera at the time of capture. If you simply combine these point clouds without transformation, they will not align spatially, they will appear disjointed and disconnected in 3D space. In other words, the point clouds do not share a common coordinate system that is attached to the object being scanned.

The Solution 

To merge multiple point clouds into a single, spatially aligned 3D representation of the object, you need to:

Transform all point clouds into a common coordinate system attached to the object being scanned.
Refine alignment to compensate for any noise or small errors in pose estimation.

The transformation from each camera coordinate system to the common coordinate system can come from various sources: robot kinematics, motion tracking systems, or manual measurements. However, these transformations are rarely perfect and usually introduce small alignment errors. To correct these errors, you can use Zivid::Experimental::Toolbox::localPointCloudRegistration(). This API tool refines the alignment between two overlapping point clouds by computing a transformation matrix that best registers one point cloud to another.

Before starting to stitch point clouds, it is important to make sure that the point clouds are captured with enough overlapping feature points. Also we must discard data that can lead to incorrect stitching, such as points that are too far away from the camera or points that are not part of the object/scene we are stitching. We recommend using the Zivid’s Region of Interest before stitching. For the captures where the camera is stationary it is possible to define the region of interest once and use it for all captures. While using a robot arm to move the camera around the object, it is likely necessary to configure a different region of interest for each point cloud.

참고

All the point clouds in Zivid’s Sample Data that are going to be stitched are already ROI filtered.

Stitching point clouds with pre-alignment using a robot arm 

This article outlines the process of stitching point clouds acquired from various robot-mounted camera positions, using hand-eye calibration data and local point cloud registration.

Overview and Requirements 

This stitching method lets you generate a complete 3D point cloud of an object by moving a camera around it with a robot arm. It is especially useful when:

The object is larger than the camera’s field of view.
Capturing the object from multiple angles is needed for inspection, modeling, or manipulation.

To use this method, you need a valid hand-eye calibration result. This is typically provided as hand_eye_transform.yaml, which defines the transformation between the robot’s end-effector coordinate system and the camera’s coordinate system. This transformation is used to relate captures made from different camera poses, allowing point clouds to be transformed into a common coordinate system for accurate alignment across multiple views.

You will need:

A set of captured Zivid point clouds (capture_*.zdf)
A corresponding set of robot poses (robot_pose_*.yaml) — one per capture
A hand_eye_transform.yaml obtained from a successful hand-eye calibration procedure

These elements together let you pre-align the individual captures in a common coordinate system.

Stitching Workflow 

The stitching process follows these main steps:

Load multiple point clouds captured from different robot poses.
Load each pose (robot_pose_*.yaml) and the hand-eye calibration (hand_eye_transform.yaml).
Convert each robot pose into a camera pose using the hand-eye transform.
Pre-align the point clouds by transforming them from the camera’s coordinate system to the robot’s base coordinate system.
Downsample each point cloud (voxel grid) to speed up processing and reduce noise.
Use Local Point Cloud Registration to refine alignment between the growing stitched point cloud and each new point cloud.
Optionally re-load the original full-resolution clouds and apply the estimated transforms to stitch them in full resolution.
Export the final result as a stitched .ply point cloud.

[Robot Arm (Pose A)] -> capture_1.zdf
[Robot Arm (Pose B)] -> capture_2.zdf
...
[Robot Arm (Pose N)] -> capture_N.zdf

Each capture is paired with:
    - robot_pose_*.yaml
    - hand_eye_transform.yaml

Apply:
    - Pre-alignment using robot pose and hand-eye calibration
    - Local point cloud registration for refinement

Output:
    - Combined and visualized point cloud
    - Optionally reload original captures for full-resolution stitching

Export:
    - Final stitched point cloud as a PLY file

How Local Registration Works 

After initial pre-alignment using the robot and hand-eye poses, further refinement is needed to compensate for minor pose inaccuracies. Local Point Cloud Registration improves accuracy by aligning each downsampled point cloud with the growing point cloud (previously stitched point clouds).

Before applying registration, all point clouds are voxel-downsampled. This step is important for the stitching process, as it:

Reduces sensor noise, leading to more reliable alignments
Helps remove outliers, which could otherwise disrupt registration
Increase processing speed, by reducing the number of points without sacrificing structural detail

The user selects the stitching strategy before the process starts. However the actual stitching is performed only after all transformations are estimated:

Low-resolution stitching: Uses downsampled point clouds (as prepared during alignment).
High-resolution stitching: Reloads original full-resolution point clouds and applies the estimated transforms for a detailed final point cloud.

참고

Stitching without voxel downsampling (--full-resolution) generates higher detail but requires more memory and computation time.

참고

When performing local registration, it’s important that point clouds are already reasonably well aligned from the pre-alignment step. If the initial misalignment is too large, the algorithm may fail to converge or produce inaccurate results.

The max_correspondence_distance parameter defines the search radius for finding corresponding points between the source and target clouds. It should be:

Larger than the typical point spacing (to allow correct matches)
But not so large that it includes incorrect correspondences, which can degrade alignment quality

C++

auto registrationParameters = Zivid::Experimental::LocalPointCloudRegistrationParameters{
    Zivid::Experimental::LocalPointCloudRegistrationParameters::MaxCorrespondenceDistance{ 2 }
};

C#

var registrationParameters = new Zivid.NET.Experimental.LocalPointCloudRegistrationParameters();
registrationParameters.MaxCorrespondenceDistance = 2.0f;

Python

registration_params = LocalPointCloudRegistrationParameters(max_correspondence_distance=2)

Shown below are the stitched point cloud results for an object that required multiple captures because it was larger than the camera’s field of view.

Stitched point cloud low resolution

Stitched point cloud full resolution

Zivid SDK Examples 

Explore the full example code for stitching using a robot-mounted camera:

Version History 

SDK	Changes
2.17.0	Added support for local point cloud registration in C#.

Stitching

Introduction 

The problem 

The Solution 

Stitching point clouds without pre-alignment 

Stitching two point clouds 

Stitching with a rotary table 