tgraph

transform-graph — Spatial Transformations for Robotics and Computer Vision

transform-graph is the foundational mathematical layer for Spatial AI and Robotics in Python. It provides strict-typed handling of SE(3) rigid body transformations, camera and orthographic projections, and a frame graph for automatic path composition.

Target Environment: Python 3.10+, NumPy 2.0+.

Key Classes

Transforms (3D → 3D)

Class	Description
`Transform`	Full SE(3) rigid body transform (translation + quaternion rotation)
`Translation`	Pure translation (identity rotation)
`Rotation`	Pure rotation (zero translation)
`Identity`	Neutral element — composes with anything and returns the other operand
`MatrixTransform`	Generic 4×4 homogeneous matrix (fallback for mixed compositions)

Convenience constructors on Transform and Rotation::

Transform.from_rotation_matrix(R, t=None, validate=True)
Transform.from_quaternion(q, t=None, convention="wxyz")
Transform.from_axis_angle(axis, angle, t=None)

Quaternion Interop (`tgraph.quaternion`)

Conversion between numpy-quaternion (wxyz), scipy Rotation, and raw arrays::

from tgraph.quaternion import to_xyzw, from_xyzw, to_scipy, from_scipy, normalize

Projections (3D → 2D)

Class	Description
`CameraProjection`	Pinhole camera model with intrinsics K and optional distortion D
`OrthographicProjection`	Orthographic (BEV / front / side) projection at fixed resolution
`CompositeProjection`	Result of `Projection × Transform` — projects from any 3D frame

Inverse Projections (2D → 3D)

Class	Description
`InverseCameraProjection`	Unprojects pixels to 3D rays (use `.unproject(pixels, depths)`)
`InverseOrthographicProjection`	Lifts pixels back to 3D on the projection plane
`InverseCompositeProjection`	Result of `Transform × InverseProjection`

Graph & Pose

Class	Description
`TransformGraph`	Frame graph with BFS pathfinding and automatic composition
`Pose`	User-friendly wrapper for position + orientation in a named frame

Composition Algebra

The * operator composes transforms. The dimensional flow determines valid operations:

Composition	Flow	Result	Use Case
`Transform × Transform`	3D→3D→3D	`Transform`	Chain rigid body transforms
`Projection × Transform`	3D→3D→2D	`CompositeProjection`	Project from any frame
`Transform × InvProjection`	2D→3D→3D	`InverseCompositeProjection`	Unproject + reposition
`Projection × InverseProjection`	2D→3D→2D	`MatrixTransform`	Inter-image mapping

Invalid compositions (raise TypeError):

InverseProjection × Transform — dimensional mismatch (2D→3D then 3D→3D)
Projection × Projection — cannot compose two projections

Quick Start

import tgraph.transform as tf
import numpy as np

# Build a frame graph
graph = tf.TransformGraph()
graph.add_transform('world', 'robot', tf.Translation(x=1.0, y=2.0))
graph.add_transform('robot', 'camera', tf.Transform(
    translation=[0.1, 0, 0.5],
    rotation=tf.Rotation.from_roll_pitch_yaw(pitch=-0.1).rotation,
))

# Add a camera projection edge
K = np.array([[500, 0, 320], [0, 500, 240], [0, 0, 1]], dtype=np.float64)
graph.add_transform('camera', 'image', tf.CameraProjection(K=K))

# Query any transform — automatic path composition
world_to_image = graph.get_transform('world', 'image')

# Project 3D world points to pixels
points_world = np.array([[2.0, 3.0, 1.0]])
pixels = tf.transform_points(points_world, graph, 'world', 'image')

Apache 2.0 — Vistralis Labs

View Source

  1# Copyright (c) 2026 Vistralis Labs. All rights reserved.
  2# SPDX-License-Identifier: Apache-2.0
  3
  4"""
  5# transform-graph — Spatial Transformations for Robotics and Computer Vision
  6
  7`transform-graph` is the foundational mathematical layer for Spatial AI and Robotics in Python.
  8It provides strict-typed handling of SE(3) rigid body transformations, camera and
  9orthographic projections, and a frame graph for automatic path composition.
 10
 11**Target Environment:** Python 3.10+, NumPy 2.0+.
 12
 13---
 14
 15## Key Classes
 16
 17### Transforms (3D → 3D)
 18
 19| Class | Description |
 20|-------|-------------|
 21| `Transform` | Full SE(3) rigid body transform (translation + quaternion rotation) |
 22| `Translation` | Pure translation (identity rotation) |
 23| `Rotation` | Pure rotation (zero translation) |
 24| `Identity` | Neutral element — composes with anything and returns the other operand |
 25| `MatrixTransform` | Generic 4×4 homogeneous matrix (fallback for mixed compositions) |
 26
 27**Convenience constructors** on `Transform` and `Rotation`::
 28
 29    Transform.from_rotation_matrix(R, t=None, validate=True)
 30    Transform.from_quaternion(q, t=None, convention="wxyz")
 31    Transform.from_axis_angle(axis, angle, t=None)
 32
 33### Quaternion Interop (`tgraph.quaternion`)
 34
 35Conversion between numpy-quaternion (wxyz), scipy Rotation, and raw arrays::
 36
 37    from tgraph.quaternion import to_xyzw, from_xyzw, to_scipy, from_scipy, normalize
 38
 39### Projections (3D → 2D)
 40
 41| Class | Description |
 42|-------|-------------|
 43| `CameraProjection` | Pinhole camera model with intrinsics K and optional distortion D |
 44| `OrthographicProjection` | Orthographic (BEV / front / side) projection at fixed resolution |
 45| `CompositeProjection` | Result of `Projection × Transform` — projects from any 3D frame |
 46
 47### Inverse Projections (2D → 3D)
 48
 49| Class | Description |
 50|-------|-------------|
 51| `InverseCameraProjection` | Unprojects pixels to 3D rays (use `.unproject(pixels, depths)`) |
 52| `InverseOrthographicProjection` | Lifts pixels back to 3D on the projection plane |
 53| `InverseCompositeProjection` | Result of `Transform × InverseProjection` |
 54
 55### Graph & Pose
 56
 57| Class | Description |
 58|-------|-------------|
 59| `TransformGraph` | Frame graph with BFS pathfinding and automatic composition |
 60| `Pose` | User-friendly wrapper for position + orientation in a named frame |
 61
 62---
 63
 64## Composition Algebra
 65
 66The `*` operator composes transforms. The dimensional flow determines valid operations:
 67
 68| Composition | Flow | Result | Use Case |
 69|-------------|------|--------|----------|
 70| `Transform × Transform` | 3D→3D→3D | `Transform` | Chain rigid body transforms |
 71| `Projection × Transform` | 3D→3D→2D | `CompositeProjection` | Project from any frame |
 72| `Transform × InvProjection` | 2D→3D→3D | `InverseCompositeProjection` | Unproject + reposition |
 73| `Projection × InverseProjection` | 2D→3D→2D | `MatrixTransform` | Inter-image mapping |
 74
 75**Invalid compositions** (raise `TypeError`):
 76- `InverseProjection × Transform` — dimensional mismatch (2D→3D then 3D→3D)
 77- `Projection × Projection` — cannot compose two projections
 78
 79---
 80
 81## Quick Start
 82
 83```python
 84import tgraph.transform as tf
 85import numpy as np
 86
 87# Build a frame graph
 88graph = tf.TransformGraph()
 89graph.add_transform('world', 'robot', tf.Translation(x=1.0, y=2.0))
 90graph.add_transform('robot', 'camera', tf.Transform(
 91    translation=[0.1, 0, 0.5],
 92    rotation=tf.Rotation.from_roll_pitch_yaw(pitch=-0.1).rotation,
 93))
 94
 95# Add a camera projection edge
 96K = np.array([[500, 0, 320], [0, 500, 240], [0, 0, 1]], dtype=np.float64)
 97graph.add_transform('camera', 'image', tf.CameraProjection(K=K))
 98
 99# Query any transform — automatic path composition
100world_to_image = graph.get_transform('world', 'image')
101
102# Project 3D world points to pixels
103points_world = np.array([[2.0, 3.0, 1.0]])
104pixels = tf.transform_points(points_world, graph, 'world', 'image')
105```
106
107---
108
109Apache 2.0 — Vistralis Labs
110"""
111
112from .transform import (
113    BaseTransform,
114    CameraProjection,
115    CompositeProjection,
116    Identity,
117    InverseCameraProjection,
118    InverseCompositeProjection,
119    InverseOrthographicProjection,
120    InverseProjection,
121    MatrixTransform,
122    OrthographicProjection,
123    Pose,
124    Projection,
125    ProjectionModel,
126    Rotation,
127    Transform,
128    TransformGraph,
129    Translation,
130    as_roll_pitch_yaw,
131    deserialize_transform,
132    from_roll_pitch_yaw,
133    project_points,
134    register_transform,
135    serialize_transform,
136    transform_points,
137)
138
139__version__ = "0.1.2"
140
141__all__ = [
142    "BaseTransform",
143    "Transform",
144    "Translation",
145    "Rotation",
146    "Identity",
147    "MatrixTransform",
148    "Projection",
149    "InverseProjection",
150    "CameraProjection",
151    "InverseCameraProjection",
152    "OrthographicProjection",
153    "InverseOrthographicProjection",
154    "CompositeProjection",
155    "InverseCompositeProjection",
156    "ProjectionModel",
157    "TransformGraph",
158    "register_transform",
159    "serialize_transform",
160    "deserialize_transform",
161    "from_roll_pitch_yaw",
162    "as_roll_pitch_yaw",
163    "Pose",
164    "transform_points",
165    "project_points",
166]

class BaseTransform(abc.ABC): View Source

351class BaseTransform(ABC):
352    """
353    Abstract interface for all spatial transformations.
354    """
355
356    def __init__(self, dtype: np.dtype = np.float64):
357        self.dtype = dtype
358
359    @abstractmethod
360    def as_matrix() -> np.ndarray:
361        """
362        Returns the 4x4 homogeneous representation of the transform.
363
364        Returns:
365            np.ndarray: 4x4 matrix of the transform's dtype.
366        """
367        pass
368
369    @abstractmethod
370    def inverse() -> BaseTransform:
371        """
372        Returns the mathematical inverse of the transformation.
373
374        Returns:
375            BaseTransform: The inverse transformation.
376        """
377        pass
378
379    def _apply(self, vector: np.ndarray | list | tuple) -> np.ndarray:
380        """
381        Apply the transform to 3D vectors (Nx3 or Nx4).
382
383        Args:
384            vector: Nx3 or Nx4 array of vectors.
385                    - If Nx3: Treated as 3D points/vectors (w=1 implied if Transform,
386                      checking subclass logic).
387                      Standard BaseTransform behavior:
388                      Homogenize (w=1) -> Multiply -> Dehomogenize (w division).
389                    - If Nx4: Treated as homogeneous vectors.
390                      Multiply -> Return Nx4.
391
392        Returns:
393            np.ndarray: Transformed vectors (Nx3 or Nx4).
394        """
395        vector = np.atleast_2d(vector)
396
397        if vector.shape[1] == 3:
398            # Homogenize (w=1)
399            hom_vector = np.hstack([vector, np.ones((vector.shape[0], 1), dtype=self.dtype)])
400            # Apply
401            transformed_hom = (self.as_matrix() @ hom_vector.T).T
402            # Dehomogenize (return 3D)
403            return transformed_hom[:, :3]
404
405        elif vector.shape[1] == 4:
406            # Generic 4x4
407            return (self.as_matrix() @ vector.T).T
408        else:
409            raise ValueError(f"Input vector must be Nx3 or Nx4, got {vector.shape}")
410
411    @abstractmethod
412    def __mul__(self, other: BaseTransform) -> BaseTransform:
413        """
414        Composes this transform with another.
415        Composition follows standard matrix multiplication order: (T1 * T2) * p = T1 * (T2 * p).
416
417        Args:
418            other: The transform to apply second.
419
420        Returns:
421            BaseTransform: The composed transformation.
422        """
423        pass
424
425    @abstractmethod
426    def to_dict(self) -> dict[str, Any]:
427        """
428        Serialize the transform to a JSON-compatible dictionary.
429
430        The dictionary MUST include a "type" key with the class name
431        to enable proper deserialization.
432
433        Returns:
434            Dict[str, Any]: JSON-compatible dictionary representation.
435        """
436        pass
437
438    @classmethod
439    @abstractmethod
440    def from_dict(cls, data: dict[str, Any]) -> BaseTransform:
441        """
442        Deserialize a transform from a dictionary.
443
444        Args:
445            data: Dictionary previously created by to_dict().
446
447        Returns:
448            BaseTransform: The deserialized transform instance.
449        """
450        pass
451
452    def __repr__(self) -> str:
453        return f"<{self.__class__.__name__}>"

Abstract interface for all spatial transformations.

dtype

@abstractmethod

def as_matrix() -> numpy.ndarray: View Source

359    @abstractmethod
360    def as_matrix() -> np.ndarray:
361        """
362        Returns the 4x4 homogeneous representation of the transform.
363
364        Returns:
365            np.ndarray: 4x4 matrix of the transform's dtype.
366        """
367        pass

Returns the 4x4 homogeneous representation of the transform.

Returns: np.ndarray: 4x4 matrix of the transform's dtype.

@abstractmethod

def inverse() -> BaseTransform: View Source

369    @abstractmethod
370    def inverse() -> BaseTransform:
371        """
372        Returns the mathematical inverse of the transformation.
373
374        Returns:
375            BaseTransform: The inverse transformation.
376        """
377        pass

Returns the mathematical inverse of the transformation.

Returns: BaseTransform: The inverse transformation.

@abstractmethod

def to_dict(self) -> dict[str, typing.Any]: View Source

425    @abstractmethod
426    def to_dict(self) -> dict[str, Any]:
427        """
428        Serialize the transform to a JSON-compatible dictionary.
429
430        The dictionary MUST include a "type" key with the class name
431        to enable proper deserialization.
432
433        Returns:
434            Dict[str, Any]: JSON-compatible dictionary representation.
435        """
436        pass

Serialize the transform to a JSON-compatible dictionary.

The dictionary MUST include a "type" key with the class name to enable proper deserialization.

Returns: Dict[str, Any]: JSON-compatible dictionary representation.

@classmethod

@abstractmethod

def from_dict(cls, data: dict[str, typing.Any]) -> BaseTransform: View Source

438    @classmethod
439    @abstractmethod
440    def from_dict(cls, data: dict[str, Any]) -> BaseTransform:
441        """
442        Deserialize a transform from a dictionary.
443
444        Args:
445            data: Dictionary previously created by to_dict().
446
447        Returns:
448            BaseTransform: The deserialized transform instance.
449        """
450        pass

Deserialize a transform from a dictionary.

Args: data: Dictionary previously created by to_dict().

Returns: BaseTransform: The deserialized transform instance.

class Translation(tgraph.Transform): View Source

739class Translation(Transform):
740    """A Transform with only translation (Identity rotation)."""
741
742    def __init__(
743        self,
744        x: float = 0.0,
745        y: float = 0.0,
746        z: float = 0.0,
747        translation: np.ndarray | list | tuple | None = None,
748    ):
749        if translation is not None:
750            super().__init__(translation=translation)
751        else:
752            super().__init__(translation=[x, y, z])

A Transform with only translation (Identity rotation).

Translation( x: float = 0.0, y: float = 0.0, z: float = 0.0, translation: numpy.ndarray | list | tuple | None = None) View Source

742    def __init__(
743        self,
744        x: float = 0.0,
745        y: float = 0.0,
746        z: float = 0.0,
747        translation: np.ndarray | list | tuple | None = None,
748    ):
749        if translation is not None:
750            super().__init__(translation=translation)
751        else:
752            super().__init__(translation=[x, y, z])

class Rotation(tgraph.Transform): View Source

755class Rotation(Transform):
756    """
757    A Transform with only rotation (Zero translation).
758
759    Supports multiple construction patterns:
760    - Quaternion components: Rotation(w=1, x=0, y=0, z=0)
761    - Quaternion object: Rotation(rotation=q)
762    - Euler angles: Rotation.from_roll_pitch_yaw(roll=0, pitch=0, yaw=0)
763    ...
764    """
765
766    def __init__(
767        self,
768        w: float = 1.0,
769        x: float = 0.0,
770        y: float = 0.0,
771        z: float = 0.0,
772        rotation: quaternion.quaternion | np.ndarray | list | tuple | None = None,
773    ):
774        if rotation is not None:
775            super().__init__(rotation=rotation)
776        else:
777            super().__init__(rotation=[w, x, y, z])
778
779    @classmethod
780    def from_roll_pitch_yaw(
781        cls,
782        roll: float = 0.0,
783        pitch: float = 0.0,
784        yaw: float = 0.0,
785    ) -> Rotation:
786        """
787        Create a Rotation from roll-pitch-yaw angles.
788
789        Uses the aerospace/robotics intrinsic **ZYX** (Tait-Bryan) convention:
790        yaw (Z) → pitch (Y) → roll (X).
791
792        Args:
793            roll: Rotation about X-axis in radians.
794            pitch: Rotation about Y-axis in radians.
795            yaw: Rotation about Z-axis in radians.
796
797        Returns:
798            Rotation: A rotation-only transform.
799
800        Example:
801            >>> attitude = tf.Rotation.from_roll_pitch_yaw(pitch=np.radians(10))
802            >>> heading = tf.Rotation.from_roll_pitch_yaw(yaw=np.pi/4)
803        """
804        q = from_roll_pitch_yaw(roll=roll, pitch=pitch, yaw=yaw)
805        return cls(rotation=q)
806
807    def as_roll_pitch_yaw(self) -> tuple[float, float, float]:
808        """
809        Extract roll, pitch, yaw from the rotation.
810
811        Uses the aerospace/robotics intrinsic **ZYX** (Tait-Bryan) convention.
812
813        Returns:
814            Tuple[float, float, float]: ``(roll, pitch, yaw)`` in radians.
815
816        Warning:
817            Euler angles have a singularity (gimbal lock) when pitch = ±90°.
818
819        Example:
820            >>> rotation = tf.Rotation.from_roll_pitch_yaw(roll=0.1, pitch=0.2, yaw=0.3)
821            >>> roll, pitch, yaw = rotation.as_roll_pitch_yaw()
822        """
823        return as_roll_pitch_yaw(self.rotation)
824
825    @classmethod
826    def from_rotation_matrix(
827        cls,
828        rotation_matrix: np.ndarray,
829        *,
830        validate: bool = True,
831    ) -> Rotation:
832        """Create a Rotation from a 3x3 rotation matrix.
833
834        See :meth:`Transform.from_rotation_matrix` for full documentation.
835
836        Args:
837            rotation_matrix: A 3x3 rotation matrix.
838            validate: If True, verify SO(3) membership.
839
840        Returns:
841            Rotation: A rotation-only transform.
842        """
843        transform = Transform.from_rotation_matrix(rotation_matrix, validate=validate)
844        return cls(rotation=transform.rotation)
845
846    @classmethod
847    def from_quaternion(
848        cls,
849        q: quaternion.quaternion | np.ndarray | list | tuple,
850        *,
851        convention: str = "wxyz",
852    ) -> Rotation:
853        """Create a Rotation from a quaternion.
854
855        See :meth:`Transform.from_quaternion` for full documentation.
856
857        Args:
858            q: Quaternion (object or 4-element array).
859            convention: ``"wxyz"`` or ``"xyzw"``.
860
861        Returns:
862            Rotation: A rotation-only transform.
863        """
864        transform = Transform.from_quaternion(q, convention=convention)
865        return cls(rotation=transform.rotation)
866
867    @classmethod
868    def from_axis_angle(
869        cls,
870        axis: np.ndarray | list | tuple,
871        angle: float,
872    ) -> Rotation:
873        """Create a Rotation from an axis-angle representation.
874
875        See :meth:`Transform.from_axis_angle` for full documentation.
876
877        Args:
878            axis: 3-element rotation axis (auto-normalized).
879            angle: Rotation angle in radians.
880
881        Returns:
882            Rotation: A rotation-only transform.
883        """
884        transform = Transform.from_axis_angle(axis, angle)
885        return cls(rotation=transform.rotation)

A Transform with only rotation (Zero translation).

Supports multiple construction patterns:

Quaternion components: Rotation(w=1, x=0, y=0, z=0)
Quaternion object: Rotation(rotation=q)
Euler angles: Rotation.from_roll_pitch_yaw(roll=0, pitch=0, yaw=0) ...

Rotation( w: float = 1.0, x: float = 0.0, y: float = 0.0, z: float = 0.0, rotation: quaternion.quaternion | numpy.ndarray | list | tuple | None = None) View Source

766    def __init__(
767        self,
768        w: float = 1.0,
769        x: float = 0.0,
770        y: float = 0.0,
771        z: float = 0.0,
772        rotation: quaternion.quaternion | np.ndarray | list | tuple | None = None,
773    ):
774        if rotation is not None:
775            super().__init__(rotation=rotation)
776        else:
777            super().__init__(rotation=[w, x, y, z])

@classmethod

def from_roll_pitch_yaw( cls, roll: float = 0.0, pitch: float = 0.0, yaw: float = 0.0) -> Rotation: View Source

779    @classmethod
780    def from_roll_pitch_yaw(
781        cls,
782        roll: float = 0.0,
783        pitch: float = 0.0,
784        yaw: float = 0.0,
785    ) -> Rotation:
786        """
787        Create a Rotation from roll-pitch-yaw angles.
788
789        Uses the aerospace/robotics intrinsic **ZYX** (Tait-Bryan) convention:
790        yaw (Z) → pitch (Y) → roll (X).
791
792        Args:
793            roll: Rotation about X-axis in radians.
794            pitch: Rotation about Y-axis in radians.
795            yaw: Rotation about Z-axis in radians.
796
797        Returns:
798            Rotation: A rotation-only transform.
799
800        Example:
801            >>> attitude = tf.Rotation.from_roll_pitch_yaw(pitch=np.radians(10))
802            >>> heading = tf.Rotation.from_roll_pitch_yaw(yaw=np.pi/4)
803        """
804        q = from_roll_pitch_yaw(roll=roll, pitch=pitch, yaw=yaw)
805        return cls(rotation=q)

Create a Rotation from roll-pitch-yaw angles.

Uses the aerospace/robotics intrinsic ZYX (Tait-Bryan) convention: yaw (Z) → pitch (Y) → roll (X).

Args: roll: Rotation about X-axis in radians. pitch: Rotation about Y-axis in radians. yaw: Rotation about Z-axis in radians.

Returns: Rotation: A rotation-only transform.

Example:

attitude = tf.Rotation.from_roll_pitch_yaw(pitch=np.radians(10)) heading = tf.Rotation.from_roll_pitch_yaw(yaw=np.pi/4)

def as_roll_pitch_yaw(self) -> tuple[float, float, float]: View Source

807    def as_roll_pitch_yaw(self) -> tuple[float, float, float]:
808        """
809        Extract roll, pitch, yaw from the rotation.
810
811        Uses the aerospace/robotics intrinsic **ZYX** (Tait-Bryan) convention.
812
813        Returns:
814            Tuple[float, float, float]: ``(roll, pitch, yaw)`` in radians.
815
816        Warning:
817            Euler angles have a singularity (gimbal lock) when pitch = ±90°.
818
819        Example:
820            >>> rotation = tf.Rotation.from_roll_pitch_yaw(roll=0.1, pitch=0.2, yaw=0.3)
821            >>> roll, pitch, yaw = rotation.as_roll_pitch_yaw()
822        """
823        return as_roll_pitch_yaw(self.rotation)

Extract roll, pitch, yaw from the rotation.

Uses the aerospace/robotics intrinsic ZYX (Tait-Bryan) convention.

Returns: Tuple[float, float, float]: (roll, pitch, yaw) in radians.

Warning: Euler angles have a singularity (gimbal lock) when pitch = ±90°.

Example:

rotation = tf.Rotation.from_roll_pitch_yaw(roll=0.1, pitch=0.2, yaw=0.3) roll, pitch, yaw = rotation.as_roll_pitch_yaw()

@classmethod

def from_rotation_matrix( cls, rotation_matrix: numpy.ndarray, *, validate: bool = True) -> Rotation: View Source

825    @classmethod
826    def from_rotation_matrix(
827        cls,
828        rotation_matrix: np.ndarray,
829        *,
830        validate: bool = True,
831    ) -> Rotation:
832        """Create a Rotation from a 3x3 rotation matrix.
833
834        See :meth:`Transform.from_rotation_matrix` for full documentation.
835
836        Args:
837            rotation_matrix: A 3x3 rotation matrix.
838            validate: If True, verify SO(3) membership.
839
840        Returns:
841            Rotation: A rotation-only transform.
842        """
843        transform = Transform.from_rotation_matrix(rotation_matrix, validate=validate)
844        return cls(rotation=transform.rotation)

Create a Rotation from a 3x3 rotation matrix.

See Transform.from_rotation_matrix() for full documentation.

Args: rotation_matrix: A 3x3 rotation matrix. validate: If True, verify SO(3) membership.

Returns: Rotation: A rotation-only transform.

@classmethod

def from_quaternion( cls, q: quaternion.quaternion | numpy.ndarray | list | tuple, *, convention: str = 'wxyz') -> Rotation: View Source

846    @classmethod
847    def from_quaternion(
848        cls,
849        q: quaternion.quaternion | np.ndarray | list | tuple,
850        *,
851        convention: str = "wxyz",
852    ) -> Rotation:
853        """Create a Rotation from a quaternion.
854
855        See :meth:`Transform.from_quaternion` for full documentation.
856
857        Args:
858            q: Quaternion (object or 4-element array).
859            convention: ``"wxyz"`` or ``"xyzw"``.
860
861        Returns:
862            Rotation: A rotation-only transform.
863        """
864        transform = Transform.from_quaternion(q, convention=convention)
865        return cls(rotation=transform.rotation)

Create a Rotation from a quaternion.

See Transform.from_quaternion() for full documentation.

Args: q: Quaternion (object or 4-element array). convention: "wxyz" or "xyzw".

Returns: Rotation: A rotation-only transform.

@classmethod

def from_axis_angle( cls, axis: numpy.ndarray | list | tuple, angle: float) -> Rotation: View Source

867    @classmethod
868    def from_axis_angle(
869        cls,
870        axis: np.ndarray | list | tuple,
871        angle: float,
872    ) -> Rotation:
873        """Create a Rotation from an axis-angle representation.
874
875        See :meth:`Transform.from_axis_angle` for full documentation.
876
877        Args:
878            axis: 3-element rotation axis (auto-normalized).
879            angle: Rotation angle in radians.
880
881        Returns:
882            Rotation: A rotation-only transform.
883        """
884        transform = Transform.from_axis_angle(axis, angle)
885        return cls(rotation=transform.rotation)

Create a Rotation from an axis-angle representation.

See Transform.from_axis_angle() for full documentation.

Args: axis: 3-element rotation axis (auto-normalized). angle: Rotation angle in radians.

Returns: Rotation: A rotation-only transform.

class Identity(tgraph.Transform): View Source

888class Identity(Transform):
889    """The identity transform (0 translation, identity rotation)."""
890
891    def __init__(self):
892        super().__init__()
893
894    def __mul__(self, other: BaseTransform) -> BaseTransform:
895        """Identity is the neutral element: I * X = X."""
896        return other

The identity transform (0 translation, identity rotation).

@register_transform

class MatrixTransform(tgraph.BaseTransform): View Source

899@register_transform
900class MatrixTransform(BaseTransform):
901    """
902    A generic transform held as a raw 4x4 matrix.
903    Used when SE(3) structure is lost or not applicable.
904    """
905
906    def __init__(self, matrix: np.ndarray, dtype: np.dtype = np.float64):
907        super().__init__(dtype=dtype)
908        if matrix.shape != (4, 4):
909            raise ValueError(f"Matrix must be 4x4, got {matrix.shape}")
910        self.matrix = matrix.astype(self.dtype)
911
912    def as_matrix(self) -> np.ndarray:
913        """Return the stored 4x4 matrix."""
914        return self.matrix
915
916    def inverse(self) -> MatrixTransform:
917        """Return the inverse via np.linalg.inv.
918
919        .. warning::
920            Uses raw ``np.linalg.inv`` for convenience. This method is intended
921            for quick inspection and non-critical paths. For near-singular or
922            ill-conditioned matrices, prefer decomposing back into structured
923            types (``Transform``, ``Projection``) that have numerically stable
924            inverses.
925        """
926        return MatrixTransform(np.linalg.inv(self.matrix), dtype=self.dtype)
927
928    def __mul__(self, other: BaseTransform) -> MatrixTransform:
929        return MatrixTransform(self.matrix @ other.as_matrix(), dtype=self.dtype)
930
931    def to_dict(self) -> dict[str, Any]:
932        """Serialize transform to a JSON-compatible dictionary."""
933        return {
934            "type": "MatrixTransform",
935            "matrix": self.matrix.tolist(),
936            "dtype": np.dtype(self.dtype).name,
937        }
938
939    @classmethod
940    def from_dict(cls, data: dict[str, Any]) -> MatrixTransform:
941        """Deserialize transform from a dictionary."""
942        dtype = np.dtype(data.get("dtype", "float64"))
943        return cls(matrix=np.array(data["matrix"]), dtype=dtype)
944
945    def __repr__(self) -> str:
946        # Format matrix with numpy's array_repr for better readability
947        matrix_str = np.array_repr(self.matrix, precision=4, suppress_small=True)
948        return f"MatrixTransform(matrix={matrix_str})"

A generic transform held as a raw 4x4 matrix. Used when SE(3) structure is lost or not applicable.

MatrixTransform(matrix: numpy.ndarray, dtype: numpy.dtype = <class 'numpy.float64'>) View Source

906    def __init__(self, matrix: np.ndarray, dtype: np.dtype = np.float64):
907        super().__init__(dtype=dtype)
908        if matrix.shape != (4, 4):
909            raise ValueError(f"Matrix must be 4x4, got {matrix.shape}")
910        self.matrix = matrix.astype(self.dtype)

matrix

def as_matrix(self) -> numpy.ndarray: View Source

912    def as_matrix(self) -> np.ndarray:
913        """Return the stored 4x4 matrix."""
914        return self.matrix

Return the stored 4x4 matrix.

def inverse(self) -> MatrixTransform: View Source

916    def inverse(self) -> MatrixTransform:
917        """Return the inverse via np.linalg.inv.
918
919        .. warning::
920            Uses raw ``np.linalg.inv`` for convenience. This method is intended
921            for quick inspection and non-critical paths. For near-singular or
922            ill-conditioned matrices, prefer decomposing back into structured
923            types (``Transform``, ``Projection``) that have numerically stable
924            inverses.
925        """
926        return MatrixTransform(np.linalg.inv(self.matrix), dtype=self.dtype)

Return the inverse via np.linalg.inv.

Uses raw np.linalg.inv for convenience. This method is intended for quick inspection and non-critical paths. For near-singular or ill-conditioned matrices, prefer decomposing back into structured types (Transform, Projection) that have numerically stable inverses.

def to_dict(self) -> dict[str, typing.Any]: View Source

931    def to_dict(self) -> dict[str, Any]:
932        """Serialize transform to a JSON-compatible dictionary."""
933        return {
934            "type": "MatrixTransform",
935            "matrix": self.matrix.tolist(),
936            "dtype": np.dtype(self.dtype).name,
937        }

Serialize transform to a JSON-compatible dictionary.

@classmethod

def from_dict(cls, data: dict[str, typing.Any]) -> MatrixTransform: View Source

939    @classmethod
940    def from_dict(cls, data: dict[str, Any]) -> MatrixTransform:
941        """Deserialize transform from a dictionary."""
942        dtype = np.dtype(data.get("dtype", "float64"))
943        return cls(matrix=np.array(data["matrix"]), dtype=dtype)

Deserialize transform from a dictionary.

@register_transform

class Projection(tgraph.BaseTransform): View Source

 975@register_transform
 976class Projection(BaseTransform):
 977    """
 978    A 3D to 2D projection transformation.
 979
 980    Stores a projection matrix P that maps 3D homogeneous points to 2D.
 981    Internally stored as 4x4 matrix with bottom row [0, 0, 0, 1] for compatibility.
 982
 983    The project_points() method projects 3D points to 2D pixel coordinates.
 984
 985    Note: Projections are generally non-invertible. The inverse() method returns
 986    an InverseProjection which represents the conceptual inverse but requires
 987    additional depth information to actually unproject points.
 988    """
 989
 990    def __init__(
 991        self,
 992        matrix: np.ndarray | list,
 993        dtype: np.dtype = np.float64,
 994    ):
 995        """
 996        Create a Projection from a 3x4 or 4x4 matrix.
 997
 998        Args:
 999            matrix: 3x4 or 4x4 projection matrix.
1000            dtype: Data type for the matrix.
1001        """
1002        super().__init__(dtype=dtype)
1003        self.matrix = _ensure_4x4_projection(np.asarray(matrix), self.dtype)
1004
1005    def as_matrix(self) -> np.ndarray:
1006        """Returns the 4x4 projection matrix."""
1007        return self.matrix
1008
1009    def as_matrix_3x4(self) -> np.ndarray:
1010        """Returns the 3x4 projection matrix (top 3 rows)."""
1011        return self.matrix[:3, :]
1012
1013    def inverse(self) -> InverseProjection:
1014        """
1015        Returns an InverseProjection representing P^-1.
1016
1017        Note: The inverse projection requires depth information to actually
1018        unproject 2D points to 3D.
1019        """
1020        return InverseProjection(self.matrix, dtype=self.dtype)
1021
1022    def __mul__(self, other: BaseTransform) -> BaseTransform:
1023        """Compose projection with another transform."""
1024        result_matrix = self.matrix @ other.as_matrix()
1025
1026        # Compose with Rigid Transform -> CompositeProjection
1027        if isinstance(other, Transform):
1028            return CompositeProjection(self, other, dtype=self.dtype)
1029
1030        if isinstance(other, (Projection, CompositeProjection)):
1031            raise TypeError(
1032                f"Composition '{type(self).__name__} * "
1033                f"{type(other).__name__}' is invalid "
1034                "(dimensional mismatch). "
1035                "Both transformations map to 2D; you cannot compose them in this order."
1036            )
1037
1038        # Fallback for all other types (MatrixTransform, InverseProjection)
1039        # These compositions (e.g. P * P_inv) stay within 2D->2D or
1040        # 3D->3D bounds but are not specialized.
1041        return MatrixTransform(result_matrix, dtype=self.dtype)
1042
1043    def _apply(self, vector: np.ndarray | list | tuple) -> np.ndarray:
1044        """
1045        Project 3D vectors to 2D pixel coordinates.
1046
1047        Args:
1048            vector: Nx3 (points) or Nx4 (homogeneous) array.
1049
1050        Returns:
1051            np.ndarray: Nx2 pixel coordinates.
1052        """
1053        vector = np.atleast_2d(vector)
1054
1055        if vector.shape[1] == 3:
1056            # Homogenize (w=1 implicit for points)
1057            hom_vec = np.hstack([vector, np.ones((vector.shape[0], 1), dtype=self.dtype)])
1058        elif vector.shape[1] == 4:
1059            hom_vec = vector
1060        else:
1061            raise ValueError(f"Input must be Nx3 or Nx4, got {vector.shape}")
1062
1063        # Project: (3x4 or 4x4) @ 4x1 -> 4x1 (if 4x4) or 3x1 (if 3x4?)
1064        # Base class stores 4x4 with bottom [0,0,0,1].
1065        # Result will be [u*w, v*w, w, 1].
1066        projected = (self.matrix @ hom_vec.T).T
1067
1068        # We need [x, y, w] part.
1069        # projected is Nx4.
1070
1071        # Perspective division
1072        w = projected[:, 2:3]
1073        w = np.where(np.abs(w) < 1e-10, 1e-10, w)
1074        pixels = projected[:, :2] / w
1075
1076        return pixels
1077
1078    def project_points(self, points: np.ndarray | list | tuple) -> np.ndarray:
1079        """
1080        Project 3D points (Nx3 or Nx4) to 2D pixel coordinates.
1081        alias for _apply(points).
1082
1083        Args:
1084             points: Nx3 or Nx4 array of points.
1085
1086        Returns:
1087             np.ndarray: Nx2 pixel coordinates.
1088        """
1089        return self._apply(points)
1090
1091    def to_dict(self) -> dict[str, Any]:
1092        """Serialize projection to a JSON-compatible dictionary."""
1093        return {
1094            "type": "Projection",
1095            "matrix": self.matrix.tolist(),
1096            "dtype": np.dtype(self.dtype).name,
1097        }
1098
1099    @classmethod
1100    def from_dict(cls, data: dict[str, Any]) -> Projection:
1101        """Deserialize projection from a dictionary."""
1102        dtype = np.dtype(data.get("dtype", "float64"))
1103        return cls(matrix=np.array(data["matrix"]), dtype=dtype)
1104
1105    def __repr__(self) -> str:
1106        return f"Projection(matrix_shape={self.matrix.shape})"

A 3D to 2D projection transformation.

Stores a projection matrix P that maps 3D homogeneous points to 2D. Internally stored as 4x4 matrix with bottom row [0, 0, 0, 1] for compatibility.

The project_points() method projects 3D points to 2D pixel coordinates.

Note: Projections are generally non-invertible. The inverse() method returns an InverseProjection which represents the conceptual inverse but requires additional depth information to actually unproject points.

Projection( matrix: numpy.ndarray | list, dtype: numpy.dtype = <class 'numpy.float64'>) View Source

 990    def __init__(
 991        self,
 992        matrix: np.ndarray | list,
 993        dtype: np.dtype = np.float64,
 994    ):
 995        """
 996        Create a Projection from a 3x4 or 4x4 matrix.
 997
 998        Args:
 999            matrix: 3x4 or 4x4 projection matrix.
1000            dtype: Data type for the matrix.
1001        """
1002        super().__init__(dtype=dtype)
1003        self.matrix = _ensure_4x4_projection(np.asarray(matrix), self.dtype)

Create a Projection from a 3x4 or 4x4 matrix.

Args: matrix: 3x4 or 4x4 projection matrix. dtype: Data type for the matrix.

matrix

def as_matrix(self) -> numpy.ndarray: View Source

1005    def as_matrix(self) -> np.ndarray:
1006        """Returns the 4x4 projection matrix."""
1007        return self.matrix

Returns the 4x4 projection matrix.

def as_matrix_3x4(self) -> numpy.ndarray: View Source

1009    def as_matrix_3x4(self) -> np.ndarray:
1010        """Returns the 3x4 projection matrix (top 3 rows)."""
1011        return self.matrix[:3, :]

Returns the 3x4 projection matrix (top 3 rows).

def inverse(self) -> InverseProjection: View Source

1013    def inverse(self) -> InverseProjection:
1014        """
1015        Returns an InverseProjection representing P^-1.
1016
1017        Note: The inverse projection requires depth information to actually
1018        unproject 2D points to 3D.
1019        """
1020        return InverseProjection(self.matrix, dtype=self.dtype)

Returns an InverseProjection representing P^-1.

Note: The inverse projection requires depth information to actually unproject 2D points to 3D.

def project_points(self, points: numpy.ndarray | list | tuple) -> numpy.ndarray: View Source

1078    def project_points(self, points: np.ndarray | list | tuple) -> np.ndarray:
1079        """
1080        Project 3D points (Nx3 or Nx4) to 2D pixel coordinates.
1081        alias for _apply(points).
1082
1083        Args:
1084             points: Nx3 or Nx4 array of points.
1085
1086        Returns:
1087             np.ndarray: Nx2 pixel coordinates.
1088        """
1089        return self._apply(points)

Project 3D points (Nx3 or Nx4) to 2D pixel coordinates. alias for _apply(points).

Args: points: Nx3 or Nx4 array of points.

Returns: np.ndarray: Nx2 pixel coordinates.

def to_dict(self) -> dict[str, typing.Any]: View Source

1091    def to_dict(self) -> dict[str, Any]:
1092        """Serialize projection to a JSON-compatible dictionary."""
1093        return {
1094            "type": "Projection",
1095            "matrix": self.matrix.tolist(),
1096            "dtype": np.dtype(self.dtype).name,
1097        }

Serialize projection to a JSON-compatible dictionary.

@classmethod

def from_dict(cls, data: dict[str, typing.Any]) -> Projection: View Source

1099    @classmethod
1100    def from_dict(cls, data: dict[str, Any]) -> Projection:
1101        """Deserialize projection from a dictionary."""
1102        dtype = np.dtype(data.get("dtype", "float64"))
1103        return cls(matrix=np.array(data["matrix"]), dtype=dtype)

Deserialize projection from a dictionary.

@register_transform

class InverseProjection(tgraph.BaseTransform): View Source

1109@register_transform
1110class InverseProjection(BaseTransform):
1111    """
1112    Represents the conceptual inverse of a Projection (P^-1).
1113
1114    This class tracks that an inverse operation was requested, but actual
1115    unprojection requires depth information. Use unproject() with depth values
1116    to convert 2D pixels back to 3D points.
1117
1118    Useful for:
1119    - Tracking transform logic in a graph
1120    - Composing with other transforms
1121    - Unprojecting when depth is available
1122    """
1123
1124    def __init__(
1125        self,
1126        original_matrix: np.ndarray | list,
1127        dtype: np.dtype = np.float64,
1128    ):
1129        """
1130        Create an InverseProjection from the original projection matrix.
1131
1132        Args:
1133            original_matrix: The original 3x4 or 4x4 projection matrix.
1134            dtype: Data type for the matrix.
1135        """
1136        super().__init__(dtype=dtype)
1137        self._original_matrix = _ensure_4x4_projection(np.asarray(original_matrix), self.dtype)
1138
1139    @property
1140    def original_matrix(self) -> np.ndarray:
1141        """The original projection matrix that was inverted."""
1142        return self._original_matrix
1143
1144    def as_matrix(self) -> np.ndarray:
1145        """
1146        Returns a pseudo-inverse matrix for composition purposes.
1147
1148        Warning: This is the Moore-Penrose pseudo-inverse and may not
1149        produce geometrically meaningful results for all operations.
1150        """
1151        return np.linalg.pinv(self._original_matrix)
1152
1153    def inverse(self) -> Projection:
1154        """Returns the original Projection."""
1155        return Projection(self._original_matrix, dtype=self.dtype)
1156
1157    def __mul__(self, other: BaseTransform) -> BaseTransform:
1158        """Compose with another transform using pseudo-inverse."""
1159        if isinstance(other, Identity):
1160            return self
1161        if isinstance(other, Transform):
1162            raise TypeError(
1163                f"Composition '{type(self).__name__} * Transform' "
1164                "is invalid (dimensional mismatch). "
1165                "InverseProjections (2D->3D) cannot post-multiply Transforms (3D->3D). "
1166                "Did you mean 'Transform * InverseProjection'?"
1167            )
1168        if isinstance(other, (InverseProjection, InverseCompositeProjection)):
1169            raise TypeError(
1170                f"Composition '{type(self).__name__} * "
1171                f"{type(other).__name__}' is invalid "
1172                "(dimensional mismatch)."
1173            )
1174
1175        return MatrixTransform(self.as_matrix() @ other.as_matrix(), dtype=self.dtype)
1176
1177    def _apply(self, vector: np.ndarray | list | tuple) -> np.ndarray:
1178        """
1179        Unproject 2D/3D vectors using pseudo-inverse.
1180
1181        Args:
1182            vector: Nx2 (pixels), Nx3 (homogenous pixels), or Nx4.
1183
1184        Returns:
1185            np.ndarray: Transformed vectors (Nx3 or Nx4).
1186        """
1187        vector = np.atleast_2d(vector)
1188        cols = vector.shape[1]
1189
1190        input_vec = None
1191        if cols == 2:
1192            # Nx2 pixels -> Nx3 homogeneous [u, v, 1]
1193            input_vec = np.hstack([vector, np.ones((vector.shape[0], 1), dtype=self.dtype)])
1194        elif cols == 3:
1195            input_vec = vector
1196        elif cols == 4:
1197            input_vec = vector
1198        else:
1199            raise ValueError(f"Input must be Nx2, Nx3 or Nx4, got {vector.shape}")
1200
1201        # If input is 3D, and P_inv is 4x4, we need 4D input?
1202        # P_inv is 4x4 (from BaseTransform.as_matrix which pinv's 4x4 P).
1203        # We need to pad to 4D if it's 3D.
1204        if input_vec.shape[1] == 3:
1205            # Pad with 0? or 1?
1206            # Let's assume w=1 for "point-like" unprojection (ray).
1207            input_vec = np.hstack([input_vec, np.ones((input_vec.shape[0], 1), dtype=self.dtype)])
1208
1209        result = (self.as_matrix() @ input_vec.T).T
1210
1211        # User wants "homogenize when needed and dehomogenize".
1212        if cols < 4:
1213            return result[:, :3]
1214        return result
1215
1216    def unproject(self, pixels: np.ndarray, depths: np.ndarray) -> np.ndarray:
1217        """
1218        Unproject 2D pixels to 3D points using depth values.
1219
1220        Args:
1221            pixels: Nx2 array of 2D pixel coordinates.
1222            depths: N array of depth values (Z coordinate in camera frame).
1223
1224        Returns:
1225            np.ndarray: Nx3 array of 3D points.
1226
1227        Note: This assumes a standard pinhole camera model where the
1228        projection matrix can be decomposed into K[R|t] form.
1229        """
1230        pixels = np.atleast_2d(pixels)
1231        depths = np.atleast_1d(depths).flatten()
1232
1233        if pixels.shape[1] != 2:
1234            raise ValueError(f"Pixels must be Nx2, got {pixels.shape}")
1235        if len(depths) != len(pixels):
1236            raise ValueError(f"Depths length {len(depths)} must match pixels length {len(pixels)}")
1237
1238        # Extract K matrix (intrinsics) from projection matrix P = K[R|t]
1239        # For simple unprojection, we assume P[:3,:3] contains K*R
1240        # and use the pseudo-inverse approach
1241        projection_3x3 = self._original_matrix[:3, :3]
1242        projection_t = self._original_matrix[:3, 3]
1243
1244        # Homogeneous pixel coordinates scaled by depth
1245        hom_pixels = np.column_stack([pixels[:, 0] * depths, pixels[:, 1] * depths, depths])
1246
1247        # Solve for 3D points: P[:3,:3] * X = hom_pixels - P[:3,3]
1248        # Using solve instead of inv for numerical stability and performance.
1249        points_3d = np.linalg.solve(projection_3x3, (hom_pixels - projection_t).T).T
1250
1251        return points_3d
1252
1253    def to_dict(self) -> dict[str, Any]:
1254        """Serialize inverse projection to a JSON-compatible dictionary."""
1255        return {
1256            "type": "InverseProjection",
1257            "original_matrix": self._original_matrix.tolist(),
1258            "dtype": np.dtype(self.dtype).name,
1259        }
1260
1261    @classmethod
1262    def from_dict(cls, data: dict[str, Any]) -> InverseProjection:
1263        """Deserialize inverse projection from a dictionary."""
1264        dtype = np.dtype(data.get("dtype", "float64"))
1265        return cls(original_matrix=np.array(data["original_matrix"]), dtype=dtype)
1266
1267    def __repr__(self) -> str:
1268        return f"InverseProjection(original_matrix_shape={self._original_matrix.shape})"

Represents the conceptual inverse of a Projection (P^-1).

This class tracks that an inverse operation was requested, but actual unprojection requires depth information. Use unproject() with depth values to convert 2D pixels back to 3D points.

Useful for:

Tracking transform logic in a graph
Composing with other transforms
Unprojecting when depth is available

InverseProjection( original_matrix: numpy.ndarray | list, dtype: numpy.dtype = <class 'numpy.float64'>) View Source

1124    def __init__(
1125        self,
1126        original_matrix: np.ndarray | list,
1127        dtype: np.dtype = np.float64,
1128    ):
1129        """
1130        Create an InverseProjection from the original projection matrix.
1131
1132        Args:
1133            original_matrix: The original 3x4 or 4x4 projection matrix.
1134            dtype: Data type for the matrix.
1135        """
1136        super().__init__(dtype=dtype)
1137        self._original_matrix = _ensure_4x4_projection(np.asarray(original_matrix), self.dtype)

Create an InverseProjection from the original projection matrix.

Args: original_matrix: The original 3x4 or 4x4 projection matrix. dtype: Data type for the matrix.

original_matrix: numpy.ndarray View Source

1139    @property
1140    def original_matrix(self) -> np.ndarray:
1141        """The original projection matrix that was inverted."""
1142        return self._original_matrix

The original projection matrix that was inverted.

def as_matrix(self) -> numpy.ndarray: View Source

1144    def as_matrix(self) -> np.ndarray:
1145        """
1146        Returns a pseudo-inverse matrix for composition purposes.
1147
1148        Warning: This is the Moore-Penrose pseudo-inverse and may not
1149        produce geometrically meaningful results for all operations.
1150        """
1151        return np.linalg.pinv(self._original_matrix)

Returns a pseudo-inverse matrix for composition purposes.

Warning: This is the Moore-Penrose pseudo-inverse and may not produce geometrically meaningful results for all operations.

def inverse(self) -> Projection: View Source

1153    def inverse(self) -> Projection:
1154        """Returns the original Projection."""
1155        return Projection(self._original_matrix, dtype=self.dtype)

Returns the original Projection.

def unproject(self, pixels: numpy.ndarray, depths: numpy.ndarray) -> numpy.ndarray: View Source

1216    def unproject(self, pixels: np.ndarray, depths: np.ndarray) -> np.ndarray:
1217        """
1218        Unproject 2D pixels to 3D points using depth values.
1219
1220        Args:
1221            pixels: Nx2 array of 2D pixel coordinates.
1222            depths: N array of depth values (Z coordinate in camera frame).
1223
1224        Returns:
1225            np.ndarray: Nx3 array of 3D points.
1226
1227        Note: This assumes a standard pinhole camera model where the
1228        projection matrix can be decomposed into K[R|t] form.
1229        """
1230        pixels = np.atleast_2d(pixels)
1231        depths = np.atleast_1d(depths).flatten()
1232
1233        if pixels.shape[1] != 2:
1234            raise ValueError(f"Pixels must be Nx2, got {pixels.shape}")
1235        if len(depths) != len(pixels):
1236            raise ValueError(f"Depths length {len(depths)} must match pixels length {len(pixels)}")
1237
1238        # Extract K matrix (intrinsics) from projection matrix P = K[R|t]
1239        # For simple unprojection, we assume P[:3,:3] contains K*R
1240        # and use the pseudo-inverse approach
1241        projection_3x3 = self._original_matrix[:3, :3]
1242        projection_t = self._original_matrix[:3, 3]
1243
1244        # Homogeneous pixel coordinates scaled by depth
1245        hom_pixels = np.column_stack([pixels[:, 0] * depths, pixels[:, 1] * depths, depths])
1246
1247        # Solve for 3D points: P[:3,:3] * X = hom_pixels - P[:3,3]
1248        # Using solve instead of inv for numerical stability and performance.
1249        points_3d = np.linalg.solve(projection_3x3, (hom_pixels - projection_t).T).T
1250
1251        return points_3d

Unproject 2D pixels to 3D points using depth values.

Args: pixels: Nx2 array of 2D pixel coordinates. depths: N array of depth values (Z coordinate in camera frame).

Returns: np.ndarray: Nx3 array of 3D points.

Note: This assumes a standard pinhole camera model where the projection matrix can be decomposed into K[R|t] form.

def to_dict(self) -> dict[str, typing.Any]: View Source

1253    def to_dict(self) -> dict[str, Any]:
1254        """Serialize inverse projection to a JSON-compatible dictionary."""
1255        return {
1256            "type": "InverseProjection",
1257            "original_matrix": self._original_matrix.tolist(),
1258            "dtype": np.dtype(self.dtype).name,
1259        }

Serialize inverse projection to a JSON-compatible dictionary.

@classmethod

def from_dict(cls, data: dict[str, typing.Any]) -> InverseProjection: View Source

1261    @classmethod
1262    def from_dict(cls, data: dict[str, Any]) -> InverseProjection:
1263        """Deserialize inverse projection from a dictionary."""
1264        dtype = np.dtype(data.get("dtype", "float64"))
1265        return cls(original_matrix=np.array(data["original_matrix"]), dtype=dtype)

Deserialize inverse projection from a dictionary.

@register_transform

class OrthographicProjection(tgraph.Projection): View Source

1800@register_transform
1801class OrthographicProjection(Projection):
1802    """
1803    Orthographic (parallel) projection — maps 3D to 2D without perspective.
1804
1805    Unlike perspective ``CameraProjection``, this applies a pure affine
1806    mapping: the output pixel coordinates are a linear function of the input
1807    3D coordinates, with no division by depth.
1808
1809    Axis conventions (default ``"top"`` / BEV):
1810        * **+x** (forward) → image **top** (smaller row)
1811        * **+y** (left)    → image **left** (smaller col)
1812
1813    Supported axis presets:
1814        * ``"top"``   — Bird's-eye view (drops Z)
1815        * ``"front"`` — Front view (drops X)
1816        * ``"side"``  — Side view (drops Y)
1817
1818    Usage::
1819
1820        ortho = OrthographicProjection("top", (-50, 50), (-50, 50), 0.1)
1821        graph.add_transform("ego", "bev", ortho)
1822        pixels = transform_points(pts, graph, "lidar", "bev")
1823
1824    Args:
1825        axis: Projection axis preset (``"top"``, ``"front"``, ``"side"``).
1826        u_range: World-coordinate extent along the column axis (metres).
1827        v_range: World-coordinate extent along the row axis (metres).
1828        resolution: Metres per pixel.
1829        dtype: Numeric data type.
1830    """
1831
1832    def __init__(
1833        self,
1834        axis: str = "top",
1835        u_range: tuple[float, float] = (-50.0, 50.0),
1836        v_range: tuple[float, float] = (-50.0, 50.0),
1837        resolution: float = 0.1,
1838        dtype: np.dtype = np.float64,
1839    ):
1840        if axis not in _ORTHO_AXIS_PRESETS:
1841            raise ValueError(f"Unknown axis '{axis}', must be one of {list(_ORTHO_AXIS_PRESETS)}")
1842
1843        self._axis = axis
1844        self._u_range = tuple(u_range)
1845        self._v_range = tuple(v_range)
1846        self._resolution = float(resolution)
1847
1848        u_idx, v_idx, flip_u, flip_v = _ORTHO_AXIS_PRESETS[axis]
1849        self._u_idx = u_idx
1850        self._v_idx = v_idx
1851
1852        # Build the 3x4 affine projection matrix.
1853        # col = (u_max - world[u_idx]) / res   if flip_u  else  (world[u_idx] - u_min) / res
1854        # row = (v_max - world[v_idx]) / res   if flip_v  else  (world[v_idx] - v_min) / res
1855        inv_res = 1.0 / resolution
1856        mat = np.zeros((3, 4), dtype=dtype)
1857
1858        if flip_u:
1859            mat[0, u_idx] = -inv_res
1860            mat[0, 3] = u_range[1] * inv_res
1861        else:
1862            mat[0, u_idx] = inv_res
1863            mat[0, 3] = -u_range[0] * inv_res
1864
1865        if flip_v:
1866            mat[1, v_idx] = -inv_res
1867            mat[1, 3] = v_range[1] * inv_res
1868        else:
1869            mat[1, v_idx] = inv_res
1870            mat[1, 3] = -v_range[0] * inv_res
1871
1872        # Third row: constant 1 (homogeneous w)
1873        mat[2, 3] = 1.0
1874
1875        super().__init__(matrix=mat, dtype=dtype)
1876
1877    # ------------------------------------------------------------------ #
1878    # Properties                                                          #
1879    # ------------------------------------------------------------------ #
1880
1881    @property
1882    def axis(self) -> str:
1883        """Projection axis preset."""
1884        return self._axis
1885
1886    @property
1887    def u_range(self) -> tuple[float, float]:
1888        """World-coordinate extent along the column axis (metres)."""
1889        return self._u_range
1890
1891    @property
1892    def v_range(self) -> tuple[float, float]:
1893        """World-coordinate extent along the row axis (metres)."""
1894        return self._v_range
1895
1896    @property
1897    def resolution(self) -> float:
1898        """Metres per pixel."""
1899        return self._resolution
1900
1901    @property
1902    def grid_shape(self) -> tuple[int, int]:
1903        """Output image dimensions ``(H, W)`` in pixels."""
1904        W = int((self._u_range[1] - self._u_range[0]) / self._resolution)
1905        H = int((self._v_range[1] - self._v_range[0]) / self._resolution)
1906        return H, W
1907
1908    @property
1909    def origin_pixel(self) -> tuple[int, int]:
1910        """Pixel coordinates ``(col, row)`` of the world origin ``(0, 0, 0)``."""
1911        px = self._apply(np.array([[0.0, 0.0, 0.0]]))[0]
1912        return int(px[0]), int(px[1])
1913
1914    # ------------------------------------------------------------------ #
1915    # Core projection (override — NO perspective division)                #
1916    # ------------------------------------------------------------------ #
1917
1918    def _apply(self, vector: np.ndarray | list | tuple) -> np.ndarray:
1919        """
1920        Project 3D points to 2D pixel coordinates (affine, no perspective).
1921
1922        Args:
1923            vector: ``(N, 3)`` or ``(N, 4)`` array of 3D points.
1924
1925        Returns:
1926            ``(N, 2)`` array of ``[col, row]`` pixel coordinates.
1927        """
1928        vector = np.atleast_2d(np.asarray(vector, dtype=self.dtype))
1929
1930        if vector.shape[1] == 3:
1931            hom = np.hstack([vector, np.ones((vector.shape[0], 1), dtype=self.dtype)])
1932        elif vector.shape[1] == 4:
1933            hom = vector
1934        else:
1935            raise ValueError(f"Input must be Nx3 or Nx4, got {vector.shape}")
1936
1937        # Affine projection: result[:, :2] are pixel coords, result[:, 2] == 1
1938        projected = (self.matrix @ hom.T).T
1939        return projected[:, :2]
1940
1941    def project_points(self, points: np.ndarray | list | tuple) -> np.ndarray:
1942        """Alias for :meth:`_apply`."""
1943        return self._apply(points)
1944
1945    # ------------------------------------------------------------------ #
1946    # Inverse                                                             #
1947    # ------------------------------------------------------------------ #
1948
1949    def inverse(self) -> InverseOrthographicProjection:
1950        """
1951        Return the inverse projection.
1952
1953        The inverse lifts 2D pixel coordinates back to 3D, placing them on the
1954        projection plane (the collapsed axis coordinate is set to 0).
1955        """
1956        return InverseOrthographicProjection(self)
1957
1958    # ------------------------------------------------------------------ #
1959    # Serialization                                                       #
1960    # ------------------------------------------------------------------ #
1961
1962    def to_dict(self) -> dict[str, Any]:
1963        """Serialize to a JSON-compatible dictionary."""
1964        return {
1965            "type": "OrthographicProjection",
1966            "axis": self._axis,
1967            "u_range": list(self._u_range),
1968            "v_range": list(self._v_range),
1969            "resolution": self._resolution,
1970            "dtype": np.dtype(self.dtype).name,
1971        }
1972
1973    @classmethod
1974    def from_dict(cls, data: dict[str, Any]) -> OrthographicProjection:
1975        """Deserialize from a dictionary."""
1976        return cls(
1977            axis=data["axis"],
1978            u_range=tuple(data["u_range"]),
1979            v_range=tuple(data["v_range"]),
1980            resolution=data["resolution"],
1981            dtype=np.dtype(data.get("dtype", "float64")),
1982        )
1983
1984    def __repr__(self) -> str:
1985        H, W = self.grid_shape
1986        return (
1987            f"OrthographicProjection(axis={self._axis!r}, "
1988            f"u_range={self._u_range}, v_range={self._v_range}, "
1989            f"res={self._resolution}, grid={W}x{H})"
1990        )

Orthographic (parallel) projection — maps 3D to 2D without perspective.

Unlike perspective CameraProjection, this applies a pure affine mapping: the output pixel coordinates are a linear function of the input 3D coordinates, with no division by depth.

Axis conventions (default "top" / BEV): * +x (forward) → image top (smaller row) * +y (left) → image left (smaller col)

Supported axis presets: * "top" — Bird's-eye view (drops Z) * "front" — Front view (drops X) * "side" — Side view (drops Y)

Usage::

ortho = OrthographicProjection("top", (-50, 50), (-50, 50), 0.1)
graph.add_transform("ego", "bev", ortho)
pixels = transform_points(pts, graph, "lidar", "bev")

Args: axis: Projection axis preset ("top", "front", "side"). u_range: World-coordinate extent along the column axis (metres). v_range: World-coordinate extent along the row axis (metres). resolution: Metres per pixel. dtype: Numeric data type.

OrthographicProjection( axis: str = 'top', u_range: tuple[float, float] = (-50.0, 50.0), v_range: tuple[float, float] = (-50.0, 50.0), resolution: float = 0.1, dtype: numpy.dtype = <class 'numpy.float64'>) View Source

1832    def __init__(
1833        self,
1834        axis: str = "top",
1835        u_range: tuple[float, float] = (-50.0, 50.0),
1836        v_range: tuple[float, float] = (-50.0, 50.0),
1837        resolution: float = 0.1,
1838        dtype: np.dtype = np.float64,
1839    ):
1840        if axis not in _ORTHO_AXIS_PRESETS:
1841            raise ValueError(f"Unknown axis '{axis}', must be one of {list(_ORTHO_AXIS_PRESETS)}")
1842
1843        self._axis = axis
1844        self._u_range = tuple(u_range)
1845        self._v_range = tuple(v_range)
1846        self._resolution = float(resolution)
1847
1848        u_idx, v_idx, flip_u, flip_v = _ORTHO_AXIS_PRESETS[axis]
1849        self._u_idx = u_idx
1850        self._v_idx = v_idx
1851
1852        # Build the 3x4 affine projection matrix.
1853        # col = (u_max - world[u_idx]) / res   if flip_u  else  (world[u_idx] - u_min) / res
1854        # row = (v_max - world[v_idx]) / res   if flip_v  else  (world[v_idx] - v_min) / res
1855        inv_res = 1.0 / resolution
1856        mat = np.zeros((3, 4), dtype=dtype)
1857
1858        if flip_u:
1859            mat[0, u_idx] = -inv_res
1860            mat[0, 3] = u_range[1] * inv_res
1861        else:
1862            mat[0, u_idx] = inv_res
1863            mat[0, 3] = -u_range[0] * inv_res
1864
1865        if flip_v:
1866            mat[1, v_idx] = -inv_res
1867            mat[1, 3] = v_range[1] * inv_res
1868        else:
1869            mat[1, v_idx] = inv_res
1870            mat[1, 3] = -v_range[0] * inv_res
1871
1872        # Third row: constant 1 (homogeneous w)
1873        mat[2, 3] = 1.0
1874
1875        super().__init__(matrix=mat, dtype=dtype)

Create a Projection from a 3x4 or 4x4 matrix.

Args: matrix: 3x4 or 4x4 projection matrix. dtype: Data type for the matrix.

axis: str View Source

1881    @property
1882    def axis(self) -> str:
1883        """Projection axis preset."""
1884        return self._axis

Projection axis preset.

u_range: tuple[float, float] View Source

1886    @property
1887    def u_range(self) -> tuple[float, float]:
1888        """World-coordinate extent along the column axis (metres)."""
1889        return self._u_range

World-coordinate extent along the column axis (metres).

v_range: tuple[float, float] View Source

1891    @property
1892    def v_range(self) -> tuple[float, float]:
1893        """World-coordinate extent along the row axis (metres)."""
1894        return self._v_range

World-coordinate extent along the row axis (metres).

resolution: float View Source

1896    @property
1897    def resolution(self) -> float:
1898        """Metres per pixel."""
1899        return self._resolution

Metres per pixel.

grid_shape: tuple[int, int] View Source

1901    @property
1902    def grid_shape(self) -> tuple[int, int]:
1903        """Output image dimensions ``(H, W)`` in pixels."""
1904        W = int((self._u_range[1] - self._u_range[0]) / self._resolution)
1905        H = int((self._v_range[1] - self._v_range[0]) / self._resolution)
1906        return H, W

Output image dimensions (H, W) in pixels.

origin_pixel: tuple[int, int] View Source

1908    @property
1909    def origin_pixel(self) -> tuple[int, int]:
1910        """Pixel coordinates ``(col, row)`` of the world origin ``(0, 0, 0)``."""
1911        px = self._apply(np.array([[0.0, 0.0, 0.0]]))[0]
1912        return int(px[0]), int(px[1])

Pixel coordinates (col, row) of the world origin (0, 0, 0).

def project_points(self, points: numpy.ndarray | list | tuple) -> numpy.ndarray: View Source

1941    def project_points(self, points: np.ndarray | list | tuple) -> np.ndarray:
1942        """Alias for :meth:`_apply`."""
1943        return self._apply(points)

Alias for _apply().

def inverse(self) -> InverseOrthographicProjection: View Source

1949    def inverse(self) -> InverseOrthographicProjection:
1950        """
1951        Return the inverse projection.
1952
1953        The inverse lifts 2D pixel coordinates back to 3D, placing them on the
1954        projection plane (the collapsed axis coordinate is set to 0).
1955        """
1956        return InverseOrthographicProjection(self)

Return the inverse projection.

The inverse lifts 2D pixel coordinates back to 3D, placing them on the projection plane (the collapsed axis coordinate is set to 0).

def to_dict(self) -> dict[str, typing.Any]: View Source

1962    def to_dict(self) -> dict[str, Any]:
1963        """Serialize to a JSON-compatible dictionary."""
1964        return {
1965            "type": "OrthographicProjection",
1966            "axis": self._axis,
1967            "u_range": list(self._u_range),
1968            "v_range": list(self._v_range),
1969            "resolution": self._resolution,
1970            "dtype": np.dtype(self.dtype).name,
1971        }

Serialize to a JSON-compatible dictionary.

@classmethod

def from_dict( cls, data: dict[str, typing.Any]) -> OrthographicProjection: View Source

1973    @classmethod
1974    def from_dict(cls, data: dict[str, Any]) -> OrthographicProjection:
1975        """Deserialize from a dictionary."""
1976        return cls(
1977            axis=data["axis"],
1978            u_range=tuple(data["u_range"]),
1979            v_range=tuple(data["v_range"]),
1980            resolution=data["resolution"],
1981            dtype=np.dtype(data.get("dtype", "float64")),
1982        )

Deserialize from a dictionary.

@register_transform

class InverseOrthographicProjection(tgraph.InverseProjection): View Source

1993@register_transform
1994class InverseOrthographicProjection(InverseProjection):
1995    """
1996    Inverse of an :class:`OrthographicProjection`.
1997
1998    Lifts 2D pixel coordinates back to 3D by inverting the affine mapping.
1999    The collapsed axis coordinate is set to 0 (i.e. the point is placed on
2000    the projection plane).
2001    """
2002
2003    def __init__(self, ortho: OrthographicProjection):
2004        self._ortho = ortho
2005        super().__init__(original_matrix=ortho.matrix, dtype=ortho.dtype)
2006
2007    @property
2008    def orthographic_projection(self) -> OrthographicProjection:
2009        """The original OrthographicProjection."""
2010        return self._ortho
2011
2012    def _apply(self, vector: np.ndarray | list | tuple) -> np.ndarray:
2013        """
2014        Unproject 2D pixel coordinates to 3D (on the projection plane).
2015
2016        Args:
2017            vector: ``(N, 2)`` pixel coords ``[col, row]``.
2018
2019        Returns:
2020            ``(N, 3)`` array of 3D points (collapsed axis = 0).
2021        """
2022        vector = np.atleast_2d(np.asarray(vector, dtype=self.dtype))
2023        if vector.shape[1] != 2:
2024            raise ValueError(f"Expected Nx2 pixel coordinates, got {vector.shape}")
2025
2026        cols = vector[:, 0]
2027        rows = vector[:, 1]
2028
2029        u_idx, v_idx, flip_u, flip_v = _ORTHO_AXIS_PRESETS[self._ortho.axis]
2030        res = self._ortho.resolution
2031        u_range = self._ortho.u_range
2032        v_range = self._ortho.v_range
2033
2034        # Invert the affine mapping
2035        if flip_u:
2036            world_u = u_range[1] - cols * res
2037        else:
2038            world_u = u_range[0] + cols * res
2039
2040        if flip_v:
2041            world_v = v_range[1] - rows * res
2042        else:
2043            world_v = v_range[0] + rows * res
2044
2045        # Build 3D points (collapsed axis = 0)
2046        N = len(cols)
2047        points = np.zeros((N, 3), dtype=self.dtype)
2048        points[:, u_idx] = world_u
2049        points[:, v_idx] = world_v
2050        return points
2051
2052    def inverse(self) -> OrthographicProjection:
2053        """Return the original OrthographicProjection."""
2054        return self._ortho
2055
2056    def to_dict(self) -> dict[str, Any]:
2057        """Serialize to a JSON-compatible dictionary."""
2058        return {
2059            "type": "InverseOrthographicProjection",
2060            "orthographic_projection": self._ortho.to_dict(),
2061        }
2062
2063    @classmethod
2064    def from_dict(cls, data: dict[str, Any]) -> InverseOrthographicProjection:
2065        """Deserialize from a dictionary produced by to_dict."""
2066        ortho = OrthographicProjection.from_dict(data["orthographic_projection"])
2067        return cls(ortho)
2068
2069    def __repr__(self) -> str:
2070        return f"InverseOrthographicProjection({self._ortho})"

Inverse of an OrthographicProjection.

Lifts 2D pixel coordinates back to 3D by inverting the affine mapping. The collapsed axis coordinate is set to 0 (i.e. the point is placed on the projection plane).

InverseOrthographicProjection(ortho: OrthographicProjection) View Source

2003    def __init__(self, ortho: OrthographicProjection):
2004        self._ortho = ortho
2005        super().__init__(original_matrix=ortho.matrix, dtype=ortho.dtype)

Create an InverseProjection from the original projection matrix.

Args: original_matrix: The original 3x4 or 4x4 projection matrix. dtype: Data type for the matrix.

orthographic_projection: OrthographicProjection View Source

2007    @property
2008    def orthographic_projection(self) -> OrthographicProjection:
2009        """The original OrthographicProjection."""
2010        return self._ortho

The original OrthographicProjection.

def inverse(self) -> OrthographicProjection: View Source

2052    def inverse(self) -> OrthographicProjection:
2053        """Return the original OrthographicProjection."""
2054        return self._ortho

Return the original OrthographicProjection.

def to_dict(self) -> dict[str, typing.Any]: View Source

2056    def to_dict(self) -> dict[str, Any]:
2057        """Serialize to a JSON-compatible dictionary."""
2058        return {
2059            "type": "InverseOrthographicProjection",
2060            "orthographic_projection": self._ortho.to_dict(),
2061        }

Serialize to a JSON-compatible dictionary.

@classmethod

def from_dict( cls, data: dict[str, typing.Any]) -> InverseOrthographicProjection: View Source

2063    @classmethod
2064    def from_dict(cls, data: dict[str, Any]) -> InverseOrthographicProjection:
2065        """Deserialize from a dictionary produced by to_dict."""
2066        ortho = OrthographicProjection.from_dict(data["orthographic_projection"])
2067        return cls(ortho)

Deserialize from a dictionary produced by to_dict.

@register_transform

class CompositeProjection(tgraph.Projection): View Source

2073@register_transform
2074class CompositeProjection(Projection):
2075    """
2076    Represents a composition of a Projection (Intrinsics) and a Transform (Extrinsics).
2077
2078    Equivalent to: Projection * Transform
2079    P_composite = K * T
2080
2081    Projects from the source frame of T directly to 2D.
2082
2083    Structure:
2084    - projection: The Projection component (applied last/leftmost)
2085    - transform: The Transform component (applied first/rightmost)
2086    """
2087
2088    def __init__(self, projection: Projection, transform: Transform, dtype: np.dtype = np.float64):
2089        self._projection = projection
2090        self._transform = transform
2091
2092        # Calculate matrix for BaseTransform compatibility
2093        # Matrix = K * T
2094        matrix = projection.as_matrix() @ transform.as_matrix()
2095        super().__init__(matrix=matrix, dtype=dtype)
2096
2097    @property
2098    def projection(self) -> Projection:
2099        """The intrinsic Projection component (applied last in the chain)."""
2100        return self._projection
2101
2102    @property
2103    def transform(self) -> Transform:
2104        """The extrinsic Transform component (applied first in the chain)."""
2105        return self._transform
2106
2107    def inverse(self) -> InverseCompositeProjection:
2108        """Return the inverse as an InverseCompositeProjection (T_inv * K_inv)."""
2109        return InverseCompositeProjection(self._transform.inverse(), self._projection.inverse())
2110
2111    def __mul__(self, other: BaseTransform) -> BaseTransform:
2112        if isinstance(other, Transform):
2113            # Composite * Transform = (K * T_old) * T_new = K * (T_old * T_new)
2114            # Update transform component
2115            new_transform = self._transform * other
2116            # Result must be a Transform (T_old * T_new is Transform * Transform -> Transform)
2117            if not isinstance(new_transform, Transform):
2118                # Fallback if transform degrades (unlikely)
2119                return super().__mul__(other)
2120
2121            return CompositeProjection(self._projection, new_transform, dtype=self.dtype)
2122
2123        # Inherit strict checks from Projection
2124        return super().__mul__(other)
2125
2126    def to_dict(self) -> dict[str, Any]:
2127        """Serialize CompositeProjection (projection + transform) to dictionary."""
2128        return {
2129            "type": "CompositeProjection",
2130            "projection": self._projection.to_dict(),
2131            "transform": self._transform.to_dict(),
2132            "dtype": np.dtype(self.dtype).name,
2133        }
2134
2135    @classmethod
2136    def from_dict(cls, data: dict[str, Any]) -> CompositeProjection:
2137        """Deserialize CompositeProjection from dictionary."""
2138        dtype = np.dtype(data.get("dtype", "float64"))
2139        projection = deserialize_transform(data["projection"])
2140        transform = deserialize_transform(data["transform"])
2141        if not isinstance(projection, Projection):
2142            raise ValueError("CompositeProjection projection must be a Projection")
2143        if not isinstance(transform, Transform):
2144            raise ValueError("CompositeProjection transform must be a Transform")
2145        return cls(projection, transform, dtype=dtype)
2146
2147    def __repr__(self) -> str:
2148        return f"CompositeProjection(projection={self._projection}, transform={self._transform})"

Represents a composition of a Projection (Intrinsics) and a Transform (Extrinsics).

Equivalent to: Projection * Transform P_composite = K * T

Projects from the source frame of T directly to 2D.

Structure:

projection: The Projection component (applied last/leftmost)
transform: The Transform component (applied first/rightmost)

CompositeProjection( projection: Projection, transform: Transform, dtype: numpy.dtype = <class 'numpy.float64'>) View Source

2088    def __init__(self, projection: Projection, transform: Transform, dtype: np.dtype = np.float64):
2089        self._projection = projection
2090        self._transform = transform
2091
2092        # Calculate matrix for BaseTransform compatibility
2093        # Matrix = K * T
2094        matrix = projection.as_matrix() @ transform.as_matrix()
2095        super().__init__(matrix=matrix, dtype=dtype)

Create a Projection from a 3x4 or 4x4 matrix.

Args: matrix: 3x4 or 4x4 projection matrix. dtype: Data type for the matrix.

projection: Projection View Source

2097    @property
2098    def projection(self) -> Projection:
2099        """The intrinsic Projection component (applied last in the chain)."""
2100        return self._projection

The intrinsic Projection component (applied last in the chain).

transform: Transform View Source

2102    @property
2103    def transform(self) -> Transform:
2104        """The extrinsic Transform component (applied first in the chain)."""
2105        return self._transform

The extrinsic Transform component (applied first in the chain).

def inverse(self) -> InverseCompositeProjection: View Source

2107    def inverse(self) -> InverseCompositeProjection:
2108        """Return the inverse as an InverseCompositeProjection (T_inv * K_inv)."""
2109        return InverseCompositeProjection(self._transform.inverse(), self._projection.inverse())

Return the inverse as an InverseCompositeProjection (T_inv * K_inv).

def to_dict(self) -> dict[str, typing.Any]: View Source

2126    def to_dict(self) -> dict[str, Any]:
2127        """Serialize CompositeProjection (projection + transform) to dictionary."""
2128        return {
2129            "type": "CompositeProjection",
2130            "projection": self._projection.to_dict(),
2131            "transform": self._transform.to_dict(),
2132            "dtype": np.dtype(self.dtype).name,
2133        }

Serialize CompositeProjection (projection + transform) to dictionary.

@classmethod

def from_dict(cls, data: dict[str, typing.Any]) -> CompositeProjection: View Source

2135    @classmethod
2136    def from_dict(cls, data: dict[str, Any]) -> CompositeProjection:
2137        """Deserialize CompositeProjection from dictionary."""
2138        dtype = np.dtype(data.get("dtype", "float64"))
2139        projection = deserialize_transform(data["projection"])
2140        transform = deserialize_transform(data["transform"])
2141        if not isinstance(projection, Projection):
2142            raise ValueError("CompositeProjection projection must be a Projection")
2143        if not isinstance(transform, Transform):
2144            raise ValueError("CompositeProjection transform must be a Transform")
2145        return cls(projection, transform, dtype=dtype)

Deserialize CompositeProjection from dictionary.

@register_transform

class InverseCompositeProjection(tgraph.InverseProjection): View Source

2151@register_transform
2152class InverseCompositeProjection(InverseProjection):
2153    """
2154    Represents the inverse of a CompositeProjection.
2155
2156    Equivalent to: Transform * InverseProjection
2157    P_inv_composite = T * K_inv
2158
2159    Unprojects from 2D to the source frame of T.
2160
2161    Structure:
2162    - transform: The Transform component (applied last/leftmost)
2163    - projection: The InverseProjection component (applied first/rightmost)
2164    """
2165
2166    def __init__(
2167        self, transform: Transform, projection: InverseProjection, dtype: np.dtype = np.float64
2168    ):
2169        self._transform = transform
2170        self._projection = projection
2171
2172        # Calculate matrix for BaseTransform compatibility
2173        # Matrix = T * K_inv
2174        transform.as_matrix() @ projection.as_matrix()
2175        # Pass a dummy matrix to super, we override everything anyway.
2176        # But for correctness, passing what we think is the "original" is hard.
2177        # Let's assume standard behavior.
2178        # super needs "original_matrix" that is the PROJECTION matrix.
2179        # P_comp = K * T_inv.
2180        # So original = (K * T_inv).matrix
2181
2182        # NOTE: We can't easily construct the unified projection
2183        # matrix just from pieces without logic,
2184        # but let's try.
2185        try:
2186            # K * T_inv
2187            orig_proj_mat = projection.inverse().as_matrix() @ transform.inverse().as_matrix()
2188            super().__init__(original_matrix=orig_proj_mat, dtype=dtype)
2189        except Exception:
2190            # Fallback
2191            super().__init__(original_matrix=np.eye(4), dtype=dtype)
2192
2193    @property
2194    def transform(self) -> Transform:
2195        """The extrinsic Transform component (applied first in the chain)."""
2196        return self._transform
2197
2198    @property
2199    def projection(self) -> InverseProjection:
2200        """The inverse Projection component (applied last in the chain)."""
2201        return self._projection
2202
2203    def as_matrix(self) -> np.ndarray:
2204        """Return the combined T * K_inv matrix."""
2205        return self._transform.as_matrix() @ self._projection.as_matrix()
2206
2207    def inverse(self) -> CompositeProjection:
2208        """Return the inverse as a CompositeProjection (K * T_inv)."""
2209        return CompositeProjection(
2210            self._projection.inverse(), self._transform.inverse(), dtype=self.dtype
2211        )
2212
2213    def __mul__(self, other: BaseTransform) -> BaseTransform:
2214        # Inherit strict checks from InverseProjection
2215        return super().__mul__(other)
2216
2217    def __rmul__(self, other: BaseTransform) -> BaseTransform:
2218        # Handle Transform * InverseCompositeProjection
2219        # T_new * (T_old * K_inv) = (T_new * T_old) * K_inv
2220        if isinstance(other, Transform):
2221            new_transform = other * self._transform
2222            if isinstance(new_transform, Transform):
2223                return InverseCompositeProjection(new_transform, self._projection, dtype=self.dtype)
2224
2225        return NotImplemented
2226
2227    def to_dict(self) -> dict[str, Any]:
2228        """Serialize InverseCompositeProjection to dictionary."""
2229        return {
2230            "type": "InverseCompositeProjection",
2231            "transform": self._transform.to_dict(),
2232            "projection": self._projection.to_dict(),
2233            "dtype": np.dtype(self.dtype).name,
2234        }
2235
2236    @classmethod
2237    def from_dict(cls, data: dict[str, Any]) -> InverseCompositeProjection:
2238        """Deserialize InverseCompositeProjection from dictionary."""
2239        dtype = np.dtype(data.get("dtype", "float64"))
2240        transform = deserialize_transform(data["transform"])
2241        projection = deserialize_transform(data["projection"])
2242        if not isinstance(transform, Transform):
2243            raise ValueError("InverseCompositeProjection transform must be a Transform")
2244        if not isinstance(projection, InverseProjection):
2245            raise ValueError("InverseCompositeProjection projection must be an InverseProjection")
2246        return cls(transform, projection, dtype=dtype)
2247
2248    def __repr__(self) -> str:
2249        return (
2250            f"InverseCompositeProjection("
2251            f"transform={self._transform}, "
2252            f"projection={self._projection})"
2253        )

Represents the inverse of a CompositeProjection.

Equivalent to: Transform * InverseProjection P_inv_composite = T * K_inv

Unprojects from 2D to the source frame of T.

Structure:

transform: The Transform component (applied last/leftmost)
projection: The InverseProjection component (applied first/rightmost)

InverseCompositeProjection( transform: Transform, projection: InverseProjection, dtype: numpy.dtype = <class 'numpy.float64'>) View Source

2166    def __init__(
2167        self, transform: Transform, projection: InverseProjection, dtype: np.dtype = np.float64
2168    ):
2169        self._transform = transform
2170        self._projection = projection
2171
2172        # Calculate matrix for BaseTransform compatibility
2173        # Matrix = T * K_inv
2174        transform.as_matrix() @ projection.as_matrix()
2175        # Pass a dummy matrix to super, we override everything anyway.
2176        # But for correctness, passing what we think is the "original" is hard.
2177        # Let's assume standard behavior.
2178        # super needs "original_matrix" that is the PROJECTION matrix.
2179        # P_comp = K * T_inv.
2180        # So original = (K * T_inv).matrix
2181
2182        # NOTE: We can't easily construct the unified projection
2183        # matrix just from pieces without logic,
2184        # but let's try.
2185        try:
2186            # K * T_inv
2187            orig_proj_mat = projection.inverse().as_matrix() @ transform.inverse().as_matrix()
2188            super().__init__(original_matrix=orig_proj_mat, dtype=dtype)
2189        except Exception:
2190            # Fallback
2191            super().__init__(original_matrix=np.eye(4), dtype=dtype)

Create an InverseProjection from the original projection matrix.

Args: original_matrix: The original 3x4 or 4x4 projection matrix. dtype: Data type for the matrix.

transform: Transform View Source

2193    @property
2194    def transform(self) -> Transform:
2195        """The extrinsic Transform component (applied first in the chain)."""
2196        return self._transform

The extrinsic Transform component (applied first in the chain).

projection: InverseProjection View Source

2198    @property
2199    def projection(self) -> InverseProjection:
2200        """The inverse Projection component (applied last in the chain)."""
2201        return self._projection

The inverse Projection component (applied last in the chain).

def as_matrix(self) -> numpy.ndarray: View Source

2203    def as_matrix(self) -> np.ndarray:
2204        """Return the combined T * K_inv matrix."""
2205        return self._transform.as_matrix() @ self._projection.as_matrix()

Return the combined T * K_inv matrix.

def inverse(self) -> CompositeProjection: View Source

2207    def inverse(self) -> CompositeProjection:
2208        """Return the inverse as a CompositeProjection (K * T_inv)."""
2209        return CompositeProjection(
2210            self._projection.inverse(), self._transform.inverse(), dtype=self.dtype
2211        )

Return the inverse as a CompositeProjection (K * T_inv).

def to_dict(self) -> dict[str, typing.Any]: View Source

2227    def to_dict(self) -> dict[str, Any]:
2228        """Serialize InverseCompositeProjection to dictionary."""
2229        return {
2230            "type": "InverseCompositeProjection",
2231            "transform": self._transform.to_dict(),
2232            "projection": self._projection.to_dict(),
2233            "dtype": np.dtype(self.dtype).name,
2234        }

Serialize InverseCompositeProjection to dictionary.

@classmethod

def from_dict( cls, data: dict[str, typing.Any]) -> InverseCompositeProjection: View Source

2236    @classmethod
2237    def from_dict(cls, data: dict[str, Any]) -> InverseCompositeProjection:
2238        """Deserialize InverseCompositeProjection from dictionary."""
2239        dtype = np.dtype(data.get("dtype", "float64"))
2240        transform = deserialize_transform(data["transform"])
2241        projection = deserialize_transform(data["projection"])
2242        if not isinstance(transform, Transform):
2243            raise ValueError("InverseCompositeProjection transform must be a Transform")
2244        if not isinstance(projection, InverseProjection):
2245            raise ValueError("InverseCompositeProjection projection must be an InverseProjection")
2246        return cls(transform, projection, dtype=dtype)

Deserialize InverseCompositeProjection from dictionary.

class ProjectionModel(enum.Enum): View Source

126class ProjectionModel(Enum):
127    """
128    Supported camera projection models.
129
130    Each member represents a complete 3D → 2D projection function, covering
131    both the ideal projection geometry and its associated distortion model.
132
133    Models:
134
135    - Pinhole: Ideal perspective projection, no distortion.
136      Parameters: fx, fy, cx, cy.
137    - BrownConrady: Pinhole + radial/tangential distortion.
138      D = (k1, k2, p1, p2, k3). OpenCV default, ROS ``plumb_bob``.
139    - KannalaBrandt: Fisheye / equidistant.
140      D = (k1, k2, k3, k4). ``cv2.fisheye``, ROS ``kannala_brandt``.
141    - Division: Simple wide-angle with single division coefficient.
142      D = (k1,).
143    - Rational: Full rational polynomial.
144      D = (k1, k2, p1, p2, k3, k4, k5, k6). ROS ``rational_polynomial``.
145    - Fisheye62: Project Aria fisheye model.
146      D = (k0, k1, k2, k3, p0, p1).
147    - MeiUnified: Unified omnidirectional camera model (Mei 2007).
148      D = (xi, k1, k2). Used by KITTI-360 fisheye cameras.
149    """
150
151    Pinhole = "Pinhole"
152    BrownConrady = "BrownConrady"
153    KannalaBrandt = "KannalaBrandt"
154    Division = "Division"
155    Rational = "Rational"
156    Fisheye62 = "Fisheye62"
157    MeiUnified = "MeiUnified"
158
159    @classmethod
160    def from_string(cls, model_str: str) -> ProjectionModel:
161        """Convert a string to a ProjectionModel enum value.
162
163        Accepts ROS ``distortion_model`` names and legacy tgraph names.
164        """
165        _aliases = {
166            # ROS camera_info distortion_model names
167            "plumb_bob": cls.BrownConrady,
168            "rational_polynomial": cls.Rational,
169            "kannala_brandt": cls.KannalaBrandt,
170            "fisheye62": cls.Fisheye62,
171            # Legacy tgraph names
172            "Fisheye": cls.KannalaBrandt,
173            "Omnidirectional": cls.Division,
174            "Pinhole+Polynomial": cls.BrownConrady,
175        }
176        if model_str in _aliases:
177            return _aliases[model_str]
178        # Exact value match
179        for model in cls:
180            if model.value == model_str:
181                return model
182        # Case-insensitive name match
183        lower = model_str.lower().replace("_", "").replace("-", "").replace("+", "")
184        for model in cls:
185            if model.name.lower() == lower:
186                return model
187        raise ValueError(f"Unknown projection model: {model_str}. Valid: {[m.value for m in cls]}")

Supported camera projection models.

Each member represents a complete 3D → 2D projection function, covering both the ideal projection geometry and its associated distortion model.

Models:

Pinhole: Ideal perspective projection, no distortion. Parameters: fx, fy, cx, cy.
BrownConrady: Pinhole + radial/tangential distortion. D = (k1, k2, p1, p2, k3). OpenCV default, ROS plumb_bob.
KannalaBrandt: Fisheye / equidistant. D = (k1, k2, k3, k4). cv2.fisheye, ROS kannala_brandt.
Division: Simple wide-angle with single division coefficient. D = (k1,).
Rational: Full rational polynomial. D = (k1, k2, p1, p2, k3, k4, k5, k6). ROS rational_polynomial.
Fisheye62: Project Aria fisheye model. D = (k0, k1, k2, k3, p0, p1).
MeiUnified: Unified omnidirectional camera model (Mei 2007). D = (xi, k1, k2). Used by KITTI-360 fisheye cameras.

Pinhole = <ProjectionModel.Pinhole: 'Pinhole'>

BrownConrady = <ProjectionModel.BrownConrady: 'BrownConrady'>

KannalaBrandt = <ProjectionModel.KannalaBrandt: 'KannalaBrandt'>

Division = <ProjectionModel.Division: 'Division'>

Rational = <ProjectionModel.Rational: 'Rational'>

Fisheye62 = <ProjectionModel.Fisheye62: 'Fisheye62'>

MeiUnified = <ProjectionModel.MeiUnified: 'MeiUnified'>

@classmethod

def from_string(cls, model_str: str) -> ProjectionModel: View Source

159    @classmethod
160    def from_string(cls, model_str: str) -> ProjectionModel:
161        """Convert a string to a ProjectionModel enum value.
162
163        Accepts ROS ``distortion_model`` names and legacy tgraph names.
164        """
165        _aliases = {
166            # ROS camera_info distortion_model names
167            "plumb_bob": cls.BrownConrady,
168            "rational_polynomial": cls.Rational,
169            "kannala_brandt": cls.KannalaBrandt,
170            "fisheye62": cls.Fisheye62,
171            # Legacy tgraph names
172            "Fisheye": cls.KannalaBrandt,
173            "Omnidirectional": cls.Division,
174            "Pinhole+Polynomial": cls.BrownConrady,
175        }
176        if model_str in _aliases:
177            return _aliases[model_str]
178        # Exact value match
179        for model in cls:
180            if model.value == model_str:
181                return model
182        # Case-insensitive name match
183        lower = model_str.lower().replace("_", "").replace("-", "").replace("+", "")
184        for model in cls:
185            if model.name.lower() == lower:
186                return model
187        raise ValueError(f"Unknown projection model: {model_str}. Valid: {[m.value for m in cls]}")

Convert a string to a ProjectionModel enum value.

Accepts ROS distortion_model names and legacy tgraph names.

def register_transform( cls: type[BaseTransform]) -> type[BaseTransform]: View Source

24def register_transform(cls: type[BaseTransform]) -> type[BaseTransform]:
25    """
26    Decorator to register a transform class for serialization.
27
28    Usage:
29        @register_transform
30        class MyTransform(BaseTransform):
31            ...
32    """
33    _TRANSFORM_REGISTRY[cls.__name__] = cls
34    return cls

Decorator to register a transform class for serialization.

Usage: @register_transform class MyTransform(BaseTransform): ...

def serialize_transform(transform: BaseTransform) -> dict[str, typing.Any]: View Source

37def serialize_transform(transform: BaseTransform) -> dict[str, Any]:
38    """
39    Serialize any transform to a JSON-compatible dictionary.
40
41    Args:
42        transform: Any BaseTransform subclass instance.
43
44    Returns:
45        Dict containing the serialized transform with a "type" key.
46    """
47    return transform.to_dict()

Serialize any transform to a JSON-compatible dictionary.

Args: transform: Any BaseTransform subclass instance.

Returns: Dict containing the serialized transform with a "type" key.

def deserialize_transform(data: dict[str, typing.Any]) -> BaseTransform: View Source

190def deserialize_transform(data: dict[str, Any]) -> BaseTransform:
191    """
192    Deserialize a transform from a dictionary.
193
194    Automatically determines the correct class from the "type" field
195    and calls its from_dict() method.
196
197    Args:
198        data: Dictionary previously created by serialize_transform() or to_dict().
199
200    Returns:
201        BaseTransform: The deserialized transform instance.
202
203    Raises:
204        ValueError: If the transform type is not registered.
205    """
206    transform_type = data.get("type")
207    if not transform_type:
208        raise ValueError("Missing 'type' field in transform data")
209
210    if transform_type not in _TRANSFORM_REGISTRY:
211        raise ValueError(
212            f"Unknown transform type: '{transform_type}'. "
213            f"Registered types: {list(_TRANSFORM_REGISTRY.keys())}"
214        )
215
216    cls = _TRANSFORM_REGISTRY[transform_type]
217    return cls.from_dict(data)

Deserialize a transform from a dictionary.

Automatically determines the correct class from the "type" field and calls its from_dict() method.

Args: data: Dictionary previously created by serialize_transform() or to_dict().

Returns: BaseTransform: The deserialized transform instance.

Raises: ValueError: If the transform type is not registered.

def from_roll_pitch_yaw( roll: float = 0.0, pitch: float = 0.0, yaw: float = 0.0) -> quaternion.quaternion: View Source

50def from_roll_pitch_yaw(
51    roll: float = 0.0,
52    pitch: float = 0.0,
53    yaw: float = 0.0,
54) -> quaternion.quaternion:
55    """
56    Create a quaternion from roll-pitch-yaw angles.
57
58    Uses the aerospace/robotics intrinsic **ZYX** (Tait-Bryan) convention:
59    yaw (Z) → pitch (Y) → roll (X).
60
61    For other conventions (ZYZ, XYZ, etc.), use scipy directly::
62
63        from scipy.spatial.transform import Rotation as R
64        q_scipy = R.from_euler('ZYZ', [alpha, beta, gamma])
65
66    Args:
67        roll: Rotation about X-axis in radians.
68        pitch: Rotation about Y-axis in radians.
69        yaw: Rotation about Z-axis in radians.
70
71    Returns:
72        quaternion.quaternion: The resulting quaternion.
73
74    Warning:
75        This function uses ``scipy.spatial.transform.Rotation`` with true
76        ZYX intrinsic ordering.  It is **not** compatible with
77        ``quaternion.from_euler_angles(alpha, beta, gamma)``, which uses
78        ZYZ convention.
79    """
80    scipy_rot = ScipyRotation.from_euler("ZYX", [yaw, pitch, roll])
81    # scipy uses [x, y, z, w], numpy-quaternion uses [w, x, y, z]
82    x, y, z, w = scipy_rot.as_quat()
83    return quaternion.quaternion(w, x, y, z)

Create a quaternion from roll-pitch-yaw angles.

Uses the aerospace/robotics intrinsic ZYX (Tait-Bryan) convention: yaw (Z) → pitch (Y) → roll (X).

For other conventions (ZYZ, XYZ, etc.), use scipy directly::

from scipy.spatial.transform import Rotation as R
q_scipy = R.from_euler('ZYZ', [alpha, beta, gamma])

Args: roll: Rotation about X-axis in radians. pitch: Rotation about Y-axis in radians. yaw: Rotation about Z-axis in radians.

Returns: quaternion.quaternion: The resulting quaternion.

Warning: This function uses scipy.spatial.transform.Rotation with true ZYX intrinsic ordering. It is not compatible with quaternion.from_euler_angles(alpha, beta, gamma), which uses ZYZ convention.

def as_roll_pitch_yaw(q: quaternion.quaternion) -> tuple[float, float, float]: View Source

 86def as_roll_pitch_yaw(
 87    q: quaternion.quaternion,
 88) -> tuple[float, float, float]:
 89    """
 90    Extract roll, pitch, yaw from a quaternion.
 91
 92    Uses the aerospace/robotics intrinsic **ZYX** (Tait-Bryan) convention.
 93
 94    For other conventions (ZYZ, XYZ, etc.), use scipy directly::
 95
 96        from scipy.spatial.transform import Rotation as R
 97        angles = R.from_quat([q.x, q.y, q.z, q.w]).as_euler('ZYZ')
 98
 99    Args:
100        q: The input quaternion.
101
102    Returns:
103        Tuple[float, float, float]: ``(roll, pitch, yaw)`` in radians.
104    """
105    scipy_rot = ScipyRotation.from_quat([q.x, q.y, q.z, q.w])
106    yaw, pitch, roll = scipy_rot.as_euler("ZYX")
107    return (roll, pitch, yaw)

Extract roll, pitch, yaw from a quaternion.

Uses the aerospace/robotics intrinsic ZYX (Tait-Bryan) convention.

For other conventions (ZYZ, XYZ, etc.), use scipy directly::

from scipy.spatial.transform import Rotation as R
angles = R.from_quat([q.x, q.y, q.z, q.w]).as_euler('ZYZ')

Args: q: The input quaternion.

Returns: Tuple[float, float, float]: (roll, pitch, yaw) in radians.

def transform_points( points: numpy.ndarray, transform_object: BaseTransform | TransformGraph, source_frame: str | None = None, target_frame: str | None = None) -> numpy.ndarray: View Source

2414def transform_points(
2415    points: np.ndarray,
2416    transform_object: BaseTransform | TransformGraph,
2417    source_frame: str | None = None,
2418    target_frame: str | None = None,
2419) -> np.ndarray:
2420    """
2421    Applies a transformation to a set of 3D points.
2422
2423    Supports polymorphic second argument:
2424    1. transform_points(points, transform)
2425       Directly applies a transform object.
2426    2. transform_points(points, graph, source_frame, target_frame)
2427       Uses the graph to find the transform from source to target.
2428       If target is a projection frame (e.g. image), returns unnormalized 3D
2429       coordinates [u*z, v*z, z].
2430
2431    Args:
2432        points: Nx3 points array.
2433        transform_object: BaseTransform object OR TransformGraph.
2434        source_frame: Source frame ID (required if using graph).
2435        target_frame: Target frame ID (required if using graph).
2436
2437    Returns:
2438        np.ndarray: Nx3 array of transformed points.
2439    """
2440    points = np.atleast_2d(points)
2441
2442    # CASE 1: TransformGraph
2443    if hasattr(transform_object, "get_transform"):
2444        graph: TransformGraph = transform_object  # type: ignore
2445        if source_frame is None or target_frame is None:
2446            raise ValueError(
2447                "When using TransformGraph, both 'source_frame' and "
2448                "'target_frame' must be provided. "
2449                "Usage: transform_points(points, graph, "
2450                "source_frame='A', target_frame='B')"
2451            )
2452        transform = graph.get_transform(source_frame, target_frame)
2453        # Recurse with resolved transform
2454        return transform_points(points, transform)
2455
2456    # CASE 2: BaseTransform
2457    elif isinstance(transform_object, BaseTransform):
2458        transform = transform_object
2459
2460        # Check for projection (special handling for "transform_points" vs "project_points")
2461        if isinstance(transform, Projection):
2462            # CameraProjection needs full model-dispatched projection (distortion-aware).
2463            # Other Projection subclasses (e.g., OrthographicProjection) are linear.
2464            if isinstance(transform, CameraProjection):
2465                pts = points
2466                if pts.shape[1] == 4:
2467                    pts = pts[:, :3] / pts[:, 3:4]
2468                elif pts.shape[1] != 3:
2469                    raise ValueError("Points must be Nx3 or Nx4")
2470
2471                z = pts[:, 2]
2472                uv = transform._apply(pts)  # full model projection → [u, v]
2473                return np.column_stack([uv[:, 0] * z, uv[:, 1] * z, z])
2474            else:
2475                # Linear projection (OrthographicProjection etc.) — use matrix path
2476                N = points.shape[0]
2477                if points.shape[1] == 3:
2478                    hom_points = np.hstack([points, np.ones((N, 1), dtype=transform.dtype)])
2479                elif points.shape[1] == 4:
2480                    hom_points = points
2481                else:
2482                    raise ValueError("Points must be Nx3 or Nx4")
2483                res_hom = (transform.as_matrix() @ hom_points.T).T
2484                return res_hom[:, :3]
2485
2486        if not isinstance(
2487            transform,
2488            (Transform, Rotation, Translation, Identity, MatrixTransform, InverseProjection),
2489        ):
2490            raise TypeError(
2491                f"Unsupported transform type: {type(transform).__name__}. "
2492                "Supported: Rigid transforms, InverseProjection, or Projection "
2493                "(for unnormalized 3D output)."
2494            )
2495        # InverseProjection accepts Nx2 (pixel coords) → Nx3
2496        if isinstance(transform, InverseProjection) and points.shape[1] == 2:
2497            return transform._apply(points)
2498
2499        if points.shape[1] == 3:
2500            hom_points = np.hstack([points, np.ones((points.shape[0], 1), dtype=transform.dtype)])
2501            transformed_hom = (transform.as_matrix() @ hom_points.T).T
2502            return transformed_hom[:, :3]
2503        elif points.shape[1] == 4:
2504            transformed = (transform.as_matrix() @ points.T).T
2505            return transformed
2506        else:
2507            raise ValueError("Points must be Nx2 (for InverseProjection), Nx3, or Nx4")
2508
2509    else:
2510        obj_type = type(transform_object).__name__
2511        raise TypeError(f"transform_object must be BaseTransform or TransformGraph, got {obj_type}")

Applies a transformation to a set of 3D points.

Supports polymorphic second argument:

transform_points(points, transform) Directly applies a transform object.
transform_points(points, graph, source_frame, target_frame) Uses the graph to find the transform from source to target. If target is a projection frame (e.g. image), returns unnormalized 3D coordinates [uz, vz, z].

Args: points: Nx3 points array. transform_object: BaseTransform object OR TransformGraph. source_frame: Source frame ID (required if using graph). target_frame: Target frame ID (required if using graph).

Returns: np.ndarray: Nx3 array of transformed points.

def project_points( points: numpy.ndarray, transform_object: BaseTransform | TransformGraph, source_frame: str | None = None, target_frame: str | None = None) -> numpy.ndarray: View Source

2514def project_points(
2515    points: np.ndarray,
2516    transform_object: BaseTransform | TransformGraph,
2517    source_frame: str | None = None,
2518    target_frame: str | None = None,
2519) -> np.ndarray:
2520    """
2521    Projects 3D points to 2D coordinates (homogenized).
2522
2523    Signatures:
2524    1. project_points(points, projection_transform)
2525    2. project_points(points, graph, source_frame, target_frame)
2526
2527    Returns:
2528        np.ndarray: Nx2 array of pixel coordinates [u, v].
2529    """
2530    points = np.atleast_2d(points)
2531
2532    # CASE 1: TransformGraph
2533    if hasattr(transform_object, "get_transform"):
2534        graph: TransformGraph = transform_object  # type: ignore
2535        if source_frame is None or target_frame is None:
2536            raise ValueError(
2537                "When using TransformGraph, both 'source_frame' and "
2538                "'target_frame' must be provided."
2539            )
2540        transform = graph.get_transform(source_frame, target_frame)
2541        return project_points(points, transform)
2542
2543    # CASE 2: BaseTransform
2544    elif isinstance(transform_object, BaseTransform):
2545        transform = transform_object
2546
2547        # If transform is Projection, _apply() does homogenization (div by z) -> 2D
2548        if isinstance(transform, Projection):
2549            return transform._apply(points)
2550        elif isinstance(transform, (Transform, Rotation, Translation, Identity, MatrixTransform)):
2551            raise TypeError(
2552                "Cannot project_points using a rigid transform. "
2553                "Target frame must be a projection frame."
2554            )
2555        else:
2556            # Try _apply anyway if it supports it?
2557            res = transform._apply(points)
2558            if res.shape[1] != 2:
2559                raise ValueError(
2560                    f"Transform returned {res.shape[1]}D points, expected 2D for project_points."
2561                )
2562            return res
2563
2564    else:
2565        obj_type = type(transform_object).__name__
2566        raise TypeError(f"transform_object must be BaseTransform or TransformGraph, got {obj_type}")

Projects 3D points to 2D coordinates (homogenized).

Signatures:

project_points(points, projection_transform)
project_points(points, graph, source_frame, target_frame)

Returns: np.ndarray: Nx2 array of pixel coordinates [u, v].

tgraph

transform-graph — Spatial Transformations for Robotics and Computer Vision

Key Classes

Transforms (3D → 3D)

Quaternion Interop (tgraph.quaternion)

Projections (3D → 2D)

Inverse Projections (2D → 3D)

Graph & Pose

Composition Algebra

Quick Start

Quaternion Interop (`tgraph.quaternion`)