User Guide

This page walks through everything you need to use TensorRSVD in practice: how to define your tensor, run the decomposition, read the output, choose good parameters, reconstruct the approximation, and switch backends.

Defining a Tensor as a Callable 

TensorRSVD represents tensors as Python callables rather than dense arrays. The callable must accept \(k\) positional arguments (one per mode) and return the tensor values at those coordinates.

Signature convention

def my_tensor(x0, x1, ..., x_{k-1}):
    ...
    return values  # same shape as x0

Each argument \(x_m\) is a NumPy array of normalized coordinates in \([0, 1]\). Index \(i_m\) in a dimension of size \(n_m\) maps to:

\[x_m = \frac{i_m}{n_m - 1}.\]

The function must be fully vectorized: it must operate element-wise on arrays without Python-level loops, and it must return an array of the same shape as its inputs.

Examples

import numpy as np

# Alternating-sign linear tensor (exact Tucker rank = k)
def alternating(x0, x1, x2):
    return x0 - x1 + x2

# 3-D Gaussian bump
def gaussian(x0, x1, x2):
    return np.exp(-(x0**2 + x1**2 + x2**2))

# Product of 1-D functions (rank-1 in every mode)
def rank1(x0, x1, x2):
    return np.sin(np.pi * x0) * np.cos(np.pi * x1) * (1 + x2)

Note

JAX backend: if you use backend="jax", the callable must be JAX-traceable. Replace np with jnp (import jax.numpy as jnp) and avoid Python control flow that depends on array values.

Running `ho_rsvd`

The single public entry point is tensorrsvd.ho_rsvd(). A minimal call looks like this:

import numpy as np
from tensorrsvd import ho_rsvd

def my_tensor(x0, x1, x2):
    return x0 - x1 + x2

U_list, S_list = ho_rsvd(
    tensor=my_tensor,
    tensor_shape=(32, 32, 32),
    dtype=np.float64,
    rank=3,
    num_oversamples=10,
    num_power_iterations=2,
    num_idxs=3,
)

Parameter guide

Parameter	Guidance
`tensor_shape`	Grid dimensions \((n_1, \ldots, n_k)\). This determines the coordinate grid and the shape of the linear operators.
`rank`	Tucker rank per mode. Pass a single `int` to use the same rank for all modes, or a list of `int` to specify per-mode ranks. Should be much smaller than \(\min(n_m, N_m)\) for memory savings to apply.
`num_oversamples`	Extra random vectors beyond `rank` (default: 10). A value of 5–10 almost always suffices. Higher values improve accuracy at the cost of more passes through the operator.
`num_power_iterations`	Number of power iterations (default: 0). Use 0 for speed, 1–2 for better accuracy when singular values decay slowly. Rarely worth going above 3.
`num_idxs`	Number of modes to decompose. Inferred automatically when `rank` is a list; required when `rank` is a scalar.
`backend`	`"numpy"` (default), `"jax"`, or `"cupy"`. See Switching Backends.

Interpreting the Output 

ho_rsvd() returns a pair (U_list, S_list):

U_list: A list of \(k\) factor matrices. U_list[m] has shape (n_m, rank_m) and contains the leading left singular vectors of the mode-\(m\) unfolding. The columns are orthonormal: U_list[m].T @ U_list[m] == I.
S_list: A list of \(k\) singular value arrays. S_list[m] has shape (rank_m,) and contains the mode-\(m\) singular values in non-increasing order.

Reading the singular values

The mode-\(m\) singular values quantify how much energy \(\mathcal{T}\) has along each direction captured by \(U_m\). A sharp decay indicates that only a few directions are needed; a flat spectrum suggests the tensor is not well approximated at the chosen rank.

import matplotlib.pyplot as plt

for m, S in enumerate(S_list):
    plt.semilogy(S, label=f"mode {m}")
plt.xlabel("index")
plt.ylabel("singular value")
plt.legend()
plt.title("Mode singular value spectra")
plt.show()

Reconstructing the Tensor 

ho_rsvd() returns the factor matrices but not the core tensor \(\mathcal{G}\). Use reconstruct() to compute the dense Tucker approximation in a single call:

import numpy as np
from tensorrsvd import ho_rsvd, reconstruct

U_list, S_list = ho_rsvd(
    tensor=my_tensor,
    tensor_shape=tensor_shape,
    dtype=np.float64,
    rank=3,
    num_oversamples=10,
    num_idxs=3,
)

T_hat = reconstruct(my_tensor, tensor_shape, U_list, dtype=np.float64)

# Materialize the original for comparison
grids = [np.arange(n) / max(n - 1, 1) for n in tensor_shape]
coords = np.meshgrid(*grids, indexing="ij")
T_true = my_tensor(*coords)

rel_err = np.linalg.norm(T_true - T_hat) / np.linalg.norm(T_true)
print(f"Relative error: {rel_err:.2e}")

reconstruct() applies the Tucker projection \(\hat{\mathcal{T}} = \mathcal{T} \times_0 P_0 \times_1 P_1 \cdots\), where \(P_m = U_m U_m^\top\), without explicitly forming the core tensor.

Note

Reconstruction requires materializing \(\mathcal{T}\) as a dense array, which defeats the memory savings for large tensors. In practice, the factor matrices and singular values are often sufficient for downstream tasks (dimensionality reduction, feature extraction, compression).

Switching Backends 

Pass backend="numpy" (default), "jax", or "cupy" to ho_rsvd():

Backend	Required package	Notes
`"numpy"`	(always available)	CPU, single-threaded. Default choice.
`"jax"`	`pip install ".[jax]"`	CPU / GPU / TPU. Operators are JIT-compiled on first call; subsequent calls are fast. Tensors must be JAX-traceable.
`"cupy"`	`pip install ".[cupy]"`	NVIDIA GPU only. Requires CUDA.

JAX example

import jax.numpy as jnp
from tensorrsvd import ho_rsvd

def gaussian_jax(x0, x1, x2):
    return jnp.exp(-(x0**2 + x1**2 + x2**2))

U_list, S_list = ho_rsvd(
    tensor=gaussian_jax,
    tensor_shape=(64, 64, 64),
    dtype=jnp.float32,
    rank=8,
    num_oversamples=10,
    num_power_iterations=2,
    num_idxs=3,
    backend="jax",
)

Warning

JAX returns its own array type (jaxlib.xla_extension.ArrayImpl). Convert to NumPy with numpy.array(U_list[0]) if you need standard NumPy arrays downstream.

Choosing Good Parameters 

Rank

Set rank to the expected Tucker rank of your tensor, or to the largest rank you can afford computationally. For unknown tensors, start with a conservative estimate and check the singular value decay.

Oversampling

The default num_oversamples=10 works well for most problems. Increase to 20–30 if you observe large errors or if the singular values decay slowly.

Power iterations

For tensors with slowly decaying singular values (flat spectra), increase num_power_iterations to 1 or 2. Each additional iteration adds two more passes through the operator but typically gives a significant accuracy boost. Beyond 3 iterations, improvements are usually marginal.

Grid size

Larger grids (bigger tensor_shape) increase the cost of each matrix–vector product. The total cost scales roughly as \(\mathcal{O}(k \cdot (r + p) \cdot q \cdot n_{\max} \cdot N_{\max})\), where \(q\) is num_power_iterations and \(N_{\max} = \prod_{j \ne m} n_j\).

User Guide

Defining a Tensor as a Callable

Running ho_rsvd

Interpreting the Output

Reconstructing the Tensor

Switching Backends

Choosing Good Parameters