Kernel Manifold Alignment (KEMA)

Performs Kernel Manifold Alignment for supervised/semi-supervised domain adaptation. Projects data from multiple domains into a shared latent space.

Performs Kernel Manifold Alignment on multidesign data structures. Automatically splits data by subject variable and aligns domains.

Performs Kernel Manifold Alignment on hyperdesign data structures. Projects data from multiple domains into a shared latent space while preserving manifold structure and aligning same-class samples.

Usage

kema(data, y, ...)

# S3 method for class 'multidesign'
kema(
  data,
  y,
  subject,
  preproc = center(),
  ncomp = 2,
  knn = 5,
  sigma = 0.73,
  u = 0.5,
  kernel = coskern(),
  sample_frac = 1,
  use_laplacian = TRUE,
  solver = "regression",
  backend = "auto",
  backend_control = NULL,
  dweight = 0.1,
  rweight = 0,
  simfun = neighborweights::binary_label_matrix,
  disfun = NULL,
  lambda = 1e-04,
  centre_kernel = FALSE,
  ...
)

# S3 method for class 'hyperdesign'
kema(
  data,
  y,
  preproc = center(),
  ncomp = 2,
  knn = 5,
  sigma = NULL,
  u = 0.5,
  kernel = NULL,
  sample_frac = 1,
  use_laplacian = TRUE,
  solver = "regression",
  backend = "auto",
  backend_control = NULL,
  dweight = 0.1,
  rweight = 0,
  simfun = neighborweights::binary_label_matrix,
  disfun = NULL,
  lambda = 1e-04,
  centre_kernel = FALSE,
  ...
)

# Default S3 method
kema(data, ...)

Arguments

data: A hyperdesign object containing multiple data domains
y: Name of the label variable to use for alignment (can contain NA for unlabeled samples)
...: Additional arguments (currently unused)
subject: Name of the subject variable that defines the domains/strata
preproc: Preprocessing function to apply to the data (default: center())
ncomp: Number of components to extract (default: 2)
knn: Number of nearest neighbors for graph construction (default: 5)
sigma: Kernel bandwidth parameter (default: 0.73)
u: Trade-off parameter between data geometry and class alignment (0-1, default: 0.5)
kernel: Kernel function to use (default: coskern())
sample_frac: Fraction of samples to use for kernel approximation (default: 1)
use_laplacian: Deprecated compatibility argument; ignored.
solver: Deprecated compatibility argument; accepted values are `"regression"` and `"exact"`, but both currently route to the original KEMA solver.
backend: Backend for the original eigensolver. One of `"auto"`, `"full_exact"`, `"reduced_exact"`, or `"operator_exact"`.
backend_control: Optional list controlling auto backend thresholds and fidelity checks (passed through to `kema_orig()`).
dweight: Deprecated compatibility argument; ignored.
rweight: Deprecated compatibility argument; ignored.
simfun: Deprecated compatibility argument; ignored.
disfun: Deprecated compatibility argument; ignored.
lambda: Regularization parameter for matrix conditioning (default: 0.0001)
centre_kernel: Deprecated compatibility argument; ignored.

Value

A multiblock_biprojector object containing:

s: Scores (embedded coordinates) for all samples
v: Primal vectors (feature weights) for out-of-sample projection
sdev: Standard deviations of the components
alpha: Dual coefficients in kernel space
Additional metadata for reconstruction and validation

A multiblock_biprojector object containing the KEMA alignment

Details

KEMA is designed for multi-domain data where you want to find a common representation that preserves both the intrinsic geometry of each domain and the class structure across domains. It supports semi-supervised learning with missing labels (NA values).

Current behavior routes `kema()` to a paper-faithful implementation (`kema_orig`) of the original Tuia & Camps-Valls generalized eigenproblems. Legacy extension arguments are still accepted for compatibility.

KEMA solves the original paper objective: $$K(L+\mu L_s)K\Lambda = \lambda K L_d K\Lambda$$ and its reduced-rank REKEMA counterpart when `sample_frac < 1`.

`kema()` now delegates to the paper-faithful `kema_orig()` backend and solves the original generalized eigenproblems from Tuia & Camps-Valls (2016), including the reduced-rank REKEMA form when `sample_frac < 1`.

Legacy extension arguments remain in the API for backward compatibility but are ignored by the current implementation.

References

Tuia, D., & Camps-Valls, G. (2016). Kernel manifold alignment for domain adaptation. PLoS ONE, 11(2), e0148655.

Examples