Linearly scalable learning of smooth low-dimensional patterns with permutation-aided entropic dimension reduction

In many data science applications, the objective is to extract appropriately ordered smooth low-dimensional data patterns from high-dimensional data sets. This is challenging since common sorting algorithms are primarily aiming at finding monotonic orderings in low-dimensional data, whereas typical dimension reduction and feature extraction algorithms are not primarily designed for extracting smooth low-dimensional data patterns. We show that when selecting the Euclidean smoothness as a pattern quality criterion, both of these problems (finding the optimal “crisp” data permutation and extracting the sparse set of permuted low-dimensional smooth patterns) can be efficiently solved numerically as one unsupervised entropy-regularized iterative optimization problem. We formulate and prove the conditions for monotonicity and convergence of this linearly scalable (in dimension) numerical procedure, with the iteration cost scaling of $𝒪 (D T^{2})$ , where $T$ is the size of the data statistics and $D$ is a feature space dimension. The efficacy of the proposed method is demonstrated through the examination of synthetic examples as well as a real-world application involving the identification of smooth bankruptcy risk minimizing transition patterns from high-dimensional economical data. The results showcase that the statistical properties of the overall time complexity of the method exhibit linear scaling in the dimensionality $D$ within the specified confidence intervals.

Keywords

entropy, regularization, permutation, economical data, unsupervised learning

Mathematical Subject Classification

Primary: 68Q32, 68T01, 68T99

Milestones

Received: 25 August 2023

Revised: 16 May 2024

Accepted: 4 June 2024

Published: 24 October 2024

Authors

Illia Horenko
Faculty of Mathematics RPTU Kaiserslautern-Landau 67663 Kaiserslautern Germany

Lukáš Pospíšil
Department of Mathematics, Faculty of Civil Engineering VSB – Technical University of Ostrava 70833 Ostrava Czech Republic

Vol. 19, No. 1, 2024

Illia Horenko and Lukáš Pospíšil

Abstract

Keywords

Mathematical Subject Classification

Milestones

Authors