Download this article
 Download this article For screen
For printing
Recent Issues
Volume 20, Issue 1
Volume 19, Issue 1
Volume 18, Issue 1
Volume 17, Issue 1
Volume 16, Issue 2
Volume 16, Issue 1
Volume 15, Issue 2
Volume 15, Issue 1
Volume 14, Issue 2
Volume 14, Issue 1
Volume 13, Issue 2
Volume 13, Issue 1
Volume 12, Issue 1
Volume 11, Issue 2
Volume 11, Issue 1
Volume 10, Issue 2
Volume 10, Issue 1
Volume 9, Issue 2
Volume 9, Issue 1
Volume 8, Issue 1
Volume 7, Issue 2
Volume 7, Issue 1
Volume 6, Issue 1
Volume 5, Issue 2
Volume 5, Issue 1
Volume 4, Issue 1
Volume 3, Issue 1
Volume 2, Issue 1
Volume 1, Issue 1
The Journal
About the journal
Ethics and policies
Peer-review process
 
Submission guidelines
Submission form
Editorial board
 
Subscriptions
 
ISSN 2157-5452 (electronic)
ISSN 1559-3940 (print)
 
Author index
To appear
 
Other MSP journals
Linearly scalable learning of smooth low-dimensional patterns with permutation-aided entropic dimension reduction

Illia Horenko and Lukáš Pospíšil

Vol. 19 (2024), No. 1, 87–102
Abstract

In many data science applications, the objective is to extract appropriately ordered smooth low-dimensional data patterns from high-dimensional data sets. This is challenging since common sorting algorithms are primarily aiming at finding monotonic orderings in low-dimensional data, whereas typical dimension reduction and feature extraction algorithms are not primarily designed for extracting smooth low-dimensional data patterns. We show that when selecting the Euclidean smoothness as a pattern quality criterion, both of these problems (finding the optimal “crisp” data permutation and extracting the sparse set of permuted low-dimensional smooth patterns) can be efficiently solved numerically as one unsupervised entropy-regularized iterative optimization problem. We formulate and prove the conditions for monotonicity and convergence of this linearly scalable (in dimension) numerical procedure, with the iteration cost scaling of 𝒪(DT2), where T is the size of the data statistics and D is a feature space dimension. The efficacy of the proposed method is demonstrated through the examination of synthetic examples as well as a real-world application involving the identification of smooth bankruptcy risk minimizing transition patterns from high-dimensional economical data. The results showcase that the statistical properties of the overall time complexity of the method exhibit linear scaling in the dimensionality D within the specified confidence intervals.

Keywords
entropy, regularization, permutation, economical data, unsupervised learning
Mathematical Subject Classification
Primary: 68Q32, 68T01, 68T99
Milestones
Received: 25 August 2023
Revised: 16 May 2024
Accepted: 4 June 2024
Published: 24 October 2024
Authors
Illia Horenko
Faculty of Mathematics
RPTU Kaiserslautern-Landau
67663 Kaiserslautern
Germany
Lukáš Pospíšil
Department of Mathematics, Faculty of Civil Engineering
VSB – Technical University of Ostrava
70833 Ostrava
Czech Republic