Binary and Boolean matrix factorizations

low-rank matrix factorization,; optimization; algorithms; binary; boolean; semi-definite programming; matrix tri-factorization; integer programming; clustering; topic modelling

Abstract :

[en] The processing of data has become an important part of many applications during the last years. Such applications include hyperspectral unmixing, recommender sys- tems (for example in services like Spotify and Netflix), computer vision, text mining and audio source separation, to name a few. Many models and algorithms have been proposed, including linear dimensionality reduction (LDR), the perceptron, a pre- cursor to modern neural networks, expectation-maximization, and the more recent algorithms for neural networks. Our focus in this thesis is on a subclass of ML models called matrix factorization models. These are unsupervised algorithms whose goal is to decompose a given data matrix into a product of two smaller matrices, referred to as factors. Both factors are typically significantly smaller than the initial data matrix. There are multiple applications for matrix factorizations. Compression is one of them, since the factors are smaller than the original matrix. In text mining, matrix factorization retrieves topics from a collection of documents. In recommender systems, it creates groupings (clusters) of data points with common characteristics. As an example, when the entries of the input matrix represent a measure of the interaction between a user (rows of the matrix) and a product (columns of the matrix), these clusters represent groups with similar tastes. In addition, depending on the application, constraints can be imposed to the factors. An example of a constrained model is nonnegative matrix factorization (NMF) where the elements of the factors need to be nonnegative. In this thesis, we focus on three models: (1) semi-binary Matrix Factorization (semi-bMF), where one factor has no constraints and the other can only have elements that are either 0 or 1, (2) Boolean matrix factorization (BMF) where both factors must only have elements that are either 0 or 1 and (3) Boolean matrix tri-factorization (BMTF), where we are factorizing into three factors. Furthermore, the matrix product used in the Boolean factorizations is the Boolean matrix product, which is different than the standard matrix product operation. This makes the computation harder but allows for better approximations and additional interpretability properties. For both models, we present new algorithms that are competitive with the state of the art and show that they can provide meaningful results in several applications, including topic modeling, image analysis, and clustering.

Disciplines :

Mathematics
Computer science

Author, co-author :

Kolomvakis, Christos ; Université de Mons - UMONS > Recherche > Service ERC Unit - Matrix Theory and Optimization

Language :

English

Title :

Binary and Boolean matrix factorizations

Defense date :

28 January 2026

Number of pages :

110

Institution :

UMONS - University of Mons [FPMS], Belgium

Degree :

Doctorat en Science de l’ingénieur et technologie

Promotor :

Gillis, Nicolas ; Université de Mons - UMONS > Faculté Polytechnique > Service de Mathématique et Recherche opérationnelle

Vandaele, Arnaud ; Université de Mons - UMONS > Faculté Polytechnique > Service de Mathématique et Recherche opérationnelle

President :

Lecron, Fabian ; Université de Mons - UMONS > Faculté Polytechnique > Service de Management de l'Innovation Technologique

Secretary :

Glineur, François; UCL - Catholic University of Louvain

Jury member :

De Lathauwer, Lieven; KU Leuven - Catholic University of Leuven

Miettinen, Pauli; UEF - University of Eastern Finland

Research unit :

F151 - Mathématique et Recherche opérationnelle

Research institute :

R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
R300 - Institut de Recherche en Technologies de l'Information et Sciences de l'Informatique

European Projects :

HE - 101085607 - eLinoR - Beyond Low-Rank Factorizations

Funders :

EU - European Union

Available on ORBi UMONS :

since 29 January 2026

Statistics

Number of views

76 (5 by UMONS)

Number of downloads

229 (5 by UMONS)

More statistics