Doctoral thesis (Dissertations and theses)
Binary and Boolean matrix factorizations
Kolomvakis, Christos
2026
 

Files


Full Text
Binary and Boolean matrix factorizations.pdf
Author postprint (2.72 MB) Creative Commons License - Public Domain Dedication
Download

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
low-rank matrix factorization,; optimization; algorithms; binary; boolean; semi-definite programming; matrix tri-factorization; integer programming; clustering; topic modelling
Abstract :
[en] The processing of data has become an important part of many applications during the last years. Such applications include hyperspectral unmixing, recommender sys- tems (for example in services like Spotify and Netflix), computer vision, text mining and audio source separation, to name a few. Many models and algorithms have been proposed, including linear dimensionality reduction (LDR), the perceptron, a pre- cursor to modern neural networks, expectation-maximization, and the more recent algorithms for neural networks. Our focus in this thesis is on a subclass of ML models called matrix factorization models. These are unsupervised algorithms whose goal is to decompose a given data matrix into a product of two smaller matrices, referred to as factors. Both factors are typically significantly smaller than the initial data matrix. There are multiple applications for matrix factorizations. Compression is one of them, since the factors are smaller than the original matrix. In text mining, matrix factorization retrieves topics from a collection of documents. In recommender systems, it creates groupings (clusters) of data points with common characteristics. As an example, when the entries of the input matrix represent a measure of the interaction between a user (rows of the matrix) and a product (columns of the matrix), these clusters represent groups with similar tastes. In addition, depending on the application, constraints can be imposed to the factors. An example of a constrained model is nonnegative matrix factorization (NMF) where the elements of the factors need to be nonnegative. In this thesis, we focus on three models: (1) semi-binary Matrix Factorization (semi-bMF), where one factor has no constraints and the other can only have elements that are either 0 or 1, (2) Boolean matrix factorization (BMF) where both factors must only have elements that are either 0 or 1 and (3) Boolean matrix tri-factorization (BMTF), where we are factorizing into three factors. Furthermore, the matrix product used in the Boolean factorizations is the Boolean matrix product, which is different than the standard matrix product operation. This makes the computation harder but allows for better approximations and additional interpretability properties. For both models, we present new algorithms that are competitive with the state of the art and show that they can provide meaningful results in several applications, including topic modeling, image analysis, and clustering.
Disciplines :
Mathematics
Computer science
Author, co-author :
Kolomvakis, Christos ;  Université de Mons - UMONS > Recherche > Service ERC Unit - Matrix Theory and Optimization
Language :
English
Title :
Binary and Boolean matrix factorizations
Defense date :
28 January 2026
Number of pages :
110
Institution :
UMONS - University of Mons [FPMS], Belgium
Degree :
Doctorat en Science de l’ingénieur et technologie
Promotor :
Gillis, Nicolas  ;  Université de Mons - UMONS > Faculté Polytechnique > Service de Mathématique et Recherche opérationnelle
Vandaele, Arnaud ;  Université de Mons - UMONS > Faculté Polytechnique > Service de Mathématique et Recherche opérationnelle
President :
Lecron, Fabian ;  Université de Mons - UMONS > Faculté Polytechnique > Service de Management de l'Innovation Technologique
Secretary :
Glineur, François;  UCL - Catholic University of Louvain
Jury member :
De Lathauwer, Lieven;  KU Leuven - Catholic University of Leuven
Miettinen, Pauli;  UEF - University of Eastern Finland
Research unit :
F151 - Mathématique et Recherche opérationnelle
Research institute :
R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
R300 - Institut de Recherche en Technologies de l'Information et Sciences de l'Informatique
European Projects :
HE - 101085607 - eLinoR - Beyond Low-Rank Factorizations
Funders :
EU - European Union
Available on ORBi UMONS :
since 29 January 2026

Statistics


Number of views
76 (5 by UMONS)
Number of downloads
229 (5 by UMONS)

Bibliography


Similar publications



Contact ORBi UMONS