My research covers a broad range of statistical learning and advanced machine learning methods that are needed in modern data analysis problems. The aim is to take into account for a) complexity of the data including latent low rank structures and subspaces, sparsity and missing values, or the sheer variety of the data, b) large scale settings which refers to high-dimensionality but also settings where the sample size is smaller or not much larger than the data dimension which make traditional asymptotically optimal methods perform poorly and c) dynamic nature of the data, or data velocity, where data accumulates or streams at fast pace and thus batch processing is not a viable solution.

A particular focus area is the analysis of *high-dimensional (HD)* sparse data,
i.e., data for which the sample size *n* is
smaller or not much larger than the dimension *p* of the data set, where *p* is
potentially very large. In such cases, the number of parameters to estimate can greatly
exceed the number of observations. For example, in genomic studies, *n* is often
the number of patients (only few tens!) and *p* is the number of genes (tens of thousands).
Yet a classifier needs to learn a high-dimensional parameter with a limited data.
This demands new approaches such as regularized optimization or imposing some structure on
the unknown parameter to reduce the number of unknowns.
Sparse HD data sets are becoming more common place in practice, particularly with the
development of areas such as genomics, multimedia imaging, or financial economics.

# Doctoral students

**
Elias Raninen** (6/2017- present)

*Topic*: classification, high-dimensional covariance estimation,
statistical learning with sparsity, regularization.

*M.Sc. student* (6/2016-5/2017). Thesis:Scaled sparse linear regression with the elastic net

**Muhammad Naveed Tabassum** (1/2016 - present)

*topic*: bioinformatics, compressed beamforming, sparse learning with sparsity

# Post-docs

**Nora Ouzir** (11/2018 - 12/2020)

*Topic*: ultrasound imaging, dictionary learning

**Jari Miettinen** (8/2017 - 7/2020)

*Topic*: Graph signal processing

**Ammar Mian** (10/2019 - 9(2020)

*Topic*: Machine learning methods on manifolds, pedestrian detection.

# Alumni

**Shahab Basiri**
(Reserach Scientist at Varian Medical Systems)

doctoral student 3/2014 - 6/2018. Thesis:
Robust large-scale statistical inference and ICA using bootstrapping

M.Sc. student 7/2012 – 2/2014. Thesis: Hypothesis Testing in Independent Component Analysis / Blind Source Separation

**Aqib Ejaz**
(Senior Engineer at Emberion Oy)

M.Sc. student, 12/2014 - 5/2015. Thesis:
Algorithms for Sparse Signal Recovery in Compressed Sensing

**Alireza Razavi** (Senior AI Development Engineer at Scania Group)

Post-doctoral fellow, 9/2011-8/2012.