Machine Learning and Data Analysis

Vadim V. Strijov · ORC ID · Math-Net · GoogleScholar

List of publications

2022

Isachenko R.V., Strijov V.V. Quadratic programming feature selection for multicorrelated signal decoding with partial least squares // Expert Systems with Applications, 2022, 207 : 117967. Article

[Abstract] [BibTeX] [DOI] [URL]

Abstract: This paper investigates the dimensionality reduction problem for signal decoding. Its main application is brain-computer interface modeling. The challenge is high redundancy in the data description. Data combines time series of two origins: design space: brain cortex signals and target space: limb motion signals. High correlations among measurements of complex signals lead to multiple correlations. This case studies correlations in input and target spaces that carry heterogeneous data. This paper proposes feature selection algorithms to construct a simple and stable forecasting model. It extends ideas of the quadratic programming feature selection approach and selects non-correlated features that are relevant to the target. The proposed methods take into account dependencies in both design and target space and select features which fit both spaces jointly. The computational experiment was carried out using an electrocorticogram (ECoG) dataset. The obtained models predict hand motions using signals of the brain cortex. The partial least squares (PLS) regression model is used as the base model for dimensionality reduction. The PLS algorithm obtains the best result, which reduces space dimensionality using the QPFS.

BibTeX:

 
@article{IsachenkoStrijov2022Decoding, 
  author = {Isachenko, R. V. and Strijov, V. V.},
  title = {Quadratic programming feature selection for multicorrelated signal decoding with partial least squares},
  journal = {Expert Systems with Applications},
  year = {2022},
  volume = {207},
  pages = {117967},
  url = {/papers/isachenko2022qpfs_decoding.pdf},
  doi = {10.1016/j.eswa.2022.117967}
}

Grabovoy A.V., Gadaev T.S., Motrenko A.P., Strijov V.V. Numerical methods of sufficient sample size estimation for generalised linear models // Lobachevskii Journal of Mathematics, 2022, 43 : 2453-2462. Article

[Abstract] [BibTeX] [DOI] [URL]

Abstract: This paper investigates the problem of cost reduction of data collection procedures. A sample set of minimum sufficient size must be collected to select an adequate regression or classification model. This sample set is modeled according to to follow the data generation hypotheses. Namely, the generalized linear regression models assume the independent and identically distributed target variable. The paper analyses several numerical methods of sample size estimation and compares them in practical terms. It includes statistic, heuristic, and Bayesian methods. The practical goal of a sample set collection is modeling. Some methods involve analysis of the model parameters. The computational experiment includes widely-used sample sets. The open-source code and the software are provided for the practitioners to use in the data collection planning.

BibTeX:

 
@article{Grabovoy2021SampleSize, 
  author = {Grabovoy, A. V. and Gadaev, T. S. and Motrenko, A. P. and Strijov, V. V.},
  title = {Numerical methods of sufficient sample size estimation for generalised linear models},
  journal = {Lobachevskii Journal of Mathematics},
  year = {2022},
  volume = {43},
  pages = {2453-2462},
  url = {http://links.springernature.com/f/a/W3ZeXWVkoEFhJMHWyIrhKQ  /AABE5gA /RgRljdB7P0RcaHR0cHM6Ly90cmVidWNoZXQucHVibGljLnNwcmluZ2VybmF0dXJlLmFwcC9nZXRfY29udGVudC85MDUyYjgyNS05MmFkLTQyMmItOWVjNy1jNzhmMmY1OGI3OGNXA3NwY0IKY6J7S6tjKeLdUlIUc3RyaWpvdkBwaHlzdGVjaC5lZHVYBAAABy0 },
  doi = {10.1134/S1995080222120125}
}

Grabovoy A.V., Strijov V.V. Probabilistic Interpretation of the Distillation Problem // Automation and Remote Control, 2022, 83(1) : 123-137. Article

[Abstract] [BibTeX] [DOI] [URL]

Abstract: The article deals with methods for reducing the complexity of approximating models. Probabilistic substantiation of distillation and privileged teaching methods is proposed. General conclusions are given for an arbitrary parametric function with a predetermined structure. A theoretical basis is demonstrated for the special cases of linear and logistic regression. The analysis of the considered models is carried out in a computational experiment on synthetic samples and real data. The FashionMNIST and Twitter Sentiment Analysis samples are considered real data.

BibTeX:

 
@article{Grabovoy2021Distilling, 
  author = {Grabovoy, A. V. and Strijov, V. V.},
  title = {Probabilistic Interpretation of the Distillation Problem},
  journal = {Automation and Remote Control},
  year = {2022},
  volume = {83},
  number = {1},
  pages = {123--137},
  url = {https://trebuchet.public.springernature.app/get_content/4df58851-23f3-4ec8-95f8-30eff603197f},
  doi = {10.1134/S000511792201009X}
}

Bazarova A.I., Grabovoy A.V., Strijov V.V. Analysis of the properties of probabilistic models in expert-augmented learning problems // Automation and Remote Control, 2022, 83 : 1527-1537. Article

[Abstract] [BibTeX] [DOI] [URL]

Abstract: The paper deals with the construction of interpretable machine learning models. The approximation problem is solved for a set of shapes on a contour image. Assumptions that the shapes are second-order curves are introduced. When approximating the shapes, information about the type, location, and shape of curves as well as about the set of their possible transformations is used. Such information is called expert information, and the machine learning method based on expert information is called expert-augmented learning. It is assumed that the set of shapes is approximated by the set of local models. Each local model based on expert information approximates one shape on the contour image. To construct the models, it is proposed to map second-order curves into a feature space in which each local model is linear. Thus, second-order curves are approximated by a set of linear models. In a computational experiment, the problem of approximating an iris on a contour image is considered.

BibTeX:

 
@article{B2021BayesianDistilationRu, 
  author = {Bazarova A.I. and Grabovoy, A. V. and Strijov, V. V.},
  title = {Analysis of the properties of probabilistic models in expert-augmented learning problems},
  journal = {Automation and Remote Control},
  year = {2022},
  volume = {83},
  pages = {1527-1537},
  url = {https://link.springer.com/epdf/10.1134/S00051179220100058?sharing_token=hAPcnuIqzQzbt4k9e1mK60ckSORA_DxfnEvY7GoQybYVd6LPNBk87BsZksMeOmQTQkPHqNC0C0hhH4wgkIwUBXiYnzpFiL-xlzke_QsjGa9T079qlMNETVn8oSyj0Oa8YO234_op_q_nnSelEgSihsbSeTNLMy5eQfqTNKRAu-E=},
  doi = {10.1134/S00051179220100058}
}

Gorpinich M., Bakhteev O.Y., Strijov V.V. Gradient Methods for Optimizing Metaparameters in the Knowledge Distillation Problem // Automation and Remote Control, 2022, 83(10) : 1544-1554. Article

[Abstract] [BibTeX] [DOI] [URL]

Abstract: The paper investigates the distillation problem for deep learning models. Knowledge distillation is a metaparameter optimization problem in which information from a model of a more complex structure, called a teacher model, is transferred to a model of a simpler structure, called a student model. The paper proposes a generalization of the distillation problem for the case of optimization of metaparameters by gradient methods. Metaparameters are the parameters of the distillation optimization problem. The loss function for such a problem is the sum of the classification term and the cross-entropy between the responses of the student model and the teacher model. Assigning optimal metaparameters to the distillation loss function is a computationally difficult task. The properties of the optimization problem are investigated to predict the metaparameter update trajectory. An analysis of the trajectory of the gradient optimization of metaparameters is carried out, and their value is predicted using linear functions. The proposed approach is illustrated using a computational experiment on CIFAR-10 and Fashion-MNIST samples and synthetic data.

BibTeX:

 
@article{Gorpinich_2022, 
  author = {M. Gorpinich and O. Yu. Bakhteev and V. V. Strijov},
  title = {Gradient Methods for Optimizing Metaparameters in the Knowledge Distillation Problem},
  journal = {Automation and Remote Control},
  year = {2022},
  volume = {83},
  number = {10},
  pages = {1544--1554},
  url = {https://trebuchet.public.springernature.app/get_content/8c4414a5-9e0f-461f-b2f5-d406954a9017},
  doi = {10.1134/s00051179220100071}
}

Motrenko A., Simchuk E., Khairullin R., Inyakin A., Kashirin D., Strijov V.V. Continuous physical activity recognition for intelligent labour monitoring // Multimedia Tools and Applications, 2022, 81(4) : 4877-4895. Article

[Abstract] [BibTeX] [DOI] [URL]

Abstract: The paper addresses the problem of human activity recognition based on data from wearable sensors. Human activity recognition depends on a wide context of actions. Activities can not be recognized from the local shape of sensor signals only. We propose a solution to the problem of human activity recognition applied to labour monitoring. The solution is based on the hierarchical representation of activities as sets of low-level actions. Viewing activities as sequences of actions allows exploring activities in a more condensed representation than time series. The hierarchical representation provides an interpretable description of studied activities in terms of actions. To obtain this hierarchical representation, one must first solve the problem of low-level action recognition. Though widely studied, the problem of action recognition requires overcoming several difficulties. Firstly, we show that using noise-aware self-learning methods can significantly improve classification quality in human activity recognition. Since time series are human-labeled, errors are inevitable and abundant. Noisy labels significantly worsen classification quality. Noise-aware learning allows for relaxing requirements for labeling precision and lower annotation costs. Secondly, we propose an algorithm of automatic pattern selection to generate low-level descriptions as an alternative in an unsupervised manner. The proposed method is based on Eamonn Keogh's time series indexing methods. We introduce local PCA projections to make the method more robust to spatial rotations of a wearable device.

BibTeX:

 
@article{motrenko2020continous, 
  author = {Motrenko, Anastasia and Simchuk, Egor and Khairullin, Renat and Inyakin, Andrey and Kashirin, Daniil and Strijov, Vadim Victor},
  title = {Continuous physical activity recognition for intelligent labour monitoring},
  journal = {Multimedia Tools and Applications},
  year = {2022},
  volume = {81},
  number = {4},
  pages = {4877--4895},
  url = {https://doi.org/10.1007/s11042-021-11288-y},
  doi = {10.1007/s11042-021-11288-y}
}

Neychev R.G., Shibaev I.A., Strijov V.V. Optimal spanning tree reconstruction in symbolic regression // Informatics and Applications, 2022. Article

Machine Learning and Data Analysis

List of publications

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2003

2002

2001

2000

1999

1997

1996