Fundamental Machine Learning Theorems

About

The goal of the course is to learn how to rigorously formulate machine learning problems and to show the role of the mathematical approach. The skills acquired in this course are the basis for writing Bachelor’s, Master’s and Ph.D works in machine learning and applied mathematics. The course includes discussions of axiomatic systems, theorems and their proofs that are relevant in machine learning. Their influence on practical applications is discussed.

Syllabus

Gauss-Markov theorem
Singular Value Decomposition theorem
Principal Component Analysis and Karhunen-Loeve Decomposition
Karush-Kuhn-Tucker theorem and optimization
Kolmogorov and Arnold theorems, Tsybenko universal approximator theorem, deep neural network theorem
The free lunch theorem in machine learning, Wolpert
Metric spaces: RKHS Aronzhine, Mercer’s theorem
Scheme theorem, Holland
Convolution theorem, Parseval’s theorem
PAC-learning, compression implies learning
Representation theorem
Theorems about boosting, bootstrap
Variational approximation
Takens’ embedding theorem
Green, Stokes, deRham theorems in geometric learning

Labworks

Presentation of the formulations, proofs and methods of application, significance, of theorems.

Grading

The quality of presentations and the quality of questions to the speaker from each student.

Prerequisites

Algebra and analysis of the third year of MIPT.