Ilya Tolstikhin

Recent news

I am joining the Brain team at Google AI, Zurich, starting from July 2018.
Wasserstein auto-encoders got an oral at ICLR 2018. Unfortunately, I am not traveling this year, but Sylvain will! Make sure to find him at the conference..
I am helping my colleagues Ruth Urner and Michael Hirsch to organize the Machine Learning Summer School (MLSS) 2017.
I will be teaching at Bocconi Summer School in Advanced Statistics and Probability on 'Statistical Causal Learning' together with David Lopez-Paz and Bernhard Schoelkopf.

Ilya Tolstikhin

Picture by Bob Williamson, Dagstuhl, 2016

Feel free to contact me: iliya[dot]tolstikhin[at]gmail[dot]com

Currently I am a research scientist at Brain team, Google AI, Zurich.

Between 2014 and 2018 I worked as a postdoc at the Empirical Inference Department of Max Planck Institute for Intelligent Systems, Tübingen, Germany.
I received a diploma (MSc equivalent) in 2010 from Lomonosov Moscow State University and PhD in 2014 from Dorodnicyn
Computing Center of Russian Academy of Sciences where I worked with Konstantin Vorontsov on statistical learning theory.

Currently my research is focused on unsupervised (deep) generative models and representation learning. I would like to hopefully solve these tasks to a certain
degree in an understandable way, that is using approaches which are well motivated and easily adjustable to various scenarios.

In past I worked a lot on statistical learning theory and theory of machine learning in general. Particularly, I was interested in tight data-dependent generalization
error and excess risk bounds in machine learning, which could shed some light on the process of learning, potentially leading to new and more accurate learning
algorithms. Also I was interested in tools, used to achieve these goals (which are extremely interesting and rich fields of research on their own), including concentration
of measure inequalities and empirical process theory. I am still interested in these topics.

Publications

Preprints

GeNet: Deep Representations for Metagenomics

Mateo Rojas-Carulla, Ilya Tolstikhin, Guillermo Luque, Nicholas Youngblut, Ruth Ley, Bernhard Schölkopf
On the Latent Space of Wasserstein Auto-Encoders

Paul K. Rubenstein, Bernhard Schoelkopf, Ilya Tolstikhin.
From optimal transport to generative modeling: the VEGAN cookbook

Olivier Bousquet, Sylvain Gelly, Ilya Tolstikhin, Carl-Johann Simon-Gabriel, Bernhard Schoelkopf.
Probabilistic Active Learning of Functions in Structural Causal Models

Paul K. Rubenstein, Ilya Tolstikhin, Philipp Hennig, Bernhard Schoelkopf.
Minimax Lower Bounds for Realizable Transductive Classification

Ilya Tolstikhin, David Lopez-Paz. (First inequality of (4) and that of (5) are wrong)

Conference papers (chronologically ordered)

Differentially Private Database Release via Kernel Mean Embeddings

Matej Balog, Ilya Tolstikhin, Bernhard Schoelkopf.

International Conference on Machine Learning (ICML), 2018.
Wasserstein Auto-Encoders, [GitHub]

Ilya Tolstikhin, Olivier Bousquet, Sylvain Gelly, Bernhard Schoelkopf.

ICLR 2018 (full oral).
AdaGAN: Boosting Generative Models, [GitHub]

Ilya Tolstikhin, Sylvain Gelly, Olivier Bousquet, Carl-Johann Simon-Gabriel, Bernhard Schoelkopf.

NIPS 2017.
Consistent Kernel Mean Estimation for Functions of Random Variables

Adam Scibior, Carl-Johann Simon-Gabriel, Ilya Tolstikhin, Bernhard Schoelkopf.

NIPS 2016.
Minimax Estimation of Maximum Mean Discrepancy with Radial Kernels

Ilya Tolstikhin, Bharath Sriperumbudur, Bernhard Schoelkopf.

NIPS 2016.
Permutational Rademacher Complexity: a New Complexity Measure for Transductive Learning

Ilya Tolstikhin, Nikita Zhivotovskiy, and Gilles Blanchard.

Algorithmic Learning Theory (ALT), 2015.
Towards a Learning Theory of Cause-Effect Inference

David Lopez-Paz, Krikamol Muandet, Bernhard Schölkopf, Iliya Tolstikhin.

International Conference on Machine Learning (ICML), 2015.
Localized Complexities for Transductive Learning

Ilya Tolstikhin, Gilles Blanchard, and Marius Kloft.

Conference on Learning Theory (COLT), 2014. (Full oral presentation)

Note: There was a minor mistake in the assumptions of the Corollary 15.
PAC-Bayes-Empirical-Bernstein Inequality

Ilya Tolstikhin, Yevgeny Seldin.

Advances in Neural Information Processing Systems (NIPS), 2013. (Spotlight presentation / acceptance ratio = 5%)
Localized excess risk bounds in combinatorial theory of overfitting, (in Russian)

Ilya Tolstikhin.

9th International Conference on Intelligent Information Processing (IIP), 2012.
The probability of overfitting for the compact and sparse sets of classifiers, (in Russian)

Ilya Tolstikhin.

8th International Conference on Intelligent Information Processing (IIP), 2010.
Exact generalization error bound for one particular model of classifiers , (in Russian)

Ilya Tolstikhin.

17th International student, postgraduate and young scientist conference "Lomonosov", 2010.

Journal papers (chronologically ordered)

Minimax Estimation of Kernel Mean Embeddings

Ilya Tolstikhin, Bharath Sriperumbudur, Krikamol Muandet.

Journal of Machine Learning Research (JMLR), to appear 2017.
On two approaches to concentration for sampling without replacement , (in Russian)

Tolstikhin Ilya.

Theory of Probability and Its Applications, 2016.
Combinatorial bounds on probability of overﬁtting based on clustering and coverage of classifiers , (in Russian)

Alexander Frey, Tolstikhin Ilya.

Machine Learning and Data Analysis (JMLDA), 2013.

Others

B0 matrix shim array design-optimization of the position, geometry and the number of segments of individual coil elements

Zivkovic I., Tolstikhin I., Schölkopf B., Scheffler K.

33rd Annual Scientific Meeting of the European Society for Magnetic Resonance in Medicine and Biology (ESMRMB), 2016.

PhD Thesis

Неравенства концентрации вероятностной меры в трансдуктивном обучении и PAC-Байесовском анализе
(Concentration inequalities applied to transductive learning and PAC-Bayesian analysis). [Text], [Synopsis]
(in Russian, translation to English not in progress...)

Computing Centre of Russian Academy of Sciences, 2014.

Abstract: The dissertation examines the role of concentration inequalities in efforts to improve performance bounds of supervised learning algorithms. The motivation to obtain tight generalization error and excess risk bounds in statistical learning theory comes from the belief that a deep understanding of a learning process might lead us to new useful ideas and more accurate algorithms. First part of the work studies concentration inequalities for one particular setting of dependent random variables: when they are sampled without replacement from the given finite population. We provide two novel Bernstein-style concentration inequalities for suprema of empirical processes and sampling without replacement. While these new inequalities may potentially have broad applications, we exemplify their significance in the second part of the work by studying the transductive setting of statistical learning theory. For which we provide an excess risk bound based on the localized complexity of the hypothesis class which holds under very mild assumptions. Finally, the third part of the work studies the PAC-Bayesian analysis, which is a general tool for data-dependent analysis in machine learning. We derive a new PAC-Bayes-Empirical-Bernstein inequality which is a powerful Bernstein-style concentration inequality depending only on empirical quantities. We show that in a number of interesting situations our new PAC-Bayes-Empirical-Bernstein bound can be significantly tighter than the state-of-the-art results.

Talks

Talks in English

Wasserstein Auto-Encoders: from optimal transport to generative modeling and beyond, [Slides], [Talk]

Deep Vision Seminars, March 2018, QUVA Lab, Amsterdam.
Implicit generative models: dual vs. primal approaches, [Slides]

Machine Learning Summer School, 2017, Tuebingen.

Workshop on Stochastic Processes and Probabilistic Models in Machine Learning, 2017, Moscow.
Statistical Causal Learning

Bocconi Summer School on Advanced Statistics and Probability. July 10-22, 2017, Como, Italy.

Together with David Lopez-Paz and Bernhard Schoelkopf.
Consistent Kernel Mean Estimation for Functions of Random Variables

Dagstuhl workshop, "New Directions for Learning with Kernels and Gaussian Processes", Schloss Dagstuhl, Germany, 2016.
On some properties of MMD and its relation to other distances

Dagstuhl workshop, "Foundations of Unsupervised Learning", Schloss Dagstuhl, Germany, 2016.
Minimax Estimation of Kernel Mean Embeddings [Poster]

Spring School "Structural Inference", Brodten, Germany, 2016.
Global and Local Complexity Measures for Transductive Learning [Talk], [Poster], [Slides]

Yandex School of Data Analysis Conference, “Machine Learning: Prospects and Applications”, Berlin, Germany, 2015.
Sampling without replacement: reduction to i.i.d. VS direct approach [Slides],
Dagstuhl workshop, “Machine Learning with Interdependent and Non-Identically Distributed Data”, Schloss Dagstuhl, Germany, 2015.
Localized Complexities for Transductive Learning, COLT 2014, [Slides], [Poster], [Talk (videolectures.net)].

Ilya Tolstikhin, Gilles Blanchard, Marius Kloft.
New Concentration Inequalities for Sampling without Replacement and an Application to Transductive Learning [Slides]
MPI for Intelligent Systems, Tubingen, 2014.
PAC-Bayes-Empirical-Bernstein Inequality, NIPS 2013, [Spotlight], [Poster], [Talk (videolectures.net)].

Ilya Tolstikhin, Yevgeny Seldin.
PAC-Bayesian Inequalities for Martingales, GRAAL, Laval University, 2013, [Slides].
PAC-Bayes-Empirical-Bernstein Inequality, GRAAL, Laval University, 2013, [Slides].

Talks in Russian

Dissertation defense, Computing Centre of Russian Academy of Sciences, October 16th, 2014, [Slides], [Video in Russian]
Localized Complexities and Fast Rates in Statistical Learning Theory, Joint MIPT/IUM seminar on Stochastic Analysis, 2014, [Slides], [Video in Russian]
PAC-Bayesian Inequalities, Joint MIPT/IUM seminar on Stochastic Analysis, 2013, [Slides]
Concentration Inequalities for Sampling without Replacement, Joint MIPT/IUM seminar on Stochastic Analysis, 2013, [Slides]

Activities

Journals review:

Journal of Machine Learning Research (JMLR);
Annals of Statistics

Conferences review:

International Conference on Machine Learning (ICML, 2013, 2015, 2016);
Neural Information Processing Systems (NIPS, 2015, 2016);
International Conference on Artificial Intelligence and Statistics (AISTATS, 2014, 2015, 2016);
Algorithmic Learning Theory (ALT, 2015).

PC Member:

NIPS Workshop on New Directions in Transfer and Multi-Task: Learning Across Domains and Tasks (2013);
Senior PC for Uncertainty in Artificial Intelligence (UAI, 2016).

Teaching

Instructor for the course “Machine Learning Theory” (together with Ruth Urner) Eberhard Karls Universität Tübingen	2016 - 2017
Teaching Assistant for the course “Machine Learning” Skolkovo Institute of Science and Technology	2013 - 2013
Tutorials for the course “Machine Learning” Lomonosov Moscow State University	2012 - 2013
Tutorials for the course “Machine Learning” Moscow Institute of Physics and Technology	2011 - 2012

Things I'm doing outside of office

Workouts and health tracking: I workout regularly and collect all kind of data I can get about my body and health. In future I hope to run some algorithms on this data to see
if I can build a reasonable model of fitness / health. All the info is available in this text file (except hurt rate listings), which is regularly updated. Feel free to use it ;)