Discrete choice models are used extensively in many disciplines where it is important to predict human behavior at a disaggregate level. This course is a follow up of the online course “Introduction to Discrete Choice Models”. We have selected some important advanced topics, that are presented in detail.

Course Details

Language English
Duration 6 weeks
Effort 5-6 hrs/week


The logit model is the workhorse of choice modelers. But it has some limitations. In particular, some assumptions used to derive it may not be consistent with the behavioral reality. It may lead to erroneous forecast. We illustrated using the so-called ""red bud-blue bus"" paradox, and Multivariate Extre Value models, addressing some of these issues, are introduced.

The sampling procedure used to collect choice data has a critical impact on the model estimation procedure. We introduce classical sampling procedures, and analyze in details the implications for model estimation.

In our quest to address the limitations of the logit model, we introduce a new family of models, based on ""mixtures"". We define what mixtures are, how they can be calculated. We investigate several important modeling assumptions that they can cover.

Random utility relies on the rationality assumption for the decision-makers. We show that human beings are not always consistent with this assumption, and may exhibit apparent irrationality. Hybrid choice models are able to capture subjective dimensions of the choice process, using variables that are called ""latent variables"".

Choices evolve over time. Individuals learn, develop habits. In order to capture that, it is necessary to observe individuals over time, and to collect so-called ""panel data"". The introduction of the time dimension into choice models has some econometrics implications, that we describe in detail.

Who needs choice models, when machine learning algorithms are so powerful and pervasive? In this last chapter, we introduce the similarities and differences between machine learning and discrete choice, and we discuss some potential limitations of machine learning in the context of the analysis of choice data.


What you will learn


  • Multivariate Extreme Value models

  • Sampling issues

  • Mixtures

  • Latent variables

  • Panel data

  • Discrete choice and machine learning



Statistics, linear regression, Introduction to Discrete Choice Models.



The course is structured into 6 weeks of about one to two-hour lectures:

  • Week 1. Multivariate Extreme Value Models

  • Week 2. Sampling

  • Week 3. Mixtures

  • Week 4. Latent variables

  • Week 5. Panel data

  • Week 6. Machine learning


Course instructors

Michel Bierlaire

Michel Bierlaire holds a PhD in Mathematical Sciences from the University of Namur, Belgium. Between 1995 and 1998, he was research associate and project manager at the Intelligent Transportation Systems Program of the Massachusetts Institute of Technolog…

École polytechnique fédérale de Lausanne

Free online courses from École polytechnique fédérale de Lausanne

EPFL is the Swiss Federal Institute of Technology in Lausanne. The past decade has seen EPFL ascend to the very top of European institutions of science and technology: it is ranked #1 in E…

95 instructors