Research

Below is a selection of research articles relating to the Automatic Statistician project.
The Automatic Statistician
Christian Steinruecken, Emma Smith, David Janz, James Lloyd, Zoubin Ghahramani
Automated Machine Learning (2019), Springer Series on Challenges in Machine Learning
pdf | web | bibtex
Scaling up the Automatic Statistician: Scalable Structure Discovery using Gaussian Processes
Hyunjik Kim, Yee Whye Teh
Proceedings of the 21st International Conference on Artificial Intelligence and Statistics (2018)
pdf | supplementary pdf | arxiv | bibtex
Unsupervised automatic dataset repair
James Allingham
Master's Thesis. Computer Laboratory, University of Cambridge (2018).
pdf | bibtex
The Automatic Statistician: A Relational Perspective
Yunseong Hwang, Anh Tong, Jaesik Choi
ICML 2016: Proceedings of the 33rd International Conference on Machine Learning (2016).
pdf | supplementary pdf | arxiv | slides | bibtex-1 | bibtex-2
Statistical Model Criticism using Kernel Two Sample Tests
James Robert Lloyd, Zoubin Ghahramani
Advances in Neural Information Processing Systems 28 (2015)
pdf | code | bibtex
Automatic Construction and Natural-Language Description of Nonparametric Regression Models
James Robert Lloyd, David Duvenaud, Roger Grosse, Joshua B. Tenenbaum, Zoubin Ghahramani
Association for the Advancement of Artificial Intelligence (AAAI) Conference, 2014
pdf | code | examples | bibtex
Structure Discovery in Nonparametric Regression through Compositional Kernel Search
David Duvenaud, James Robert Lloyd, Roger Grosse, Joshua B. Tenenbaum, Zoubin Ghahramani
International Conference on Machine Learning, 2013
pdf | code | poster | bibtex
Exploiting compositionality to explore a large space of model structures
Roger Grosse, Ruslan Salakhutdinov, William T. Freeman, Joshua B. Tenenbaum
Proceedings of the 28th Conference in Uncertainty in Artificial Intelligence (2012)
pdf | slides | code | bibtex
The Automatic Statistician for Classification
Qiurui Charles He
Master's Thesis. Department of Engineering, University of Cambridge (2016).
code | bibtex | pdf (available on request) | slides (available on request)
Towards an Artificial Intelligence for Regression
Riaz Moola
Master's Thesis. Computer Laboratory, University of Cambridge (2015).
pdf | bibtex
Kernel Structure Discovery for Gaussian Process Classification
Nikola Mrkšić
Master's Thesis. Computer Laboratory, University of Cambridge (2014).
pdf | bibtex
The Automatic Statistician and Future Directions in Probabilistic Machine Learning
Zoubin Ghahramani
Presentation at the Machine Learning Summer School, Tübingen (2015)
slides | bibtex