Last Updated: 20160126

AAS 227: Special Session

213. Lectures in AstroStatistics

Wednesday, January 6 2016

10:00am-11:30am // Osceola 5
| Description | Schedule | Contacts | changelog |


A functional literacy in AstroStatistics is becoming a necessity for astronomers who are confronted with high-quality datasets from modern instruments. New astronomical datasets pose unprecedented data analytic challenges with complex data that aim to improve our understanding of the Universe, provided that they are carefully analyzed and uncertainties are accounted for correctly. This requires descriptive science-driven statistical models and methods that relate our best underlying physical processes to observables. The field of AstroStatistics is at this intersection of observational Astronomy, Statistics, and data science. Our session is aimed at making Astronomers familiar with newer techniques that are becoming available, with the goal of expanding the analysis toolkit that is available to them. We therefore review basic methods, covering topics like the least-squares fitting, likelihoods, Machine Learning concepts that allow classification and clustering, and Bayesian analysis, in a series of three lectures by experts in the field.

The speakers also participated in informal discussions during the Topics in AstroStatistics Splinter Session later in the afternoon.


Chair: Aneta Siemiginowska (Harvard-Smithsonian Center for Astrophysics, Cambridge, MA)

10:00 am: Edward Robinson (University of Texas, Austin, TX)

The Likelihood Function and Likelihood Statistics

The likelihood function is a necessary component of Bayesian statistics but not of frequentist statistics. The likelihood function can, however, serve as the foundation for an attractive variant of frequentist statistics sometimes called likelihood statistics. We will first discuss the definition and meaning of the likelihood function, giving some examples of its use and abuse - most notably in the so-called prosecutor's fallacy. Maximum likelihood estimation is the aspect of likelihood statistics familiar to most people. When data points are known to have Gaussian probability distributions, maximum likelihood parameter estimation leads directly to least- squares estimation. When the data points have non-Gaussian distributions, least-squares estimation is no longer appropriate. We will show how the maximum likelihood principle leads to logical alternatives to least squares estimation for non-Gaussian distributions, taking the Poisson distribution as an example.

The likelihood ratio is the ratio of the likelihoods of, for example, two hypotheses or two parameters. Likelihood ratios can be treated much like un-normalized probability distributions, greatly extending the applicability and utility of likelihood statistics. Likelihood ratios are prone to the same complexities that afflict posterior probability distributions in Bayesian statistics. We will show how meaningful information can be extracted from likelihood ratios by the Laplace approximation, by marginalizing, or by Markov chain Monte Carlo sampling.

Presentation slides [.pdf]
Rob Robinson is the W.B. Blakemore II Regents Professor in Astronomy at the University of Texas at Austin. He is the author of a book on Data Analysis for Scientists and Engineers, which is being published by Princeton University Press. A draft of the book is available online at until the actual book comes out.

10:30 am: Tom Loredo (Cornell Center for Astrophysics and Planetary Science, Ithaca, NY)

From least squares to multilevel modeling: A graphical introduction to Bayesian inference

This tutorial presentation will introduce some of the key ideas and techniques involved in applying Bayesian methods to problems in astrostatistics. The focus will be on the big picture: understanding the foundations (interpreting probability, Bayes's theorem, the law of total probability and marginalization), making connections to traditional methods (propagation of errors, least squares, chi-squared, maximum likelihood, Monte Carlo simulation), and highlighting problems where a Bayesian approach can be particularly powerful (Poisson processes, density estimation and curve fitting with measurement error). The "graphical" component of the title reflects an emphasis on pictorial representations of some of the math, but also on the use of graphical models (multilevel or hierarchical models) for analyzing complex data. Code for some examples from the talk will be available to participants, in Python and in the Stan probabilistic programming language.

Presentation slides [.pdf]
Tom Loredo is a Senior Research Associate and Lecturer at Cornell University, and is affiliated with both CAS/PSU and SAMSI. He is also co-chair of the LSST Informatics and Statistics Science Collaboration.
A Jupyter notebook illustrating the use of Stan in a typical astronomy problem is available at the git repository
A non-Python, non-GitHub user can download the entire notebook via the "Download Zip" button, which contains an HTML file that shows the rendered notebook with more instructions on how the notebook could be used.

11:00 am: Jessi Cisewski (Yale University, New Haven, CT)

Topics in Machine Learning for Astronomers

As astronomical datasets continue to increase in size and complexity, innovative statistical and machine learning tools are required to address the scientific questions of interest in a computationally efficient manner. I will introduce some tools that astronomers can employ for such problems with a focus on clustering and classification techniques. I will introduce standard methods, but also get into more recent developments that may be of use to the astronomical community.

Presentation slides [.pdf]
Jessi Cisewski is an Assistant Professor at the Department of Statistics at Yale University, and is leading the Astrostatistics Interest Group at the American Statistical Association.
Lectures on various topics of statistics and astronomy can also be found at the recent Summer School in Statistics for Astronomers held at Penn State.


Vinay Kashyap (vkashyap @ cfa . harvard . edu)
Aneta Siemiginowska (asiemiginowska @ cfa . harvard . edu)


2015-dec-17: set up page.
2015-dec-29: typos.
2016-jan-20: added slides for Robinson lecture.
2016-jan-21: added slides for Cisewski lecture.
2016-jan-26: added slides for Loredo lecture.



Wed 6 Jan 2016
Osceola 5

E. Robinson
T. Loredo
J. Cisewski