Directories | UChicago
Department of Statistics
5734 S. University Avenue
Chicago, IL 60637
The University of Chicago
©

Website Inquiries






Course Announcements Archive Through 2009

Last revised: 2/18/09

SPRING 2009

STATISTICS 20000. Elementary Statistics.
Sec 01: Naiqing Gu, MWF, 9:30-10:20 AM, Eckhart 133.
Sec 02: Xiaohui Chang, MWF, 12:30-1:20 PM, Eckhart 133.
PQ: Students with credit for Stat 220000 or 23400 not admitted.
Required Reading: Freedman, Pisani, and Purves, Statistics, 4th edition, W. W. Norton Press. ISBN-10: 0393929728, ISBN-13: 978-0393929720.

This course meets one of the general education requirements in the mathematical sciences. NOTE: STAT 20000 may not be used in the statistics major. It is recommended for students who do not plan to take advanced statistics courses. This
course introduces statistical concepts and methods for the collection, presentation, analysis, and interpretation of data. Elements of sampling, simple techniques for analysis of means, proportions, and linear association are used to illustrate both effective and fallacious uses of statistics.

Back to top

STATISTICS 22000=HDCP 22050. Stat Meth And Applications.
Sec 01: Debashis Mondal, MWF, 10:30–11:20 AM, Eckhart 133.
Sec 02: Winfried Barta, MWF, 1:30-2:20 PM, Eckhart 133.
PQ: 2 qtrs Calculus. Students who matriculate in the College after August 2008 may count either Stat 22000 or 23400, but not both, toward the 42 credits required for graduation.
Required Reading: Moore and McCabe, Introduction to the Practice of Statistics, 5th edition, W. H. Freeman. ISBN-10: 0716764008, ISBN-13: 978-0716764007.

This course introduces statistical techniques and methods of data analysis, including the use of computers. Examples are drawn from the biological, physical, and social sciences. Students are required to apply the techniques discussed to data drawn from actual research. Topics include data description, graphical techniques, exploratory data analyses, random variation and sampling, one- and two-sample problems, the analysis of variance, linear regression, and analysis of discrete data.

Back to top

STATISTICS 22200. Linear Models And Exper Design.
Sec 01: Linda Collins, MWF, 11:30 AM–12:20 PM.
PQ: Stat 22000 or consent of instructor.
Required Reading: Oehlert, G. W. (2000). A First Course in Design and Analysis of Experiments, W. H. Freeman. ISBN-10: 0-7167-3510-5 ISBN-13: 978-0-7167-3510-6.

This course covers principles and techniques for the analysis of experimental data and the planning of the statistical aspects of experiments. Topics include linear models, analysis of variance, randomization, blocking, factorial designs, confounding, and incorporation of covariate information. In addition to regular homework assignments and exams, students will be required to complete a project involving the design and analysis of an experiment.

Back to top

STATISTICS 23400. Statistical Models/Method-1.
Sec 01: David Degras, MWF, 11:30 AM–12:20 PM, Eckhart 133.
Sec 02: David Degras, MWF, 2:30–3:20 PM, Eckhart 133.
PQ: Math 13300, 15300 or 16300. Students who matriculate in the College after August 2008 may count either Stat 22000 or 23400, but not both, toward the 42 credits required for graduation.
Required Reading: Rossman (2006). Investigating Statistical Concepts, Applications, and Methods by Chance, Duxbury (Thomson Brooks/Cole). ISBN-10: 0495050644, ISBN-13: 978-D495050643.
Tanis and Hogg (2007). A Brief Course in Mathematical Statistics, Prentice Hall, ISBN-10: 0131751395, ISBN-13: 978-0131751392.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, hypergeometric, Poisson, exponential, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for proportions and means for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data.
Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Univariate calculus and computer simulation are used throughout the course to investigate statistical concepts and their mathematical underpinnings. One full year of univariate calculus is a prerequisite for the course (Math 13300, 15300, or 16300). Familiarity with at least limits, derivatives and integrals of polynomial and exponential functions, change of variable (substitution) in definite integrals, max-min problems, use of summation notation, and sequences and series as well as a willingness to explore ideas mathematically are key to your success in this course. See http://www.stat.uchicago.edu/~stat234 for more detailed information. Stat 23400 takes the mathematical prerequisite quite seriously. Enrolling concurrently in either Math 13300, 15300, or 16300 while taking Stat 23400 is very strongly
discouraged. Further, students who do not feel strong mathematically, may want to wait until completing their entire mathematical requirement (e.g., Math 19500-19600 for Economics majors) before enrolling in Stat 23400. Economics majors are strongly encouraged to delay taking Stat 23400 until the quarter just before enrolling in their required econometrics course (Econ 21000), for which Stat 23400 is a prerequisite. Thus, delaying Stat 23400 until at least late in the second year or even early in the third year of the Economics degree program should not be considered unusual.

Back to top

STATISTICS 24500. Statistical Theory/Method-2.
Sec 01: Debashis Mondal, TTH, 1:30 PM–2:50 PM, Eckhart 133.
PQ Multivariate Calculus (Math 19520 or 20000 or equivalent) and Linear Algebra (Math 19620 or 25500 or Stat 24300 or equivalent).
Required Reading: Rice, John A. (2007). Mathematical Statistics and Data Analysis, 3rd edition, Duxbury. ISBN-10: 0534399428, ISBN-13: 978-0534399429.

This is the second quarter of a two-quarter sequence. Enrollment in the second quarter alone is permitted, although not recommended. The first quarter covered the basics—tools from probability and the elements of statistical theory. The second quarter will cover statistical methodology, including the data transformation, regression phenomena, t-tests, analysis of variance, linear regression, correlation, and some multivariate distribution theory. Some principles of data analysis will be introduced, and an attempt will be made to present ANOVA and regression in a unified framework. Much of the material is covered in chapters 10–12 and 14 of the text, but other viewpoints and derivations will be introduced as well. The computer will be used in the second quarter. Some mathematical maturity will be assumed, to the level of calculus. The computer will be used for data analysis and simulation.

Back to top

STATISTICS 24600. Statistical Theory/Method-3.
Sec 01: Yali Amit, TT, 10:30–11:50 AM, Eckhart 133.
PQ: Stat 24500 or equivalent or consent of instructor.
Required Reading: Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning, Springer Verlag. ISBN-10: 0387310738, ISBN-13: 978-0387310732.

This course will introduce a variety of modern statistical methods. We will start with the description of families of multivariate models—multivariate normal distribution, graphical models, log-linear models. We will then discuss inference for such models including the EM algorithm, Bayesian inference and Monte-carlo methods. The main application of the models will be in classification problems—i.e the 'generative approach'. We will also introduce some modern discriminative approaches for classification.

Back to top

STATISTICS 25100. Intro to Math Probability.
Sec 01: Michael J. Wichura, TTH, 4:30–5:50 PM, Eckhart 133.
PQ: Math 20000 or Math 20500 or consent of instructor.
Required Reading: Ross, Sheldon (January 7, 2009). A First Course in Probability, 8th edition, Prentice Hall. ISBN-10: 013603313X, ISBN-13: 978-0136033134.

The aim of the course is to provide an introduction to the concepts of probability. The course will cover the basic ideas used to describe aspects of randomness, such as events, random variables, independence, and conditional probability, with emphasis on the methods, calculation, and applications of probability. The topics treated are: combinatorics; probability models; rules of probability; conditional probability; independence; random variables; expectation and standard deviation; games of chance; common discrete distributions—uniform, binomial, hypergeometric, geometric, negative binomial, and Poisson—and their interrelationships; univariate and multivariate density and distribution functions, change of variable formulas for densities, common continuous distributions—beta, Cauchy, chisquare, exponential, gamma, lognormal, normal, uniform—and their interrelationships; moment generating functions, laws of large numbers and the central limit theorem. For more information, please visit http://www.stat.uchicago.edu/~wichura/Stat251/courseinfo.html.

Back to top

STATISTICS 26100=STAT 33600. Time Dependent Data.
Sec 01: Mei Wang, TTH, 1:30–2:50 PM, Eckhart 133.
PQ Stat 22400 or Stat 24500 or consent of instructor.
Required Reading: Shumway and Stoffer (2006). Time Series Analysis and Its Applications: With R Examples, 2nd edition, Springer. ISBN-10: 0387293175, ISBN-13: 978-
0387293172.

This course considers the modeling and analysis of data that are ordered in time. The main focus will be on quantitative observations taken at evenly spaced intervals and will include both time-domain and spectral approaches. Time permitting, statistical approaches to other data types, such as multiple time series, categorical observations or point processes, will be considered.

Back to top

STATISTICS 26700=CHSS 32900, HIPS 25600, STAT 36700, History of Statistics.
Sec 01: Stephen M. Stigler, MWF, 9:30–10:20 AM, Eckhart 117.
PQ: A course in Statistics.
Required Reading: Stigler, Stephen M. (1990). The History of Statistics: The Measurement of Uncertainty Before 1900, Belknap Press of Harvard University Press. ISBN-10:
067440341X, ISBN-13: 978-0674403413. Other materials will be distributed in class or by web.

This course will cover topics in the history of statistics, from the eleventh century to the middle of the twentieth century. The emphasis will be upon the period 1650 to 1950, and upon the mathematical developments in the theory of probability and how they came to be used in the sciences, both to quantify uncertainty in observational data and as a conceptual framework for scientific theories. The course will include broad views of the development of the subject, and closer looks at specific people and investigations, including reanalyses of historical data. Topics will include: Early probability; Probability in seventeenth century medicine; Inverse probability in inference; Statistical methods in early geodesy; The introduction of least squares; Statistics in social science; Statistics in early biology and psychology; Simulation; Statistics and the evaluation of social programs; Maximum likelihood estimation; Statistics and nineteenth century forensic science; Statistics and medical science. Major figures who will be examined include Pascal, various Bernoullis, De Moivre, Laplace, Boscovich, Gauss, Quetelet, Galton, Edgeworth, Karl Pearson, Gosset, Fisher, Neyman.

Back to top

STATISTICS 30200. Mathematical Statistics-2.
Sec 01: Matthew Stephens, MW, 1:30–2:50 PM, Eckhart 117.
PQ: Stat 30100 or consent of instructor.
Required Reading: Berger, James O., Statistical Decision Theory and Bayesian Analysis, 2nd edition.

This course continues the development of Mathematical Statistics, with an emphasis on Bayesian inference. Key topics include Decision theory, The Likelihood principle, Foundations and philosophy of probability, Prior information, exchangeability, De Finetti's theorem, Bayesian inference (estimation and hypothesis testing), Bayesian computation, admissibility and Stein's paradox. The mathematical level will generally be at that of an easy advanced calculus course. We will assume familiarity with standard statistical distributions (e.g. Normal, Poisson, Binomial, Exponential), with the laws of probability, expectation, conditional expectation etc. Concepts will be illustrated mainly by instructive "toy" examples, where calculations can be done by hand. However, we will also study more complex, practical applications of Bayesian statistics. Although methods of computation will be discussed, the primary focus will be on concepts, and not on computation.

Back to top

STATISTICS 31300. Intro: Stochastic Processes-2.
Sec 01: Per A. Mykland, TTH, 10:30–11:50 AM, Eckhart 117.
PQ: Stat 31200 or consent of instructor.
Required Reading: Ross, S. (1996). Stochastic Processes, 2nd edition., Wiley.

This course is a continuation of Statistics 31300: Introduction to Stochastic Processes I. Topics to be discussed will include rates of convergence for Markov chains, introduction to MCMC, generating functions, Galton-Watson processes, continuous-time Markov chains, martingales, and Brownian motion. There will be weekly homework assignments, and midterm and final exams.

Back to top

STATISTICS 33200=HSTD 43200. Causal Inference.
Sec 01: Tyler VanderWeele, Thu, 3:00–5:50 PM, BSLC arr.
PQ Stat 22400-22600 or HSTD 32400-32700 or equivalent or consent of instructor.
Required Reading: None.

The course will be concerned with the process of drawing causal inferences from observational data in the biomedical and social sciences. The course will introduce a number of fundamental concepts in causal inference and cover methods of estimating causal effects for both point- and time-varying exposures. Concepts and methods that will be covered include: confounding, potential outcomes, propensity scores, causal directed acyclic graphs, inverse probability of treatment weighting, marginal structural models, mediation, interaction, sensitivity analysis, and instrumental variables.

Back to top

STATISTICS 33600=STATISTICS 26100. Time Dependent Data.
Sec 01: Mei Wang, TTH, 1:30 PM–2:50PM, Eckhart 202.
PQ: Stat 22400 or 24500 or consent of instructor.
Required Reading: Shumway and Stoffer (2006). Time Series Analysis and Its Applications: With R Examples, 2nd edition, Springer. ISBN-10: 0387293175, ISBN-13: 978-0387293172.

This course considers the modeling and analysis of data that are ordered in time. The main focus will be on quantitative observations taken at evenly spaced intervals and will include both time-domain and spectral approaches. Time permitting, statistical approaches to other data types, such as multiple time series, categorical observations or point processes, will be considered.

Back to top

STATISTICS 33620. Statistical Methods for Spatial-Temporal Processes.
Sec 01: Michael L. Stein, TTH, 9:00–10:20 AM, Eckhart 117.
PQ: Consent of instructor.
Required Reading: None.

This course will investigate emerging approaches in the statistical modeling and analysis of large spatial-temporal datasets, with a focus on examples from the atmospheric and environmental sciences. Some of the topics we might cover include various extensions of the Kalman filter, hierarchical Bayesian approaches that incorporate physical understanding of the underlying process and spectral methods for spatial-temporal processes. Students will be expected to read papers in the current literature and discuss them in class. Classwork will include code development, likely done in groups and perhaps building on existing software.

Back to top

STATISTICS 33700=BUSF 41914, ECON 31500. Multivariate Time Series Analylsis.
Sec 01: Ruey-Shiong Tsay, 8:30–11:30 AM.
PQ Business 41910=Stat 33700 and Econ 31500.
Required Reading: None.
Course website: http://faculty.chicagobooth.edu/ruey.tsay/teaching/mts
A useful web site for U.S. data: Fed. Res. at St Louis: http://research.stlouisfed.org/fred2/

Course Objective:

    1. To study the basic theory of multivariate processes
    2. To gain experience in analyzing multivariate time series data
    3. To learn multivariate time series models, including vector AR and ARMA models with exogenous variables
    4. To understand co-integration and error-correction models
    5. To study structural specification of a vector process
    6. To learn state-space models and Kalman filter.

No textbook is assigned. Lecture Notes available on course web.

Grading:

Mid-term (40%), Final Exam (40%), and homework assignments (20%).

Computing:

The key software packages are SCA and S-Plus, but you may use any software of your choice.

Course Outline: All topics include applications

    1. Transfer function models
    2. Stationary vector autoregressive and moving average models
    3. Estimation, modeling, and forecasting
    4. Unit-root, co-integration and error-correction models
    5. Multivariate Seasonal models
    6. Structural specification
    7. State-space model and Kalman filter
    8. Multivariate volatility models if time permits.

Back to top

STATISTICS 34700. Generalized Linear Models.
Sec 01: Peter McCullagh, TTH, 3:00–4:20 PM, Eckhart 133.
PQ: Stat 34300 or consent of instructor.
Required Reading: Venables & Ripley, Modern Applied Statistics w/S, 4th edition, Springer. ISBN: 0-387-
95457-0.
Cox & Snell (2000). Applied Statistics, Chapman & Hall/CRC. ISBN: 0-412-16570-8.

This is an applied course for students who are familiar with linear models at the level of Draper and Smith or Weisberg. The following topics will be covered:

  • Factors, variates, contrasts, interactions
  • Exponential-family models: variance function
  • Definition of a generalized linear model: link functions
  • Analysis of deviance
  • Specific examples of GLMs
  • Logistic and probit regression
  • Cumulative logistic models
  • Log-linear models and contingency tables
  • Inverse linear models
  • Quasi-likelihood and least squares; estimating functions
  • Over-dispersion
  • Partially linear models.

Back to top

STATISTICS 35500. Statistical Genetics.
Sec 01: Mary Sara McPeek, Fri, 1:30–4:00 PM, Eckhart 117.
PQ: Human Genetics 47100, Stat 24400-24500 or equivalent recommended. Students w/o this background should consult instructor.
Required Reading: There is no textbook.

This is an advanced course in statistical genetics. It is recommended that students have either Human Genetics 471 or Stat 244 and 245 as prerequisites. This is a discussion course and student presentations will be required. Topics vary and may include, but are not limited to, statistical problems in association mapping, linkage mapping, population genetics, microarray analysis, and genetic models for complex traits.

Back to top

STATISTICS 36700=STAT 26700, CHSS 32900, HIPS 25600. History of Statistics.
Sec 01: Stephen M. Stigler, MWF, 9:30–10:20AM, Eckhart 117.
PQ: A course in Statistics.
Required Reading: Stigler, Stephen M. (1990). The History of Statistics: The Measurement of Uncertainty Before 1900, Belknap Press of Harvard University Press. ISBN-10: 067440341X, ISBN-13: 978-0674403413. Other materials will be distributed in class or by web.

This course will cover topics in the history of statistics, from the eleventh century to the middle of the twentieth century. The emphasis will be upon the period 1650 to 1950, and upon the mathematical developments in the theory of probability and how they came to be used in the sciences, both to quantify uncertainty in observational data and as a conceptual framework for scientific theories. The course will include broad views of the development of the subject, and closer looks at specific people and investigations, including reanalyses of historical data. Topics will include: Early probability; Probability in seventeenth century medicine; Inverse probability in inference; Statistical methods in early geodesy; The introduction of least squares; Statistics in social science; Statistics in early biology and psychology; Simulation; Statistics and the evaluation of social programs; Maximum likelihood estimation; Statistics and nineteenth century forensic science; Statistics and medical science. Major figures who will be examined include Pascal, various Bernoullis, De Moivre, Laplace, Boscovich, Gauss, Quetelet, Galton, Edgeworth, Karl Pearson, Gosset, Fisher, Neyman.

Back to top

STATISTICS 37710=CMSC 35400. Machine Learning.
Sec 01: Partha Niyogi, TTH, 12:00–1:20PM, Ryerson 276.
PQ: CMSC 25000 or 35000 or consent of instructor.

Back to top

STATISTICS 38500=MATH 38500. Advanced Topics: Probability.
Sec 01: Steven Lalley, TTH, 3:30–4:20 PM, Eckhart 117.
PQ:
Required Reading None.

Topics for Statistics 38500 will include:

  • Continuous-time martingales
  • Brownian motion
  • Levy processes
  • Ito integral and stochastic calculus
  • Stochastic differential equations and diffusions

Back to top

STATISTICS 43800. Statistical Inference for Financial Data.
Sec 01: Per A. Mykland, TTH, 1:30–2:50 PM, Eckhart 117.
Note Class starts May 5 (1/2 qtr).
PQ Stat 49210 or equivalent.
Required Reading: None.

This is a course in the analysis of high frequency, tick-by-tick data. The application is mainly in finance, but the course may also be of interest to other applications with high frequency data, such as neural science. We cover the estimation of volatility and related quantities, such as leverage effect and realized regressions, including under microstructure noise. Tools include process convergence and contiguity, with and without microstructure noise.

Back to top

STATISTICS 49210=ECON 31401. Learning, Filtering and Pricing.
Sec 01: Lars P. Hansen, TTH, 1:30–2:50 PM, Eckhart 117.
Note This class meets 5 weeks only. Class starts on March 31.
PQ: STAT 39000 or equivalent.
Required Reading: TBA

This is a five-week course on structural economic models of asset prices. It will develop and apply martingale and statistical methods to contrast implications of alternative models. In additional it will explore the impact of learning and model uncertainty on prices in decentralized asset markets.

Back to top

WINTER 2009

College Courses

STATISTICS 20000.  Elementary Statistics.
Sec 01: Wei Biao Wu, MWF 9:30-10:20 AM, Eckhart 133.
PQ:  Math 10600 or equivalent.
Required reading:  Freedman, Pisani, and Purves, Statistics, 4th edition, W. W. Norton Press. ISBN-10: 0393929728, ISBN-13: 978-0393929720.

This course meets one of the general education requirements in the mathematical sciences. NOTE: STAT 20000 may not be used in the statistics major. It is recommended for students who do not plan to take advanced statistics courses. This course introduces statistical concepts and methods for the collection, presentation, analysis, and interpretation of data. Elements of sampling, simple techniques for analysis of means, proportions, and linear association are used to illustrate both effective and fallacious uses of statistics.

Back to top

STATISTICS 22000. Statistical Methods and Their Applications.
Sec 01:  Peter McCullagh, MWF 10:30-11:20 AM, Eckhart 133.
PQ:  2 QTRS Calculus.
Required reading:  Moore and McCabe, Introduction to the Practice of Statistics, 5th edition. W. H. Freeman.  ISBN-10: 0716764008, ISBN-13: 978-0716764007.

This course introduces statistical techniques and methods of data analysis, including the use of computers. Examples are drawn from the biological, physical, and social sciences. Students are required to apply the techniques discussed to data drawn from actual research. Topics include data description, graphical techniques, exploratory data analyses, random variation and sampling, one- and two-sample problems, the analysis of variance, linear regression, and analysis of discrete data.

Back to top

STATISTICS 22600=HSTD 32600. Analysis of Categorical Data.
Sec 01: Mei Wang, TTH, 1:30-2:50 PM, Eckhart 133.
PQ: STAT 22000 or equivalent.
Required reading: Agresti, A. (2007). An introduction to Categorical Data Analysis, 2nd edition, Wiley.

It is expected that the students have a good understanding of basic descriptive statistics such as means, variances and expectation, of the inferential notions of estimate, confidence intervals and significance or hypothesis testing. Familiarity with one statistical package, e.g. R, Splus, SAS, SPSS, Stata or Minitab, and ability to access Web sites and to download files from the Web are required. The free statistical package R will be used in this course for Winter 2007.

This course is an introduction to the theory and applications of statistical methods for investigating the relationships among discrete variables. The course will present methods for analyzing categorical data, including standard methods for contingency tables such as odds ratios,
tests of independence and various measures of association, generalized linear models for binary data and count data, logistic regression for binomial data, loglinear models for Poisson data, and models for paired samples with categorical responses. The statistical techniques discussed will be presented by many real examples involving physical, biological and social science data.

Back to top

STATISTICS 22700=HSTD32700. Biostatistical Methods.
Sec 01: Ronald A. Thisted, TTH, 10:30-11:50 AM, BSLC room to be announced.
PQ: HSTD 32400/STAT 22400 or STAT 24500 or equivalent; or consent of instructor.
Required Reading: Collett, D. (2003). Modelling Binary Data, 2nd edition, Boca Raton, Chapman & Hall/CRC.
Collett, D. (2004). Modelling Survival Data in Medical Research, 2nd edition, London, Chapman & Hall.

This course is designed to provide students with tools for analyzing categorical, count, and time-to-event data frequently encountered in medicine, public health and related biological and social sciences. The course will emphasize application of methods rather than statistical theory, including recognition of the appropriate methods, interpretation and presentation of results. Methods covered include: contingency table analysis, logistic regression, log-linear (Poisson) regression, conditional logistic regression, regression methods for ordinal data, Kaplan-Meier survival curves, parametric survival models, and Cox proportional-hazards survival analysis.

Back to top

STATISTICS 23400. Statistical Models and Methods.
Sec 01: Linda B. Collins, MWF, 11:30-12:20 PM, Eckhart 133.
Sec 02: Nina Singhal Hinrichs, TTH, 9:00-10:20 AM,Eckhart 133.
PQ: Mathematics 13300, 15300 or 16300.
Required Reading: Chance and Rossman (2005). Investigating Statistical Concepts, Applications, and Methods, 1st edition. Duxbury (Thomson Brooks/Cole), ISBN: 0-4950-5064-4.
Tanis and Hogg (2008). A Brief Course in Mathematical Statistics, Pearson/Prentice Hall, ISBN: 0-1317-5139-5.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, hypergeometric, Poisson, exponential, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for proportions and means for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Univariate calculus and computer simulation are used throughout the course to investigate statistical concepts and their mathematical underpinnings. One full year of univariate calculus is a prerequisite for the course (Math 13300, 15300, or 16300). Familiarity with at least limits, derivatives and integrals of polynomial and exponential functions, change of variable (substitution) in definite integrals, max-min problems, use of summation notation, and sequences and series as well as a willingness to explore ideas mathematically are key to your success in this course. See http://www.stat.uchicago.edu/~stat234 for more detailed information.

Stat 23400 takes the mathematical prerequisite quite seriously. Enrolling concurrently in either Math 13300, 15300, or 16300 while taking Stat 23400 is very strongly discouraged. Further, students who do not feel strong mathematically, may want to wait until completing their entire mathematical requirement (e.g., Math 19500-19600 for Economics majors) before enrolling in Stat 23400. Economics majors are strongly encouraged to delay taking Stat 23400 until the quarter just before enrolling in their required econometrics course (Econ 21000), for which Stat 23400 is a prerequisite. Thus, delaying Stat 23400 until at least late in the second year or even early in the third year of the Economics degree program should not be considered unusual.

Back to top

STATISTICS 24300. Numerical Linear Algebra.
Sec 01: Lynne Butler, MW, 3:00-4:20 PM, Pick 016.
PQ:  Multivariate Calculus (MATH 19520 or 20000 or equivalent).
Required Reading: Strang, Gilbert (2006). Linear Algebra and Its Applications, 4th edition, Brooks/Cole.

In this course students learn the ideas and methods of linear algebra by understanding them geometrically, justifying them algebraically, and using them to solve problems in various disciplines. The ideas and methods covered are used in advanced courses on quantum mechanics, econometrics, and linear statistical models. Topics covered include: Gaussian elimination; vector spaces and orthogonality; eigenvectors and eigenvalues; diagonalization of real symmetric (and complex Hermitian) matrices; the spectral theorem; QR, Cholesky and Singular Value Decompositions.

This is a first course in linear algebra but, unlike matrix algebra courses, it prepares students for graduate work in statistics, physical chemistry, physics and economics.

Back to top

STATISTICS 24400. Statistical Theory and Methods I.
Sec 01: Stephen Stigler, TTH, 10:30-11:50 AM, Eckhart 133.
PQ:  MATH 19600, 20100, or 20500.
Recommended reading: Detailed notes will be available in the first quarter. The text is Rice, John A., Mathematical Statistics and Data Analysis, 3rd edition, Duxbury.

This is the first quarter of a two-quarter sequence. Enrollment in the first quarter alone is permitted, although not recommended. The first quarter will cover the basics -- tools from probability and the elements of statistical theory. Topics will include the definitions of probability and random variables, binomial and other discrete probability distributions, normal and other continuous probability distribution, joint probability distributions and the transformation of random variables, principles of inference (including Bayesian inference), maximum likelihood estimation, hypothesis testing and confidence intervals, likelihood ratio tests, multinomial distributions and chi-square tests. Some large sample theory will be included. The emphasis will be upon statistical theory, specifically upon concepts and tools that are useful for understanding and applying statistical methodology. Examples will be drawn from the social, physical, and biological sciences. There is no enforced prerequisite in probability or statistics, although the pace is such that students may find it useful to have taken a previous elementary course. The coverage of topics in probability will be limited and brief, so that those who have taken a course in probability will find reinforcement rather than redundancy. The second quarter will cover statistical methodology, including some multivariate analysis, the analysis of variance, the regression phenomenon, linear regression analysis, data analysis, and correlation. The computer will be used in the second quarter.

Back to top

STATISTICS 24500. Statistical Theory and Methods II.
Sec 01:  Wei Biao Wu,  TTH, 1:30-2:50 PM, Eckhart 133.
PQ:  STAT 24400 or consent of instructor.
Required reading: Rice, J. A. (2006). Mathematical Statistics and Data Analysis, 3rd edition, Brooks/Cole.

This is the second quarter of a two-quarter sequence. Enrollment in the second quarter alone is permitted, although not recommended. The first quarter covered the basics -- tools from probability and the elements of statistical theory. The second quarter will cover statistical methodology, including the data transformation, regression phenomena, t-tests, analysis of variance, linear regression, correlation, and some multivariate distribution theory. Some principles of data analysis will be introduced, and an attempt will be made to present ANOVA and regression in a unified framework. Much of the material is covered in chapters 10-12 and 14 of the text, but other viewpoints and derivations will be introduced as well. The computer will be used in the second quarter. Some mathematical maturity will be assumed, to the level of calculus. The computer will be used for data analysis and simulation.

Back to top

STATISTICS 24700=CPNS 32100. Math/Stats Methods for Neuroscience-2.
Sec 01:  William Van Drongelen, WF, 1:30-2:50 PM, BSLC 401.
PQ:  STAT 24400 or consent of instructor.
Required reading: Rice, J. A. (1995). Mathematical Statistics and Data Analysis, 2nd edition, Duxbury.

This course deals with the application of non-linear methods in signal processing and dynamical systems theory to issues in neuroscience. Data analysis with Matlab is again emphasized.

The third course in this sequence is an elective course in one of the quantitative sciences relevant to neuroscience that can be selected by the student in consultation with the program chair.

Back to top

CANCELLED
STATISTICS 25200=STAT 31200.  Introduction to Stochastic Processes I.

Back to top

STATISTICS 25300=STAT 31700. Introduction to Probability Models.
Sec 01:  Mei Wang, TTH, 9:00-10:20 AM,  Eckhart 117.
PQ:  STAT 25100 or STAT 24400 or equivalent. Consent of instructor.
Required reading: Ross, R., (2007). Introduction to Probability Models, 9th edition.

This course introduces stochastic processes as models for a variety of phenomena in the physical and biological sciences. Another appropriate title for the course could be "an Introduction to Applied Stochastic Processes." Following a very brief review of basic concepts in probability the course will introduce stochastic processes that are popular in applications in sciences, such as discrete time Markov chain, the Poisson process, continuous time Markov process, renewal process and Brownian motion.

Back to top

Graduate Courses

STATISTICS 30100. Mathematical Statistics I.
Sec 01:  Mary Sara McPeek, TTH, 1:30-2:50 PM,  Eckhart 117.
PQ:  STAT 30400 or consent of instructor.
Required Reading: Casella and Berger, Statistical Inference, 2nd edition.
Ferguson, A Course in Large Sample Theory.

This course is part of a two-quarter sequence on the theory of statistics. Topics will include exponential families, quadratic forms of multivariate normal, asymptotics of order statistics, sufficiency and completeness, the likelihood function, methods of point estimation, and asymptotic properties of maximum likelihood estimates. Other topics (e.g. Bayesian methods and methods for dependent observations) may be covered if time permits.

Back to top

STATISTICS 31200.  Introduction to Stochastic Processes 1.
Sec 01:  Steven P. Lalley, TTH, 10:30-11:50 AM, Eckhart 117.
PQ:  STAT 25100 or consent of instructor.
Required reading: Ross, S. (1996). Stochastic Processes, 2nd edition, Wiley.

Stochastic processes provide models for random events that evolve in time and may include substantial dependence among observations at different times. The goal of this course is to present a variety of useful models including Markov chains, renewal theory, random walks, queueing and branching processes.

Back to top

STATISTICS 31700=STAT 25300.   Introduction to Probability Models.
Sec 01:  Mei Wang, TTh, 9:00-10:20 AM,  Eckhart 117.
PQ:  STAT 24400 or STAT 25100.
Required reading: Ross, R., (2007). Introduction to Probability Models, 9th edition.

This course introduces stochastic processes as models for a variety of phenomena in the physical and biological sciences. Another appropriate title for the course could be "an Introduction to Applied Stochastic Processes." Following a very brief review of basic concepts in probability the course will introduce stochastic processes that are popular in applications in sciences, such as discrete time Markov chain, the Poisson process, continuous time Markov process, renewal process and Brownian motion.

Back to top

STATISTICS 32500=GSBC 41902.  Statistical Inference.
Sec 01:  Timothy Conley, Mon, 1:30-4:30 PM, GSB to be announced.
PQ:  Business 41901=STAT 32500
Required reading: DeGroot and Scherviah, Probability and Statistics. Lecture notes will be provided in the form of a CoursePack.

This Ph.D.-level course is the second in a two-quarter sequence with Business 41901. The central topic is statistical inference. The topics covered include Bayesian inference, classical estimation, decision theory, MCMC methods and Hierarchical models. The use of Hierarchical models is a focus in applications.

Back to top

STATISTICS 32600=GSBC 37904=ECON 31601. Marketing Topics.
Sec 01: Peter Rossi, MW, 5:00-6:20 PM, GSB to be announced.

Back to top

STATISTICS 33900=FINM 33100.  Financial Data Analysis.
Sec 01: Staff, Mon, 6:30-9:30 PM, Ryerson 251.
PQ: Math Finance & Stat Students only; Crosslisted with STAT 33900.
Recommended textbooks:
Applied Linear Regression (3rd edition) by Sanford Weisberg (Wiley)
Time Series: Applications to Finance by Ngai Hang Chan (Wiley)
Statistical Analysis of Financial Data in S-Plus by Rene A. Carmona (Springer-Verlag)
Mathematical Statistics and Data Analysis by John A. Rice (Duxbury)
The Basics of S-plus, by Andreas Krause and Melvin Olson (Springer-Verlag)

Note that only part of each book will be used in the course. However, in the course of your career, you will find all of them useful to have on your shelf for further reading and reference.

Back to top

STATISTICS 34500.  Design and Analysis of Experiments.
Sec 01:  Michael L. Stein, MW, 1:30-2:50 PM, Eckhart 133.
PQ: STAT 34300
Required Reading:  Mead, R., The Design of Experiments, Cambridge.
Recommended Reading: West, B. D., Welch, K. B. and Galecki, A. T., Linear Mixed Models., Chapman & Hall/CRC.

An introduction to the methodology and application of linear models in experimental design. A major focus of the course will be the basic principles of experimental design, such as blocking, randomization and incomplete layouts. Both standard designs, such as fractional factorials and incomplete block designs, as well as nonstandard designs, will be studied within this context. The analysis of these experiments will be developed as well, with particular emphasis on careful model formulation and the role of fixed and random effects. Time permitting, additional topics may include the use of covariates in the analysis of designed experiments, spatial analysis of field trials and Bayesian approaches to analysis of experimental data.

Course work will include the planning, execution and analysis of an experiment by the class.

Back to top

STATISTICS 34920. Introduction to Nonparametric Regression.
Sec 01:  David Degras, TTh, 10:30-11:50 AM, Room to be announced.
PQ: STAT 22400 or STAT 24500 or STAT 34300 or consent of instructor.
Required reading:   Bowman, Adrian W. and Adelchi, Azzalini (1997). Applied Smoothing Techniques for Data Analysis, Oxford Science Publications.

This course provides an introduction to nonparametric regression models and methods. Theoretical and practical aspects will be presented, among which the construction of nonparametric estimators (kernel, spline, projection, local polynomial), the selection of smoothing parameters, some asymptotic theory (consistency, convergence rates, asymptotic distribution), and the construction of confidence regions. According to time left and to students' interests, further topics such as minimax estimation or adaptive estimation may be addressed. The methods will be implemented and illustrated in the R language environment.

Back to top

STATISTICS 35700=HSTD 31001.  Epidemiologic Methods.
Sec 01:  Diane Lauderdale, TTh, 12:00-1:20 PM, BH W229.
PQ: HSTD 30900 or consent of instructor.
Required reading:

This course expands on the material presented in "Principles of Epidemiology," further exploring issues in the conduct of epidemiologic studies. The student will learn the application of both stratified and multivariate methods to the analysis of epidemiologic data. The final project will be to write the "specific aims" and "methods" sections of a research proposal on a topic of the student's choice.

Back to top

STATISTICS 37000.  Algebraic Methods in Statistics.
Sec 01:  Mathias Drton, MWF, 10:30-11:20 AM, Eckhart 117.
PQ:
Required reading: Drton, M. Sturmfels, B., and Sullivant, S., Lectures on Algebraic Statistics.

Topics will include:

  • Markov bases for contingency table analysis
  • Likelihood ratio tests
  • Conditional independence models
  • Hidden variable models
  • Bayesian marginal likelihood integrals

Back to top

STATISTICS 38300.  Measure-Theoretic Probability-III.
Sec 01:  Michael J. Wichura, MWF, 12:30-1:20 PM, Eckhart 117.
PQ: STAT 38100 or consent of instructor.
Required reading: No text book is required; notes will be distributed in class.

Topics for Stat 38300 will include:

  • The Hahn and Jordan decomposition theorems
  • Modes of convergence: with probability one, in probability, and in mean; uniform integrability
  • L2-spaces: projections; representation of linear functionals
  • The Radon-Nikodym theorem: absolute continuity, Radon-Nikodym derivatives; likelihood ratios; Lebesgue decompositions
  • Conditional probability: regular conditional probability distributions
  • Conditional expectation: given sub-sigma fields, and given measurable functions
  • Martingales: definitions and examples, transformations
  • Stopping times; optional sampling
  • Martingale limit and closure theorems
  • Backward submartingales
  • Continuous-time martingales: convergence, closure, optional sampling

Back to top

STATISTICS 47630.  Monte Carlo Methods.
Sec 01:  Steven P. Lalley, TTh, 1:30-2:50 PM, Eckhart 117.
PQ: STAT 24600 or STAT 31200 or consent of instructor. Stat 47620 will meet the first 5 weeks of the term. Course runs first 5 weeks of quarter
Recommended reading: Liu, Jun, Monte Carlo Strategies in Scientific Computing. Papers to be announced later.

This will be a brief introduction to several useful techniques of simulation:

  • importance sampling and sequential importance sampling
  • MCMC (Markov chain Monte Carlo)
  • Gibbs sampling
  • perfect sampling

The utility of these methods will be illustrated by a number of substantial examples, including

  • self-avoiding random walks
  • enumeration of contingency tables with fixed margins
  • simulation of Ising models
  • code-breaking

Back to top

STATISTICS 47900. Stochastic Models for Memory and Learning.
Sec 01: Yali Amit , TTh, 3:00-4:20 PM, Eckhart 117. Course begins 6th week.
PQ: Consent of instructor.
Required reading: None

This 5 week course will cover a some of the literature analyzing learning and memory in large neuronal populations as stochastic processes. First we will discuss models with discrete time dynamics, discrete binary neurons and finite state synapses, and derive bounds on memory capacity, learning and forgetting times. This will only involve discrete time Markov chain analysis and some ideas from mean-field analysis. Second we will introduce continuous integrate and fire neurons and continuous time dynamics. Using mean-field methods we will analyze the stability of large networks with random connections, and the behavior of the networks after learning. People interested in probability will be exposed to a rich collection of stochastic models waiting to be analyzed in a rigorous mathematical framework (the mean field analysis is only approximate.) People interested in neuroscience will be exposed to interesting models and some intriguing connections to experimental data.

Back to top

AUTUMN 2008


College Courses

STATISTICS 20000.  Elementary Statistics.
Sec 01: Keith Worsley, MWF 9:30-10:20 AM, Eckhart 133.
Sec 02: Phillip Lynch, MWF 12:30-1:20 PM, Eckhart 133.
PQ: Math 10600 or equivalent.
Required reading: Freedman, Pisani and Purves (2007). Statistics, 4th edition, Norton. ISBN-10: 0393929728, ISBN-13: 978-0393929720.

This course meets one of the general education requirements in the mathematical sciences. STAT 20000 may not be used in the statistics major. It is recommended for students who do not plan to take advanced statistics courses. Not open to students with credit for STAT 22000 or 23400.

This course introduces statistical concepts and methods for the collection, presentation, analysis, and interpretation of data. Elements of sampling, simple techniques for analysis of means, proportions, and linear association are used to illustrate both effective and fallacious uses of statistics.

Back to top

STATISTICS 22000=HDCP 22050.  Stat Meth And Applications.
Sec 01: Han Xiao, MWF 10:30-11:20 AM, Eckhart 133.
Sec 02: Xiaoquan Wen, MWF 1:30-2:20 PM, Eckhart 133.
PQ: 2 QTRS Calculus.
Required reading: Moore, McCabe and Craig (2009). Introduction to the Practice of Statistics, 6th edition, W. H. Freeman. ISBN-10: 1429216220 ISBN-13: 978-1429216227

Students who matriculate in the College after September 2008 may count either STAT 22000 or STAT 23400, but not both, toward the forty-two credits required for graduation.

This course introduces statistical techniques and methods of data analysis, including the use of computers. Examples are drawn from the biological, physical, and social sciences. Students are required to apply the techniques discussed to data drawn from actual research. Topics include data description, graphical techniques, exploratory data analyses, random variation and sampling, one- and two-sample problems, the analysis of variance, linear regression, and analysis of discrete data.

Back to top

STATISTICS 22400=HSTD 32400. Applied Regression Analysis.
Sec 01:  Vanja Dukic, TTh 10:30-11:50 AM, Eckhart 133
PQ: HSTD 32700 or STAT 22000 or STAT 23400 or STAT 24400 or consent of instructor.
Required reading: Chatterjee, Samprit and Hadi, Ali S., Regression Analysis by Example, 4th edition.

This course is an introduction to the methods and applications of fitting and interpreting multiple regression models. The main emphasis is on the method of least squares. Topics include the examination of residuals, the transformation of data, strategies and criteria for the selection of a regression equation, the use of dummy variables, and tests of fit. The techniques discussed will be illustrated by many real examples involving biological and social science data. Examples and exercises will be implemented in a statistical software package "Stata", but familiarity with Stata is not required.

Back to top

STATISTICS 23400. Statistical Models and Methods.
Sec 01: Linda Collins, MWF, 11:30-12:20 PM, Eckhart 133.
Sec 02: David Degras, TTh, 3:00-4:20 PM, Eckhart 133.
PQ: Mathematics 13300, 15300 or 16300, or equivalent.
Required Reading: Buntinas and Funk (2005). Statistics for the Sciences, Duxbury (Thomson Brooks/Cole). ISBN-10: 0534387748 ISBN-13: 978-0534387747.

Students who matriculate in the College after September 2008 may count either STAT 22000 or STAT 23400, but not both, toward the forty-two credits required for graduation.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, hypergeometric, Poisson, exponential, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for proportions and means for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Univariate calculus and computer simulation are used throughout the course to investigate statistical concepts and their mathematical underpinnings. One full year of univariate calculus is a prerequisite for the course (Math 13300, 15300, or 16300). Students with AP Calculus credit for any of these prerequisite courses may also enroll. Familiarity with at least limits, derivatives and integrals of polynomial and exponential functions, change of variable (substitution) in definite integrals, max-min problems, use of summation notation, and sequences and series as well as a willingness to explore ideas mathematically are key to your success in this course. See http://www.stat.uchicago.edu/~stat234 for more detailed information.

Stat 23400 takes the mathematical prerequisite quite seriously. Enrolling concurrently in either Math 13300, 15300, or 16300 while taking Stat 23400 is very strongly discouraged. Further, students who do not feel strong mathematically, may want to wait until completing their entire mathematical requirement (e.g., Math 19520-19620 for Economics majors) before enrolling in Stat 23400. Economics majors are strongly encouraged to delay taking Stat 23400 until the quarter just before enrolling in their required econometrics course (Econ 21000), for which Stat 23400 is a prerequisite. Thus, delaying Stat 23400 until at least late in the second year or even early in the third year of the Economics degree program should not be considered unusual.

Back to top

STATISTICS 24400. Statistical Theory/Method-1.
Sec 01: Mathias Drton, TTH, 1:30-2:50 PM, Eckhart 133
Sec 02: Michael Stein, MW, 1:30-2:50 PM, Stuart 101
PQ:  Multivariate Calculus (Math 19520 or 20000 or equivalent and Linear Algebra (Math 19620), 25500 or Stat 24300 or equivalent).
Required Reading: Rice, John A. (2007). Mathematical Statistics and Data Analysis, 3rd edition, Duxbury.

This is the first quarter of a two-quarter sequence. Enrollment in the first quarter alone is permitted, although not recommended. The first quarter will cover the essential tools from probability needed for study of statistical theory and the basic elements of statistical theory.

Topics will include the definitions of probability and random variables, binomial and other discrete probability distributions, normal and other continuous probability distribution, joint probability distributions and the transformation of random variables, principles of inference (including Bayesian inference), maximum likelihood estimation, hypothesis testing and confidence intervals, likelihood ratio tests, multinomial distributions and chi-square tests. Some large sample theory will be included. The emphasis will be upon statistical theory, specifically upon concepts and tools that are useful for understanding and applying statistical methodology.

There is no enforced prerequisite in probability or statistics, although the pace is such that students may find it useful to have taken a previous elementary course. The coverage of topics in probability will be limited, so that those who have taken a course in probability will find reinforcement rather than redundancy. The second quarter will cover statistical methodology, including some multivariate analysis, the analysis of variance, the regression phenomenon, linear regression analysis, data analysis, and correlation.

The mathematics prerequisites are listed as general guidance. You should be comfortable with multivariate calculus through partial differentiation and multiple integration. Linear algebra is generally used only in 24500 and not 24400.

Back to top

Graduate Courses

STATISTICS 30400.  Distribution Theory.
Sec 01: Dan Nicolae, MWF, 1:30-2:20 PM, Eckhart 117
PQ: STAT 24500 or MATH 25000 or equivalent
Recommended reading: Severini, T. (2005). Elements of Distribution Theory, Cambridge University Press.

This course covers the basics of distribution theory. Topics include:

  • Distribution functions and their inverses, quantile functions, Q/Q plots, change-of-variables for probability densities
  • Expectation, variance, median, mode of random variables
    Basics of measure theory, including Fubini's theorem and interchangeability of limits and integrals
  • Moment generating functions and characteristic functions, including power series expansion, inversion formulas, uniqueness theorems, and convergence in distribution
  • Cumulants and cumulant generating functions: examples, properties
  • Different concepts of convergence for random variables
  • Limit theorems, including the weak law of large numbers, the central limit theorem.

Back to top

STATISTICS 30700=CMSC 37800.  Numerical Computation.
Sec 01: Ronald Thisted, TTh, 9:00-10:20 AM, Eckhart 117
PQ: Linear Algebra (Stat 34300 or equivalent) and some previous experience with Statistics.
Required reading: Thisted, Ronald A. Elements of Statistical Computation, CRC/Chapman & Hall.
Recommended, but not required:  

  • Gentle, James. Random Number Generation and Monte Carlo Methods, 2nd edition,    Springer.
  • Watkins, David S. Fundamentals of Matrix Computations, 2nd edition, Wiley.
  • Scheinerman, Edward. C++ for Mathematicians. CRC/Chapman and Hall.

This course starts with a presentation of the fundamental algorithms for the solution of linear equations, the decomposition of matrices, and finite dimensional eigenvalue problems. Applications to least squares/regression will be presented, emphasizing use of existing numerical software. The course will also discuss optimization problems and introduce the basic principles of simulation-based methods.

Topics include:

  • Gaussian elimination and back-substitution
  • LU decomposition. (General/Symmetric)
  • Singular value decomposition. (Symmetric)
  • Householder orthogonalization and QR factorization. (Symmetric).
  • Iterative methods: Jacobi and Gauss Seidel.
  • Optimization: Newton-Raphson and quasi-Newton.
  • Uniform random number generation.
  • Simulating specific distributions
  • Monte Carlo methods

By the end of the course students should be able to apply these algorithms in their research work.

Backto top

STATISTICS 32300=HSTD 43000. Bayesian Methods and Computation.
Sec 01:  Vanja Dukic, TTh, 3:00-4:20 PM, Eckhart 117.
PQ:  STAT 301-302; 343; 312-313; OR consent of instructor.
Required reading: Tanner. Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions, 3rd edition, Springer.

This course will cover basics of modern statistical computation, with emphasis on Bayesian computational methods. It will begin with the introduction to Bayesian statistics, and cover normal and non-normal approximation to likelihood and posterior distributions, the EM algorithm, data augmentation and Markov Chain Monte Carlo (MCMC) methods. Time permitting, we will conclude with some recent developments in the MCMC area, such as perfect and adaptive sampling methods. Biostatistical and environmental examples will be given throughout the course. There will be weekly homeworks, and students will be expected to complete a project by the end of the course. There will be no final exam, but there will be an in-class final project presentation. Algorithms can be implemented in any language, but familiarity with R and Matlab will be assumed.

Back to top

STATISTICS 33100.  Sample Surveys.
Sect 01: Kirk Wolter, TTh, 10:30-11:50 AM, Eckhart 117
PQ: Consent of instructor
Required reading: Wolter, K.M. (2007). Introduction to Variance Estimation, 2nd edition, Springer-Verlag, New York.
Cochran, W.G. (1977). Sampling Techniques, 3rd edition, John Wiley & Sons, New York.

This is an introductory course to the statistics and methodology of sample surveys. Topics include

  • basic methods of sample selection,
  • determining sample size, stratification,
  • general estimators (Horvitz-Thompson, ratio, generalized regression, calibration)
  • domain estimation,
  • nonresponse,
  • nonsampling error,
  • multiple-stage sampling,
  • a national sampling frame for area probability surveys,
  • telephone surveys,
  • questionnaire design,
  • variance estimation for complex surveys,
  • analysis of contingency tables, and
  • regression analysis for survey data.

The course will be of interest to students who anticipate a research career that designs, collects, and analyzes survey data in fields such as economics, education, healthcare, marketing, psychology, sociology, and statistics.

Back to top

STATISTICS 33500=GSBC 41910. Time-Series Analysis/Forecast.
Sec 01:  Ruey-Shiong Tsay, Thu, 8:30-11:30 AM, GSB HC3B
PQ:  Business 41901 or consent of instructor.
Required reading: No textbook. Some selected reference books are given below:

  • Hamilton, J. (1994). Time Series Analysis, Princeton University Press.
  • Tsay, Ruey S. (2005). Analysis of Financial Time Series, 2nd edition, Wiley.
  • A Course in Time Series Analysis, ed. Pena, Tiao and Tsay, Wiley, 2001.

GSB Honor Code: This course requires students to follow the GSB Honor Code and Standards of Scholarship in examinations and homework assignments. The GSB Honor Code requires students to sign the following pledge, "I pledge my honor that I have not violated the Honor Code during this examination," on every examination.

Course Ob jective:

  • To introduce time series analysis for econometric and financial applications
  • To discuss time series forecasting
  • To gain experience in model building
  • To assess the impacts of interventions and outliers
  • To understand state-space models and Kalman filter
  • To study MCMC methods and their applications in time series analysis
  • To discuss unit-root theory, trend-stationarity, testing and applications.

Lecture: Thursdays 8:30 to 11:30 am, starting September 25
Lecture Notes:
Outline of the lecture and some supplementary material will be posted on the web:
http://faculty.chicagogsb.edu/ruey.tsay/teaching/uts/.

Back to top

STATISTICS 33610. Asymptotics for Time Series.
Sec 01:  Wei Biao Wu, MW, 1:30-2:50 PM, Eckhart 117.
PQ:  Business 30200 and STAT 31300 or consent of instructorf instructor.
Required reading:
Fan, J. and Yao, Q., (2003). Nonlinear time series, nonparametric and parametric methods. Springer, New York.
Wu, W. B. (2005): Nonlinear system theory: Another look at dependence. Proceedings of the National Academy of Sciences USA. 102, 14150--14154.
Wu, W. B. and Zhibiao Zhao (2007) Inference of Trends in Time Series. Journal of the Royal Statistical Society, Series B, 69, 391--410

I will present a systematic asymptotic theory for time series analysis. In particular, I will discuss asymptotics for sample mean, sample variances, banded covariance matrices estimates, inference of trends, periodograms, spectral density estimates, quantile estimation, nonparametric estimates, VaR and long-range dependent processes. Some asymptotic theory for non-stationary processes and functional linear models will also be presented.

Back to top

STATISTICS 34300.  Applied Linear Stat Methods.
Sec 01: Mathias Drton, TTh, 9:00-10:20 AM, Eckhart 133
PQ: STAT 24500 or equivalent and Linear Algebra (Stat 24300 or equivalent).
Optional Reading: Venables, W.N. and Ripley, B.D. (1999). Modern Applied Statistics with S-Plus (3rd ed). Springer-Verlag.
Required Reading: Weisberg, S. (2005). Applied Linear Regression, Third Edition. John Wiley & Sons. Software: Splus or R.

Statistics 34300 is an intensive course in the theory and methods of linear regression and related techniques of statistical modelling. It is intended primarily for graduate students in Statistics and related fields.

The course is also open to undergraduates and others who have a solid understanding of matrix algebra and basic statistical theory. Thorough familiarity with the simple linear regression model is expected.

The course will review linear regression with a single predictor, and will cover the multiple-predictor case; least-squares estimation; associated distribution theory; estimation, confidence intervals and tests; regression with errors in the predictors; weighted least squares, assessing lack of fit; residual analysis; regression diagnostics; transformations; model building; collinearity; subset-selection methods, including stepwise regression; prediction; nonlinear least squares.

Back to top

STATISTICS 35000=HSTD 30900, ENST 27400, PPHA 36400.  Principles of Epidemiology.
Sec 01: Kurina Lianne , MW, 10:30-11:50 AM, BSLC arr.
PQ: Introductory Statistics
Required reading:
Gordis, Leon. Epidemiology, 4th Ed. 2008. Saunders (available at the Barnes and Noble campus bookstore or at Amazon.com)
Original epidemiological articles (available on course website)

Epidemiology is the study of the distribution and determinants of health and disease in human populations. This course introduces the basic principles of epidemiologic study design, analysis and interpretation through lectures, assignments, and critical appraisal of both classic and contemporary research articles.

OBJECTIVES

  • To be able to critically read and understand epidemiologic studies.
  • To be able to calculate and interpret measures of disease occurrence and measures of disease-exposure associations.
  • To understand the contributions of epidemiology to clinical research, medicine, and public health.

Back to top

STATISTICS 35201=HSTD 32901. Intro to Clinical Trials.
Sec 01: James Dignam, Tu, 3:00-5:50 PM, BH W230. Other ‘field trips’ as discussed below, will be recommended as additions or substitutes for class meetings.
PQ: Introductory statistics course (HSTD 32100, STAT 22000, or similar), ability to use a personal computer, or, permission of instructor.
Required reading: Piantadosi, S. Clinical Trials: A Methodologic Perspective, 2nd Edition 2005, Wiley Interscience, New York.
Reference: Friedman, L.M., Furberg, C.D., DeMets, D.L. Fundamentals of Clinical Trials., 3rd ed., 1998, Springer, New York.
Cook, T.D., DeMets, D.L. Introduction to Statistical Methods for Clinical Trials, 2007, Chapman & Hall/CRC, London.
Other readings: Selected articles for class discussion will be identified and distributed the prior week. A reading list of these methodological articles, commentaries, and clinical trial report articles will be developed and distributed.

This course will review major components of clinical trial conduct, including the formulation of clinical hypotheses and study endpoints, trial design, development of the research protocol, trial progress monitoring, analysis, and the summary and reporting of results. Other aspects of clinical trials to be discussed include ethical and regulatory issues in human sub jects research, data quality control, meta-analytic overviews and consensus in treatment strategy resulting from clinical trials, and the broader impact of clinical trials on public health.

The course will be conducted partly via lectures and partly in a ‘reading’ format. Designated individuals may take the lead in covering a main topic, with participation and input by all. Similarly, presentation of materials in the special topics portion of the class meeting will be shared among all.

Back to top

STATISTICS 35581.  Statistical Methods and Gene Networks.
Sec 01: Matthew Stephens, TTh, 1:30-2:50 PM, Eckhart 117.
PQ: STAT 24400 or consent of instructor.
Required reading: No textbook required.

This is an advanced graduate seminar-style course in statistical genetics. We will read and critique published papers that use statistical methods to infer gene networks. We expect to cover a range of different types of data types (eg gene expression data, CHIP-chip data, literature mining) and statistical approaches (eg Bayes Networks, similarity measures), with an emphasis on understanding the details of the statistical approach used. Students will be expected to be familiar and facile with a range of statistical concepts, including statistical modelling, maximum likelihood and Bayesian inference, and hypothesis testing. Students will be expected to read papers in advance of each class, and to take turns in presenting and leading the discussion.

Back to top

STATISTICS 36800. Wavelets in Statistics.
Sec 01: Debashis Mondal, TTh, 12:00-1:20 PM, Eckhart 117.
PQ: STAT 30200 or consent of instructor.
Required reading: None
Recommended reading: Vidakovic, Brani (1999). Statistical Modeling by Wavelets, Wiley, ISBN 0471293652, 9780471293651.

This course introduces wavelet methods in statistics. Topics include

  • wavelet transform, shrinkage estimators, non-parametric regression,
  • density estimation, deconvolution, inverse problems, and
  • selected applications in time series analysis.

Back to top

STATISTICS 36900=HSTD 33300.  Longitudinal Data Analysis.
Sec 01:  Paul Rathouz, TTh, 9:00-10:20 AM, BSLC 313.
PQ: Statistics 220 (introductory statistical methods), Health Studies 321 or equivalent, Health Studies 324 / Statistics 224 (Applied Regression Analysis) or equivalent, and Health Studies 327 / Statistics 227 (Biostatistical Methods) or equivalent; or permission of instructor. Facility with the Stata software package is assumed.
Required reading: (Available in bookstore.)
* Fitzmaurice GM, Laird NM, Ware JH. (2004). Applied Longitudinal Analysis. Hoboken, NJ: Wiley-Interscience.

Other references: (* Indicates text on reserve in Eckhart Library.)
* Diggle PJ, Heagerty P, Liang K-Y, & Zeger SL. (2002). Analysis of Longitudinal Data, 2nd edn. Oxford: Oxford University Press.
* Hedeker DR & Gibbons RD. (2006). Longitudinal data analysis. Hoboken, NJ: Wiley-Interscience. Hand D, Crowder M. (1996). Practical Longitudinal Data Analysis. London: Chapman & Hall.
Lindsey JK. (1999). Models for repeated measurements. New York: Oxford University Press.
Littel RC, Milliken GA, Stroup WA, Wolfinger RD. (1996). SAS System for Mixed Models. Cary, NC: SAS Institute.
* McCulloch CE, Searle SR. (2001). Generalized, Linear, and Mixed Models. New York: John Wiley & Sons.
Verbeke G, Molenberghs G. (2000). Linear mixed models for longitudinal data. New York: Springer.

Longitudinal data consist of multiple measures over time on a sample of individuals. This type of data occurs extensively in both observational and experimental biomedical and public health studies, as well as in studies in sociology and applied economics. This course will provide an introduction to the principles and methods for the analysis of longitudinal data. Emphasis will be on data analysis and interpretation. Supporting statistical theory will be given at a level appropriate for an advanced Master’s student in Statistics. Problems will be motivated by applications in epidemiology, clinical medicine, health services research, and disease natural history studies.

Back to top

STATISTICS 37910=CMSC 35510.  Statistical Methods in Computer Vision.
Sec 01: Yali Amit, TTh, 10:30-11:50 AM, Ryerson 277.
PQ: Consent of instructor.
Required reading: Instructor’s notes and selected papers.

The first part of this course will provide an introduction to pattern recognition, and an overview of statistical modeling of high dimensional data and parameter estimation methods. Sub jects include graphical models, mixture models, Maximum likelihood and Bayesian estimation, the EM algorithm and its variants. These methods will then be employed in a number of computer vision algorithms involving ob ject detection and recognition. Class work will consist of some theoretical assignments and implementation of some of the algorithms in code on real data. In addition we will present a brief introduction to C++ and provide the students with a set of tools to facilitate the programming assignments.

Back to top

STATISTICS 38100.  Measure-Theoretic Probability I.
Sec 01: Michael Wichura, MWF, 2:30-3:20 PM, Eckhart 117.
PQ: STAT 31300 or consent of instructor.
Required Reading: There is no text required or recommended. Notes will be provided.

This course is the first of a three quarter sequence presenting a careful development of some topics from measure and probability. Topics to be covered in 381 include: classes of sets -- fields, sigmafields, monotone classes, pi and lambda systems; probabilities and general measures; independence and the Borel-Cantelli lemmas; measurable functions; induced measures, distribution and inverse distribution functions; integration with respect to measures -- basic properties, change of variable, indefinite integration, densities; integration to the limit -- MCT, DCT, and friends; laws of large numbers, applications to probability and statistics; transition probabilities and product measures.

Back to top

STATISTICS 39000=FINM 34500. Stochastic Calculus I.
Sec 01: Per Mykland, M, 6:30-9:30 PM, Ryerson 251.
PQ: Enrollment in Mathematical Finance M. Sc. program or consent of instructor.
Required Reading:  Stochastic Calculus for Finance I-II, by Steven E. Shreve (required).
A Course in Financial Calculus, by Alison Etheridge (highly recommended).
`The basics of S-PLUS'', 3rd edition, by the same authors (A. Krause and M. Olson), ISBN 0-387-95456-2 (required).

This course is an introduction to stochastic calculus as it is relevant to the pricing and hedging of options and other derivative securities. It is the first of a two-quarter sequence offered in collaboration by the Department of Statistics and the master's program in Mathematical Finance. The main topics to be covered are:

  • The Fundamental Theorem of Asset Pricing
  • Martingales
  • Brownian Motion
  • The Ito Integral and Ito's Formula
  • The Black-Sholes Formula
  • Girsanov's Formula
  • Currency Options
  • The Martingale Representation and Hedging

There will be weekly homework assignments, and midterm and final exams. The course assistants will conduct weekly help sessions on Friday afternoons.

Back to top

STATISTICS 47620. Simulation Methods.
Sec 01: Steven P. Lalley, MWF, 10:30-11:20 AM, Eckhart 117
PQ: STAT 24600 or STAT 31200 or consent of instructor. Stat 47620 will be taught starting October 27th, 2008 and credits will be 50.
Required Reading: Monte Carlo Strategies in Scientific Computing by Jun Liu.

This will be a brief introduction to several useful techniques of simulation:

  • importance sampling
  • MCMC (Markov chain Monte Carlo)
  • Gibbs sampling
  • perfect sampling

The utility of these methods will be illustrated by a number of substantial examples, including

  • enumeration of contingency tables with fixed margins
  • simulation of Ising models
  • code-breaking.

Back to top

SPRING 2008

College Courses

STATISTICS 20000.  Elementary Statistics.
Sec 01: Dan Wang, MWF 9:30-10:20 AM, Eckhart 133.
Sec 02: Wenlong Wang, MWF 12:30-1:20 PM, Eckhart 133.
PQ:  Math 10500 or equivalent.
Required reading: Statistics, 4th edition, by Freedman, Pisani, Purves 2007, Norton.
ISBN-10: 0393929728, ISBN-13: 978-0393929720.

This course meets one of the general education requirements in the mathematical sciences. NOTE: STAT 20000 may not be used in the statistics major. It is recommended for students who do not plan to take advanced statistics courses. This course introduces statistical concepts and methods for the collection, presentation, analysis, and interpretation of data. Elements of sampling, simple techniques for analysis of means, proportions, and linear association are used to illustrate both effective and fallacious uses of statistics.

Back to top

STATISTICS 22000. Statistical Methods and Their Applications.
Sec 01:  Zuoheng Wang, MWF 10:30-11:20 AM, Harper Memorial 140.
Sec 02:  Shali Wu, MWF 1:30-2:20 PM, Eckhart 133.
PQ:  2 QTRS Calculus.
Required reading: Introduction to the Practice of Statistics, 5th edition by Moore and McCabe
2006, W. H. Freeman. ISBN-10: 0716764008, ISBN-13: 978-0716764007

This course introduces statistical techniques and methods of data analysis, including the use of computers. Examples are drawn from the biological, physical, and social sciences. Students are required to apply the techniques discussed to data drawn from actual research. Topics include data description, graphical techniques, exploratory data analyses, random variation and sampling, one- and two-sample problems, the analysis of variance, linear regression, and analysis of discrete data.

Back to top

STATISTICS 22200 Linear Models and Experimental.
Sec 01: Linda Collins, TTh, 9:00-10:20 AM, Eckhart 133.
PQ: STAT 22000 or consent of instructor.
Required Reading: Oehlert, G. W. (2000) A First Course in Design and Analysis of Experiments. W. H. Freeman. ISBN-10: 0-7167-3510-5 ISBN-13: 978-0-7167-3510-6.

This course covers principles and techniques for the analysis of experimental data and the planning of the statistical aspects of experiments. Topics include linear models, analysis of variance, randomization, blocking, factorial designs, confounding, and incorporation of covariate information.

In addition to regular homework assignments and exams, students will be required to complete a project involving the design and analysis of an experiment of their own.

Back to top

STATISTICS 23400. Statistical Models and Methods.
Sec 01: Pending, MWF, 11:30-12:20 PM, Eckhart 133.
Sec 02: Michael Finegold, MWF, 2:30-3:20 PM, Eckhart 133.
PQ: Mathematics 13300, 15300 or 16300.
Required Reading:

Investigating Statistical Concepts, Applications, and Methods by Chance and Rossman 2006, Duxbury (Thomson Brooks/Cole). ISBN-10: 0495050644, ISBN-13: 978-D495050643.

A Brief Course in Mathematical Statistics by Tanis and Hogg 2007, Prentice Hall, ISBN-10: 0131751395, ISBN-13: 978-0131751392.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, hypergeometric, Poisson, exponential, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for proportions and means for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Univariate calculus and computer simulation are used throughout the course to investigate statistical concepts and their mathematical underpinnings. One full year of univariate calculus is a prerequisite for the course (Math 13300, 15300, or 16300). Familiarity with at least limits, derivatives and integrals of polynomial and exponential functions, change of variable (substitution) in definite integrals, max-min problems, use of summation notation, and sequences and series as well as a willingness to explore ideas mathematically are key to your success in this course. See http://www.stat.uchicago.edu/~stat234 for more detailed information.

Stat 23400 takes the mathematical prerequisite quite seriously. Enrolling concurrently in either Math 13300, 15300, or 16300 while taking Stat 23400 is very strongly discouraged. Further, students who do not feel strong mathematically, may want to wait until completing their entire mathematical requirement (e.g., Math 19500-19600 for Economics majors) before enrolling in Stat 23400. Economics majors are strongly encouraged to delay taking Stat 23400 until the quarter just before enrolling in their required econometrics course (Econ 21000), for which Stat 23400 is a prerequisite. Thus, delaying Stat 23400 until at least late in the second year or even early in the third year of the Economics degree program should not be considered unusual.

Back to top

STATISTICS 24500. Statistical Theory and Methods 2.
Sec 01:  Debashis Mondal, TTh, 1:30 PM, Eckhart 133.
PQ:  STAT 23400 and STAT 23500 or STAT 24400 or consent of instructor.
Required reading: Mathematical Statistics and Data Analysis, 3rd edition, Rice, John A. 2007, Duxbury. ISBN-10: 0534399428, ISBN-13: 978-0534399429.

This is the second quarter of a two-quarter sequence. Enrollment in the second quarter alone is permitted, although not recommended. The first quarter covered the basics -- tools from probability and the elements of statistical theory. The second quarter will cover statistical methodology, including the data transformation, regression phenomena, t-tests, analysis of variance, linear regression, correlation, and some multivariate distribution theory. Some principles of data analysis will be introduced, and an attempt will be made to present ANOVA and regression in a unified framework. Much of the material is covered in chapters 10-12 and 14 of the text, but other viewpoints and derivations will be introduced as well. The computer will be used in the second quarter. Some mathematical maturity will be assumed, to the level of calculus. The computer will be used for data analysis and simulation.

Back to top

STATISTICS 24600. Statistical Theory and Methods 3.
Sec 01: Yali Amit,  TTh, 10:30-11:50 AM, Eckhart 133.
PQ: STAT 24400 and STAT 24500 or consent of instructor.
Required Reading: Pattern Recognition and Machine Learning, Bishop, Christopher M. 2006, Springer Verlag. ISBN-10: 0387310738, ISBN-13: 978-0387310732.

This course will introduce a variety of modern statistical methods. We will start with the description of families of multivariate models - multivariate normal distribution, graphical models, log-linear models. We will then discuss inference for such models including the EM algorithm, Bayesian inference and Monte-carlo methods. The main application of the models will be in classification problems - i.e the 'generative approach'. We will also introduce some modern discriminative approaches for classification.

Back to top

STATISTICS 25100.  Intro to Math Probability.
Sec 01: Michael J. Wichura, TTh, 12:00-1:20 PM, Eckhart 312.
PQ: MATH 20000 or MATH 20400 or consent of instructor.
Required Reading: A First Course in Probability, 7th edition, Ross, S. 2005, Pearson/Prentice Hall. ISBN-10: 0131856626, ISBN-13: 978-0131856622.

The aim of the course is to provide an introduction to the concepts of probability. The course will cover the basic ideas used to describe aspects of randomness, such as events, random variables, independence, and conditional probability, with emphasis on the methods, calculation, and applications of probability. The topics treated are: combinatorics; probability models; rules of probability; conditional probability; independence; random variables; expectation and standard deviation; games of chance; common discrete distributions—uniform, binomial, hypergeometric, geometric, negative binomial, and Poisson—and their interrelationships; univariate and multivariate density and distribution functions, change of variable formulas for densities, common continuous distributions—beta, Cauchy, chisquare, exponential, gamma, lognormal, normal, uniform—and their interrelationships; moment generating functions, laws of large numbers and the central limit theorem.

For more information, please visit http://www.stat.uchicago.edu/~wichura/Stat251/courseinfo.html

Back to top

STATISTICS 26100=STATISTICS 33600. Time Dependent Data.
Sec 01: MIchael Stein, TTH, 1:30-2:50 PM, Eckhart 202.
PQ: STAT 24400 or STAT 24500.
Required Reading: Time Series Analysis and Its Applications: With R Examples, 2nd edition
Shumway and Stoffer 2006, Springer. ISBN-10: 0387293175, ISBN-13: 978-0387293172.

This course considers the modeling and analysis of data that are ordered in time. The main focus will be on quantitative observations taken at evenly spaced intervals and will include both time-domain and spectral approaches. Time permitting, statistical approaches to other data types, such as categorical observations or point processes, will be considered.

Back to top

STATISTICS 26700=CHSS 32900, HIPS 25600, STAT 36700.
Sec 01: Stephen Stigler, MWF, 9:30-10:20 AM, Eckhart 117.
PQ: A course in Statistics.
Required Reading: The History of Statistics: The Measurement of Uncertainty Before 1900.
Stigler, Stephen M. 1990, Belknap Press of Harvard University Press. ISBN-10: 067440341X,
ISBN-13: 978-0674403413. Other materials will be distributed in class or by web.

This course will cover topics in the history of statistics, from the eleventh century to the middle of the twentieth century. The emphasis will be upon the period 1650 to 1950, and upon the mathematical developments in the theory of probability and how they came to be used in the sciences, both to quantify uncertainty in observational data and as a conceptual framework for scientific theories. The course will include broad views of the development of the subject, and closer looks at specific people and investigations, including reanalyses of historical data. Topics will include: Early probability; Probability in seventeenth century medicine; Inverse probability in inference; Statistical methods in early geodesy; The introduction of least squares; Statistics in social science; Statistics in early biology and psychology; Simulation; Statistics and the evaluation of social programs; Maximum likelihood estimation; Statistics and nineteenth century forensic science; Statistics and medical science. Major figures who will be examined include Pascal, various Bernoullis, De Moivre, Laplace, Boscovich, Gauss, Quetelet, Galton, Edgeworth, Karl Pearson, Gosset, Fisher, Neyman.

Back to top

Graduate Courses

STATISTICS 30200. Mathematical Statistics 2.
Sec 01:  Wei Biao Wu, MW, 1:30-2:50 PM,  Eckhart 117.
PQ:  STAT 30100 or consent of instructor.
Required Reading: Mathematical Statistics, 2nd ed. 2003. Corr. 4th printing edition (October 5, 2007). Jun Shao, Springer. ISBN-10:0387953825, ISBN-13: 978-0387953823.

This course continues the development of mathematical statistics. Topics of importance include: statistical decision theory, admissability and the Neyman-Pearson lemma, formal hypothesis testing, comparison with conditional frequentist and Bayesian viewpoints, ancillarity, interval estimation, UMP tests and MLR, unbiased tests, score statistics, generalized likelihood ratio tests and asymptotics, minimax estimation, empirical Bayes and "shrinkage".

Back to top

STATISTICS 31300.  Introduction to Stochastic Processes 2.
Sec 01: Per A. Mykland, TTH, 10:30-11:50 AM,  Eckhart 117.
PQ:  STAT 31200 or consent of instructor.
Required reading:

Back to top

STATISTICS 32200. Bayesian Data Analysis.
Sec 01: Matthew Stephens, MW, 1:30-2:50 PM, Eckhart 308.
PQ:  Consent of instructor.
Reading: There is no required text, but "Bayesian Theory" by Bernardo and Smith is background reading.

This course is aimed at graduate students in statistics, and others with the necessary statistical background. We will assume familiarity with standard statistical distributions (e.g. Normal, Poisson, Binomial, Exponential), with the laws of probability, and concepts of statistical inference (maximum likelihood estimation, hypothesis testing, confidence intervals, etc), and basic familiarity with the R statistical package.

The course will cover foundations of Bayesian statistics, including axiomatic development, exchangeability, De Finetti's theorem, Jeffreys and improper priors, decision theory, Bayesian hypothesis testing and Bayes factors. Concepts will be illustrated mainly by instructive "toy" examples, where calculations can be done by hand. However, we will also study more complex, practical applications of Bayesian statistics. Although methods of computation will be discussed, the primary focus will be on concepts, and not on computation.

Back to top

STATISTICS 33200=HSTD 43200. Causal Inference.
Sec 01: Tyler Vanderweele, TTH, 10:30-11:50 AM, BSLC arr.
PQ: STAT 22400-22600 or HSTD 32400-32700 or equivalent or consent of instructor.
Required reading: 

Back to top

STATISTICS 34700. Generalized Linear Models.
Sec 01:  Peter McCullagh, TTH, 3:00-4:20 PM, Eckhart 133.
PQ: STAT 34300 or consent of instructor.
Required Reading: Generalized Linear Models, 2nd edition, McCullagh and Nelder 1990, Chapman & Hall/CRC. ISBN-10: 0412317605, ISBN-13: 978-0412317606.
Recommended Reading: Modern Applied Statistics with S, 4th edition, Venables and Ripley 2003, Springer. ISBN-10: 0387954570, ISBN-13: 978-0387954578.
Applied Statistics: Principles and Examples, Cox and Snell 19821 Chapman & Hall/CRC. ISBN-10: 0412165708, ISBN-13: 978-0412165702.

This is an applied course for students who are familiar with linear models at the level of Draper and Smith or Weisberg. The following topics will be covered:

Factors, variates, contrasts, interactions
Exponential-family models: variance function
Definition of a generalized linear model: link functions
Analysis of deviance
Specific examples of GLMs

logistic and probit regression
cumulative logistic models
log-linear models and contingency tables
inverse linear models

Quasi-likelihood and least squares; estimating functions
Over-dispersion
Partially linear models

Back to top

STATISTICS 35500. Statistical Genetics.
Sec 01: Mary Sara McPeek, F, 1:30-4:10 PM, Eckhart 117.
PQ: Human Genetics 471 and Statistics 244 and 245. Students who do not meet the prerequisites may enroll on a P/NP basis with consent of the instructor.
Reading: There is no textbook.

This is an advanced course in statistical genetics. Prerequisites are Human Genetics 471 and Statistics 244 and 245. Students who do not meet the prerequisites may enroll on a P/NP basis with consent of the instructor. This is a discussion course and student presentations will be required. Topics vary and may include, but are not limited to, statistical problems in association mapping, linkage mapping, population genetics, microarray analysis, and genetic models for complex traits.

Back to top

STATISTICS 36700=CHSS 32900, HIPS 25600, STAT 26700.
Sec 01: Stephen Stigler, MWF, 9:30-10:20 AM, Eckhart 117.
PQ: A course in Statistics.
Required Reading: Stephen M. Stigler, The History of Statistics: The Measurement of Uncertainty Before 1900. (Cambridge, Mass.: Harvard University Press, 1986.) (Available in paperback.) Other materials will be distributed in class or by web.

This course will cover topics in the history of statistics, from the eleventh century to the middle of the twentieth century. The emphasis will be upon the period 1650 to 1950, and upon the mathematical developments in the theory of probability and how they came to be used in the sciences, both to quantify uncertainty in observational data and as a conceptual framework for scientific theories. The course will include broad views of the development of the subject, and closer looks at specific people and investigations, including reanalyses of historical data. Topics will include: Early probability; Probability in seventeenth century medicine; Inverse probability in inference; Statistical methods in early geodesy; The introduction of least squares; Statistics in social science; Statistics in early biology and psychology; Simulation; Statistics and the evaluation of social programs; Maximum likelihood estimation; Statistics and nineteenth century forensic science; Statistics and medical science. Major figures who will be examined include Pascal, various Bernoullis, De Moivre, Laplace, Boscovich, Gauss, Quetelet, Galton, Edgeworth, Karl Pearson, Gosset, Fisher, Neyman.

Back to top

STATISTICS 38600. Topics in Stochastic Processes.
Sec 01: Steven P. Lalley, TTH, 3:00-4:20 PM, Eckhart 117.
PQ: Consent of instructor.
Recommended Reading: Stochastic Differential Equations by Oksendal (PG)
Brownian Motion and Stochastic Calculus by Karatzas and Shreve (R)
Foundations of Modern Probability by Kallenberg (X)

Topics:

Wiener process (Brownian motion)
Weak Convergence in Function Space
Ito Integral and Ito Calculus
Introduction to Diffusion Processes

Back to top

STATISTICS 43800. Statistical Inference for Financial Data.
Sec 01: Per A. Mykland, TTH, 1:30-2:50 PM, Eckhart 117.
PQ: STAT 30200, 34300 and 38300 or consent of instructor.
Required Reading:

 

WINTER 2008

HSTD 43501.  Theory and Methods for Multivariate and Longitudinal Data.
Sec 01:  Paul Rathouz and Tyler VanderWeele, MW 1:30-2:50 pm, BSLC Arr.
PQ: Statistics 304, 301, 302, 343, 347. Statistics Statistics 244, 245, 246 with experience or coursework in matrix linear algebra may be substituted for Statistics 304, 301, 302.
Required reading: Instructor’s notes and selected papers.

This course presents a theoretical treatment of methods for multivariate and longitudinal data. The course covers both continuous and categorical data. Focus will be primarily on likelihood-based methods and their direct extensions. The first two-thirds of the course will cover mean and covariance models for multivariate normal data. Applications and special cases will include Hotelling's T-test,
multivariate linear regression, linear mixed and growth curve models, linear structural equations models and graphical models. The last one-third of the course will focus on categorical outcomes and will include generalized linear mixed models, structural equations models for categorical data and generalized linear marginal models. Readings will be taken from selected texts and original articles in the statistical literature. Students should expect four homework sets focused on theory and programming tasks related to methods developed in the course, as well as a final programming project.

Topic List (number of lectures) [expected instructor]
----------
(1) Review of Theory for Multivariate Normal Distribution [tv]
(1) Wishart Distribution and Hotelling's T-test [tv]
(4) General Linear Model for Correlated Data [pr]

  • Models for the mean and variance-covariance of multivariate processes
  • Likelihood methods for multivariate linear regression models
  • Hypothesis tests for variance-covariance parameters in non-standard situations

(7) Applications and Special Cases of the General Linear Model for Correlated Data [tv]

  • Linear Mixed Models and Growth Curve Models
  • Structural Equations Models
  • Multivariate Normal Graphical Models

(6) Generalized Linear Mixed Models -- Likelihood Methods [pr]

  • Canonical link models and the EM algorithm
  • Numerical integration techniques
  • MC integration

(time permitting) Structural Equations Models for Categorical Data [tv]

(time permitting) Generalized linear marginal models for categorical data (see Section 8.2 in DHLZ, Zhao and Prentice, Heagerty) [pr]

College Courses

STATISTICS 20000.  Elementary Statistics.
Sec 01: Wei Biao Wu, MWF 9:30-10:20 AM, Eckhart 133.
PQ:  Math 10600 or equivalent.
Required reading:  Freedman, Pisani, and Purves, Statistics, 4th edition. W. W. Norton Press. ISBN-10: 0393929728, ISBN-13: 978-0393929720.

This course meets one of the general education requirements in the mathematical sciences. NOTE: STAT 20000 may not be used in the statistics major. It is recommended for students who do not plan to take advanced statistics courses. This course introduces statistical concepts and methods for the collection, presentation, analysis, and interpretation of data. Elements of sampling, simple techniques for analysis of means, proportions, and linear association are used to illustrate both effective and fallacious uses of statistics.

Back to top

STATISTICS 22000. Statistical Methods and Their Applications.
Sec 01:  Debashis Mondal, MWF 10:30-11:20 AM, Eckhart 133.
PQ:  2 QTRS Calculus.
Required reading:  Moore and McCabe, Introduction to the Practice of Statistics, 5th edition. W. H. Freeman.  ISBN-10: 0716764008, ISBN-13: 978-0716764007.

This course introduces statistical techniques and methods of data analysis, including the use of computers. Examples are drawn from the biological, physical, and social sciences. Students are required to apply the techniques discussed to data drawn from actual research. Topics include data description, graphical techniques, exploratory data analyses, random variation and sampling, one- and two-sample problems, the analysis of variance, linear regression, and analysis of discrete data.

Back to top

STATISTICS 22600=HSTD 32600. Analysis of Categorical Data.
Sec 01: Mei Wang, TTH, 3:00-4:20 PM, Eckhart 133.
PQ: STAT 22000 or equivalent.
Required reading: Agresti, A. An introduction to Categorical Data Analysis. Wiley, 2nd ed., 2007.

It is expected that the students have a good understanding of basic descriptive statistics such as means, variances and expectation, of the inferential notions of estimate, confidence intervals and significance or hypothesis testing. Familiarity with one statistical package, e.g. R, Splus, SAS, SPSS, Stata or Minitab, and ability to access Web sites and to download files from the Web are required. The free statistical package R will be used in this course for Winter 2007.

This course is an introduction to the theory and applications of statistical methods for investigating the relationships among discrete variables. The course will present methods for analyzing categorical data, including standard methods for contingency tables such as odds ratios,
tests of independence and various measures of association, generalized linear models for binary data and count data, logistic regression for binomial data, loglinear models for Poisson data, and models for paired samples with categorical responses. The statistical techniques discussed will be presented by many real examples involving physical, biological and social science data.

Back to top

STATISTICS 22700=HSTD32700. Biostatistical Methods.
Sec 01: Ronald A. Thisted, TTH, 10:30-11:50 AM, BSLC 202.
PQ: HSTD 32400/STAT 22400 or STAT 24500 or equivalent; or consent of instructor.
Required reading: Collett, D. (2003). Modelling Binary Data, Second Edition. Boca Raton, Chapman & Hall/CRC.
Collett, D. (2004). Modelling Survival Data in Medical Research, Second Edition. London, Chapman & Hall.

This course is designed to provide students with tools for analyzing categorical, count, and time-to-event data frequently encountered in medicine, public health and related biological and social sciences. The course will emphasize application of methods rather than statistical theory, including recognition of the appropriate methods, interpretation and presentation of results. Methods covered include: contingency table analysis, logistic regression, log-linear (Poisson) regression, conditional logistic regression, regression methods for ordinal data, Kaplan-Meier survival curves, parametric survival models, and Cox proportional-hazards survival analysis.

Back to top

STATISTICS 23400. Statistical Models and Methods.
Sec 01: Linda B. Collins, MWF, 11:30-12:20 PM, Eckhart 133.
Sec 02: Nina Singhal Hinrichs, MWF, 2:30-3:20 PM, Ryerson 276 (as of 1/9/08).
PQ: Mathematics 13300, 15300 or 16300.
Required Reading: Chance and Rossman (2005). Investigating Statistical Concepts, Applications, and Methods, First Edition. Duxbury (Thomson Brooks/Cole), ISBN: 0-4950-5064-4.
Tanis and Hogg (2008). A Brief Course in Mathematical Statistics, Pearson/Prentice Hall, ISBN: 0-1317-5139-5.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, hypergeometric, Poisson, exponential, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for proportions and means for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Univariate calculus and computer simulation are used throughout the course to investigate statistical concepts and their mathematical underpinnings. One full year of univariate calculus is a prerequisite for the course (Math 13300, 15300, or 16300). Familiarity with at least limits, derivatives and integrals of polynomial and exponential functions, change of variable (substitution) in definite integrals, max-min problems, use of summation notation, and sequences and series as well as a willingness to explore ideas mathematically are key to your success in this course. See http://www.stat.uchicago.edu/~stat234 for more detailed information.

Stat 23400 takes the mathematical prerequisite quite seriously. Enrolling concurrently in either Math 13300, 15300, or 16300 while taking Stat 23400 is very strongly discouraged. Further, students who do not feel strong mathematically, may want to wait until completing their entire mathematical requirement (e.g., Math 19500-19600 for Economics majors) before enrolling in Stat 23400. Economics majors are strongly encouraged to delay taking Stat 23400 until the quarter just before enrolling in their required econometrics course (Econ 21000), for which Stat 23400 is a prerequisite. Thus, delaying Stat 23400 until at least late in the second year or even early in the third year of the Economics degree program should not be considered unusual.

Back to top

STATISTICS 24400. Statistical Theory and Methods I.
Sec 01: Mathias Drton, TTH, 10:30-11:50 AM, Eckhart 133.
PQ:  MATH 19600, 20100, or 20500.
Required reading: Rice, John A. (2007). Mathematical Statistics and Data Analysis, Third Edition, by (Duxbury).

This is the first quarter of a two-quarter sequence. Enrollment in the first quarter alone is permitted, although not recommended. The first quarter will cover the basics -- tools from probability and the elements of statistical theory. Topics will include the definitions of probability and random variables, binomial and other discrete probability distributions, normal and other continuous probability distribution, joint probability distributions and the transformation of random variables, principles of inference (including Bayesian inference), maximum likelihood estimation, hypothesis testing and confidence intervals, likelihood ratio tests, multinomial distributions and chi-square tests. Some large sample theory will be included. The emphasis will be upon statistical theory, specifically upon concepts and tools that are useful for understanding and applying statistical methodology.

There is no enforced prerequisite in probability or statistics, although the pace is such that students may find it useful to have taken a previous elementary course. The coverage of topics in probability will be limited and brief, so that those who have taken a course in probability will find reinforcement rather than redundancy. The second quarter will cover statistical methodology, including some multivariate analysis, the analysis of variance, the regression phenomenon, linear regression analysis, data analysis, and correlation. Statistical software will be used for simulations and data analysis.

Back to top

STATISTICS 24500. Statistical Theory and Methods II.
Sec 01:  Wei Biao Wu,  TTH, 1:30-2:50 PM, Eckhart 133.
PQ:  STAT 24400 or consent of instructor.
Required reading: Rice, J. A. (2006). Mathematical Statistics and Data Analysis, 3rd ed., Brooks/Cole.

This is the second quarter of a two-quarter sequence. Enrollment in the second quarter alone is permitted, although not recommended. The first quarter covered the basics -- tools from probability and the elements of statistical theory. The second quarter will cover statistical methodology, including the data transformation, regression phenomena, t-tests, analysis of variance, linear regression, correlation, and some multivariate distribution theory. Some principles of data analysis will be introduced, and an attempt will be made to present ANOVA and regression in a unified framework. Much of the material is covered in chapters 10-12 and 14 of the text, but other viewpoints and derivations will be introduced as well. The computer will be used in the second quarter. Some mathematical maturity will be assumed, to the level of calculus. The computer will be used for data analysis and simulation.

Back to top

STATISTICS 24700=CPNS 32100. Math/Stats Methods for Neuroscience-2.
Sec 01:  William Van Drongelen, WF, 1:30-2:50 PM, BSLC 401.
PQ:  STAT 24400 or consent of instructor.
Required reading: Rice, J. A. (1995). Mathematical Statistics and Data Analysis, 2nd ed., Duxbury.

This course deals with the application of non-linear methods in signal processing and dynamical systems theory to issues in neuroscience. Data analysis with Matlab is again emphasized.

The third course in this sequence is an elective course in one of the quantitative sciences relevant to neuroscience that can be selected by the student in consultation with the program chair.

Back to top

STATISTICS 25200=STAT 31200.  Introduction to Stochastic Processes I.
Sec 01:  Steven P. Lalley, TTH, 10:30-11:50 AM, Ryerson 277 .
PQ:  STAT 25100 or consent of instructor
Required reading: Ross, S. (1996). Stochastic Processes, 2nd ed., Wiley.

Stochastic processes provide models for random events that evolve in time and may include substantial dependence among observations at different times. The goal of this course is to present a variety of useful models including Markov chains, renewal theory, random walks, queueing and branching processes.

Back to top

 

Graduate Courses

STATISTICS 30100. Mathematical Statistics I.
Sec 01:  Mary Sara McPeek, TTH, 1:30-2:50 PM,  Eckhart 117.
PQ:  STAT 30400 and MATH 20500 or consent of instructor.
Required Reading: Casella and Berger. Statistical Inference, 2nd ed.
Ferguson. A Course in Large Sample Theory.

This course is part of a two-quarter sequence on the theory of statistics. Topics will include exponential families, quadratic forms of multivariate normal, asymptotics of order statistics, sufficiency and completeness, the likelihood function, methods of point estimation, and asymptotic properties of maximum likelihood estimates. Other topics (e.g. Bayesian methods and methods for dependent observations) may be covered if time permits.

Back to top

STATISTICS 30600. Adv. Statistical Inference 1.
Sec 01:  Peter McCullagh, TTH, 10:30-11:50 AM,  Eckhart 117.
PQ:  Consent of instructor.
Suggested reading: Tensor Methods in Statistics by P. McCullagh.
Principles of Statistical Inference by L. Pace and A. Salvan.
Statistical Models by A.C. Davison.

The focus of the course will be on parametric statistical models and likelihood.

Topics for discussion include

  • Statistical models: definition by example
  • Likelihood and likelihood ratio statistic
  • Asymptotic approximation of distributions
  • Edgeworth and related approximations
  • Cumulant calculations and tensor calculus
  • Marginal likelihood and REML estimation
  • Projection and Best linear prediction: splines
  • Processes, exchangeable processes, modulated processes, regression processes
  • Bayesian models

Back to top

STATISTICS 31200=STAT 25200.   Introduction to Stochastic Processes 1.
Sec 01:  Steven P. Lalley, TTH, 10:30-11:50 AM, Ryerson 277.
PQ:  STAT 25100 or consent of instructor.
Required Reading: Ross, S. (1996). Stochastic Processes, 2nd ed., Wiley.

Stochastic processes provide models for random events that evolve in time and may include substantial dependence among observations at different times. The goal of this course is to present a variety of useful models including Markov chains, renewal theory, random walks, queueing and branching processes.

Back to top

STATISTICS 31700=STAT 25300.   Introduction to Probability Models.
Sec 01:  Mei Wang, TTH, 9:00-10:20 AM,  Eckhart 133.
PQ:  STAT 25100 or STAT 24400 or equivalent. Consent of instructor.
Required reading: Ross, R., Introduction to Probability Models, 9th ed., (2007).

This course introduces stochastic processes as models for a variety of phenomena in the physical and biological sciences. Another appropriate title for the course could be "an Introduction to Applied Stochastic Processes." Following a very brief review of basic concepts in probability the course will introduce stochastic processes that are popular in applications in sciences, such as discrete time Markov chain, the Poisson process, continuous time Markov process, renewal process and Brownian motion.

Back to top

STATISTICS 32500=GSBC 41902.  Statistical Inference.
Sec 01:  Nicholas G. Polson, Tue 8:30-11:30 AM, GSB C10.
PQ:  Business 41901=STAT 32500
Required reading: DeGroot and Scherviah, Probability and Statistics. Lecture notes will be provided in the form of a CoursePack.

This Ph.D.-level course is the second in a two-quarter sequence with Business 41901. The central topic is statistical inference. The topics covered include Bayesian inference, classical estimation, decision theory, MCMC methods and Hierarchical models. The use of Hierarchical models is a focus in applications.

Back to top

STATISTICS 34500.  Design and Analysis of Experiments.
Sec 01:  Michael L. Stein, MW, 1:30-2:50 PM, Eckhart 133.
PQ: STAT 34300
Reading:  Mead, R., The Design of Experiments. Cambridge.
West, B.D., Welch, K.B. and Galecki, A.T., Linear Mixed Models. Chapman & Hall/CRC.

An introduction to the methodology and application of linear models in experimental design. A major focus of the course will be the basic principles of experimental design, such as blocking, randomization and incomplete layouts. Both standard designs, such as fractional factorials and incomplete block designs, as well as nonstandard designs, will be studied within this context. The analysis of these experiments will be developed as well, with particular emphasis on careful model formulation and the role of fixed and random effects. Time permitting, additional topics may include the use of covariates in the analysis of designed experiments, spatial analysis of field trials and Bayesian approaches to analysis of experimental data.

Course work will include the planning, execution and analysis of an experiment by the class.

Back to top

STATISTICS 35540. Population Genetics and Metagenomics.
Sec 01:  Nicholas Eriksson, MW, 1:30-2:50 PM, Eckhart 117.
Required reading: Selected papers.

In this course we will focus on the biological, statistical, and computational challenges involved in the analysis of metagenomic data. In metagenomics and population genetics, the goal is to understand the patterns of variation in genomes within species and between related species. New sequencing technologies allow us to gather data on unexplored populations, but require new methods of analysis. We will look at the connections between these problems and more established fields such as phylogenetics, sequence assembly, and population genetics.

Topics will be chosen from recent papers from the literature; some of these papers will be presented by seminar participants.

Back to top

STATISTICS 35600=HSTD 33100. Applied Survival Analysis.
Sec 01:  James Dignam, TTH, 10:30-11:50 AM, BSLC 305.
PQ: HSTD 32100; STAT 22000; or equivalent, and HSTD 32400/STAT 22400 or equivalent; or consent of instructor.
Required reading:

This course will provide an introduction to the principles and methods for the analysis of time-to-event data. This type of data occurs extensively in both observational and experimental biomedical and public health studies, as well as in industrial applications. While some theoretical statistical detail is given (at the level appropriate for a Master's student in statistics), the primary focus will be on data analysis. Problems will be motivated from an epidemiologic and clinical perspective, concentrating on the analysis of cohort data and time-to-event data from controlled clinical trials.

Back to top

STATISTICS 35700=HSTD 31001.   Epidemiologic Methods.
Sec 01:  Diane Lauderdale, TTH, 12:00-1:20 PM, BH W229.
PQ: HSTD 30900 or consent of instructor.
Required reading:

This course expands on the material presented in "Principles of Epidemiology," further exploring issues in the conduct of epidemiologic studies. The student will learn the application of both stratified and multivariate methods to the analysis of epidemiologic data. The final project will be to write the "specific aims" and "methods" sections of a research proposal on a topic of the student's choice.

Back to top

STATISTICS 37300.  Graphical Models and Algebraic Statistics.
Sec 01:  Mathias Drton, TTH, 1:30-2:50 PM, Eckhart 202.
PQ: Consent of instructor.
Required reading:  None.

In graphical modelling, one associates a statistical model with a graph: nodes represent variables and edges indicate dependencies between variables. This framework is popular in many applied areas and encompasses models such as Bayesian networks, log-linear models, phylogenetic tree models, multivariate regression, factor analysis, and structural equation models. Topics to be discussed in this course include conditional independence, directed and undirected graphical models, hidden variables, and statistical inference in the different model classes. We will also discuss algebraic techniques that are useful in the study of graphical models.

Back to top

STATISTICS 38300.  Measure-Theoretic Probability-3.
Sec 01:  Michael J. Wichura, MWF, 12:30-1:20 PM, Eckhart 117.
PQ: STAT 38100 or consent of instructor.
Required reading: No text book is required; notes will be distributed in class.

Topics for Stat 38300 will include:

  • The Hahn and Jordan decomposition theorems
  • Modes of convergence: with probability one, in probability, and in mean; uniform integrability
  • L2-spaces: projections; representation of linear functionals
  • The Radon-Nikodym theorem: absolute continuity, Radon-Nikodym derivatives; likelihood ratios; Lebesgue decompositions
  • Conditional probability: regular conditional probability distributions
  • Conditional expectation: given sub-sigma fields, and given measurable functions
  • Martingales: definitions and examples, transformations
  • Stopping times; optional sampling
  • Martingale limit and closure theorems
  • Backward submartingales
  • Continuous-time martingales: convergence, closure, optional sampling

Back to top

STATISTICS 39000=FINM 34500.  Stochastic Calculus-1.
Sec 01:  Jostein Paulsen, W, 6:00-9:00 PM, Ryerson 251.
PQ: Math Finance Students Only. The course, Mathematical Foundations of Option Pricing is very useful. Otherwise, the more mathematical analysis you know, the better.
Required Reading:  Tomas Bjork: Arbitrage Theory in Continuous Time. Oxford University Press
Steven Shreve: Stochastic Calculus for Finance II: Continuous-Time Models. Springer Finance.

In the class we will give a more or less in depth coverage of the following topics. Notes are provided by the instructor.

• Basic stochastic calculus including martingales, filtrations, Brownian motion, the
Itô integral, Itô's formula and stochastic differential equations.

  • Option pricing, both with deterministic interest rate and with stochastic interest rate. Also option hedging.
  • Forwards and futures
  • Foreign exchange derivative pricing
  • Interest rate models including classical models, the Heath-Jarrow-Morton approach and the Market Models.
  • Some credit risk models

Back to top

STATISTICS 47900. Stochastic Models for Memory and Learning.
Sec 01: Yali Amit , TTH, 3:00-4:20 PM, Eckhart 117. Course begins 6th week.
PQ: Consent of instructor.
Required reading: None

This 5 week course will cover a some of the literature analyzing learning and memory in large neuronal populations as stochastic processes. First we will discuss models with discrete time dynamics, discrete binary neurons and finite state synapses, and derive bounds on memory capacity, learning and forgetting times. This will only involve discrete time Markov chain analysis and some ideas from mean-field analysis. Second we will introduce continuous integrate and fire neurons and continuous time dynamics. Using mean-field methods we will analyze the stability of large networks with random connections, and the behavior of the networks after learning. People interested in probability will be exposed to a rich collection of stochastic models waiting to be analyzed in a rigorous mathematical framework (the mean field analysis is only approximate.) People interested in neuroscience will be exposed to interesting models and some intriguing connections to experimental data.

Back to top

STATISTICS 48400. Adv. Topics in Probability 1.
Sec 01: Steven P.  Lalley, TTH, 3:00-4:20 PM, Eckhart 117. Course held 1st-5th weeks of quarter.
PQ: Basic probability and linear algebra, some elementary complex variable theory.
Recommended reading:

  • "Methodologies in spectral analysis of large dimensional random matrices, a review" by Z. D. Bai , Statistica Sinica 9 (1999), 611-677.
  • "High dimensional statistical inference and random matrices" by I. Johnstone, http://front.math.ucdavis.edu/0611.5589.
  • "Introduction to the Random Matrix Theory: Gaussian Unitary Ensemble and Beyond" by Yan V. Fyodorov, http://front.math.ucdavis.edu/0412.4717.
  • "Orthogonal Polynomials and Random Matrices: A Riemann-Hilbert Approach" by P. Deift (mainly ch. 5), http://www.ams.org/bookstore.

This 5-week course will introduce the basic theory of large random matrices. The first part of the course will be devoted to "bulk spectral" properties of Wigner and sample covariance matrices
(that is, the empirical distribution of their eigenvalues), leading to the Wigner semi-circle law and the Marchenko-Pastur theorem. The second part will focus on the top of the spectrum, and will (I hope) give a mostly complete derivation of the "Tracy-Widom" distribution.

Back to top

AUTUMN 2007

College Courses

STATISTICS 20000.  Elementary Statistics.
Sec 01: Xinghua Zheng, MWF 9:30-10:20 AM, Eckhart 133
Sec 02: Minsun Song, MWF 12:30-1:20 PM, Eckhart 133
PQ:  Math 10600 or equivalent.
Required reading:  Freedman, Pisani, and Purves, Statistics, 4th edition. W. W. Norton Press. ISBN-10: 0393929728, ISBN-13: 978-0393929720.

This course meets one of the general education requirements in the mathematical sciences. NOTE: STAT 20000 may not be used in the statistics major. It is recommended for students who do not plan to take advanced statistics courses. This course introduces statistical concepts and methods for the collection, presentation, analysis, and interpretation of data. Elements of sampling, simple techniques for analysis of means, proportions, and linear association are used to illustrate both effective and fallacious uses of statistics.

Back to top

STATISTICS 22000. Statistical Methods and Their Applications.
Sec 01:  Shali Wu, MWF 10:30-11:20 AM, Eckhart 133
Sec 02:  Marcin Hitczenko, MWF 1:30-2:20 PM, Eckhart 133
PQ:  2 QTRS Calculus.
Required reading:  Moore and McCabe, Introduction to the Practice of Statistics, 5th edition. W. H. Freeman.  ISBN-10: 0716764008, ISBN-13: 978-0716764007.

This course introduces statistical techniques and methods of data analysis, including the use of computers. Examples are drawn from the biological, physical, and social sciences. Students are required to apply the techniques discussed to data drawn from actual research. Topics include data description, graphical techniques, exploratory data analyses, random variation and sampling, one- and two-sample problems, the analysis of variance, linear regression, and analysis of discrete data.

Back to top

STATISTICS 22400=HSTD 32400. Applied Regression Analysis.
Sec 01:  Vanja Dukic, TTH 10:30-11:50 AM, Eckhart 133
PQ: HSTD 32700 or STAT 22000 or STAT 23400 or STAT 24400 or consent of instructor.
Required reading:

This course is an introduction to the methods and applications of fitting and interpreting multiple regression models. The main emphasis is on the method of least squares. Topics include the examination of residuals, the transformation of data, strategies and criteria for the selection of a regression equation, the use of dummy variables, and tests of fit. The techniques discussed will be illustrated by many real examples involving biological and social science data. Examples and exercises will be implemented in a statistical software package "Stata", but familiarity with Stata is not required.

Back to top

STATISTICS 23400. Statistical Models/Method-1.
Sec 01:  Linda Collins, MWF 11:30-12:20 PM, Eckhart 133
Sec 02: Nicholas Eriksson, MWF 2:30-3:20 PM, Eckhart 133
PQ: MATH 13300, 15300 or 16300
Required reading: Chance and Rossman, Investigating Statistical Concepts, Applications, and Methods.  Thompson, Brooks/Cole, ISBN-10: 0495050644, ISBN-13: 978-0495050643.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, Poisson, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for means and variances for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Back to top

STATISTICS 24400. Statistical Theory/Method-1.
Sec 01:  Michael Stein, TTH, 1:30-2:50 PM, Eckhart 133
PQ:  MATH 19600, 20100, or 20500
Required Reading: Rice, John A. (2007). Mathematical Statistics and Data Analysis, Third Edition, by (Duxbury).

This is the first quarter of a two-quarter sequence. Enrollment in the first quarter alone is permitted, although not recommended. The first quarter will cover the essential tools from probability needed for study of statistical theory and the basic elements of statistical theory.

Topics will include the definitions of probability and random variables, binomial and other discrete probability distributions, normal and other continuous probability distribution, joint probability distributions and the transformation of random variables, principles of inference (including Bayesian inference), maximum likelihood estimation, hypothesis testing and confidence intervals, likelihood ratio tests, multinomial distributions and chi-square tests. Some large sample theory will be included. The emphasis will be upon statistical theory, specifically upon concepts and tools that are useful for understanding and applying statistical methodology.

There is no enforced prerequisite in probability or statistics, although the pace is such that students may find it useful to have taken a previous elementary course. The coverage of topics in probability will be limited, so that those who have taken a course in probability will find reinforcement rather than redundancy. The second quarter will cover statistical methodology, including some multivariate analysis, the analysis of variance, the regression phenomenon, linear regression analysis, data analysis, and correlation.

The mathematics prerequisites are listed as general guidance. You should be comfortable with multivariate calculus through partial differentiation and multiple integration.

Back to top

Graduate Courses

STATISTICS 30400.  Distribution Theory.
Sec 01: Dan Nicolae, MWF, 1:30-2:20 PM, Eckhart 117
PQ: STAT 24500 or MATH 25000 or equivalent
Recommended reading: Severini, T. (2005). Elements of Distribution Theory. Cambridge University Press.

This course covers the basics of distribution theory. Topics include:

  • Distribution functions and their inverses, quantile functions, Q/Q plots, change-of-variables for probability densities
  • Expectation, variance, median, mode of random variables
    Basics of measure theory, including Fubini's theorem and interchangeability of limits and integrals
  • Moment generating functions and characteristic functions, including power series expansion, inversion formulas, uniqueness theorems, and convergence in distribution
  • Cumulants and cumulant generating functions: examples, properties
  • Different concepts of convergence for random variables
  • Limit theorems, including the weak law of large numbers, the central limit theorem.

Back to top

STATISTICS 30700=CMSC 37800.  Numerical Computation.
Sec 01: Ronald Thisted, TTH, 10:30-11:50 AM, Ryerson 276
PQ: Stat 34300 (concurrent enrollment OK) or consent of instructor.
Required reading: Thisted, Ronald A. Elements of Statistical Computation. CRC/Chapman & Hall.
Recommended, but not required:
   Gentle, James. Random Number Generation and Monte Carlo Methods. Second edition.
   Springer.
   Watkins, David S. Fundamentals of Matrix Computations. Second edition.
   Wiley.
   Scheinerman, Edward. C++ for Mathematicians. CRC/Chapman and Hall.

This course starts with a presentation of the fundamental algorithms for the solution of linear equations, the decomposition of matrices, and finite dimensional eigenvalue problems. Applications to least squares/regression will be presented, emphasizing use of existing numerical software. The course will also discuss optimization problems and introduce the basic principles of simulation-based methods.

Topics include:

  • Gaussian elimination and back-substitution
  • LU decomposition. (General/Symmetric)
  • Singular value decomposition. (Symmetric)
  • Householder orthogonalization and QR factorization. (Symmetric).
  • Iterative methods: Jacobi and Gauss Seidel.
  • Optimization: Newton-Raphson and quasi-Newton.
  • Uniform random number generation.
  • Simulating specific distributions
  • Monte Carlo methods

By the end of the course students should be able to apply these algorithms in their research work.

Back to top

STATISTICS 32301=HSTD 43001. Advanced Bayesian Methods.
Sec 01:  Vanja Dukic, W, 12:30-3:20 PM, BH W230
PQ:  STAT 32300/HSTD 43000 or STAT 30100-30200, 31200-31300 and consent of instructor
Required reading: Notes and manuscripts will be distributed in class.

This class is a continuation of the Bayesian Topics (Stat 32300/HSTD 43000). We will move beyond the material learned there (the basics of Bayesian statistics and computation (importance sampling, EM, MCEM, data augmentation, Metropolis-Hastings and Gibbs sampling). In particular, we will focus on extensions to MCMC geared for dealing with high-dimensional problems with potential multimodality (simulated tempering, sequential Monte Carlo, Hamiltonian MCMC, Langevin MCMC). We will also discuss issues and algorithms for model comparison (transdimensional MCMC and algorithms for computation of normalizing constants). Algorithms can be implemented in any language, but familiarity with R or Matlab will be assumed. The class will have a seminar format.

Back to top

STATISTICS 32400=GSBC 41901. Probability and Statistics.
Sec 01:  Nicholas Polson, Tue, 8:30-11:30 AM, HC3B
PQ:  One year of Calculus
Required reading: The text for the course is DeGroot and Schervish, Probability and Statistics. Lecture notes will be available in the form of a CoursePack.

Back to top

STATISTICS 33100.  Sample Surveys.
Sect 01: Kirk Wolter, TTH, 10:30-11:50 AM, Eckhart 117
PQ: Consent of instructor
Required reading: Wolter, K.M. (2007). Introduction to Variance Estimation, 2nd Edition, Springer-Verlag, New York.

This is an introductory course to the statistics and methodology of sample surveys. Topics include

  • basic methods of sample selection,
  • determining sample size, stratification,
  • general estimators (Horvitz-Thompson, ratio, generalized regression, calibration)
  • domain estimation,
  • nonresponse,
  • nonsampling error,
  • multiple-stage sampling,
  • a national sampling frame for area probability surveys,
  • telephone surveys,
  • questionnaire design,
  • variance estimation for complex surveys,
  • analysis of contingency tables, and
  • regression analysis for survey data.

The course will be of interest to students who anticipate a research career that designs, collects, and analyzes survey data in fields such as economics, education, healthcare, marketing, psychology, sociology, and statistics.

Back to top

STATISTICS 33900=FINM 33100. Financial Data Analysis.
Sec 01:  Per A. Mykland, W, 6:00-9:00 PM, Eckhart 202
PQ:  Math Finance Students only
Required reading:
Recommended textbooks:
Applied Linear Regression (3rd edition) by Sanford Weisberg (Wiley)
Time Series: Applications to Finance by Ngai Hang Chan (Wiley)
Statistical Analysis of Financial Data in S-Plus by Rene A. Carmona (Springer-Verlag)
Mathematical Statistics and Data Analysis by John A. Rice (Duxbury)
The Basics of S-plus, by Andreas Krause and Melvin Olson (Springer-Verlag)

Note that only part of each book will be used in the course. However, in the course of your career, you will find all of them useful to have on your shelf for further reading and reference.

Back to top

STATISTICS 34300.  Applied Linear Stat Methods.
Sec 01: Mathias Drton, TTH, 9:00-10:20 AM, Eckhart 133
PQ: STAT 24500 and MATH 25000 or equivalent
Optional Reading: Venables, W.N. and Ripley, B.D. (1999). Modern Applied Statistics with S-Plus (3rd ed). Springer-Verlag.
Required Reading: Weisberg, S. (2005). Applied Linear Regression, Third Edition. John Wiley & Sons. Software: Splus or R.

Statistics 34300 is an intensive course in the theory and methods of linear regression and related techniques of statistical modelling. It is intended primarily for graduate students in Statistics and related fields.

The course is also open to undergraduates and others who have a solid understanding of matrix algebra and basic statistical theory. Thorough familiarity with the simple linear regression model is expected.

The course will review linear regression with a single predictor, and will cover the multiple-predictor case; least-squares estimation; associated distribution theory; estimation, confidence intervals and tests; regression with errors in the predictors; weighted least squares, assessing lack of fit; residual analysis; regression diagnostics; transformations; model building; collinearity; subset-selection methods, including stepwise regression; prediction; nonlinear least squares.

Back to top

STATISTICS 35000=HSTD 33300, ENST 27400, PPHA 36400.  Principles of Epidemiology.
Sec 01: Kurina Lianne , TTH, 9:00-10:20 AM, BSLC 240
PQ: Introductory Statistics
Required reading:

Back to top

STATISTICS 35600.  CANCELLED .

Back to to

STATISTICS 36900=HSTD 33300.  Longitudinal Data Analysis.
Sec 01:  Paul Rathouz, TTH, 9:00-10:20 AM, BSLC 313
PQ: HSTD 32100; STAT 22000 or equivalent and HSTD 32400-STAT 22400 or equivalent; or consent of instructor.
Required reading: *Fitzmaurice GM, Laird NM, Ware JH. (2004). Applied Longitudinal Analysis. Hoboken, NJ: Wiley.
Other references: (* Indicates text on reserve in Eckhart Library.)
   *Diggle PJ, Heagerty P, Liang K-Y, & Zeger SL. (2002). Analysis of Longitudinal Data, 2nd edn.
Oxford: Oxford University Press.
   *Hedeker DR & Gibbons RD. (2006). Longitudinal data analysis. Hoboken, NJ: Wiley-Interscience.
Hand D, Crowder M. (1996). Practical Longitudinal Data Analysis. London: Chapman & Hall.
   *Lindsey JK. (1999). Models for repeated measurements. New York: Oxford University Press.
Littel RC, Milliken GA, Stroup WA, Wolfinger RD. (1996). SAS System for Mixed Models. Cary,
NC: SAS Institute.

Back to top

STATISTICS 37610.  Monte Carlo Methods in Scientific Computing.
Sec 01:  Samuel Kou, TTH, 1:30-2:50 PM, Eckhart 117
PQ: Consent of instructor
Required reading: Jun S. Liu (2001). Monte Carlo Strategies in Scientific Computing. Springer.

This course introduces students to modern Monte Carlo methods and their applications in scientific computing. In addition to the classical topics, such as Metropolis-Hastings algorithm, Gibbs sampler, importance sampling and sequential Monte Carlo, special topics include Swendsen-Wang algorithm, multicanonical sampling, histogram method, multigrid sampling, parallel tempering, equi-energy sampler, fragment-regrowth sequential sampling, dimension expansion Monte Carlo, DNA sequence alignment and motif finding, lattice protein folding, and polymer models on lattice.

Back to top

STATISTICS 37910=CMSC 35510.  Statistical Methods in Computer Vision.
Sec 01: Yali Amit, TTH, 10:30-11:50 AM, Ryerson 277
PQ: Consent of instructor
Required reading: Instructor’s notes and selected papers.

The first part of this course will provide an introduction to pattern recognition, and an overview of statistical modeling of high dimensional data and parameter estimation methods. Sub jects include graphical models, mixture models, Maximum likelihood and Bayesian estimation, the EM algorithm and its variants. These methods will then be employed in a number of computer vision algorithms involving ob ject detection and recognition. Class work will consist of some theoretical assignments and implementation of some of the algorithms in code on real data.

Back to top

STATISTICS 38100.  Measure-Theoretic Probability I.
Sec 01: Michael Wichura, MWF, 2:30-3:20 PM, Eckhart 117
PQ: STAT 31300 or consent of instructor
Required Reading: There is no text required or recommended. Notes will be provided.

This course is the first of a three quarter sequence presenting a careful development of some topics from measure and probability. Topics to be covered in 381 include: classes of sets -- fields, sigmafields, monotone classes, pi and lambda systems; probabilities and general measures; independence and the Borel-Cantelli lemmas; measurable functions; induced measures, distribution and inverse distribution functions; integration with respect to measures -- basic properties, change of variable, indefinite integration, densities; integration to the limit -- MCT, DCT, and friends; laws of large numbers, applications to probability and statistics; transition probabilities and product measures.

Back to top

SUMMER 2007

STATISTICS 22000. Stat Meth And Applications.
Sec 01:  David Matteson, TTH, 1:00-3:00 PM, Eckhart 117
PQ: Math 15200 or equivalent
Required reading:   Moore, D. S. and McCabe, G. P. (2006). Introduction to the Practice of Statistics, 5th Edition. Freeman.

This course is an introduction to statistical techniques and methods of data analysis, including the use of computers. Examples are drawn from the biological, physical, and social sciences. Students are required to apply the techniques discussed to data drawn from actual research. Topics include data description, graphical techniques, exploratory data analyses, random variation and sampling, one- and two-sample problems, the analysis of variance, linear regression, and analysis of discrete data.

SPRING 2007

STATISTICS 22200. Linear Models and Experimental Design.
Sec 01: Mei Wang, TTH, 9:00-10:20 AM, Eckhart 133.
PQ: STAT 22000 or consent of instructor.
Required Reading: Oehlert, G. W. (2000) A First Course in Design and Analysis of Experiments. W. H. Freeman. ISBN: 0-7167-3510-5.

This course will introduce the student to the major statistical issues in the design of experiments and the analysis of experimental data. The major topics will be

  • The basic principles of experimental design: randomization, blocking,
    and balance.
  • Important classes of designs: matched pairs, complete factorial designs,
    and fractional factorial designs.
  • Analysis of variance and inference from experimental data.

In addition to regular homework assignments and exams, students will be required to complete a project involving the design and analysis of an experiment of their own.

Back to top

STATISTICS 23400. Statistical Models and Methods.
Sec 01: Linda B. Collins, MWF, 11:30-12:20 PM, Eckhart 133.
Sec 02: Oli Atlason, MWF, 2:30-3:20 PM, Eckhart 133.
PQ: Math 13300, 15300 or 16300.
Required Reading: Chance and Rossman (2005). Investigating Statistical Concepts, Applications, and Methods, First Edition.  Duxbury (Thomson Brooks/Cole), ISBN: 0-4950-5064-4.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, Poisson, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for means and variances for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Univariate calculus and computer simulation are used throughout the course to investigate statistical concepts and their mathematical underpinnings. One full year of univariate calculus is a prerequisite for the course (Math 13300, 15300, or 16300). Familiarity with at least limits, derivatives and integrals of polynomial and exponential functions, change of variable (substitution) in definite integrals, max-min problems, use of summation notation, and sequences and series as well as a willingness to explore ideas mathematically are key to your success in this course.

Stat 23400 takes the mathematical prerequisite quite seriously. Enrolling concurrently in either Math 13300, 15300, or 16300 while taking Stat 23400 is very strongly discouraged. Further, students who do not feel strong mathematically, may want to wait until completing their entire mathematical requirement (e.g., Math 19500-19600 for Economics majors) before enrolling in Stat 23400. Economics majors are strongly encouraged to delay taking Stat 23400 until the quarter just before enrolling in their required econometrics course (Econ 21000), for which Stat 23400 is a prerequisite. Thus, delaying Stat 23400 until at least late in the second year or even early in the third year of the Economics degree program should not be considered unusual.

Back to top


STATISTICS 24500. Statistical Theory/Method-2.
Sec 01: Stephen M. Stigler, TTH, 1:30-2:50 PM, Location: Eckhart 133.
PQ: STAT 23400 and STAT 23500 or STAT 24400 or consent of instructor.
Required Reading: Rice, J. A. Mathematical Statistics and Data Analysis, 3rd ed., Duxbury Press. ISBN: 0-534-20934-3.

This is the second quarter of a two-quarter sequence. Enrollment in the second quarter alone is permitted, although not recommended. The first quarter covered the basics -- tools from probability and the elements of statistical theory. The second quarter will cover statistical methodology, including the data transformation, regression phenomena, t-tests, analysis of variance, linear regression, correlation, and some multivariate distribution theory. Some principles of data analysis will be introduced, and an attempt will be made to present ANOVA and regression in a unified framework. Much of the material is covered in chapters 10-12 and 14 of the text, but other viewpoints and derivations will be introduced as well. The computer will be used in the second quarter. Some mathematical maturity will be assumed, to the level of calculus. The computer will be used for data analysis and simulation.

Back to top


STATISTICS 24600.  Statistics Theory/Method-3.
Sec 01: Yali Amit, TTH, 10:30-11:50 AM, Eckhart 133.
PQ: STAT 24400 and STAT 24500 or consent of instructor.
Required Reading:  Wasserman, Larry (2003). All of Statistics, Springer. ISBN: 0-387-40272-1.

This course will introduce a variety of modern statistical methods. We will start with the description of families of multivariate models (multivariate normal distribution, graphical models, log-linear models). We will then discuss Missing data problems and the EM algorithm, Bayesian inference, Monte-carlo simulations, bootstrap and some modern classification techniques.

Back to top


STATISTICS 25100.  Intro to Math Probability.
Sec 01 and 02: Michael J. Wichura and Gregory F. Lawler.  Professor Wichura will teach both sections 01 and 02 for the first half of the course; Professor Lawler will teach both sections
the second half of the course.

Time: TTH, 12:00-1:20 PM (Section 01). TTH, 3:00-4:20 PM (Section 02).
Location: Eckhart 312.
PQ: MATH 20000 or MATH 20500 or consent of instructor.
Required Reading: Ross, A First Course in Probability, 7th Edition. Pearson/Prentice Hall. ISBN 0-13-185662-6.

The aim of the course is to provide an introduction to the concepts of probability. The course will cover the basic ideas used to describe aspects of randomness, such as events, random variables, independence, and conditional probability, with emphasis on the methods, calculation, and applications of probability. The topics treated are: combinatorics; probability models; rules of probability; conditional probability; independence; random variables; expectation and standard deviation; games of chance; common discrete distributions—uniform,
binomial, hypergeometric, geometric, negative binomial, and Poisson—and their interrelationships; univariate and multivariate density and distribution functions, change of variable formulas for densities, common continuous distributions—beta, Cauchy, chisquare, exponential, gamma, lognormal, normal, uniform—and their interrelationships; moment generating functions, laws of large numbers and the central limit theorem.

Midterm: Please note that there will be a COMMON midterm for both sections, to be held THURSDAY EVENING of the sixth week (May 3), from 6:00-8:00PM

For more information, please visit http://www.stat.uchicago.edu/~wichura/Stat251/courseinfo.html

Back to top

STATISTICS 26700=STAT 36700=HIPS 25600=CHSS 32900.  History of Statistics.  
Sec 01: Stephen M. Stigler, MWF, 9:30-10:20 AM, Eckhart 203.
PQ:  A course in Statistics.
Recommended Reading: Stephen M. Stigler, The History of Statistics: The Measurement of Uncertainty Before 1900. (Cambridge, Mass.: Harvard University Press, 1986.) (Available in paperback.) Other materials will be distributed in class or by web.

This course will cover topics in the history of statistics, from the eleventh century to the middle of the twentieth century. The emphasis will be upon the period 1650 to 1950, and upon the mathematical developments in the theory of probability and how they came to be used in the sciences, both to quantify uncertainty in observational data and as a conceptual framework for scientific theories. The course will include broad views of the development of the subject, and closer looks at specific people and investigations, including reanalyses of historical data. Topics will include: Early probability; Probability in seventeenth century medicine; Inverse probability in inference; Statistical methods in early geodesy; The introduction of least squares; Statistics in social science; Statistics in early biology and psychology; Simulation; Statistics and the evaluation of social programs; Maximum likelihood estimation; Statistics and nineteenth century forensic science; Statistics and medical science. Major figures who will be examined include Pascal, various Bernoullis, De Moivre, Laplace, Boscovich, Gauss, Quetelet, Galton, Edgeworth, Karl Pearson, Gosset, Fisher, Neyman.

Back to top

STATISTICS 30200.  Mathematical Statistics-2.
Sec 01: Wei-Biao Wu, MW, 1:30-2:50 PM, Eckhart 117.
PQ: STAT 30100 or consent of instructor.
Required Reading: Casella and Berger. Statistical Inference, Second edition. Duxbury Press. ISBN: 0-534-24312-6

This course continues the development of mathematical statistics. Topics of importance include: statistical decision theory, admissability and the Neyman-Pearson lemma, formal hypothesis testing, comparison with conditional frequentist and Bayesian viewpoints, ancillarity, interval estimation, UMP tests and MLR, unbiased tests, score statistics, generalized likelihood ratio tests and asymptotics, minimax estimation, empirical Bayes and "shrinkage".

Back to top

STATISTICS 31300.  Intro: Stochastic Processes-2.
Sec 01: Steven P. Lalley, TTH, 10:30-11:50 AM, Eckhart 117.
PQ: STAT 31200 or consent of instructor.
Required Reading: Notes will be posted online.

This course is a continuation of Statistics 31300: Introduction to Stochastic Processes I. Topics to be discussed will include rates of convergence for Markov chains, introduction to MCMC, generating functions, Galton-Watson processes, continuous-time Markov chains, martingales, and Brownian motion. There will be weekly homework assignments, and midterm and final exams.

Back to top

STATISTICS 32300.  Bayesian Methods in Biostatistics.
Sec 01: Vanja Dukic, Wed, 12:30-3:20 PM, BSLC
PQ: STAT 30100-30200, 24400-24500, 34300, 31200-31300, consent of instructor.
Required Reading: Tanner, 3rd ed., Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions, Springer.

This course will cover basics of modern statistical computation, with emphasis on Bayesian computational methods. It will begin with the introduction to Bayesian statistics, and cover normal and non-normal approximation to likelihood and posterior distributions, the EM algorithm, data augmentation and Markov Chain Monte Carlo (MCMC) methods. Time permitting, we will conclude with some recent developments in the MCMC area, such as perfect and adaptive sampling methods. Biostatistical and environmental examples will be given throughout the course. There will be weekly homeworks, and students will be expected to complete a project by the end of the course. There will be no final exam, but there will be an in-class final project presentation. Algorithms can be implemented in any language, but familiarity with R and Matlab will be assumed.

Back to top

STATISTICS 33200=HSTD 43200.  Causal Inference.
Sec 01: Tyler J. VanderWeele, MW, 10:30-11:50 AM , BSLC
PQ: The course is intended for both masters and doctoral students in statistics and in the social sciences.  The course will be accessible to anyone with a firm understanding of linear and logistic regression (e.g. STAT 224/226 or HSTD 324/327)  though students would benefit from a more sophisticated understanding of statistics.  Supplementary handouts will be provided covering the proofs and technical details concerning methods presented for more theoretically-oriented students.
Required Reading: None.  If you want to read further on certain topics, take a look at Judea Pearl's book "Causality".

The course will be concerned with the process of drawing causal inferences from observational data in the biomedical and social sciences.  The course will introduce a number of fundamental concepts in causal inference and cover methods of estimating causal effects for both point- and time-varying exposures.  Concepts and methods that will be covered include: confounding, potential outcomes, propensity scores, directed acyclic graphs, inverse probability of treatment weighting, marginal structural models and structural nested models.  Time permitting, the course may also briefly survey a number of other topics such as instrumental variables, the estimation of direct and indirect effects, sensitivity analysis and bounds for causal effects.

STATISTICS 33700=GSBC 41914=ECON 31500.  Multivariate Time Series Anal.
Sec 01: Ruey-Shiong Tsay, Wed, 8:30-11:30 AM
PQ: Business 41910 or equivalent; = STAT 33700 and ECON 31500
Required Reading: None.

Course website: http://faculty.chicagogsb.edu/ruey.tsay/teaching/mts

A useful web site for U.S. data: Fed. Res. at St Louis: http://research.stlouisfed.org/fred2/

Course Objective:
1. To study the basic theory of multivariate processes
2. To gain experience in analyzing multivariate time series data
3. To learn multivariate time series models, including vector AR and ARMA models with exogenous variables
4. To understand co-integration and error-correction models
5. To study structural specification of a vector process
6. To learn state-space models and Kalman filter.

No textbook is assigned. Lecture Notes available on course web.

Grading:
Mid-term (40%), Final project (40%), and homework assignments (20%).

Computing:
The key software packages are SCA and S-Plus, but you may use any software of your choice.

Course Outline: All topics include applications
1. Transfer function models
2. Stationary vector autoregressive and moving average models
3. Estimation, modeling, and forecasting
4. Unit-root, co-integration and error-correction models
5. Multivariate Seasonal models
6. Structural specification
7. State-space model and Kalman filter
8. Multivariate volatility models if time permits.


STATISTICS 34700.  Generalized Linear Models.
Sec 01: Peter McCullagh, TTH, 3:00-4:20 PM, Eckhart 133.
PQ: STAT 34300 or consent of instructor.
Required Reading: McCullagh & Nelder, Generalized Linear Models, 2nd edition. Chapman & Hall/CRC. ISBN: 0-412-31760-5.
Recommended Reading: Venables & Ripley, Modern Applied Statistics w/S, 4th edition, Springer. ISBN: 0-387-95457-0.
Cox & Snell (2000), Applied Statistics, Chapman & Hall/CRC. ISBN: 0-412-16570-8.

STATISTICS 35500.  Statistical Genetics.
Sec 01: Mary Sara McPeek, Friday, 1:30-4:00 PM, Eckhart 117.
PQ: Consent of instructor.
Required Reading: None.

This is an advanced course in statistical genetics. Prerequisites are Human Genetics 47100 and Statistics 24400 and 24500.  Students who do not meet the prerequisites may enroll on a P/NP basis with consent of the instructor. This is a discussion course and student presentations will be required. Topics vary and may include, but are not limited to, statistical problems in association mapping, linkage mapping, population genetics, microarray analysis, and genetic models for complex traits.

STATISTICS 36700=STAT 26700=CHSS 32900=HIPS 25600.  History of Statistics.
Sec 01: Stephen M. Stigler, MWF, 9:30-10:20 AM, Eckhart 203.
PQ: A course in Statistics.
Recommended Reading: Stephen M. Stigler, The History of Statistics: The Measurement of Uncertainty Before 1900. (Cambridge, Mass.: Harvard University Press, 1986.) (Available in paperback.) Other materials will be distributed in class or by web.

This course will cover topics in the history of statistics, from the eleventh century to the middle of the twentieth century. The emphasis will be upon the period 1650 to 1950, and upon the mathematical developments in the theory of probability and how they came to be used in the sciences, both to quantify uncertainty in observational data and as a conceptual framework for scientific theories. The course will include broad views of the development of the subject, and closer looks at specific people and investigations, including reanalyses of historical data. Topics will include: Early probability; Probability in seventeenth century medicine; Inverse probability in inference; Statistical methods in early geodesy; The introduction of least squares; Statistics in social science; Statistics in early biology and psychology; Simulation; Statistics and the evaluation of social programs; Maximum likelihood estimation; Statistics and nineteenth century forensic science; Statistics and medical science. Major figures who will be examined include Pascal, various Bernoullis, De Moivre, Laplace, Boscovich, Gauss, Quetelet, Galton, Edgeworth, Karl Pearson, Gosset, Fisher, Neyman.

STATISTICS 37800.  Statistical Computing.
Sec 01: Dongping Fang, TTH, 1:30-2:50 PM, Eckhart 117.
PQ: Experience with R or Splus programming.
Required Reading: None.

This course emphasizes practical aspects of designing statistical algorithms for implementation. Throughout the course, some basic computer arithmetic, accuracy, linear algebra and optimization methods are covered, some selected algorithms like linear regressions, multinomial logistic regression, neural networks, classification tree, clustering will also be introduced and used for illustration. Note that this course doesn't cover program languages like C++, Java, etc.

References:

  • Lange, K. (1999). Numerical Analysis for Statisticians. Springer.

  • Press, W. H., Teukolsky, S. A., Vetterling, W. T., and Flannery, B. P. (1992). Numerical Recipes in C: The Art of Scientific Computing. (or Numerical Recipes in Fortran: The Art of Scientific Computing.) Second Edition. Cambridge University Press.

  • Thisted, R. A. (1988). Elements of Statistical Computing. Chapman and Hall.

  • Gray, R. (2003), Advanced Statistical Computing (BIO 248 cd Course Notes, http://www.stat.wisc.edu/~mchung/teaching/stat471/stat_computing.pdf)

  • Micah Altman, M., Gill, J., and McDonald, M. P. (2004). Numerical Issues in Statistical Computing for the Social Scientist. John Wiley & Sons


STATISTICS 38600.  Topics in Stochastic Processes.
Sec 01: Steven P. Lalley, TTH, 3:00-4:20 PM,Eckhart 117
PQ: STAT 23400 and STAT 23500 or STAT 24400 or consent of instructor.
Required Reading:
Topics:

Wiener process (Brownian motion)
Weak Convergence in Function Space
Ito Integral and Ito Calculus
Introduction to Diffusion Processes

Recommended Reading:

Stochastic Differential Equations by Oksendal (PG)
Brownian Motion and Stochastic Calculus by Karatzas and Shreve (R)
Foundations of Modern Probability by Kallenberg (X)


WINTER 2007

STATISTICS 22600=HSTD 32600. Analysis of Categorical Data.
Sec 01: Mei Wang, TTH, 3:00-4:20 PM, Eckhart 133.
PQ: STAT 22000 or equivalent.
It is expected that the students have a good understanding of basic descriptive statistics such as means, variances and expectation, of the inferential notions of estimate, confidence intervals and significance or hypothesis testing. Familiarity with one statistical package, e.g. Stata, Sas, Splus or R, Spss, Minitab and ability to access Web sites and to download files from the Web are required. Stata will be used in this course for Winter 2007.
Required Reading:
Agresti, A. An introduction to Categorical Data Analysis. Wiley, 1996.

This course is an introduction to the theory and applications of statistical methods for investigating the relationships among discrete variables. The course will present methods for analyzing categorical data, standard methods for contingency tables such as odds ratios, tests of independence and various measures of association, generalized linear models for binary data and count data, logistic regression for binomial data, loglinear models for Poisson data. The statistical techniques discussed will be presented by many real examples involving both physical and social science data.

Back to top

STATISTICS 22700=HSTD32700. Biostatistical Methods.
Sec 01: Ronald A. Thisted, TTH, 10:30-11:50 AM, BSLC Arr
PQ: HSTD 32400/STAT 22400 or equivalent; or consent of instructor.
Reading:

This course is designed to provide students with tools for analyzing categorical, count and time-to-event data frequently encountered in medicine, public health and related biological and social sciences. The course will emphasize application of the methodology rather than statistical theory, including recognition of the appropriate methods, interpretation and presentation of results. Methods covered include: contingency table analysis, Kaplan-Meier survival analysis, Cox proportional-hazards survival analysis, logistic regression, Poisson regression.

Back to top

STATISTICS 23400.  Statistical Models and Methods.
Sec 01: Linda B. Collins, MWF, 11:30-12:20 PM, Eckhart 133.
PQ: Mathematics 13300, 15300 or 16300
Required Reading: Chance and Rossman (2005). Investigating Statistical Concepts, Applications, and Methods, First Edition.  Duxbury (Thomson Brooks/Cole), ISBN: 0-4950-5064-4.
Downing and Clark (1996). Forgotten Statistics. Barron's Educational Series, ISBN: 0-8120-9713-0

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, Poisson, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for means and variances for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Univariate calculus and computer simulation are used throughout the course to investigate statistical concepts and their mathematical underpinnings. One full year of univariate calculus is a prerequisite for the course (Math 13300, 15300, or 16300). Familiarity with at least limits, derivatives and integrals of polynomial and exponential functions, change of variable (substitution) in definite integrals, max-min problems, use of summation notation, and sequences and series as well as a willingness to explore ideas mathematically are key to your success in this course.

Stat 23400 takes the mathematical prerequisite quite seriously. Enrolling concurrently in either Math 13300, 15300, or 16300 while taking Stat 23400 is very strongly discouraged. Further, students who do not feel strong mathematically, may want to wait until completing their entire mathematical requirement (e.g., Math 19500-19600 for Economics majors) before enrolling in Stat 23400. Economics majors are strongly encouraged to delay taking Stat 23400 until the quarter just before enrolling in their required econometrics course (Econ 21000), for which Stat 23400 is a prerequisite. Thus, delaying Stat 23400 until at least late in the second year or even early in the third year of the Economics degree program should not be considered unusual.

Back to top

STATISTICS 24400. Statistical Theory and Methods I.
Sec 01: Michael Stein, TTH, 10:30-11:50 AM, Eckhart 133.
PQ:  MATH 19600, 20100, or 20500
Required Reading: Rice, John A. (2007). Mathematical Statistics and Data Analysis, Third Edition, by (Duxbury).

This is the first quarter of a two-quarter sequence. Enrollment in the first quarter alone is permitted, although not recommended. The first quarter will cover the basics -- tools from probability and the elements of statistical theory. Topics will include the definitions of probability and random variables, binomial and other discrete probability distributions, normal and other continuous probability distribution, joint probability distributions and the transformation of random variables, principles of inference (including Bayesian inference), maximum likelihood estimation, hypothesis testing and confidence intervals, likelihood ratio tests, multinomial distributions and chi-square tests. Some large sample theory will be included. The emphasis will be upon statistical theory, specifically upon concepts and tools that are useful for understanding and applying statistical methodology.

There is no enforced prerequisite in probability or statistics, although the pace is such that students may find it useful to have taken a previous elementary course. The coverage of topics in probability will be limited and brief, so that those who have taken a course in probability will find reinforcement rather than redundancy. The second quarter will cover statistical methodology, including some multivariate analysis, the analysis of variance, the regression phenomenon, linear regression analysis, data analysis, and correlation.

The mathematics prerequisites are listed as general guidance. You should be comfortable with multivariate calculus through partial differentiation and multiple integration.

Back to top

STATISTICS 24500. Statistical Theory and Methods II.
Sec 01: Wei Biao Wu, TTH, 1:30-2:50 PM, Eckhart 133 .
PQ:  STAT 24400 or consent of instructor.
Required Reading: Rice, J. A. (1995). Mathematical Statistics and Data Analysis, 2nd ed., Duxbury.

This is the second quarter of a two-quarter sequence. Enrollment in the second quarter alone is permitted, although not recommended. The first quarter covered the basics -- tools from probability and the elements of statistical theory. The second quarter will cover statistical methodology, including the data transformation, regression phenomena, t-tests, analysis of variance, linear regression, correlation, and some multivariate distribution theory. Some principles of data analysis will be introduced, and an attempt will be made to present ANOVA and regression in a unified framework. Much of the material is covered in chapters 10-12 and 14 of the text, but other viewpoints and derivations will be introduced as well. The computer will be used in the second quarter. Some mathematical maturity will be assumed, to the level of calculus. The computer will be used for data analysis and simulation.

Back to top

STATISTICS 25200=STAT 31200.  Introduction to Stochastic Processes I.
Sec 01: Steven P. Lalley/Per A. Mykland, TTH, 10:30-11:50 AM, Eckhart 312.
PQ:  STAT 25100 or consent of instructor
Required Reading: Ross, S. (1996). Stochastic Processes, 2nd ed., Wiley.

Stochastic processes provide models for random events that evolve in time and may include substantial dependence among observations at different times. The goal of this course is to present a variety of useful models including Markov chains, renewal theory, random walks, queueing and branching processes.

Back to top

STATISTICS 25300=STAT 31700.  Introduction to Probability Models.
Sec 01: Mei Wang, TTH, 9:00-10:20 AM, Eckhart 133.
PQ:  Consent of instructor
Required Reading:Ross, R. (2003). Introduction to Probability Models, 9th ed.

This course introduces stochastic processes as models for a variety of phenomena in the physical and biological sciences. Following a brief review of basic concepts in probability the course will introduce stochastic processes that are popular in applications in sciences, such as discrete time Markov chain, the Poisson process, continuous time Markov process, renewal process and Brownian motion.

Back to top

STATISTICS 30100. Mathematical Statistics I
Sec 01: Mary Sara McPeek, TTH, 1:30-2:50 PM, Eckhart 117.
PQ:  STAT 30400 and MATH 20500 or consent of instructor
Required Reading: Casella and Berger. Statistical Inference, 2nd ed.
Ferguson. A Course in Large Sample Theory.

This course is part of a two-quarter sequence on the theory of statistics. Topics will include exponential families, quadratic forms of multivariate normal, asymptotics of order statistics, sufficiency and completeness, the likelihood function, methods of point estimation, and asymptotic properties of maximum likelihood estimates. Other topics (e.g. Bayesian methods and methods for dependent observations) may be covered if time permits.

Back to top

STATISTICS 31200=STAT 25200.  Introduction to Stochastic Processes I.
Sec 01: Steven P. Lalley/Per A. Mykland , TTH, 10:30-11:50 AM, Eckhart 312.
PQ:  STAT 25100 or consent of instructor
Required Reading: Ross, S. (1996). Stochastic Processes, 2nd ed., Wiley.

Stochastic processes provide models for random events that evolve in time and may include substantial dependence among observations at different times. The goal of this course is to present a variety of useful models including Markov chains, renewal theory, random walks, queueing and branching processes.

Back to top

STATISTICS 31700=STAT 25300.  Introduction to Probability Models.
Sec 01: Mei Wang, TTH, 9:00-10:20 AM, Eckhart 133.
PQ:  Consent of instructor
Required Reading: Ross, R. (2003). Introduction to Probability Models, 9th ed. or most current.

This course introduces stochastic processes as models for a variety of phenomena in the physical and biological sciences. Following a brief review of basic concepts in probability the course will introduce stochastic processes that are popular in applications in sciences, such as discrete time Markov chain, the Poisson process, continuous time Markov process, renewal process and Brownian motion.

Back to top

STATISTICS 32500=GSBC 41902.  Statistical Inference.
Sec 01: Nicholas G. Polson, Wed 8:30-11:30 AM, GSB C10.
PQ:  Business 41901=STAT 32500
Reading: DeGroot and Scherviah, Probability and Statistics. Lecture notes will be provided in the form of a CoursePack.

This Ph.D.-level course is the second in a two-quarter sequence with Business 41901. The central topic is statistical inference. The topics covered include Bayesian inference, classical estimation, decision theory, MCMC methods and Hierarchical models. The use of hierarchical models is a focus in applications.

Back to top

STATISTICS 33500=GSBC 41910=ECON 31400.  Time Series Analysis.
Sec 01:  Jeffrey R. Russell, Fri 1:30-4:30 PM, GSB HP3B.
PQ:  Business 41901 or consent of Instructor
Reading: Hamilton, “Time Series Analysis”, is available at the bookstore.
Software: We will use the student version of the Eviews software.
Here is the the company: http://www.eviews.com/
The student version is here: http://www.eviews.com/eviews4/eviews41s/evstud41.html

Topics Covered:

  • Difference Equations and Lag Operators
  • Stationary ARMA models
  • Maximum Likelihood Estimation and Inference
  • Spectral Analysis
  • Forecasting and Forecast Evaluation
  • GARCH and Stochastic Volatility Models for Time Varying Volatility
  • Unit Roots and Time Trends
  • Vector Autoregressions (VARs)
  • Cointegration

Grades: Grades will be determined by homework (15%), a midterm (35%) and a final
exam (50%).

Back to top

STATISTICS 33800.  Statistical Inference for Financial Data.
Sec 01:  Per A. Mykland, TTH, 1:30-2:50 PM, Ryerson 358.
PQ:  Consent of instructor
Required Reading:

Financial data is commonly modeled by diffusion, jump-diffusion, and related models, and it is usually supposed that observation is discrete. The course is concerned with inference in such settings. We shall be reading papers, and also get some of the mathematical background from the texts. We shall not focus so much on the financial application, but rather the econometrics of these data.

The format is a mixture of lectures and student presentations.

The course is primarily intended for second year graduate students in Statistics, and also students with similar background in Econometrics or Finance. It is recommended, but not absolutely required, that students have taken Stat 30400-30100-30200 and either Stat 31200-31300 or Stat 38100-38300. Equivalent courses are also OK. If you have never taken a finance course, you may consider taking one concurrently (whether in the GSB, Economics, or Statistics), though this is not required.

Back to top

STATISTICS 34500.  Design and Analysis of Experiments.
Sec 01:   Michael L. Stein, MW, 1:30-2:50 PM, Eckhart 133.
PQ: STAT 34300
Reading: Mead, R., The Design of Experiments.

An introduction to the methodology and application of linear models in experimental design. A major focus of the course will be the basic principles of experimental design, such as blocking, randomization and incomplete layouts. Both standard designs, such as fractional factorials and incomplete block designs, as well as nonstandard designs, will be studied within this context. The analysis of these experiments will be developed as well, with particular emphasis on careful model formulation and the role of fixed and random effects. Time permitting, additional topics may include the use of covariates in the analysis of designed experiments, spatial analysis of field trials and Bayesian approaches to analysis of experimental data.

Course work will include the planning, execution and analysis of an experiment by the class.

Back to top

STATISTICS 35201=HSTD 32901. Introduction to Clinical Trials.
Sec 01:  James Dignam, TTH, 1:30-2:50 PM, BSLC.
PQ: HSTD 32100; STAT 22000; introductory statistics; or consent of instructor
Reading:

This course will review major components of clinical trial conduct, including the formulation of clinical hypotheses and study endpoints, trial design, development of the research protocol, trial progress monitoring, analysis, and the summary and reporting of results. Other aspects of clinical trials to be discussed include ethical and regulatory issues in human subjects research, data quality control, meta-analytic overviews and consensus in treatment strategy resulting from clinical trials, and the broader impact of clinical trials on public health.

Back to top

STATISTICS 35700=HSTD 31001. Epidemiologic Methods.
Sec 01:  Diane Lauderdale, Ronald Thisted, TTH, 12:00-1:20 PM, Location: BSLC 202.
PQ: HSTD 30900 or consent of instructor
Reading:

This course expands on the material presented in "Principles of Epidemiology," further exploring issues in the conduct of epidemiologic studies. The student will learn the application of both stratified and multivariate methods to the analysis of epidemiologic data. The final project will be to write the "specific aims" and "methods" sections of a research proposal on a topic of the student's choice.

Back to top

STATISTICS 37700.  Statistical Machine Learning.
Sec 01: Gilles Blanchard, TTH, 3:00-4:20 PM, Eckhart 117.
PQ: Consent of instructor
Reading: The course will borrow material from different sources, including:

L. Devroye, L. Gyorfi, G. Lugosi, A probabilistic Theory of Pattern recognition, Springer

R.Duda, P.Hart, D.Stork, Pattern classification, Wiley

Advanced lectures on machine learning, Springer lecture notes in computer science (2 volumes)

There is no *required* reading. The following source is recommended:

G. Lugosi, Pattern classification and learning theory.
in Principles of nonparametric learning, Springer, p. 1-56, also available at
http://tornasol.upf.es/~lugosi/lecturenotes.ps.

This course will have two main focuses: presenting different methods of machine learning and introducing the main mathematical tools of statistical learning theory, which allow to study these methods from a statistical perspective.

Machine learning applies to complex data for which standard statistical approaches are not appropriate, for example in very high dimension or for higly structured, non numerical data. We will introduce several popular machine learning methods, such as nearest-neighbor, decision trees, Support Vector Machines and other kernel methods, ensemble and boosting methods.

Statistical learning theory is a generic statistical and mathematical framework into which the performance of the above methods can be studied, and for which the notion of model complexity plays a central role. We will introduce traditional learning theory, Vapnik-Chervonenkis dimension, as well as some more recent developments such as Rademacher complexities and randomized approaches.

Depending on the interest of the students the emphasis can shift on one or the other part. A solid background in basic probability and undergraduate level math is expected.

Back to top

Statistics 39100=FINM 34600.  Stochastic Calculus/Finance II
Sec 01: Steven P. Lalley/Per A. Mykland, W, 6:00-9:00 PM, Eckhart 202.
PQ: Math Finance Students Only
Required Reading:  Stochastic Calculus for Finance I-II, by Steven E. Shreve (required).
A Course in Financial Calculus, by Alison Etheridge (highly recommended).
`The basics of S-PLUS'', 3rd edition, by the same authors (A. Krause and M. Olson), ISBN 0-387-95456-2 (required).

This course is an introduction to stochastic calculus as it is relevant to the pricing and hedging of
options and other derivative securities. It is the first of a two-quarter sequence offered in collaboration
by the Department of Statistics and the master's program in Mathematical Finance. The main topics to
be covered are:

  • The Fundamental Theorem of Asset Pricing
  • Martingales
  • Brownian Motion
  • The Ito Integral and Ito's Formula
  • The Black-Sholes Formula
  • Girsanov's Formula
  • Currency Options
  • The Martingale Representation and Hedging

There will be weekly homework assignments, and midterm and final exams. The course assistants will conduct weekly help sessions on Friday afternoons.

Back to top

STATISTICS 47900. Stochastic Models for Memory and Learning.
Instructor: Yali Amit

Time: MW, 2:30-3:50 PM
Location: Eckhart 117
PQ: Consent of instructor
Required Reading: None

This 5 week course will cover a some of the literature analyzing learning and memory in large neuronal populations as stochastic processes. First we will discuss models with discrete time dynamics, discrete binary neurons and finite state synapses, and derive bounds on memory capacity, learning and forgetting times. This will only involve discrete time Markov chain analysis and some ideas from mean-field analysis. Second we will introduce continuous integrate and fire neurons and continuous time dynamics. Using mean-field methods we will analyze the stability of large networks with random connections, and the behavior of the networks after learning. People interested in probability will be exposed to a rich collection of stochastic models waiting to be analyzed in a rigorous mathematical framework (the mean field analysis is only approximate.) People interested in neuroscience will be exposed to interesting models and some intriguing connections to experimental data.

Back to top

AUTUMN 2006

STATISTICS 22400=HSTD 32400. Applied Regression Analysis.
Sec 01: Vanja M. Dukic, TTH, 10:30-11:50 AM,Eckhart 133.
PQ: STD 32700 or STAT 22000 or STAT 23400 or STAT 24400 or consent of instructor.
Required Reading: Chatterjee, Hadi, and Price, Regression Analysis by Example, 4th Edition. (Note: the 3rd edition can also be used.)
STATA help sessions:
Note that these sessions are optional, and are designed to help you get familiar with Stata enough to solve homework problems (bring your questions!). All sessions within one week are identical. All notes and worksheets used in these sessions will be posted on this website. We have 3 sessions, in hope that each of you can fit one every week into your schedules:
Mondays 12pm-1pm in BSLC 018.
Tuesdays 12pm-1pm in BSLC 018.
Wednesdays 2pm-3pm in BSLC 018.

This course is an introduction to the methods and applications of fitting and interpreting multiple regression models. The main emphasis is on the method of least squares. Topics include the examination of residuals, the transformation of data, strategies and criteria for the selection of a regression equation, the use of dummy variables, and tests of fit. The techniques discussed will be illustrated by many real examples involving biological and social science data. Examples and exercises will be implemented in a statistical software package "Stata", but familiarity with Stata is not required.

Back to top

STATISTICS 23400.  Statistical Models and Methods.
Sec 01:  Linda B. Collins, MWF, 11:30-12:20 PM, Eckhart 133.
Sec 02: Omar De la Cruz, MWF, 2:30-3:20 PM, Eckhart 133.
PQ: Mathematics 13300, 15300 or 16300
Required Reading: Chance and Rossman (2005). Investigating Statistical Concepts, Applications, and Methods, First Edition.  Duxbury (Thomson Brooks/Cole), ISBN: 0-4950-5064-4.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, Poisson, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for means and variances for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Univariate calculus and computer simulation are used throughout the course to investigate statistical concepts and their mathematical underpinnings. One full year of univariate calculus is a prerequisite for the course (Math 13300, 15300, or 16300). Familiarity with at least limits, derivatives and integrals of polynomial and exponential functions, change of variable (substitution) in definite integrals, max-min problems, use of summation notation, and sequences and series as well as a willingness to explore ideas mathematically are key to your success in this course.

Stat 23400 takes the mathematical prerequisite quite seriously. Enrolling concurrently in either Math 13300, 15300, or 16300 while taking Stat 23400 is very strongly discouraged. Further, students who do not feel strong mathematically, may want to wait until completing their entire mathematical requirement (e.g., Math 19500-19600 for Economics majors) before enrolling in Stat 23400. Economics majors are strongly encouraged to delay taking Stat 23400 until the quarter just before enrolling in their required econometrics course (Econ 21000), for which Stat 23400 is a prerequisite. Thus, delaying Stat 23400 until at least late in the second year or even early in the third year of the Economics degree program should not be considered unusual.

Back to top

STATISTICS 24400. Statistical Theory and Methods I.
Sec 01:  Mathias Drton, TTH, 1:30-2:50 PM, Kent 107.
PQ:  MATH 19600, 20100, or 20400
Required Reading: Rice, John A. (1995). Mathematical Statistics and Data Analysis, Second Edition, by (Duxbury).

This is the first quarter of a two-quarter sequence. Enrollment in the first quarter alone is permitted, although not recommended. The first quarter will cover the basics -- tools from probability and the elements of statistical theory. Topics will include the definitions of probability and random variables, binomial and other discrete probability distributions, normal and other continuous probability distribution, joint probability distributions and the transformation of random variables, principles of inference (including Bayesian inference), maximum likelihood estimation, hypothesis testing and confidence intervals, likelihood ratio tests, multinomial distributions and chi-square tests. Some large sample theory will be included. The emphasis will be upon statistical theory, specifically upon concepts and tools that are useful for understanding and applying statistical methodology.

There is no enforced prerequisite in probability or statistics, although the pace is such that students may find it useful to have taken a previous elementary course. The coverage of topics in probability will be limited and brief, so that those who have taken a course in probability will find reinforcement rather than redundancy. The second quarter will cover statistical methodology, including some multivariate analysis, the analysis of variance, the regression phenomenon, linear regression analysis, data analysis, and correlation. Statistical software will be used for simulations and data analysis.

Back to top

STATISTICS 30400. Distribution Theory.
Sec 01:  Dan Nicolae, MWF, 1:30-2:20 PM, Eckhart 117.
PQ: STAT 24500 & MATH 25000 or equivalent
Recommended Reading: Severini, T. (2005). Elements of Distribution Theory. Cambridge University Press.

This course covers the basics of distribution theory. Topics include:

  • Distribution functions and their inverses, quantile functions, Q/Q plots, change-of-variables for probability densities
  • Expectation, variance, median, mode of random variables
    Basics of measure theory, including Fubini's theorem and interchangeability of limits and integrals
  • Moment generating functions and characteristic functions, including power series expansion, inversion formulas, uniqueness theorems, and convergence in distribution
  • Cumulants and cumulant generating functions: examples, properties
  • Different concepts of convergence for random variables
  • Limit theorems, including the weak law of large numbers, the central limit theorem.

Back to top

STATISTICS 30700=CMSC 37800. Numerical Computation.
Sec 01: Yali Amit, MWF, 10:30-11:20 PM, Eckhart 117.
PQ: Consent of instructor and elementary programming experience
Required Reading:  Watkins, Fundamentals of Matrix Computation, Wiley.

We present a rigorous development of the fundamental algorithms for the decomposition of matrices, with applications to least squares problems, and finite dimensional eigenvalue problems. Most of the relevant mathematics will be developed rigorously. Some basics of iterative methods and non-linear optimization will be presented. Time permitting we will also describe the Fast Fourier Transform and the Discrete Wavelet transform. Some basics of C++ programming will be introduced through the implementation of the algorithms.

Topics include:

  1. Gaussian elimination and back-substitution
  2. LU decomposition. (General/Symmetric)
  3. Singular value decomposition. (general/Symmetric)
  4. Householder orthogonalization and QR factorization. (General/Symmetric).
  5. Jacobi transformations for symmetric matrices.
  6. QR decompositions.
  7. Generalized inverses.
  8. Elementary eigenvalue methods for symmetric matrices
  9. Iterative methods: Jacobi and Gauss Seidel.
  10. Conjugate gradient.
  11. Dynamic programming.
  12. The FFT and the Discrete wavelet transform.

A common set of C++ objects will be defined at the start of the course, meant to deal with arrays, and the students will write code for most of the algorithms in C++.

Back to top

STATISTICS 32400=GSBC 41901. Probability and Statistics.
Sec 01:  Nicholas G. Polson, Th, 8:30-11:30 AM, Lecture Hall C10.
PQ: 1 Year of Calculus.
Required Reading: DeGroot and Schervish. Probability and Statistics. Lecture notes will be available in the form of a CoursePack.

This Ph.D.-level course (in addition to 41902) provides a thorough introduction to Classical and Bayesian statistical theory. The two-quarter sequence provides the necessary probability and statistical background for many of the advanced courses in the GSB curriculum. The central topic of Business 41901 is probability. Basic concepts in probability are covered. An introduction to martingales is given. Homework assignments are given throughout the quarter.

Back to top

STATISTICS 32600=GSBC 37904=ECON 31601. Marketing Topics: Bayesian Applications in Marketing and MicroEconometrics.
Sec 01:  Peter E. Rossi, W, 6:00-9:00, Lehman Brothers Classroom-HPC02.
PQ:
Required Reading: 

This course will cover a comprehensive introduction to Bayesian inference with special emphasis on micro-data and marketing applications. The course will be based on the instructor's textbook. Topics include: Bayesian Essentials, Practical MCMC methods, Hierarchical Models, Non-standard Priors, Models for data with Discrete Components, Bayesian treatment of Simultaneity, and Dirichlet Process Priors. The course will also emphasize statistical computing in R. For all models and topics discussed in the course, examples of R/C code will be provided. In the homeworks, students will be asked to modify existing R code and write their own code to extend some of the ideas covered in class. R is the most important statistical language which is similar to, but with more extensive capababilites, as MATLAB.

Back to top

STATISTICS 33100. Sample Surveys.
Sec 01: Kirk Wolter, TTH, 10:30-11:50 AM,Eckhart 117.
PQ: Consent of instructor
Reading:  Cochran, W. G. (1977). Sampling Techniques, Third Edition. Wiley; Wolter, K.M. (1985). Introduction to Variance Estimation, Springer-Verlag, New York.

This is an introductory course to the statistics and methodology of sample surveys. Topics include

  • basic methods of sample selection,
  • determining sample size, stratification,
  • general estimators (Horvitz-Thompson, ratio, generalized regression, calibration),
  • domain estimation,
  • nonresponse,
  • nonsampling error,
  • multiple-stage sampling,
  • a national sampling frame for area probability surveys,
  • telephone surveys,
  • questionnaire design,
  • variance estimation for complex surveys,
  • analysis of contingency tables, and
  • regression analysis for survey data.

The course will be of interest to students who anticipate a research career that designs, collects, and analyzes survey data in fields such as economics, education, healthcare, marketing, psychology, sociology, and statistics.

Back to top

STATISTICS 34300. Applied Linear Stat Methods.
Sec 01: Mathias Drton,TTH, 9:00-10:20 AM, Eckhart 133.
PQ: STAT 24500 & MATH 25000 or equivalent
Optional Reading:  Venables, W.N. and Ripley, B.D. (1999). Modern Applied Statistics with S-Plus (3rd ed). Springer-Verlag.
Required Reading: Weisberg, S. (2005). Applied Linear Regression, Third Edition. John Wiley & Sons.
Software: Splus or R.

Statistics 34300 is an intensive course in the theory and methods of linear regression and related techniques of statistical modelling. It is intended primarily for graduate students in Statistics and related fields.

The course is also open to undergraduates and others who have a solid understanding of matrix algebra and basic statistical theory. Thorough familiarity with the simple linear regression model is expected.

The course will review linear regression with a single predictor, and will cover the multiple-predictor case; least-squares estimation; associated distribution theory; estimation, confidence intervals and tests; regression with errors in the predictors; weighted least squares, assessing lack of fit; residual analysis; regression diagnostics; transformations; model building; collinearity; subset-selection methods, including stepwise regression; prediction; nonlinear least squares.

Back to top

STATISTICS 35000=HSTD 30900=ENST 27400=PPHA 36400. Principles of Epidemiology.
Sec 01:  Lianne Kurina, TTH, 9:00-10:20 AM, BSLC (to be announced).
PQ: Introductory statistics recommended, may be taken concurrently
Required Reading: 

Epidemiology is the study of the distribution and determinants of health and disease in human populations. This course introduces the basic principles of epidemiologic study design, analysis, and interpretation, through lectures, assignments, and critical appraisement of both classic and contemporary research articles. The course objectives include: (1) To be able to critically read and understand epidemiologic studies; (2) To be able to calculate and interpret measures of disease occurrence and measures of disease-exposure associations; and (3) To understand the contributions of epidemiology to clinical research, medicine and public health.

Back to top

STATISTICS 36900=HSTD 33300. Longitudinal Data Analysis.
Sec 01:  Paul Rathouz, TTH, 9:00-10:20 AM, BSLC (to be announced).
PQ: HSTD 32100; STAT 22000; or equivalent, and HSTD 32400/STAT 22400 or equivalent; or consent of instructor
Required Reading: Diggle, P.J., Heagerty, P., Liang, K.-Y., & Zeger, S.L. (2002). Analysis of Longitudinal Data, Second Edition. Oxford: Oxford University Press.

Longitudinal data consist of multiple measures over time on a sample of individuals. This type of data occurs extensively in both observational and experimental biomedical and public health studies, as well as in studies in sociology and applied economics. This course will provide an introduction to the principles and methods for the analysis of longitudinal data. While some theoretical statistical detail is given (at the level appropriate for a Master's student in Statistics), the primary focus will be on data analysis and interpretation. Problems will be motivated by applications in epidemiology, clinical medicine, health services research, and disease natural history studies .

Back to top

STATISTICS 38100. Measure-Theoretic Probability I.
Sec 01: Michael J. Wichura, MWF, 2:30-3:20 PM, Eckhart 117.
PQ: STAT 31300 or consent of instructor
Required Reading:  There is no text required or recommended. Notes will be provided.

This course is the first of a three quarter sequence presenting a careful development of some topics from measure and probability. Topics to be covered in 381 include: classes of sets -- fields, sigmafields, monotone classes, pi and lambda systems; probabilities and general measures; independence and the Borel-Cantelli lemmas; measurable functions; induced measures, distribution and inverse distribution functions; integration with respect to measures -- basic properties, change of variable, indefinite integration, densities; integration to the limit -- MCT, DCT, and friends; laws of large numbers, applications to probability and statistics; transition probabilities and product measures.

Back to top

SPRING 2006

STATISTICS 22200. Linear Models and Experimental Design.
Sec 01:  Mei Wang, TTH, 9:00-10:20 AM, Eckhart 133.
PQ: Statistics 22000 or consent of instructor
Required Reading: Oehlert, G. W. (2000) A First Course in Design and Analysis of Experiments. W. H. Freeman.  ISBN: 0-7167-3510-0
Optional Reading: Hamilton, L. C. (2003) Statistics with Stata (Updated for Version 7). Duxbury Press.

This course will introduce the students to the major statistical issues in the design of experiments and the analysis of experimental data. The major topics will be

  • The basic principles of experimental design: randomization, blocking, and balance.
  • Important classes of designs: matched pairs, complete factorial designs, Latin squares, Graeco-Latin squares, fractional factorial designs and confounding.
  • Analysis of covariance and inference from experimental data.

In addition to regular homework assignments and exams, students will be required to complete a project involving the design and analysis of an experiment of their own.

Back to top

STATISTICS 23400.  Statistical Models and Methods I.
Sec 01: Linda B. Collins, MWF, 11:30-12:20 PM, Eckhart 133.
Sec 02: Linda B. Collins, MWF, 2:30-3:20 PM, Eckhart 133.
PQ: Mathematics 13300, 15300 or 16300
Required Reading: Chance and Rossman (2006). Investigating Statistical Concepts, Applications, and Methods, 1st edition.  Duxbury (Thomson Brooks/Cole), ISBN: 0-4950-5064-4.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, Poisson, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for means and variances for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Back to top

STATISTICS 24600. Statistical Theory and Methods III.
Sec 01:  Marc Coram, TTH, 10:30-11:50 AM, Eckhart 133.
PQ: Statistics 23400 and Statistics 23500 or Statistics 24400 and 24500, or consent of instructor
Required reading:
Givens and Hoeting (2005). Computational Statistics. 1st edition. Wiley.  ISBN: 0471-46124-5

This course is the third part of a sequence that introduces mathematical statistics and probability. In this quarter, the emphasis is on modern computer-intensive statistical methods.

In the first part of the course, the impact of missing data on statistical analyses is considered and algorithms for iterative maximum likelihood estimation, such as the EM and the Newton-Raphson algorithms, and for Bayesian computation, such as Data Augmentation, are introduced in certain examples. Additionally, Markov Chain Monte Carlo methods are outlines in a more general framework of Bayesian inference.

The second part of the course presents an overview of the bootstrap and related nonparametric methods for assessing statistical accuracy.

Knowledge of probability distributions, random variables, and estimation techniques such as maximum likelihood from Statistics 24400 and 24500 is essential for this class.

Back to top

STATISTICS 25100. Introduction to Mathematical Probability.
Sec 01:   Michael Wichura, TTH, 4:30-5:50 PM, Eckhart 133.
PQ:  Math 20000 or 20400 or consent of instructor
Reading: Ross, Sheldon (2006), A First Course in Probability, Seventh Edition. Pearson/Prentice-Hall. ISBN: 0-13-185662-6

Probability theory originated in the consideration of gambling problems, but has become an important tool for scientists, engineers, medical practitioners, lawyers, and people working in business. A wide variety of phenomena are characterized by randomness and uncertainty, which is measured by probability. Probability models also play a fundamental role in the statistical analysis of data. The aim of the course is to provide an introduction to the elementary concepts of probability. The first part of the course will introduce the student to the basic ideas used to describe aspects of randomness, such as events, random variables, independence, and conditional probability. The remainder of the course focuses on the methods, calculation, and applications of probability. The topics treated are: combinatorics, discrete and continuous distributions, density functions, distribution theory, calculation and interpretation of moments, covariance and correlation, the classical central limit theorem.

Back to top

STATISTICS 26100  Time Dependent Data.
Sec 01:  Michael Stein, MWF 10:30-11:30 AM, Eckhart 117.
PQ: Students will need some exposure to linear modeling (Statistics 22400, 24500, or consent of instructor).
Reading: Janacek ((2001), Practical Time Series, Arnold, London. ISBN: 0-340-71999-0.
Chatfield, The Analysis of Time Series, Chapman & Hall/CRC.  ISBN: 1-58488-317-0.

This course considers the modeling and analysis of data that are ordered in time. The main focus will be on quantitative observations taken at evenly spaced intervals and will include both time-domain and spectral approaches. Time permitting, statistical approaches to other data types, such as categorical observations or point processes, will be considered.

Back to top

STATISTICS 30200. Mathematical Statistics II.
Sec 01: Marc Coram, TTH, 3:00-4:20 PM,   Eckhart 117
PQ:  Statistics 30100 or consent of instructor
Reading:  Casella and Berger. Statistical Inference, Second edition. Duxbury Press. ISBN: 0534243126

This course continues the development of mathematical statistics. Topics of importance include: statistical decision theory, admissability and the Neyman-Pearson lemma, formal hypothesis testing, comparison with conditional frequentist and Bayesian viewpoints, ancillarity, interval estimation, UMP tests and MLR, unbiased tests, score statistics, generalized liklihood ratio tests and asymptotics, minimax estimation, empirical Bayes and "shrinkage".

Back to top

STATISTICS 30800. Advanced Statistical Inference II.
Sec 01:  Dan Nicolae, TTH, 9:00-10:20 AM, Eckhart 117.
PQ: Consent of instructor
Reading: None

The focus of the course will be on non-parametric statistics and statistical learning.

Topics for discussion include

  • Classical Nonparametric Statistics
  • Empirical Likelihood
  • Resampling Techniques
  • Supervised and Unsupervised Learning

Actual topics covered will depend on interests of students and instructor.

Back to top

STATISTICS 31300. Introduction to Stochastic Processes II.
Sec 01:  Steven Lalley, TTH, 10:30-11:50 AM, Eckhart 117.
PQ: Statistics 31200 or consent of instructor
Reading:  Ross, S. (1996). Stochastic Processes, 2nd ed., Wiley.

This course is a continuation of Statistics 31300: Introduction to Stochastic Processes I. Topics to be discussed will include generating functions, Galton-Watson processes, continuous-time Markov chains, martingales, and Brownian motion. There will be weekly homework assignments, and midterm and final exams.

Back to top

STATISTICS 34700. Generalized Linear Models.
Sec 01: Peter McCullagh, MW, 1:30-2:50 PM, Ryerson 352.
PQ: Statistics 34300 or consent of instructor
Reading:  McCullagh & Nelder (1989). Generalized Linear Models, Second Edition. Chapman & Hall/CRC.  ISBN: 0-412-31760-5
Venables and Ripley. Modern Applied Statistics W/S, Fourth Edition. Springer. ISBN: 0-387-95457-0
Cox & Snell (2000). Applied Statistics, Chapman & Hall/CRC. ISBN: 0-412-16570-8

This is an applied course for students who are familiar with linear models at the level of Draper and Smith or Weisberg. The following topics will be covered:

Factors, variates, contrasts, interactions
Exponential-family models: variance function
Definition of a generalized linear model: link functions
Analysis of deviance
Specific examples of GLMs

logistic and probit regression
cumulative logistic models
log-linear models and contingency tables
inverse linear models

Quasi-likelihood and least squares; estimating functions
Over-dispersion
Partially linear models.

Back to top

STATISTICS 35500. Statistical Genetics.
Sec 01: Mary Sara McPeek, F, 1:30-3:50 PM, Eckhart 117.
PQ: Human Genetics 47100 and Statistics 24400 and 24500 or consent of instructor
Reading:

This is an advanced course in statistical genetics. Prerequisites are Human Genetics 471 and Statistics 244 and 245. Students who do not meet the prerequisites may enroll on a P/NP basis with consent of the instructor. This is a discussion course and student presentations will be required. Topics vary and may include, but are not limited to, statistical problems in linkage mapping, association mapping, map construction, and genetic models for complex traits.

Back to top

STATISTICS 36300. Statistics for Dependent Data.
Sec 01: Wei Biao Wu, MW, 1:30-2:50 PM, Eckhart 117.
PQ: Statistics 30400, 38100 and 31200
Reading: See below.

Many deep statistical results have been obtained under the assumption that the observations are independent. However, applications from engineering, economics, geophysics, meteorology and other natural sciences suggest that the observations are dependent, and the dependence is the rule rather than the exception. So a statistical theory is needed to infer the underlying dependence structure for the purpose of interpretation, inference and prediction. In the course I will provide a novel look at the issue of dependence and introduce new dependence measures so that based on which a statistical theory can be built. In particular I will discuss parametric and nonparametric inference, time and frequency domain approaches, empirical processes, asymptotic representations, robust inference, unit root testing and other issues. New research problem will be proposed.

Readings:

  1. Masanobu Taniguchi, Yoshihide Kakizawa (2000) Asymptotic Theory of Statistical Inference for Time Series (Springer Series in Statistics)
  2. Fan J and Yao Q (2003) Nonlinear Time Series : Nonparametric and Parametric Methods (Springer Series in Statistics)
  3. Wu W. B. (2005) "Nonlinear system theory: Another look at dependence", Proceedings of the National Academy of Sciences 102, 14150--14154
  4. Wu W.B. (2005) On the Bahadur representation of sample quantiles for dependent
    sequences. Ann. Stat. 1934-1963
  5. Wei Biao Wu and Jan Mielniczuk (2002) Kernel density estimation for linear processes. Ann. Stat.1441-1459
  6. Tong, Howell (1990) "Non-linear time series: a dynamical system approach" Oxford University Press

Back to top

STATISTICS 37300. Graphical Models and Algebraic Statistics.
Sec 01: Mathias Drton, TTH, 1:30-2:50 PM, Eckhart 117.
PQ:   Consent of instructor
Reading: The course will draw on material from different sources, which will include:
  • D. Edwards (2000). Introduction to graphical modelling. 2nd edition. Springer, New York.
  • L. Pachter, B. Sturmfels (2005). Algebraic statistics for computational biology, Cambridge University Press.

The goal of this course is to provide an introduction to graphical models and algebraic techniques that are useful in the study of these models.

In graphical modelling, one associates, in a mathematically rigorous way, a statistical model with a graph: nodes represent variables and edges indicate some form of dependence between variables. This framework encompasses many classic statistical models and is popular in many applied areas, including in particular computational biology. Since the parameterizations and constraints defining graphical models are of polynomial nature, algebraic techniques introduced in the course can be applied to gain insight into structural and computational properties of a model.

A few weeks into the course students will form small research groups, each of which will work on a course project. Depending on student interests projects may be of applied, methodological, or theoretical nature. While some basic mathematical or statistical maturity is expected, students with different backgrounds are welcome to attend the class, but all are expected to work hard and collaborate with each other.

Back to top

STATISTICS 38500. Advanced Topics in Probability: Brownian Motion and Stochastic Calculus.
Sec 01: Steven Lalley, TTH, 3:00-4:20 PM, Eckhart 133.
PQ: Consent of instructor.
Reading: None.

Topics for Statistics 38500 will include:

Continuous-time martingales
Brownian motion
Weak convergence and Donsker's functional central limit theorem
Ito integral and stochastic calculus
Stochastic differential equatioins and diffusions

Back to top

WINTER 2006

STATISTICS 22600=HSTD 32600.  Analysis of Categorical Data.
Sec 01:  Peter Radchenko, TTH 3:00-4:20 PM ,  Eckhart 133.
PQ: STAT 22000 or equivalent. It is expected that the students have a good understanding of basic descriptive statistics such as means, variances and expectation, of the inferential notions of estimate, confidence intervals and significance or hypothesis testing. Familiarity with a statistical package such as Stata, Sas, Splus, Spss, or Minitab is required.
Reading: Agresti, A. (1996). An introduction to Categorical Data Analysis, Wiley,
Optional Reading: Hamilton, L. C. (2003). Statistics with Stata (Updated for Version 8), Duxbury Press, 2003.

This course is an introduction to the theory and applications of statistical methods for investigating the relationships among discrete variables. It will present methods for analyzing categorical data, standard methods for contingency tables such as odds ratios, tests of independence and various measures of association, generalized linear models for binary data and count data, logistic regression for binomial data, loglinear models for Poisson data. The statistical techniques discussed will be presented by many real examples involving both physical and social science data.

Back to top

STATISTICS 22700=HSTD 32700.  Biostatistical Methods.
Sec 01: Ronald A. Thisted,TTH 10:30-11:50 AM, BSLC 202.
PQ: HSTD 32400/STAT 22400; or equivalent; or consent of instructor
Reading:

This course is designed to provide students with tools for analyzing categorical, count and time-to-event data frequently encountered in medicine, public health and related biological and social sciences. The course will emphasize application of the methodology rather than statistical theory, including recognition of the appropriate methods, interpretation and presentation of results. Methods covered include: contingency table analysis, Kaplan-Meier survival analysis, Cox proportional-hazards survival analysis, logistic regression, Poisson regression.

Back to top

STATISTICS 23400.  Statistical Models and Methods I.
Sec 01: Linda B. Collins, MWF, 9:30-10:20 AM, Kent 107.
PQ: MATH 13300, 15300 or 16300
Required Reading: Chance and Rossman (2006). Investigating Statistical Concepts, Applications, and Methods, Duxbury (Thomson Brooks/Cole), ISBN: 0-495-01655-1.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena. Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, Poisson, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for means and variances for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400..

Back to top

STATISTICS 23500.  Statistical Models and Methods II.
Sec 01: Linda B. Collins, MWF, 11:30-12:20 PM, Eckhart 133.
PQ: STAT 23400 or consent of instructor
Required Reading: Tamhane and Dunlop (2000). Statistics and Data Analysis: from Elementary to Intermediate, Prentice Hall, ISBN: 0-13-744426-5.

This is the second quarter of a two-quarter sequence. This course continues the presentation basic ideas of probability theory and statistics begun in STAT 23400, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Topics include repeated-sampling frequentist inference; consisting of methods for count data, ANOVA, and multiple regression, as well as an introduction to Bayesian inference. Additional topics, such as experimental design, non-parametric statistics and maximum likelihood estimation are introduced as time permits. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus, but is less demanding than that required by STAT 24400-24500. Other than the mathematical level, the content of the two sequences are similar.

Back to top

STATISTICS 24500.  Statistical Theory and Methods II.
Sec 01:  Mathias Drton, TTH 1:30-2:50 PM, Eckhart 133.
PQ: STAT 24400 or consent of instructor
Recommended Reading:  Rice, J. A. (1995). Mathematical Statistics and Data Analysis, 2nd ed., Duxbury.

This is the second quarter of a two-quarter sequence. Enrollment in the second quarter alone is permitted, although not recommended. The first quarter covered the basics -- tools from probability and the elements of statistical theory. The second quarter will cover statistical methodology, including the data transformation, regression phenomena, t-tests, analysis of variance, linear regression, correlation, and some multivariate distribution theory. Some principles of data analysis will be introduced, and an attempt will be made to present ANOVA and regression in a unified framework. Much of the material is covered in chapters 10-12 and 14 of the text, but other viewpoints and derivations will be introduced as well. The computer will be used in the second quarter. Some mathematical maturity will be assumed, to the level of calculus. The computer will be used for data analysis and simulation.

Back to top

STATISTICS 25300=STAT 31700. Introduction to Probability Models.
Sec 01: Mei Wang, TTH, 10:30-11:50 AM, Eckhart 117.
PQ: Consent of instructor
Reading:  Ross, R. (2003). Introduction to Probability Models, 8th ed.

This course introduces stochastic processes as models for a variety of phenomena in the physical and biological sciences. Following a brief review of basic concepts in probability the course will introduce stochastic processes that are popular in applications in sciences, such as discrete time Markov chain, the Poisson process, continuous time Markov process, renewal process and Brownian motion.

Back to top

STATISTICS 30100.  Mathematical Statistics I.
Sec 01:  Mary Sara McPeek, TTH, 1:30-2:50 PM, Eckhart 117.
PQ:  STAT 30400 or consent of instructor
Reading:  Casella and Berger. Statistical Inference, 2nd ed.
Ferguson. A Course in Large Sample Theory.

This course is part of a two-quarter sequence on the theory of statistics. Topics will include exponential families, quadratic forms of multivariate normal, asymptotics of order statistics, sufficiency and completeness, the likelihood function, methods of point estimation, and asymptotic properties of maximum likelihood estimates. Other topics (e.g. Bayesian methods and methods for dependent observations) may be covered if time permits.

Back to top

STATISTICS 31200. Introduction to Stochastic Processes I.
Sec 01:  Per A. Mykland, TTH, 10:30-11:50 AM, Eckhart 133.
PQ:  STAT 25100 or consent of instructor
Reading:  Ross, S. (1996). Stochastic Processes, 2nd ed., Wiley.

Stochastic processes provide models for random events that evolve in time which may include substantial dependence among observations at different times. The goal of this course is to present a variety of useful models including Markov chain, renewal theory, Brownian motions etc.

Back to top

STATISTICS 31700=STAT 25300.  Introduction to Probability Models.
Sec 01:  Mei Wang, TTH, 10:30-11:50 AM, Eckhart 117.
PQ: Consent of instructor
Reading:  Ross, R. (2003). Introduction to Probability Models, 8th ed.

This course introduces stochastic processes as models for a variety of phenomena in the physical and biological sciences. Following a brief review of basic concepts in probability the course will introduce stochastic processes that are popular in applications in sciences, such as discrete time Markov chain, the Poisson process, continuous time Markov process, renewal process and Brownian motion.

Back to top

STATISTICS 33800.  Statistical Inference for Financial Data
Sec 01: Per A. Mykland, TTH, 1:30-2:50 PM, Ryerson 277.
PQ: Consent of instructor
Reading: 

Financial data is commonly modeled by diffusion, jump-diffusion, and related models, and it is usually supposed that observation is discrete. The course is concerned with inference in such settings. We shall be reading papers, and also get some of the mathematical background from the texts. We shall not focus so much on the financial application, but rather the econometrics of these data.

The format is a mixture of lectures and student presentations.

The course is primarily intended for second year graduate students in Statistics, and also students with similar background in Econometrics or Finance. It is recommended, but not absolutely required, that students have taken Stat 30400-30100-30200 and either Stat 31200-31300 or Stat 38100-38300. Equivalent courses are also OK. If you have never taken a finance course, you may consider taking one concurrently (whether in the GSB, Economics, or Statistics), though this is not required.

Back to top

STATISTICS 34500.  Design & Analysis of Experiments.
Sec 01: Michael Stein, MW, 1:30-2:50 PM, Eckhart 133.
PQ: STAT 34300 or  consent of instructor
Reading: Mead, R., The Design of Experiments.
Recommended Reading:  Cox, D. R. and Solomon, P. J. Components of Variance.

An introduction to the methodology and application of linear models in experimental design. A major focus of the course will be the basic principles of experimental design, such as blocking, randomization and incomplete layouts. Many of the standard designs, such as fractional factorial, incomplete block and split unit designs will be studied within this context. The analysis of these experiments will be developed as well, with particular emphasis on the role of fixed and random effects. Time permitting, additional topics may include response surface analysis, the use of covariates in the analysis of designed experiments, and spatial analysis of field trials.

Back to top

STATISTICS 37900=CMSC 35500=CMSC 25050.  Topics in Computer Vision.
Sec 01: Yali Amit, WF, 2:30-3:50 PM, Ryerson 277.
PQ: Consent of instructor
Reading: Amit, Y. (2002). 2d Object Detection and Recognition: Models, Algorithms and Networks, MIT Press.

Deformable models for detecting objects in images will be discussed in detail. One-dimensional models to identify object contours and boundaries; two-dimensional models for image matching; sparse models for efficient detection of objects in complex scenes. Various mathematical tools needed to define the models and the associated algorithms will be developed. Applications include, detecting contours in medical images, matching brains, detecting faces in images and more. Methods for object recognition and classification related to the sparse detection models will be covered with applications to handwritten character recognition and recognition of rigid objects in scenes. Neural network implementations of some of the algorithms will be presented and some connections to the functions of the biological visual system will be discussed.

Back to top

STATISTICS 38300.  Measure-Theoretic Probability III.
Sec 01: Michael Wichura, MWF, 2:30-3:20 PM, Eckhart 117.
PQ:  STAT 38100 or consent of instructor
Reading:  No text book is required; notes will be distributed in class.

Topics for Stat 38300 will include:

  • The Hahn and Jordan decomposition theorems
  • Modes of convergence: with probability one, in probability, and in mean; uniform integrability
  • L2-spaces: projections; representation of linear functionals
  • The Radon-Nikodym theorem: absolute continuity, Radon-Nikodym derivatives; likelihood ratios; Lebesgue decompositions
  • Conditional expectation: given sub-sigma fields, and given measurable functions
  • Conditional probability: regular conditional probability distributions
  • Martingales: definitions and examples, transformations
  • Stopping times; optional sampling
  • Martingale limit and closure theorems
  • Backward submartingales
  • Continuous-time martingales: convergence, closure, optional sampling.

Back to top

 

STATISTICS 39100=FINM 34600.  Stochasti Calculus/Finance-2.
Sec 01: Jostein Paulsen, W, 6:00-9:00 PM, Eckhart 202.
PQ:  Statistics 39000, FinMath 34500, MathFin Stds
Reading:  Notes are made available on the internet.

The course will cover several issues in mathematical finance with main emphasis on fixed income models. Topics covered are

  • Models for short term interest rates
  • The Heath-Jarrow-Morton methodology
  • Forward measures, forward and future prices
  • Multivariate stochastic calculus
  • Change of numeraire, pricing of stock options with stochastic interest rates
  • Multivariate extensions of interest rate models
  • The LIBOR market model. Pricing of caps, caplets and swaptions

Back to top

AUTUMN 2005

STATISTICS 22400=HSTD 32400. Applied Regression Analysis.
Sec 01: Vanja Dukic, TTH 10:30-11:50 AM, Eckhart 133.
PQ: HSTD 32700 or STAT 22000 or 23400 or 24400 or consent of instructor
Required Reading: "Regression Analysis by Example," (3rd Edition, Chatterjee, Hadi, and Price).

This course is an introduction to the methods and applications of fitting and interpreting multiple regression models. The main emphasis is on the method of least squares. Topics include the examination of residuals, the transformation of data, strategies and criteria for the selection of a regression equation, the use of dummy variables, and tests of fit. The techniques discussed will be illustrated by many real examples involving biological and social science data. Examples and exercises will be implemented in a statistical software package "Stata", but familiarity with Stata is not required.

Back to top

STATISTICS 23400. Statistical Models and Methods I.
Sec 01: Yali Amit, MWF, 11:30-12:20 PM, Eckhart 133.
Sec 02: Mathias Drton, MWF, 2:30-3:20 PM, Eckhart 133.
PQ: Math 13300, 15300 Or 16300
Required Reading: Tamhane and Dunlop (2000). Statistics and Data Analysis: from Elementary to Intermediate." Prentice Hall, ISBN: 0-13-744426-5.

This course presents basic ideas of probability theory and statistics, and is recommended for students throughout the natural and social sciences who want a broad background in statistical methodology and exposure to probability models and the statistical concepts underlying the methodology. Probability is developed for the purpose of modeling outcomes of random phenomena.

Random variables and their expectations are studied; including means and variances of linear combinations, and an introduction to conditional expectation. Binomial, Poisson, normal and other standard probability distributions are considered. Some probability models are studied mathematically and others via simulation on a computer. Sampling distributions and related statistical methods are explored mathematically, studied via simulation and illustrated on data. Statistical methods for describing data and making inferences based on samples from populations are presented. Methods include, but are not limited to, inference for means and variances for one- and two-sample problems, correlation and simple linear regression. Graphical and numerical data description are used for exploration, communication of results, and comparing mathematical consequences of probability models and data. Mathematics is employed to the level of univariate calculus and is less demanding than that required by STAT 24400.

Back to top

STATISTICS 24400. Statistical Theory and Methods I.
Sec 01: Stephen M. Stigler, TTH, 1:30-2:50 PM, Kent 107.
PQ:  MATH 19600, 20100, or 20400
Recommended Reading: Detailed notes will be distributed in the first quarter. The recommended (but not required in the first quarter) text is "Mathematical Statistics and Data Analysis (2nd edition, 1995)", by John A. Rice (Duxbury).

This is the first quarter of a two-quarter sequence. Enrollment in the first quarter alone is permitted, although not recommended. The first quarter will cover the basics -- tools from probability and the elements of statistical theory. Topics will include the definitions of probability and random variables, binomial and other discrete probability distributions, normal and other continuous probability distribution, joint probability distributions and the transformation of random variables, principles of inference (including Bayesian inference), maximum likelihood estimation, hypothesis testing and confidence intervals, likelihood ratio tests, multinomial distributions and chi-square tests. Some large sample theory will be included. The emphasis will be upon statistical theory, specifically upon concepts and tools that are useful for understanding and applying statistical methodology. Examples will be drawn from the social, physical, and biological sciences. There is no enforced prerequisite in probability or statistics, although the pace is such that students may find it useful to have taken a previous elementary course. The coverage of topics in probability will be limited and brief, so that those who have taken a course in probability will find reinforcement rather than redundancy. The second quarter will cover statistical methodology, including some multivariate analysis, the analysis of variance, the regression phenomenon, linear regression analysis, data analysis, and correlation. The computer will be used in the second quarter.

Back to top

STATISTICS 30400. Distribution Theory.
Sec 01: Dan Nicolae, MWF, 1:30-2:20 PM, Eckhart 117.
PQ: STAT 24500 & MATH 25000 or equivalent
Recommended Reading: Severini, T. (2005). "Elements of Distribution Theory." Cambridge University Press.

This course covers the basics of distribution theory. Topics include:

  • Distribution functions and their inverses, quantile functions, Q/Q plots, change-of-variables for probability densities
  • Expectation, variance, median, mode of random variables
    Basics of measure theory, including Fubini's theorem and interchangeability of limits and integrals
  • Moment generating functions and characteristic functions, including power series expansion, inversion formulas, uniqueness theorems, and convergence in distribution
  • Cumulants and cumulant generating functions: examples, properties
  • Different concepts of convergence for random variables
  • Limit theorems, including the weak law of large numbers, the central limit theorem.

Back to top

STATISTICS 30700=CMSC 37800. Numerical Computation.
Sec 01: Robert Kirby, MWF, 11:30-12:20 PM, Ryerson 277.
PQ: Consent of instructor and elementary programming experience
Required Reading:  "Numerical linear algebra" by Trefethen & Bau (SIAM Press)
Recommended Reading: "Applied linear algebra" by Demmel (SIAM Press)
"Matrix Computations" by Golub & van Loan, (3rd edition, Johns Hopkins Press).

This course covers numerical linear algebra, including matrix decompositions, direct and iterative methods for the solution of linear systems, and eigenvalue problems. Homework will involve a combination of mathematical and programming assignments.

Back to top

STATISTICS 33100. Sample Surveys.
Sec 01: Kirk Wolter, TTH, 10:30-11:50 AM, Eckhart 117.
PQ: Consent of instructor
Reading:  Cochran, W. G. (1977). "Sampling Techniques," 3rd edition. Wiley

This is an introductory course to the statistics and methodology of sample surveys. Topics include

  • basic methods of sample selection,
  • determining sample size, stratification,
  • general estimators (Horvitz-Thompson, ratio, generalized regression, calibration),
  • domain estimation,
  • nonresponse,
  • nonsampling error,
  • multiple-stage sampling,
  • a national sampling frame for area probability surveys,
  • telephone surveys,
  • questionnaire design,
  • variance estimation for complex surveys,
  • analysis of contingency tables, and
  • regression analysis for survey data.

It will be of interest to students who anticipate a research career that designs, collects, and analyzes survey data in fields such as economics, education, healthcare, marketing, psychology, sociology, and statistics.

Back to top

STATISTICS 34300. Applied Linear Stat Methods.
Sec 01: Wei Biao Wu, TTH, 3:00-4:20 PM, Eckhart 133.
PQ: STAT 24500 & MATH 25000 or equivalent
Required Reading:  Venables, W.N. and Ripley, B.D. (1999). "Modern Applied Statistics with S-Plus (3rd ed)". Springer-Verlag.
Weisberg, S. (2005). "Applied Linear Regression, Third Edition". John Wiley & Sons.
Software: Splus or R.

Statistics 34300 is an intensive course in the theory and methods of linear regression and related techniques of statistical modelling. It is intended primarily for graduate students in Statistics and related fields.

The course is also open to undergraduates and others who have a solid understanding of matrix algebra and basic statistical theory. Thorough familiarity with the simple linear regression model is expected.

The course will review linear regression with a single predictor, and will cover the multiple-predictor case; least-squares estimation; associated distribution theory; estimation, confidence intervals and tests; regression with errors in the predictors; weighted least squares, assessing lack of fit; residual analysis; regression diagnostics; transformations; model building; collinearity; subset-selection methods, including stepwise regression; prediction; nonlinear least squares.

Back to top

STATISTICS 35000=HSTD 30900, ENST 27400, PPHA 36400. Principles of Epidemiology.
Sec 01: Kurina Lianne, TTH, 9:00-10:20 AM, BSLC 240.
PQ: Introductory Statistics
Required Reading:  Gordis, Leon (2004). "Epidemiology, 3rd Ed". Elsevier Science.

Epidemiology is the study of the distribution and determinants of health and disease in human populations. This course introduces the basic principles of epidemiologic study design, analysis and interpretation, through lectures, assignments, and critical appraisement of both classic and contemporary research articles.

Back to top

STATISTICS 35900. Statistics in Neuroscience.
Sec 01: Yali Amit, TTH, 10:30-11:50 AM, Ryerson 277.
PQ:  Consent of instructor
Required Reading:   None.

This course will provide a survey of recent literature on the statistical analysis of neural recordings. Estimation of receptive fields, variations on reverse correlation, correlation analysis, decoding using Bayesian methods, time dependent models for analysis of spike train data. The course will involve student presentation of papers. Introductory lectures on the biological background and the relevant statistical tools will be provided.

Back to top

STATISTICS 36900=HSTD 33300. Longitudinal Data Analysis.
Sec 01: Paul Rathouz, TTH, 9:00-10:20 AM, BSLC 313.
PQ: HSTD 32100/STAT 22200 or equiv and HSTD 32400/STAT 22400 or equiv OR consent of instructor
Required Reading:  Diggle PJ, Heagerty P, Liang K-Y, & Zeger SL. (2002). "Analysis of Longitudinal Data, 2nd edn". Oxford: Oxford University Press.

Longitudinal data consist of multiple measures over time on a sample of individuals. This type
of data occurs extensively in both observational and experimental biomedical and public health studies, as well as in studies in sociology and applied economics. This course will provide an introduction to the principles and methods for the analysis of longitudinal data. Emphasis will be on data analysis and interpretation. Supporting statistical theory will be given at a level appropriate for an advanced Master’s student in Statistics. Problems will be motivated by applications in epidemiology, clinical medicine, health services research, and disease natural history studies.

Back to top

STATISTICS 37800. Statistical Computing.
Sec 01: Kenneth Wilder, MWF, 12:30-1:20 PM, Ryerson 277.
PQ: Consent of instructor
Required Reading:  Accelerated C++, by Andrew Koenig and Barbara E. Moo

Students will be introduced to and gain experience in using a variety of computational tools that are useful for large statistical computing projects. These include R, python, C++, and various Unix utilities. The primary tool will be C++. The emphasis will be on choosing and being able to use the right tools for a project taking into account issues including< execution speed, memory usage, coding ease and portability.

Students will work on Linux operating systems, but no prior Linux/Unix experience is assumed.

Back to top

STATISTICS 38100. Measure-Theoretic Probability I.
Sec 01: Michael J. Wichura, MWF, 2:30-3:20 PM, Eckhart 117.
PQ: STAT 31300 or consent of instructor
Required Reading:  There is no text required or recommended. Notes will be provided.

This course is the first of a three quarter sequence presenting a careful development of some topics from measure and probability. Topics to be covered in 381 include: classes of sets -- fields, sigmafields, monotone classes, pi and lambda systems; probabilities and general measures; independence and the Borel-Cantelli lemmas; measurable functions; induced measures, distribution and inverse distribution functions; integration with respect to measures -- basic properties, change of variable, indefinite integration, densities; integration to the limit -- MCT, DCT, and friends; laws of large numbers, applications to probability and statistics; transition probabilities and product measures.

Back to top

STATISTICS 38200. Measure-Theoretic Probability-II  (Probabilistic Aspects of Combinatorial Optimization).
Sec 01: Peter Radchenko, TTH, 3:00-4:20 PM, Eckhart 117.
PQ:  Statistics 31300 or consent of instructor
Recommended Reading:  “Probability theory and combinatorial optimization” by Michael Steele (CBMS-NSF regional conference series in applied mathematics, 1997).

This course is an introduction to the modern probability theory that is directly applicable to combinatorial optimization.  We will consider simple stochastic models for the values of the problem inputs and concentrate on understanding the behavior of the objective function.

Applications will include:

  • Long-common-subsequence problem
  • Longest-increasing-subsequence problem
  • Traveling salesman problem
  • Minimal spanning trees
  • Steiner trees
  • Minimal matching problem
  • Nearest-neighbor-link problem

Probability topics will include:

  • Martingale inequalities
  • Subadditive sequences and functionals
  • Kingman’s subadditive ergodic theorem
  • Poissonization and de-Poissonization
  • Talagrand’s isoparametric theory of concentration inequalities

Back to top

STATISTICS 39000=FINM 34500. Stochastic Calculus I.
Sec 01: Per Mykland, W 6:00-9:00 PM, Eckhart 202.
PQ: Enrollment in Mathematical Finance M. Sc. program or consent of instructor
Required Reading:  Stochastic Calculus for Finance I-II, by Steven E. Shreve (required).
A Course in Financial Calculus, by Alison Etheridge (highly recommended).
`The basics of S-PLUS'', 3rd edition, by the same authors (A. Krause and M. Olson), ISBN 0-387-95456-2 (required).

This course is an introduction to stochastic calculus as it is relevant to the pricing and hedging of
options and other derivative securities. It is the first of a two-quarter sequence offered in collaboration
by the Department of Statistics and the master's program in Mathematical Finance. The main topics to
be covered are:

  • The Fundamental Theorem of Asset Pricing
  • Martingales
  • Brownian Motion
  • The Ito Integral and Ito's Formula
  • The Black-Sholes Formula
  • Girsanov's Formula
  • Currency Options
  • The Martingale Representation and Hedging

There will be weekly homework assignments, and midterm and final exams. The course assistants will conduct weekly help sessions on Friday afternoons.

Back to top

SPRING 2005

STATISTICS 22200. Linear Models and Experimental Design.
Sec 01:  Mei Wang, TTH, 1:30-2:50 PM, Eckhart 133.
Prerequisites: Statistics 22000 or equivalent.
Required Textbook: Oehlert, G. W. (2000) A First Course in Design and Analysis of Experiments. W. H. Freeman.
Optional Textbook: Hamilton, L. C. (2003) Statistics with Stata (Updated for Version 7). Duxbury Press.

This course will introduce the students to the major statistical issues in the design of experiments and the analysis of experimental data. The major topics will be

  • The basic principles of experimental design: randomization, blocking, and balance.
  • Important classes of designs: matched pairs, complete factorial designs, Latin squares, Graeco-Latin squares, fractional factorial designs and confounding.
  • Analysis of covariance and inference from experimental data.

In addition to regular homework assignments and exams, students will be required to complete a project involving the design and analysis of an experiment of their own.

Back to top

STATISTICS 24600. Statistical Theory and Methods III: Computational Statistics.
Sec 01:  Marc Coram, TTH, 10:30-11:50 AM, Eckhart 133.
Prerequisites: Statistics 24400 and 24500, or equivalent.
Recommended textbooks:
Main textbook:
Computational Statistics, Givens and Hoeting. Wiley 2005
Additional textbooks:
Gelman, A. Carlin, B., Stern, H. S., Rubin, D.B., Bayesian Data Analysis (2nd ed), Chapman & Hall, 2004.
Little, R. J. A., Rubin, D. B., Statistical Analysis with Missing Data (2nd ed), Wiley, 2002.
Rice, J. A., Matematical Statistics and Data Analysis (2nd ed), Duxbury Press, 1995.
Efron, B. and Tibshirani, R. J., An Introduction to the Bootstrap, Chapman & Hall, 1993.
Venables W. N., Ripley B. D., Modern Applied Statistics with S (4th ed), Springer, 1999.

This course is the third part of a sequence that introduces mathematical statistics and probability. In this quarter, the emphasis is on modern computer-intensive statistical methods.

In the first part of the course, the impact of missing data on statistical analyses is considered and algorithms for iterative maximum likelihood estimation, such as the EM and the Newton-Raphson algorithms, and for Bayesian computation, such as Data Augmentation, are introduced in certain examples. Additionally, Markov Chain Monte Carlo methods are outlines in a more general framework of Bayesian inference.

The second part of the course presents an overview of the bootstrap and related nonparametric methods for assessing statistical accuracy.

Knowledge of probability distributions, random variables, and estimation techniques such as maximum likelihood from Statistics 24400 and 24500 is essential for this class.

Back to top

STATISTICS 25100. Introduction to Mathematical Probability.
Sec 01:  Michael Wichura, TTH, 3:00-4:20 PM, Eckhart 133.
Prerequisites: Math 20000 or 20300 (or equivalent) and completion of one of the Common Core sequences in the Biological or Physical Sciences, or permission from the instructor. No prior exposure to probability theory is required or assumed.
Textbook: Ross, Sheldon (2002), A First Course in Probability, Sixth Edition. Prentice-Hall.

Probability theory originated in the consideration of gambling problems, but has become an important tool for scientists, engineers, medical practitioners, lawyers, and people working in business. A wide variety of phenomena are characterized by randomness and uncertainty, which is measured by probability. Probability models also play a fundamental role in the statistical analysis of data. The aim of the course is to provide an introduction to the elementary concepts of probability. The first part of the course will introduce the student to the basic ideas used to describe aspects of randomness, such as events, random variables, independence, and conditional probability. The remainder of the course focuses on the methods, calculation, and applications of probability. The topics treated are: combinatorics, discrete and continuous distributions, density functions, distribution theory, calculation and interpretation of moments, covariance and correlation, the classical central limit theorem

Back to top

STATISTICS 26700/36700, HIPS 25600, CHSS 32900. History of Statistics.
Sec 01:  Stephen Stigler, TTH, 9:00-10:20 AM, Eckhart 133.
Prerequisites: A course in statistics.
Textbook: Stephen M. Stigler, The History of Statistics: The Measurement of Uncertainty Before 1900. (Cambridge, Mass.: Harvard University Press, 1986) (Available in paperback). Other materials will be distributed in class or by web.

This course will cover topics in the history of statistics, from the eleventh century to the middle of the twentieth century. The emphasis will be upon the period 1650 to 1950, and upon the mathematical developments in the theory of probability and how they came to be used in the sciences, both to quantify uncertainty in observational data and as a conceptual framework for scientific theories. The course will include broad views of the development of the subject, and closer looks at specific people and investigations, including reanalyses of historical data. Topics will include: Early probability; Probability in seventeenth century medicine; Inverse probability in inference; Statistical methods in early geodesy; The introduction of least squares; Statistics in social science; Statistics in early biology and psychology; Simulation; Statistics and the evaluation of social programs; Maximum likelihood estimation; Statistics and nineteenth century forensic science; Statistics and medical science. Major figures who will be examined include Pascal, various Bernoullis, De Moivre, Laplace, Boscovich, Gauss, Quetelet, Galton, Edgeworth, Karl Pearson, Gosset, Fisher, Neyman.

Back to top

STATISTICS 30200. Mathematical Statistics II.
Sec 01:  Marc Coram, TTH, 3:00-4:20 PM, Eckhart 117.
Prerequisites: Statistics 30100.
Textbook: Casella and Berger: Statistical Inference.

This course continues the development of mathematical statistics. Topics of importance include: statistical decision theory, admissability and the Neyman-Pearson lemma, formal hypothesis testing, comparison with conditional frequentist and Bayesian viewpoints, ancillarity, interval estimation, UMP tests and MLR, unbiased tests, score statistics, generalized liklihood ratio tests and asymptotics, minimax estimation, empirical Bayes and "shrinkage"

Back to top

STATISTICS 30800. Advanced Statistical Inference II.
Sec 01:  Michael Stein, TTH, 1:30-2:50 PM, Eckhart 117.
Prerequisites: Statistics 30200 or equivalent.
Textbook: None.

The focus of the course will be on inference for parametric statistical models.

Suggested topics for discussion include

  • Likelihood and likelihood ratio statistic
  • Asymptotic approximation of distributions
  • Bayesian and frequentist model selection
  • Shrinkage estimation
  • Inference for stochastic processes
  • Nonstandard asymptotic regimes (e.g., fixed domain asymptotics)

Actual topics covered will depend on interests of students and instructor.

Back to top

STATISTICS 31300. Introduction to Stochastic Processes II.
Sec 01:  Steven Lalley, TTH, 10:30-11:50 AM, Eckhart 117.
Prerequisites: Statistics 31200 or consent of instructor.
Textbook: Ross, S. (1996). Stochastic Processes, 2nd ed., Wiley.

This course is a continuation of Statistics 31300: Introduction to Stochastic Processes I. Topics to be discussed will include generating functions, Galton-Watson processes, continuous-time Markov chains, martingales, and Brownian motion. There will be weekly homework assignments, and midterm and final exams

Back to top

STATISTICS 34700. Generalized Linear Models.
Sec 01: Peter McCullagh, MW, 1:30-2:50 PM, Ryerson 358.
Prerequisites: Statistics 344 or equivalent.
Textbooks: McCullagh & Nelder (1989). Generalized Linear Models, Second Edition. Chapman & Hall.
Cox & Snell (1981). Applied Statistics, Chapman & Hall.
Venables and Ripley Modern Applied Statistics with S.

This is an applied course for students who are familiar with linear
models at the level of Draper and Smith or Weisberg. The following
topics will be covered:

Factors, variates, contrasts, interactions
Exponential-family models: variance function
Definition of a generalized linear model: link functions
Analysis of deviance
Specific examples of GLMs

logistic and probit regression
cumulative logistic models
log-linear models and contingency tables
inverse linear models

Quasi-likelihood and least squares; estimating functions
Over-dispersion
Partially linear models.

Back to top

STATISTICS 35500. Statistical Genetics.
Sec 01: Mary Sara McPeek, F, 1:30-2:50 PM, Eckhart 117.
This is an advanced course in statistical genetics. Prerequisites are Human Genetics 471 and Statistics 244 and 245. Students who do not meet the prerequisites may enroll on a P/NP basis with consent of the instructor. This is a discussion course and student presentations will be required. Topics vary and may include, but are not limited to, statistical problems in linkage mapping, association mapping, map construction, and genetic models for complex traits.

Back to top

STATISTICS 36700/26700, HIPS 25600, CHSS 32900. History of Statistics.
Sec 01: Stephen Stigler, TTH, 9:00-10:20 PM, Eckhart 117.
Prerequisites: A course in statistics.
Textbook: Stephen M. Stigler, The History of Statistics: The Measurement of Uncertainty Before 1900. (Cambridge, Mass.: Harvard University Press, 1986) (Available in paperback). Other materials will be distributed in class or by web.

This course will cover topics in the history of statistics, from the eleventh century to the middle of the twentieth century. The emphasis will be upon the period 1650 to 1950, and upon the mathematical developments in the theory of probability and how they came to be used in the sciences, both to quantify uncertainty in observational data and as a conceptual framework for scientific theories. The course will include broad views of the development of the subject, and closer looks at specific people and investigations, including reanalyses of historical data. Topics will include: Early probability; Probability in seventeenth century medicine; Inverse probability in inference; Statistical methods in early geodesy; The introduction of least squares; Statistics in social science; Statistics in early biology and psychology; Simulation; Statistics and the evaluation of social programs; Maximum likelihood estimation; Statistics and nineteenth century forensic science; Statistics and medical science. Major figures who will be examined include Pascal, various Bernoullis, De Moivre, Laplace, Boscovich, Gauss, Quetelet, Galton, Edgeworth, Karl Pearson, Gosset, Fisher, Neyman.

Back to top

STATISTICS 37900 CMSC 25050/CMSC 35500. Topics in Computer Vision.
Sec 01: Yali Amit, TTH, 9:00-10:20 AM, Ryerson 277.
Prerequisites: Consent of instructor.
Textbooks: Yali Amit, 2d Object Detection and Recognition: models, algorithms and networks, MIT Press, 2002.

Deformable models for detecting objects in images will be discussed in detail. One-dimensional models to identify object contours and boundaries; dense two-dimensional models for image matching and classification; sparse models for efficient detection of objects in complex scenes. The course will describe a unified statistical framework for these models. The different mathematical and statistical tools needed to define the models and the associated algorithms will be developed in class. Applications include, detecting contours in medical images, matching brains, detecting faces and other objects in images, reading license plates and zipcodes.

Neural network implementations of some of the algorithms will be presented and some connections to the functions of the biological visual system will be discussed.

Back to top

STATISTICS 38500. Advanced Topics in Probability.
Sec 01: Michael Wichura, WF, 11:45-1:05 PM, Eckhart 117.
Prerequisites: Consent of instructor.
Textbook: None.

Topics for Statistics 38500 will include:

Continuous-time martingales: convergence, closure, optional sampling
Brownian motion
Imbedding
Functional central timit theorem
Functional law of the iterated logarithm Ito integration
Other topics related to Brownian motion

Back to top

STATISTICS 43900. Inference and Simulation for Spatial Point Processes.
Sec 01: Jesper Moller, Department of Mathematical Sciences. Aalborg University, TTH, 3:00-4:20 PM, Eckhart 206.
Prerequisites: At least an elementary knowledge about statistical inference. Some knowledge about Markov chains, Poisson processes and stochastic processes in general will be advantegeous but not required.
Textbook: Møller, J. and Waagepetersen, R.W. (2003). Inference and Simulation for Spatial Point Processes, Chapman & Hall/CRC Press.

Spatial point processes are used to model point patterns where the points typically are positions or centres of objects in a two- or three-dimensional region. The points may be decorated with marks (such as sizes or types of the objects) whereby marked point processes are obtained. The areas of applications are manifold and include astronomy, ecology, forestry, geography, image analysis, and spatial epidemiology. For more than 30 years spatial point processes have been a major area of research in spatial statistics. It is expected that research in spatial point processes will continue to be of importance as new technology makes huge amounts of spatial point process data available and new applications emerge.

Some of the earliest applications of computational methods in statistics are related to spatial point processes. In the last decade computational methods, and particularly Markov Chain Monte Carlo (MCMC) methods, have undergone major developments. This course is concerned with simulationbased inference for spatial point processes, with an emphasis on MCMC methods. Many of the MCMC methods apply as well in other areas of statistics.

Some keywords: Bayesian inference, Cox processes, Markov chain Monte Carlo, Markov point processes, maximum likelihood, Metropolis-Hastings algorithm, perfect simulation, Poisson point processes, pseudo likelihood, simulation-based inference.

Back to top

Winter 2005

Back to top

Autumn 2004

Back to top

Spring 2004

Back to top

Winter 2004

Back to top

Autumn Quarter 2003

Back to top

Summer 2003

Back to top

Spring 2003

Back to top

Winter 2003

Back to top

Autumn 2002

Back to top

Spring 2002

Back to top

Winter 2002

Back to top

Autumn 2001

Back to top

Winter 2001

Back to top

Autumn 2000

Back to top

Spring 2000

Back to top

Winter 2000

Back to top

Autumn 1999

Back to top

Spring 1999

Back to top

Winter 1999

Back to top

Autumn 1998

Back to top

Spring 1998

Back to top

Winter 1998

Back to top

Autumn 1997

Back to top

Spring 1997

Back to top

Winter 1997

Back to top

Autumn 1996

Back to top

Spring 1996

Back to top

Winter 1996

Back to top

Autumn 1995

Back to top

Spring 1995

Back to top

Winter 1995

Back to top

Autumn 1994


About the image