
Stephen Raudenbush
· Lewis-Sebring Distinguished Service Professor, Department of Sociology, the College, and the Harris School of Public Policy StudiesUniversity of Chicago · Sociology
Active 1983–2025
About
Stephen Raudenbush is the Lewis-Sebring Distinguished Service Professor in the Department of Sociology, the College, and the Harris School of Public Policy Studies at the University of Chicago. He holds a B.A., Ed.M., and Ed.D. from Harvard University. His research interests include the sociology of education and quantitative methods, with a focus on statistical models for child and youth development within social settings such as classrooms, schools, and neighborhoods. Raudenbush is best known for developing hierarchical linear models, which have broad applications in the design and analysis of longitudinal and multilevel research. His current work involves studying the development of literacy and math skills in early childhood with implications for instruction, as well as methods for assessing school and classroom quality. He is a member of the National Academy of Sciences and the American Academy of Arts and Sciences, and has received the American Educational Research Association award for Distinguished Contributions to Educational Research.
Research topics
- Computer Science
- Mathematics education
- Psychology
- Artificial Intelligence
- Developmental psychology
- Machine Learning
- Sociology
- Social Science
- Econometrics
- Management science
- Cognitive psychology
- Data science
- Medicine
- Engineering
- Linguistics
- Mathematics
- Statistics
Selected publications
Journal of Educational and Behavioral Statistics · 2025-12-10
articleSenior authorConsider the conventional multilevel model <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">Y</mml:mi> <mml:mo>=</mml:mo> <mml:mi mathvariant="normal">C</mml:mi> <mml:mi mathvariant="normal">γ</mml:mi> <mml:mo>+</mml:mo> <mml:mi mathvariant="normal">Zu</mml:mi> <mml:mo>+</mml:mo> <mml:mi mathvariant="normal">e</mml:mi> </mml:mrow> </mml:math> where <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">γ</mml:mi> </mml:mrow> </mml:math> represents fixed effects and <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mo stretchy="false">(</mml:mo> <mml:mi mathvariant="normal">u</mml:mi> <mml:mo>,</mml:mo> <mml:mi mathvariant="normal">e</mml:mi> <mml:mo stretchy="false">)</mml:mo> </mml:mrow> </mml:math> are multivariate normal random effects. The continuous outcomes <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">Y</mml:mi> </mml:mrow> </mml:math> and covariates <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">C</mml:mi> </mml:mrow> </mml:math> are fully observed with a subset <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">Z</mml:mi> </mml:mrow> </mml:math> of <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">C</mml:mi> </mml:mrow> </mml:math> . The parameters are <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">θ</mml:mi> <mml:mo>=</mml:mo> <mml:mo stretchy="false">(</mml:mo> <mml:mi mathvariant="normal">γ</mml:mi> <mml:mo>,</mml:mo> <mml:mi>var</mml:mi> <mml:mo stretchy="false">(</mml:mo> <mml:mi mathvariant="normal">u</mml:mi> <mml:mo stretchy="false">)</mml:mo> <mml:mo>,</mml:mo> <mml:mi mathvariant="normal">var</mml:mi> <mml:mo stretchy="false">(</mml:mo> <mml:mi mathvariant="normal">e</mml:mi> <mml:mo stretchy="false">)</mml:mo> <mml:mo stretchy="false">)</mml:mo> </mml:mrow> </mml:math> . Dempster, Rubin and Tsutakawa framed the estimation as a missing data problem, where <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mo stretchy="false">(</mml:mo> <mml:mi mathvariant="normal">Y</mml:mi> <mml:mo>,</mml:mo> <mml:mi mathvariant="normal">u</mml:mi> <mml:mo stretchy="false">)</mml:mo> </mml:mrow> </mml:math> are the complete data and the random effects <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi>u</mml:mi> </mml:mrow> </mml:math> are conceived as missing data. Viewed in this way, the Expectation-Maximization (EM) algorithm has proven to be a natural and popular approach to estimation. However, when <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">C</mml:mi> </mml:mrow> </mml:math> is partially observed or subject to measurement error, it is natural to formulate a multilevel model for <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">C</mml:mi> </mml:mrow> </mml:math> that includes random effects, <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi>ν</mml:mi> </mml:mrow> </mml:math> . In this article, we extend this thinking to allow estimation of the joint distribution of data <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">Y</mml:mi> <mml:mo>=</mml:mo> <mml:mo stretchy="false">(</mml:mo> <mml:mi mathvariant="normal">Y</mml:mi> <mml:mo>,</mml:mo> <mml:mi mathvariant="normal">C</mml:mi> <mml:mo stretchy="false">)</mml:mo> <mml:mo>=</mml:mo> <mml:mo stretchy="false">(</mml:mo> <mml:msub> <mml:mrow> <mml:mi mathvariant="normal">Y</mml:mi> </mml:mrow> <mml:mrow> <mml:mi mathvariant="normal">o</mml:mi> </mml:mrow> </mml:msub> <mml:mo>,</mml:mo> <mml:msub> <mml:mrow> <mml:mi mathvariant="normal">Y</mml:mi> </mml:mrow> <mml:mrow> <mml:mi mathvariant="normal">m</mml:mi> </mml:mrow> </mml:msub> <mml:mo stretchy="false">)</mml:mo> </mml:mrow> </mml:math> and random effects <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline" overflow="scroll"> <mml:mrow> <mml:mi mathvariant="normal">b</mml:mi> <mml:mo>=</mml:mo> <mml:mo stretchy="false">(</mml:mo> <mml:mi mathvariant="normal">u</mml:mi> <mml:mo>,</mml:mo> <mml:mi mathvariant="normal">v</mml:mi> <mml:mo stretchy="false">)</mml:mo> </mml:mrow> </mml:math> from observed data
Statistics in Medicine · 2024-10-15 · 1 citations
articleOpen accessSenior authorof the experimental patients accessed e-assist while no controls were provided the access. Of interest are the average causal effect of assignment to treatment and the complier average causal effect as well as the variation of these causal effects across physicians. Each physician generates probabilities of screening for experimental compliers (experimental patients who accessed e-assist), control compliers (controls who would have accessed e-assist had they been assigned to e-assist), and never takers (patients who would have avoided e-assist no matter what). Estimating physician-specific probabilities jointly over physicians poses novel challenges. We address these challenges by maximum likelihood, factoring a "complete-data likelihood" uniquely into the conditional distribution of screening and partially observed compliance given random effects and the distribution of random effects. We marginalize this likelihood using adaptive Gauss-Hermite quadrature. The approach is doubly iterative in that the conditional distribution defies analytic evaluation. Because the small sample size per physician constrains estimability of multiple random effects, we reduce their dimensionality using a shared random effects model having a factor analytic structure. We assess estimators and recommend sample sizes to produce reasonably accurate and precise estimates by simulation, and analyze data from a trial of a CRCS intervention.
Hierarchical Linear Models (HLM) and Multilevel Causal Inference
2023-01-01
report1st authorCorrespondingThis seminar introduces you to the theory and practice of multilevel modeling, and the logic of causal inference at multiple levels of analysis. The first day introduces you to two-level and three-level analysis, with an emphasis on how to build and interpret HLMs with a theoretically-rigorous foundation to guide estimation and inference. The second day considers how modern methods of causal inference can be applied to multilevel experimental and quasi-experimental designs. The new updated version of the HLM software will be used to illustrate model building and inference in a hands-on way. An official Instats certificate of completion with Professor Raudenbush is provided at the conclusion of the seminar. The seminar offers 2 ECTS Equivalent points for European PhD students.
Figshare · 2023-01-01
datasetOpen accessSenior authorWe consider two-level models where a continuous response <i>R</i> and continuous covariates <i>C</i> are assumed missing at random. Inferences based on maximum likelihood or Bayes are routinely made by estimating their joint normal distribution from observed data Robs and Cobs. However, if the model for <i>R</i> given <i>C</i> includes random coefficients, interactions, or polynomial terms, their joint distribution will be nonstandard. We propose a family of unique factorizations involving selected “provisionally known random effects” <i>u</i> such that h(Robs,Cobs|u) is normally distributed and <i>u</i> is a low-dimensional normal random vector; we approximate h(Robs,Cobs)=∫h(Robs,Cobs|u)g(u)du via adaptive Gauss-Hermite quadrature. For polynomial models, the approximation is exact but, in any case, can be made as accurate as required given sufficient computation time. The model incorporates random effects as explanatory variables, reducing bias due to measurement error. By construction, our factorizations solve problems of compatibility among fully conditional distributions that have arisen in Bayesian imputation based on the Gibbs Sampler. We spell out general rules for selecting <i>u</i>, and show that our factorizations can support fully compatible Bayesian methods of imputation using the Gibbs Sampler. Supplementary materials for this article are available online.
Figshare · 2023-01-01
datasetOpen accessSenior authorAbstract–We consider two-level models where a continuous response <i>R</i> and continuous covariates <i>C</i> are assumed missing at random. Inferences based on maximum likelihood or Bayes are routinely made by estimating their joint normal distribution from observed data <i>R<sub>obs</sub> </i> and <i>C<sub>obs</sub> </i>. However, if the model for <i>R</i> given <i>C</i> includes random coefficients, interactions, or polynomial terms, their joint distribution will be nonstandard. We propose a family of unique factorizations involving selected “provisionally known random effects” <i>u</i> such that h(Robs,Cobs|u) is normally distributed and <i>u</i> is a low-dimensional normal random vector; we approximate h(Robs,Cobs)=∫h(Robs,Cobs|u)g(u)du via adaptive Gauss-Hermite quadrature. For polynomial models, the approximation is exact but, in any case, can be made as accurate as required given sufficient computation time. The model incorporates random effects as explanatory variables, reducing bias due to measurement error. By construction, our factorizations solve problems of compatibility among fully conditional distributions that have arisen in Bayesian imputation based on the Gibbs Sampler. We spell out general rules for selecting <i>u</i>, and show that our factorizations can support fully compatible Bayesian methods of imputation using the Gibbs Sampler.
Journal of Computational and Graphical Statistics · 2023-07-13 · 7 citations
articleSenior author–We consider two-level models where a continuous response R and continuous covariates C are assumed missing at random. Inferences based on maximum likelihood or Bayes are routinely made by estimating their joint normal distribution from observed data Robs and Cobs . However, if the model for R given C includes random coefficients, interactions, or polynomial terms, their joint distribution will be nonstandard. We propose a family of unique factorizations involving selected “provisionally known random effects” u such that h(Robs,Cobs|u) is normally distributed and u is a low-dimensional normal random vector; we approximate h(Robs,Cobs)=∫h(Robs,Cobs|u)g(u)du via adaptive Gauss-Hermite quadrature. For polynomial models, the approximation is exact but, in any case, can be made as accurate as required given sufficient computation time. The model incorporates random effects as explanatory variables, reducing bias due to measurement error. By construction, our factorizations solve problems of compatibility among fully conditional distributions that have arisen in Bayesian imputation based on the Gibbs Sampler. We spell out general rules for selecting u, and show that our factorizations can support fully compatible Bayesian methods of imputation using the Gibbs Sampler.
Hierarchical Linear Models (HLM) and Multilevel Causal Inference
2022-01-01
report1st authorCorrespondingThis seminar, taught by Professor Raudenbush, will introduce you to the theory and practice of multilevel modeling, and the logic of causal inference at multiple levels of analysis. The first day introduces you to two-level and three-level analysis, with an emphasis on how to build and interpret HLMs with a theoretically-rigorous foundation that will guide estimation and inference. The second day considers how modern methods of causal inference can be applied to multilevel experimental and quasi-experimental designs. The new and substantially updated version of the HLM software will be used to illustrate model building and inference in a hands-on way. All participants will receive a free 60-day trial license to this exciting update to the classic HLM software from Scientific Software International, and a 20% discount on a future purchase of HLM. An official Instats certificate of completion with Professor Raudenbush is provided at the conclusion of the seminar. The seminar offers 2 ECTS Equivalent points for European PhD students.
Psychological Science · 2021 · 30 citations
Senior authorCorresponding- Psychology
- Linguistics
- Developmental psychology
Early linguistic input is a powerful predictor of children's language outcomes. We investigated two novel questions about this relationship: Does the impact of language input vary over time, and does the impact of time-varying language input on child outcomes differ for vocabulary and for syntax? Using methods from epidemiology to account for baseline and time-varying confounding, we predicted 64 children's outcomes on standardized tests of vocabulary and syntax in kindergarten from their parents' vocabulary and syntax input when the children were 14 and 30 months old. For vocabulary, children whose parents provided diverse input earlier as well as later in development were predicted to have the highest outcomes. For syntax, children whose parents' input substantially increased in syntactic complexity over time were predicted to have the highest outcomes. The optimal sequence of parents' linguistic input for supporting children's language acquisition thus varies for vocabulary and for syntax.
Effects of double-dose algebra on college persistence and degree attainment
Proceedings of the National Academy of Sciences · 2021-06-28 · 13 citations
articleOpen accessCorrespondingSignificance To obtain high-paying jobs, students from low-income backgrounds need to complete college at higher rates. Could remedial math coursework in high school help them do so? Some have argued that adolescence is too late to intervene, or that schools cannot overcome systemic social problems that hinder student success. Yet, evidence from Chicago suggests otherwise. We find that ninth graders assigned to “double-dose algebra”—an algebra class plus a second class designed to bolster algebra skills—were more likely to stay in, and complete, college compared to similar students who did not take double-dose algebra. However, when students were placed in double-dose classes with much-lower-skilled peers the program had no effect.
American Journal of Sociology · 2020-07-01
paratextOpen access
Recent grants
NIH · $399k · 2001
Enviromental & Biological Variation and Language Growth
NIH · $34.1M · 2002–2020
Frequent coauthors
- 387 shared
Robert J. Sampson
Harvard University
- 374 shared
Felton J. Earls
- 347 shared
Jeanne Brooks‐Gunn
Columbia University
- 35 shared
J. Douglas Willms
Learning Partnership
- 21 shared
Anthony S. Bryk
- 17 shared
Sean F. Reardon
Stanford University
- 14 shared
Jeanne Brooks‐Gunn
- 12 shared
Guanglei Hong
Awards & honors
- American Educational Research Association award for Distingu…
- Member of the National Academy of Sciences
- Member of the American Academy of Arts and Sciences
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Stephen Raudenbush
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup