
Joseph Ramsey
· Special Faculty and Director of Research ComputingCarnegie Mellon University · Philosophy
Active 1978–2024
Research topics
- Computer Science
- Mathematics
- Data Mining
- Machine Learning
- Artificial Intelligence
- Statistics
- Econometrics
- Data science
- Software engineering
- Programming language
- World Wide Web
Selected publications
Causal-learn: Causal Discovery in Python
arXiv (Cornell University) · 2023 · 26 citations
- Computer Science
- Computer Science
- Programming language
Causal discovery aims at revealing causal relations from observational data, which is a fundamental task in science and engineering. We describe $\textit{causal-learn}$, an open-source Python library for causal discovery. This library focuses on bringing a comprehensive collection of causal discovery methods to both practitioners and researchers. It provides easy-to-use APIs for non-specialists, modular building blocks for developers, detailed documentation for learners, and comprehensive methods for all. Different from previous packages in R or Java, $\textit{causal-learn}$ is fully developed in Python, which could be more in tune with the recent preference shift in programming languages within related communities. The library is available at https://github.com/py-why/causal-learn.
Causal Discovery for Observational Sciences Using Supervised Machine Learning
Journal of Data Science · 2023 · 7 citations
- Computer Science
- Machine Learning
- Artificial Intelligence
Causal inference can estimate causal effects, but unless data are collected experimentally, statistical analyses must rely on pre-specified causal models. Causal discovery algorithms are empirical methods for constructing such causal models from data. Several asymptotically correct discovery methods already exist, but they generally struggle on smaller samples. Moreover, most methods focus on very sparse causal models, which may not always be a realistic representation of real-life data generating mechanisms. Finally, while causal relationships suggested by the methods often hold true, their claims about causal non-relatedness have high error rates. This non-conservative error trade off is not ideal for observational sciences, where the resulting model is directly used to inform causal inference: A causal model with many missing causal relations entails too strong assumptions and may lead to biased effect estimates. We propose a new causal discovery method that addresses these three shortcomings: Supervised learning discovery (SLdisco). SLdisco uses supervised machine learning to obtain a mapping from observational data to equivalence classes of causal models. We evaluate SLdisco in a large simulation study based on Gaussian data and we consider several choices of model size and sample size. We find that SLdisco is more conservative, only moderately less informative and less sensitive towards sample size than existing procedures. We furthermore provide a real epidemiological data application. We use random subsampling to investigate real data performance on small samples and again find that SLdisco is less sensitive towards sample size and hence seems to better utilize the information available in small datasets.
Frequent coauthors
- 46 shared
Clark Glymour
- 24 shared
Peter Spirtes
Carnegie Mellon University
- 15 shared
Rubén Sánchez-Romero
Rutgers Sexual and Reproductive Health and Rights
- 12 shared
Catherine Hanson
- 10 shared
D. Mastrovito
Allen Institute
- 9 shared
S.J. Hanson
Griffith University
- 8 shared
Kun Zhang
Xiamen University
- 8 shared
Bryan Andrews
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Joseph Ramsey
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup