Welcome to the new AACRAO website. While the site is live, development is ongoing, visitors can expect updates and new features in the weeks ahead.

You must be logged in to view this content. Please login or learn more about joining AACRAO. If you have any questions or experience any issues logging in, please reach out to us atmembership@aacrao.org.

Log In Join

Accurately forecasting course enrollment rates in higher education is of great concern in order to minimize unnecessary administrative costs as well as burden to both students and faculty. This research aimed to first recreate course enrollment predictions based on a conditional probability analysis using student data from San Diego State University (SDSU) and then to improve upon those predictions by applying classification and regression trees (CART) and random forest. The authors incorporated student demographic and academic information into algorithms to ascertain their influence on improving course enrollment prediction accuracy. They used these strategies to predict enrollment in General Chemistry at SDSU, a course with historically varied and large enrollment numbers in the multiple hundreds per semester. The authors then determined which factors were the most influential to the General Chemistry enrollment number using a variable importance metric derived from tree-based algorithms.

shao-l--sm

Lucy Shao is a Ph.D. student in Biostatistics at the University of California, San Diego. Previously, she worked at Analytic Studies & Institutional Research at San Diego State University on student learning and
institutional operation. Currently, her research focuses on causal inference and high-dimensional statistics. Lucy received an M.S. in Statistics from UC San Diego and a B.S. in Applied Mathematics from UCLA.

ieong-m--sm

Martin Ieong is a Data Scientist at Revolution Medicines, where he analyzes genomic and bioinformatics data in the field of cancer research. He previously worked in the software development industry, providing
technical assistance and statistical knowledge to create data science solutions. Martin received an M.S. in Statistics from San Diego State University, and a B.A. from UC San Diego in Linguistics.

levine-r-a--sm

Richard A. Levine is Professor of Statistics at San Diego State University and Faculty Advisor overseeing the Statistical Modeling Group in SDSU Analytic Studies and Institutional Research. He is
former Chair of the SDSU Department of Mathematics and Statistics and past Editor of the Journal of Computational and Graphical Statistics. He is Associate Editor for Statistics of the Notices of the American Mathematical
Society and is a fellow of the American Statistical Association. Professor Levine received his Ph.D. in Statistics from Cornell University.

stronach-j--sm

Jeanne Stronach is an experienced and effective higher education leader with 20 years of experience in institutional research. Specializing in successful team-building with internal and external partners, innovative
resource management and effective strategic planning. Achievements include building self-service visualizations to support data-informed decision-making as well as spearheading a cross-divisional Data Champions program to build data community and
boost data literacy. Jeanne earned her B.A. at UC San Diego and her M.A. at DePaul University.

fan-j--smJuanjuan Fan, Ph.D., is a Professor of Statistics and Data Science in the Department of Mathematics and Statistics, and serves as a Faculty Advisor at the Analytic Studies & Institutional Research (ASIR), at San
Diego State University. Her research interests include survival analysis, decision trees and random forests, and observational study data. Working with her students and collaborators, she has published many papers assessing student success
studies and solving various problems in educational data mining. Professor Fan received her Ph.D. in Biostatistics from the University of Washington.

Breiman, L., J. H. Friedman, R. A. Olshen, and C. J. Stone. 1984. Classification and Regression Trees. Boca Raton, FL: Taylor & Francis Group.

Kraft, C. R., and J. P. Jarvis. 2005. An Adaptive Model for Predicting Course Enrollment, Master’s (thesis). Clemson University, Clemson, SC.

Breiman, L. 2001. Random forests. Machine Learning. 45: 5–32.

University of California. 1999. Educating the Next Generation of Californians in a Research University Context: University of
California Graduate and Undergraduate Enrollment Planning Through 2010
. CA: Author.

University of Virginia. 1999. Methodology for School Enrollment Projections. Charlottesville, VA: Demographics Research Group, Weldon Cooper Center for Public Service.

Malley, J., J. Kruppa, A. Dasgupta, K. Malley, and A. Ziegler. 2012. Probability machines: Consistent probability estimation using nonparametric learning machines. Methods of
Information in Medicine
. 51: 74–81.

James, G., D. Witten, T. Hastie, and R. Tibshirani. 2013. Classification. In An Introduction to Statistical Learning: with Applications in R. New York: Springer.

Wikipedia: The Free Encyclopedia. 2020. Decision tree. April 18. San Francisco: Wikimedia Foundation.

Breiman, L., and A. Cutler. 2018. randomForest: Breiman and Cutler’s Random Forests for Classification and Regression.

Ishwaran, H., and U. B. Kogalur. 2020. RandomForestSRC Fast Unified Random Forests for Survival, Regression, and Classification (RF-SRC).

Therneau, T., B. Atkinson, and B. Ripley. 2019. rpart: Recursive Partitioning and Regression Trees.

Soltys, M., H. Dang, H., G. R. Reilly, and K. Soltys. 2021. Enrollment predictions with machine learning. Strategic Enrollment Management Quarterly. 9(2): 11–18.

Related Posts