Course Title:
Advanced Statistics for Data Science Specialization
Course Description:
Fundamental concepts in probability, statistics and linear models are primary building blocks for data science work. Learners aspiring to become biostatisticians and data scientists will benefit from the foundational knowledge being offered in this specialization. It will enable the learner to understand the behind-the-scenes mechanism of key modeling tools in data science, like least squares and linear regression. This specialization starts with Mathematical Statistics bootcamps, specifically concepts and methods used in biostatistics applications. These range from probability, distribution, and likelihood concepts to hypothesis testing and case-control sampling. This specialization also linear models for data science, starting from understanding least squares from a linear algebraic and mathematical perspective, to statistical linear models, including multivariate regression using the R programming language. These courses will give learners a firm foundation in the linear algebraic treatment of regression modeling, which will greatly augment applied data scientists' general understanding of regression models.
Course instructional level:
AdvanceCourse Duration:
3 Month/6 MonthHours: 45
Course Images
Course coordinator:
Brian Caffo, PhDPrerequisites, if any:
NACourse coordinator's profile(s):
Brian Caffo, PhD is a professor in the Department of Biostatistics at the Johns Hopkins University Bloomberg School of Public Health. He graduated from the Department of Statistics at the University of Florida in 2001. He works in the fields of computational statistics and neuroinformatics and co-created the SMART ( www.smart-stats.org) working group. He has been the recipient of the Presidential Early Career Award for Scientist ( PECASE) and Engineers and Bloomberg School of Public Health Golden Apple and AMTRA teaching awards.Course Contents:
Course Outcomes:
- Learn about probability, expectations, conditional probabilities, distributions, confidence intervals, bootstrapping, binomial proportions, and more.
- Understand the matrix algebra of linear regression models.
- Learn about canonical examples of linear models to relate them to techniques that you may already be using.