"Statistical Learning for Data Science" is an advanced course designed to equip working professionals with the knowledge and skills necessary to excel in the field of data science. Through comprehensive instruction on key topics such as shrink methods, parametric regression analysis, generalized linear models, and general additive models, students will learn how to apply resampling methods to gain additional information about fitted models, optimize fitting procedures to improve prediction accuracy and interpretability, and identify the benefits and approach of non-linear models. This course is the perfect choice for anyone looking to upskill or transition to a career in data science.
This course can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science (MS-DS) degree offered on the Coursera platform. The MS-DS is an interdisciplinary degree that brings together faculty from CU Boulder’s departments of Applied Mathematics, Computer Science, Information Science, and others. With performance-based admissions and no application process, the MS-DS is ideal for individuals with a broad range of undergraduate education and/or professional experience in computer science, information science, mathematics, and statistics. Learn more about the MS-DS program at https://hua.dididi.sbs/degrees/master-of-science-data-science-boulder.
Welcome to our Resampling, Selection, and Splines class! In this course, we will dive deep into these key topics in statistical learning and explore how they can be applied to data science. The module provides an introductory overview of the course and introduces the course instructor.
涵盖的内容
6个视频3篇阅读材料1个讨论话题
显示有关单元内容的信息
6个视频•总计83分钟
Course Introduction•3分钟
Generalized Linear Models: Part 1•15分钟
Generalized Linear Models: Part 2•20分钟
Parametric vs Non-Parametric Regression•25分钟
General Additive Models Part 1•10分钟
General Additive Models Part 2•11分钟
3篇阅读材料•总计21分钟
Course Updates and Accessibility Support•1分钟
Earn Academic Credit for your Work!•10分钟
Course Support•10分钟
1个讨论话题•总计10分钟
Introduce Yourself!•10分钟
Generalized Least Squares
第 2 单元•小时 后完成
单元详情
In this module, we will turn our attention to generalized least squares (GLS). GLS is a statistical method that extends the ordinary least squares (OLS) method to account for heteroscedasticity and serial correlation in the error terms. Heteroscedasticity is the condition where the variance of the errors is not constant across all levels of the predictor variables, while serial correlation is the condition where the errors are correlated across time or space. GLS has many practical applications, such as in finance for modeling asset returns, in econometrics for modeling time series data, and in spatial analysis for modeling spatially correlated data. By the end of this module, you will have a good understanding of how GLS works and when it is appropriate to use it. You will also be able to implement GLS in R using the gls() function in the nlme package.
涵盖的内容
1个视频1篇阅读材料1个编程作业1个非评分实验室
显示有关单元内容的信息
1个视频•总计21分钟
Generalized Least Squares•21分钟
1篇阅读材料•总计30分钟
Generalized Least Squares (GLS): Relations to OLS & WLS•30分钟
1个编程作业•总计60分钟
Generalized Least Squares•60分钟
1个非评分实验室•总计60分钟
Generalized Least Squares Practice•60分钟
Shrink Methods
第 3 单元•小时 后完成
单元详情
In this module, we will explore ridge regression, LASSO, and principal component analysis (PCA). These techniques are widely used for regression and dimensionality reduction tasks in machine learning and statistics.
涵盖的内容
7个视频3篇阅读材料3个编程作业
显示有关单元内容的信息
7个视频•总计101分钟
L1 and L2 Norms•12分钟
Ridge Regression: Part 1•7分钟
Ridge Regression: Part 2•8分钟
Ridge Regression: Part 3•18分钟
LASSO •13分钟
Principle Component Analysis (PCA) Overview•19分钟
PCA in Terms of SVD•23分钟
3篇阅读材料•总计105分钟
Ridge Regression•30分钟
LASSO•30分钟
Principle Component Analysis•45分钟
3个编程作业•总计120分钟
Ridge Regression•60分钟
LASSO•30分钟
Principle Component Analysis•30分钟
Cross-Validation
第 4 单元•小时 后完成
单元详情
This week, we will be exploring the concept of cross-validation, a crucial technique used to evaluate and compare the performance of different statistical learning models. We will explore different types of cross-validation techniques, including k-fold cross-validation, leave-one-out cross-validation, and stratified cross-validation. We will discuss their strengths, weaknesses, and best practices for implementation. Additionally, we will examine how cross-validation can be used for model selection and hyperparameter tuning.
涵盖的内容
1个视频1篇阅读材料1个编程作业
显示有关单元内容的信息
1个视频•总计7分钟
Cross-Validation•7分钟
1篇阅读材料•总计90分钟
Summary•90分钟
1个编程作业•总计90分钟
Cross-Validation•90分钟
Bootstrapping
第 5 单元•小时 后完成
单元详情
For our final module, we will explore bootstrapping. Bootstrapping is a resampling technique that allows us to gain insights into the variability of statistical estimators and quantify uncertainty in our models. By creating multiple simulated datasets through resampling, we can explore the distribution of sample statistics, construct confidence intervals, and perform hypothesis testing. Bootstrapping is particularly useful when parametric assumptions are hard to meet or when we have limited data. By the end of this week, you will have an understanding of bootstrapping and its practical applications in statistical learning.
涵盖的内容
1个视频1篇阅读材料1个编程作业
显示有关单元内容的信息
1个视频•总计10分钟
Bootstrapping•10分钟
1篇阅读材料•总计60分钟
Summary•60分钟
1个编程作业•总计90分钟
Bootstrapping•90分钟
获得职业证书
将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。
攻读学位
课程 是 University of Colorado Boulder提供的以下学位课程的一部分。如果您被录取并注册,您已完成的课程可计入您的学位学习,您的学习进度也可随之转移。
查看符合条件的学位
攻读学位
课程 是 University of Colorado Boulder提供的以下学位课程的一部分。如果您被录取并注册,您已完成的课程可计入您的学位学习,您的学习进度也可随之转移。
CU Boulder is a dynamic community of scholars and learners on one of the most spectacular college campuses in the country. As one of 34 U.S. public institutions in the prestigious Association of American Universities (AAU), we have a proud tradition of academic excellence, with five Nobel laureates and more than 50 members of prestigious academic academies.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.