返回到 AI Workflow: Data Analysis and Hypothesis Testing
IBM

AI Workflow: Data Analysis and Hypothesis Testing

This is the second course in the IBM AI Enterprise Workflow Certification specialization.  You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones.   In this course you will begin your work for a hypothetical streaming media company by doing exploratory data analysis (EDA).  Best practices for data visualization, handling missing data, and hypothesis testing will be introduced to you as part of your work.  You will learn techniques of estimation with probability distributions and extending these estimates to apply null hypothesis significance tests. You will apply what you learn through two hands on case studies: data visualization and multiple testing using a simple pipeline.   By the end of this course you should be able to: 1.  List several best practices concerning EDA and data visualization 2.  Create a simple dashboard in Watson Studio 3.  Describe strategies for dealing with missing data 4.  Explain the difference between imputation and multiple imputation 5.  Employ common distributions to answer questions about event probabilities 6.  Explain the investigative role of hypothesis testing in EDA 7.  Apply several methods for dealing with multiple testing   Who should take this course? This course targets existing data science practitioners that have expertise building machine learning models, who want to deepen their skills on building and deploying AI in large enterprises. If you are an aspiring Data Scientist, this course is NOT for you as you need real world expertise to benefit from the content of these courses. What skills should you have? It is assumed that you have completed Course 1 of the IBM AI Enterprise Workflow specialization and have a solid understanding of the following topics prior to starting this course: Fundamental understanding of Linear Algebra; Understand sampling, probability theory, and probability distributions; Knowledge of descriptive and inferential statistical concepts; General understanding of machine learning techniques and best practices; Practiced understanding of Python and the packages commonly used in data science: NumPy, Pandas, matplotlib, scikit-learn; Familiarity with IBM Watson Studio; Familiarity with the design thinking process.

状态:Machine Learning
状态:Python Programming
高级设置课程小时

精选评论

PM

5.0评论日期:Apr 2, 2020

More practicality and assignment should me there. Which is more helpful for the learners.

RM

5.0评论日期:Jul 6, 2020

Very Informative and Labs for Hands-on session was useful.

所有审阅

显示:18/18

Olivier Roncalez
1.0
评论日期:May 6, 2020
Jonathan Venezia
1.0
评论日期:May 27, 2020
Pralay Maity
5.0
评论日期:Apr 2, 2020
Mahjube Chavoshi
3.0
评论日期:May 18, 2020
Rangarajan me16s058
5.0
评论日期:Jul 7, 2020
PERAM MAHENDRA REDDY
5.0
评论日期:Aug 22, 2024
Vaibhav Kumar
5.0
评论日期:Sep 12, 2022
Rafail Mahammadli
5.0
评论日期:Oct 5, 2020
Théophile Pace
4.0
评论日期:Apr 29, 2021
SALVADOR LINARES MORCILLO
4.0
评论日期:Sep 15, 2020
Shoaib Qureshi
4.0
评论日期:Dec 12, 2020
Dang Ha Gia Huy
4.0
评论日期:Apr 17, 2025
BHAVANA gubbi
3.0
评论日期:Aug 30, 2020
Brunello Bonanni
3.0
评论日期:Apr 24, 2021
Pertti Viitamaki
3.0
评论日期:Aug 12, 2020
SUPARNA CHATTERJEE
3.0
评论日期:Dec 17, 2020
Gaurav Saxena
3.0
评论日期:Aug 3, 2020
Vasyl Rudiuk
2.0
评论日期:Jul 1, 2020