This comprehensive course guides students through the complete data analytics workflow using Python, combining programming fundamentals with advanced statistical analysis. The curriculum is structured across five interconnected modules that build upon each other, using real-world datasets to provide practical, hands-on experience.
Starting with programming fundamentals, you'll learn essential Python concepts while working with real datasets like public library revenue and restaurant safety inspections. The course introduces the Jupyter Notebook environment and transitions students from spreadsheet-based analysis to powerful programmatic approaches. Students master core programming concepts including variables, functions, and control flow structures.
This course helps you bridge the gap between theoretical knowledge and practical application, enabling you to become proficient in using Python for comprehensive data analysis, from basic data manipulation to advanced statistical modeling and forecasting.
This module is an introduction to Python programming, designed for beginners with no prior coding experience. You will explore the fundamental concepts and practices that underpin programming languages, with a specific focus on their application in data manipulation and analysis.
涵盖的内容
24个视频10篇阅读材料4个作业1个编程作业3个非评分实验室
显示有关单元内容的信息
24个视频•总计97分钟
Welcome to Course 3•6分钟
Generative AI in this course•2分钟
Module 1 introduction•2分钟
Computer programming•4分钟
Navigating the Jupyter notebook environment•4分钟
Input, processing, output•3分钟
Python or a spreadsheet?•4分钟
Types and expressions•3分钟
Printing and comments•4分钟
Storing information: variables•4分钟
Debugging with variables•5分钟
Creating lists•4分钟
List operations•5分钟
Taking action: calling functions•5分钟
State•3分钟
Control flow•2分钟
Comparison•5分钟
Branching code: if & else•4分钟
Repeating actions: for loops•4分钟
Indentation•5分钟
Branching code: elif•5分钟
Repeating actions: range•4分钟
Execution order•3分钟
Your first graded lab•3分钟
10篇阅读材料•总计78分钟
Join the DeepLearning.AI Forum to ask questions, get support, or share amazing ideas!•2分钟
[Optional] Practice with types, expressions, and printing•5分钟
Variable names•15分钟
[Optional] Practice with variables•10分钟
[Optional] Practice with lists and functions•10分钟
[Optional] Practice with comparisons and if statements•5分钟
[Optional] Practice with branching code•5分钟
[Optional] Practice with loops•5分钟
Python cheat sheet•20分钟
Module 1 lecture notes•1分钟
4个作业•总计230分钟
Module 1 quiz•30分钟
Lesson 1 quiz•10分钟
Lesson 2 quiz•10分钟
Lesson 3 quiz•180分钟
1个编程作业•总计80分钟
Retail sales analysis•80分钟
3个非评分实验室•总计130分钟
Module 1 lecture code•10分钟
Air quality in New York City - Data Exploration•60分钟
Air quality in New York City - Looping Through Data•60分钟
Data structures and descriptive statistics
第 2 单元•小时 后完成
单元详情
This module introduces essential data analysis techniques using Python and the pandas library. You will learn how to import and work with data efficiently, leveraging DataFrames and Series to manipulate, filter, and analyze datasets. The module covers fundamental concepts such as vectorization for performance optimization, distinguishing between attributes and methods, and performing descriptive statistics. Additionally, you will explore data visualization techniques and segmentation methods to extract meaningful insights from structured data.
涵盖的内容
19个视频9篇阅读材料4个作业1个编程作业4个非评分实验室
显示有关单元内容的信息
19个视频•总计71分钟
Module 2 introduction•1分钟
Beyond lists•4分钟
Importing modules•4分钟
Pandas•2分钟
Reading CSV into Python•5分钟
DataFrames•4分钟
Attributes and methods•2分钟
Selecting columns•5分钟
Counts, sums, & histograms•4分钟
Sorting•3分钟
Sorting by multiple columns•3分钟
Filtering•5分钟
Filtering by multiple conditions•2分钟
Selecting rows•5分钟
Central tendency, variability, and skewness•5分钟
Categorical data•5分钟
Correlation•6分钟
Segmentation by one feature•3分钟
Segmentation by multiple features•5分钟
9篇阅读材料•总计101分钟
About the 2016 New Coder Survey data set•5分钟
[Optional] Practice with DataFrames•20分钟
Dictionaries and NumPy arrays•20分钟
[Optional] Practice with sorting•10分钟
[Optional] Practice with filtering•10分钟
Selection in Pandas•10分钟
[Optional] Practice with descriptive statistics•15分钟
Python Cheat Sheet•10分钟
Module 2 lecture notes•1分钟
4个作业•总计60分钟
Module 2 quiz•30分钟
Lesson 1 quiz•10分钟
Lesson 2 quiz•10分钟
Lesson 3 quiz•10分钟
1个编程作业•总计80分钟
Retail sales - Expanding your analysis•80分钟
4个非评分实验室•总计120分钟
Module 2 lecture code•30分钟
Practice Lab: Buenos Aires Subway - Data structures•30分钟
Practice Lab: Buenos Aires Subway - Sorting and filtering•30分钟
Practice Lab: Buenos Aires subway - Descriptive statistics•30分钟
Visualization
第 3 单元•小时 后完成
单元详情
This module focuses on data visualization using Python, covering essential tools and techniques for creating effective visuals. You will learn to generate visualizations directly from pandas DataFrames and Series, as well as use popular libraries like matplotlib and Seaborn to develop custom plots. The module explores various visualization types, from basic line graphs and bar charts to advanced distribution and categorical plots. Additionally, you will learn how to enhance readability through styling, annotations, and design choices to highlight trends, patterns, and anomalies in data.
涵盖的内容
18个视频4篇阅读材料4个作业1个编程作业4个非评分实验室
显示有关单元内容的信息
18个视频•总计75分钟
Module 3 introduction•1分钟
Plotting with matplotlib•6分钟
Colors, grids, & saving plots•4分钟
Text & annotations•5分钟
Ticks & spines•5分钟
Grouped column charts•6分钟
Stacked column charts •3分钟
Scatter plots•4分钟
Method chaining•4分钟
Plotting with Seaborn•5分钟
Themes & palettes•3分钟
Box plots•5分钟
Histograms•4分钟
Other charts•5分钟
Combining charts•4分钟
Matplotlib subplots•4分钟
Looping with subplots•3分钟
Seaborn pairplot•3分钟
4篇阅读材料•总计41分钟
Reading Documentation•15分钟
[Optional] Practice with method chaining•15分钟
Python Cheat Sheet•10分钟
Module 3 lecture notes•1分钟
4个作业•总计80分钟
Module 3 quiz•30分钟
Lesson 1 quiz•10分钟
Lesson 2 quiz•10分钟
Lesson 3 quiz•30分钟
1个编程作业•总计80分钟
Exploring Australia's coral reefs•80分钟
4个非评分实验室•总计90分钟
Module 3 lecture code•30分钟
Practice Lab: Flight delays and cancellations - Matplotlib Charts•20分钟
Practice Lab: Flight delays and cancellations - Plotting with Seaborn•20分钟
Practice Lab: Flight delays and cancellations - Histograms and rugplots•20分钟
Inferential Statistics
第 4 单元•小时 后完成
单元详情
This module introduces statistical inference and regression modeling using Python. You will learn to construct confidence intervals, perform hypothesis testing with t-tests, and simulate data using NumPy. The module covers both simple and multiple linear regression, guiding you through model development, interpretation of key metrics (such as R-squared, p-values, and coefficients), and prediction of new data points. Additionally, you will explore methods to encode categorical variables, evaluate model performance using error metrics, and refine regression models with the help of Large Language Models (LLMs).
涵盖的内容
20个视频6篇阅读材料4个作业1个编程作业4个非评分实验室
显示有关单元内容的信息
20个视频•总计89分钟
Module 4 introduction•2分钟
Confidence intervals•6分钟
One-sample t-tests•5分钟
Two-sample t-tests•4分钟
Simulation: uniform•5分钟
Simulation: normal•4分钟
What is linear regression?•6分钟
Choosing an independent variable•3分钟
Training the model•5分钟
Interpreting the output of a regression model•5分钟
Prediction•5分钟
Multiple linear regression•4分钟
Training a multiple linear regression model •5分钟
Interpreting multiple linear regression•4分钟
Encoding categorical data•4分钟
Modeling with categorical data•5分钟
Prediction: Multiple Linear Regression•3分钟
Evaluating your model•5分钟
LLMs for model iteration•6分钟
The linear regression process•3分钟
6篇阅读材料•总计56分钟
[Optional] Practice with confidence intervals•10分钟
[Optional] Practice with simulating random outcomes•15分钟
Nonlinear transformations•10分钟
Interaction features•10分钟
Python Cheat Sheet•10分钟
Module 4 lecture notes•1分钟
4个作业•总计60分钟
Module 4 quiz•30分钟
Lesson 1 quiz•10分钟
Lesson 2 quiz•10分钟
Lesson 3 quiz•10分钟
1个编程作业•总计80分钟
Analyzing Car CO₂ Emissions•80分钟
4个非评分实验室•总计120分钟
Module 4 lecture code•30分钟
Practice Lab: London housing prices - Confidence intervals and hypothesis testing•30分钟
Practice Lab: London housing prices - Linear regression•30分钟
Practice Lab: London housing prices - Regression with categorical data•30分钟
Time series
第 5 单元•小时 后完成
单元详情
This module explores working with time series data in Python, focusing on DateTime objects, indexing, and visualization. You will learn to manipulate time-based data, apply descriptive statistics, and segment time series by key date features. The module covers resampling and reshaping techniques, as well as using simple and multiple linear regression to model trends and seasonality. Additionally, you will evaluate forecasting models using appropriate error metrics to assess their performance.
涵盖的内容
14个视频5篇阅读材料4个作业2个编程作业5个非评分实验室
显示有关单元内容的信息
14个视频•总计61分钟
Module 5 introduction•1分钟
DateTimes•5分钟
Using DateTimes as indices•4分钟
Line charts•4分钟
Formatting date axis labels•6分钟
Moving average•5分钟
Percent change•5分钟
Segmentation•5分钟
Multiple line charts: reshaping•6分钟
Resampling•5分钟
Forecasting with the trend•6分钟
Forecasting with seasonality•5分钟
Error metrics for forecasting•4分钟
Your next steps•1分钟
5篇阅读材料•总计36分钟
[Optional] Practice with Datetimes•10分钟
[Optional] Practice with reshaping (pivoting) dataframes•10分钟
Python Cheat Sheet•10分钟
Module 5 lecture notes•1分钟
Acknowledgments•5分钟
4个作业•总计50分钟
Module 5 quiz•30分钟
Lesson 1 quiz•10分钟
Lesson 2 quiz•5分钟
Lesson 3 quiz•5分钟
2个编程作业•总计160分钟
Analyzing Chlorophyll levels in Australian Coral Reefs•80分钟
Capstone: Loan Interest Rates•80分钟
5个非评分实验室•总计160分钟
Module 5 lecture code•30分钟
Practice Lab: Flight delays and cancellations - Plotting the time series•20分钟
Practice Lab: Flight delays and cancellations - Working with time series data•20分钟
Practice Lab: Flight delays and cancellations - Linear regression with time series•30分钟
(Optional) Installing Python on your computer•60分钟
DeepLearning.AI is an education technology company that develops a global community of AI talent.
DeepLearning.AI's expert-led educational experiences provide AI practitioners and non-technical professionals with the necessary tools to go all the way from foundational basics to advanced application, empowering them to build an AI-powered future.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.