This course offers students an opportunity to learn fundamentals of computation required to understand and analyze real world data. The course helps students to work with modern data structures, apply data cleaning and data wrangling operations. The course covers conceptual and practical applications of probability and distribution, cluster analysis, text analysis and time series analysis.
This course is Part 1 of 2.
In this module, we will focus on Python programming fundamentals. The aim is to help you understand Python's basic syntax, data types, and operators, enabling the creation of simple programs. Additionally, we will cover the use of if statements, loops, and proper indentation to control program flow, fostering a foundational understanding of essential control structures in Python programming.
涵盖的内容
5个视频6篇阅读材料2个作业1个讨论话题
显示有关单元内容的信息
5个视频•总计19分钟
Course Overview•2分钟
Meet Your Faculty•1分钟
Python Fundamentals•8分钟
Control Structures Pt 1•3分钟
Control Structures Pt 2•6分钟
6篇阅读材料•总计83分钟
Course Introduction•2分钟
Syllabus - Foundations of Data Analytics Part 1•5分钟
Academic Integrity•1分钟
Python Fundamentals•40分钟
Control Structures Pt 1•10分钟
Control Structures Pt 2•25分钟
2个作业•总计20分钟
Module 1 Assess Your Learning: Python Fundamentals•10分钟
Module 1 Assess Your Learning: Control Structures•10分钟
1个讨论话题•总计20分钟
Meet Your Fellow Learners•20分钟
Data Structures
第 2 单元•小时 后完成
单元详情
In this module we will dive into the diverse landscape of Python data structures, including lists, dictionaries, sets, tuples, and arrays. By exploring real-world use cases, you will uncover the unique strengths and weaknesses of each data structure. You will gain insights into recognizing and understanding the characteristics of these structures, empowering you to make informed choices when tackling programming challenges. Through hands-on practice, you will develop the skills to select and apply the most suitable data structure to efficiently solve a wide range of problems, enhancing their proficiency in Python programming.
涵盖的内容
2个视频2篇阅读材料1个作业
显示有关单元内容的信息
2个视频•总计13分钟
List, Tuples, and Arrays•8分钟
Dictionaries•5分钟
2篇阅读材料•总计200分钟
List, Tuples, and Arrays•170分钟
Dictionaries•30分钟
1个作业•总计10分钟
Module 2 Assess Your Learning: Lists, Tuples, Arrays & Dictionaries•10分钟
Modern Data Structures and Data Wrangling
第 3 单元•小时 后完成
单元详情
In this module we will introduce DataFrames, a pivotal tool in data manipulation and analysis. You will grasp the fundamental concepts of DataFrames, learning how to create, manipulate, and access data efficiently. You will gain essential skills for basic data exploration–including summarizing data, indexing, and slicing, enabling them to extract meaningful insights. Furthermore, this module equips learners with the expertise to clean and preprocess data, covering handling missing values, filtering data, merging/joining datasets, and transforming data for analysis readiness. By the end of this module, you will harness DataFrames for advanced data analysis, mastering group-wise operations, aggregation, and statistical analysis.
涵盖的内容
3个视频3篇阅读材料2个作业
显示有关单元内容的信息
3个视频•总计19分钟
Introduction to DataFrames•9分钟
Data Cleaning and Transformation•5分钟
Data Aggregation•5分钟
3篇阅读材料•总计195分钟
Introduction to DataFrames•160分钟
Data Cleaning and Transformation•10分钟
Data Aggregation•25分钟
2个作业•总计20分钟
Module 3 Assess Your Learning: DataFrames•10分钟
Module 3 Assess Your Learning: Data Cleaning, Transformation and Aggregation•10分钟
Exploratory Data Analysis
第 4 单元•小时 后完成
单元详情
This module will equip you with a comprehensive toolkit for proficient data exploration and analysis. It covers the essential techniques and tools for effectively summarizing data sets, encompassing statistical summaries, data visualization, and data cleaning methods. You will learn how to identify and assess missing data, outliers, and anomalies, vital tasks during the initial exploratory phase of data analysis. Furthermore, you will develop the ability to uncover patterns, relationships, and trends within the data using various visualizations, including scatter plots, histograms, and correlation matrices, enabling you to extract valuable insights and make informed decisions from the data.
涵盖的内容
2个视频2篇阅读材料1个作业
显示有关单元内容的信息
2个视频•总计14分钟
Data Exploration Techniques•5分钟
Visualization Methods for Pattern Recognition•9分钟
2篇阅读材料•总计102分钟
Data Exploration Techniques•100分钟
Visualization Methods for Pattern Recognition•2分钟
1个作业•总计10分钟
Module 4 Assess Your Learning: Visualization Methods for Pattern Recognition•10分钟
Introduction to Clustering Techniques
第 5 单元•小时 后完成
单元详情
In this module, we will delve into the fundamental concepts of clustering, a critical component of data analysis and pattern recognition. You will learn to recognize the importance of clustering and its role in identifying meaningful groups within data. You will explore key concepts, including data similarity, distance metrics, and the objective of grouping similar data points together. Additionally, the module equips you with the skills to assess the quality of clustering results through evaluation metrics like silhouette score and Dunn index, as well as visual inspection of clustering plots. By the end of this module, you will be proficient in understanding, applying, and evaluating clustering techniques for effective data analysis and pattern recognition.
涵盖的内容
4个视频4篇阅读材料2个作业
显示有关单元内容的信息
4个视频•总计24分钟
Clustering Techniques•4分钟
Pattern Recognition using Clustering Techniques•3分钟
Understanding Clustering Fundamentals•7分钟
Cluster Validity•11分钟
4篇阅读材料•总计135分钟
Clustering Techniques•30分钟
Pattern Recognition using Clustering Techniques•30分钟
Understanding Clustering Fundamentals•15分钟
Cluster Validity•60分钟
2个作业•总计20分钟
Module 5 Assess Your Learning: Clustering Techniques•10分钟
Module 5 Assess Your Learning: Cluster Validity•10分钟
Clustering Algorithms
第 6 单元•小时 后完成
单元详情
This module will provide you with a comprehensive exploration of clustering algorithms, enabling proficiency in this crucial data analysis technique. You will identify various clustering algorithms, such as k-means, hierarchical clustering, and DBSCAN, along with their underlying principles and assumptions. The expertise you will gain will help you determine the most suitable clustering algorithm based on data characteristics and objectives, and you will learn to implement these algorithms using programming languages like Python and tools such as scikit-learn. The clustering quality can be evaluated using internal and external validation methods, as discussed in Week 5.
涵盖的内容
4个视频4篇阅读材料3个作业
显示有关单元内容的信息
4个视频•总计14分钟
K-Means Clustering•6分钟
Hierarchical Clustering•4分钟
DBSCAN•2分钟
Choosing Appropriate Clustering Algorithms•2分钟
4篇阅读材料•总计4分钟
K-Means Clustering•1分钟
Hierarchical Clustering•1分钟
DBSCAN•1分钟
Choosing Appropriate Clustering Algorithms•1分钟
3个作业•总计30分钟
Module 6 Assess Your Learning: K-Means Clustering•10分钟
Module 6 Assess Your Learning: Hierarchical Clustering•10分钟
Module 6 Assess Your Learning: DBSCAN•10分钟
Introduction to Time Series Analysis
第 7 单元•小时 后完成
单元详情
In this module, you will explore the realm of time series data, gaining a comprehensive understanding of its characteristics, components (trend, seasonality, and noise), and prevalent sources across diverse domains. Through effective visualization techniques and descriptive statistics, you will acquire the skills to recognize patterns and trends within time series data.
涵盖的内容
4个视频5篇阅读材料3个作业
显示有关单元内容的信息
4个视频•总计17分钟
Time Series Analysis•2分钟
Components of Time Series•3分钟
Visual Analytics of Time Series•5分钟
Time Series Data Techniques•7分钟
5篇阅读材料•总计20分钟
Time Series Analysis•1分钟
Components of Time Series•3分钟
Visual Analytics of Time Series•3分钟
Time Series Data Techniques•3分钟
Congratulations!•10分钟
3个作业•总计30分钟
Module 7 Assess Your Learning: Time Series Analysis•10分钟
Module 7 Assess Your Learning: Visual Analytics of Time Series•10分钟
Module 7 Assess Your Learning: Time Series Data Techniques•10分钟
Founded in 1898, Northeastern is a global research university with a distinctive, experience-driven approach to education and discovery. The university is a leader in experiential learning, powered by the world’s most far-reaching cooperative education program. The spirit of collaboration guides a use-inspired research enterprise focused on solving global challenges in health, security, and sustainability.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I purchase the Certificate?
When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.