Welcome to this course on applied natural language processing in engineering. This course is designed to provide you with an in-depth understanding of NLP, a pivotal area of artificial intelligence that empowers computers to comprehend, interpret, and generate human language. Throughout this course, you will explore a wide range of topics, from fundamental NLP tasks like text classification and Named Entity Recognition (NER) to advanced techniques in neural machine translation and optimization methods critical for machine learning. We will delve into the complexities of teaching language to machines, addressing challenges like ambiguity, grammar, and cultural nuances. By the end of this part 1 course, you will have a foundational understanding of how modern NLP systems work - particularly those involving machine learning and deep learning. These topics will equip you to build, analyze and improve NLP systems across many applications.
This module provides an in-depth exploration of Natural Language Processing (NLP), a crucial area of artificial intelligence that enables computers to understand, interpret, and generate human language. By combining computational linguistics with machine learning, NLP is applied in various technologies, from chatbots and sentiment analysis to machine translation and speech recognition. The module introduces fundamental NLP tasks such as text classification, Named Entity Recognition (NER), and neural machine translation, showcasing how these applications shape real-world interactions with AI. Additionally, it highlights the complexities of teaching language to machines, including handling ambiguity, grammar, and cultural nuances. Through the course, you will gain hands-on experience and knowledge about key techniques like word representation and distributional semantics, preparing them to solve language-related challenges in modern AI systems.
涵盖的内容
4个视频19篇阅读材料2个作业1个应用程序项目
显示有关单元内容的信息
4个视频•总计6分钟
Course Introduction•3分钟
Meet Your Faculty•1分钟
Natural Language Processing (NLP)•1分钟
Representing the Meaning of a Word•1分钟
19篇阅读材料•总计154分钟
Course Introduction•2分钟
Syllabus - Applied Natural Language Processing in Engineering Part 1•10分钟
Academic Integrity•1分钟
Recommended Prior Knowledge•100分钟
Week 1 Introduction•2分钟
Introduction to NLP•5分钟
Example: Chatbots•2分钟
Example: Email Filtering•2分钟
Example: Sentiment Analysis•3分钟
Example: GPT - 3•3分钟
Example: ChatGPT Capabilities•5分钟
Natural Language Processing•1分钟
Funny Takes on Language Evolution•2分钟
How Do We Represent the Meaning of a Word?•2分钟
How Do We Have Usable Meaning in a Computer?•4分钟
Words as Discrete Symbols•5分钟
Representing Words by Their Context•2分钟
Word Vectors•2分钟
Final Thoughts on NLP•1分钟
2个作业•总计36分钟
Assess Your Learning: What is NLP?•18分钟
Assess Your Learning: Motivation•18分钟
1个应用程序项目•总计15分钟
Challenges of Teaching Language to AI•15分钟
Gradient Descent & Optimization Techniques
第 2 单元•小时 后完成
单元详情
This module focuses on optimization techniques critical for machine learning, particularly in natural language processing (NLP) tasks. It introduces Gradient Descent (GD), a fundamental algorithm used to minimize cost functions by iteratively adjusting model parameters. You’ll explore variants like Stochastic Gradient Descent (SGD) and Mini-Batch Gradient Descent to learn more about their efficiency in handling large datasets. Advanced methods such as Momentum and Adam are covered to give you insight on how to enhance convergence speed by smoothing updates and adapting learning rates. The module also covers second-order techniques like Newton’s Method and Quasi-Newton methods (e.g., BFGS), which leverage curvature information for more direct optimization, although they come with higher computational costs. Overall, this module emphasizes balancing efficiency, accuracy, and computational feasibility in optimizing machine learning models.
涵盖的内容
2个视频16篇阅读材料3个作业
显示有关单元内容的信息
2个视频•总计8分钟
Machine Learning and NLP•4分钟
Optimization Techniques•4分钟
16篇阅读材料•总计82分钟
Week 2 Overview•2分钟
Machine Learning•2分钟
Variations of Gradient Descent•2分钟
Types of ML in NLP•6分钟
What is a Model in NLP and How Does it Learn?•6分钟
Understanding Cost Functions•2分钟
Minimizing the Cost Function in NLP•10分钟
Why Optimization Techniques Matter•1分钟
Why SGD Works•10分钟
Jacobian Matrix & Hessian Matrix•5分钟
Momentum•10分钟
Newton's Methods•5分钟
Quasi-Newton Methods•5分钟
Root Mean Square Propagation (RMSProp)•5分钟
Adaptive Moment Estimation (Adam)•10分钟
Overall Challenges of Second-Order Optimization Techniques•1分钟
3个作业•总计81分钟
Assess Your Learning: ML in NLP•18分钟
Assess Your Learning: Optimization Techniques•18分钟
Module 2 Quiz•45分钟
Neural Networks & Cost Functions
第 3 单元•小时 后完成
单元详情
This module explores Named Entity Recognition (NER), a core task in Natural Language Processing (NLP) that identifies and classifies entities like people, locations, and organizations in text. We’ll begin by examining how logistic regression can be used to model NER as a binary classification problem, though this approach faces limitations with complexity and context capture. We’ll then transition to more advanced techniques, such as neural networks, which excel at handling the complex patterns and large-scale data that traditional models struggle with. Neural networks' ability to learn hierarchical features makes them ideal for NER tasks, as they can capture contextual information more effectively than simpler models. Throughout the module, we compare these methods and highlight how deep learning approaches such as Recurrent Neural Networks (RNNs) and transformers like BERT improve NER accuracy and scalability.
涵盖的内容
2个视频14篇阅读材料3个作业1个应用程序项目
显示有关单元内容的信息
2个视频•总计4分钟
Neural Networks Definitions•4分钟
Network Propagation•0分钟
14篇阅读材料•总计89分钟
Week 3 Overview•2分钟
Neural Networks•2分钟
Named Entity Recognition (NER)•5分钟
NER as a Binary Regression Problem•5分钟
Neural Network•5分钟
Neural Network Structure•5分钟
How Does a Neural Network Learn?•10分钟
Mathematical Representation•20分钟
Steps in Back Propagation Algorithm•5分钟
Stochastic Gradient•5分钟
Classification Tasks•5分钟
Sequence-to-Sequence Tasks•5分钟
Sequence Labeling Tasks•5分钟
Regression Tasks & Divergence Measures•10分钟
3个作业•总计81分钟
Assess Your Learning: NER & Neural Networks•18分钟
Assess Your Learning: Cost Functions•18分钟
Module 3 Quiz•45分钟
1个应用程序项目•总计10分钟
Some Common Activation Functions•10分钟
Embeddings, GloVe, Evaluation Techniques
第 4 单元•小时 后完成
单元详情
The Word2Vec and GloVe models are popular word embedding techniques in Natural Language Processing (NLP), each offering unique advantages. Word2Vec, developed by Google, operates via two key models: Continuous Bag of Words (CBOW) and Skip-gram, focusing on predicting a word based on its context or vice versa (Word2Vec). The GloVe model, on the other hand, created by Stanford, combines count-based and predictive approaches by leveraging word co-occurrence matrices to learn word vectors (GloVe). Both models represent words in a high-dimensional vector space and capture semantic relationships. Word2Vec focuses on local contexts, learning efficiently from large datasets, while GloVe emphasizes global word co-occurrence patterns across the entire corpus, revealing deeper word associations. These embeddings enable tasks like analogy-solving, semantic similarity, and other linguistic computations, making them central to modern NLP applications.
涵盖的内容
3个视频29篇阅读材料4个作业1个应用程序项目
显示有关单元内容的信息
3个视频•总计11分钟
GLoVe Training Process•5分钟
Word2Vec•4分钟
Skip-Gram•2分钟
29篇阅读材料•总计267分钟
Week 4 Overview•2分钟
Introduction to GLoVe•5分钟
Co-occurrence Matrix•5分钟
Objective: Ratio of Co-occurrences•5分钟
Calculating Probability Ratios•5分钟
Symmetry and Linearity in GloVe•5分钟
Minimizing the Cost Function and Optimizing Word Vectors•5分钟
Optimization Process•10分钟
Final Word Vectors•2分钟
Implicit Properties in GloVe•5分钟
GLoVe Introduction•2分钟
What is Language Modeling?•5分钟
Co-occurrence Matrix•5分钟
Vector Representations for Word•3分钟
Continuous Bag of Words (CBOW)•5分钟
Mathematical Objectives•10分钟
Mathematical Objectives 2•15分钟
Limitations of CBOW•1分钟
Skip-Gram•15分钟
Gradient Derivation•15分钟
The Challenge of Training Skip-Gram•10分钟
Binary Classification Perspective•10分钟
Gradient of Negative Sampling Objective•10分钟
Connecting Between Skip-Gram, Negative Sampling, and One Sampling•2分钟
Skip-Gram with Negative Sampling Across All Words•10分钟
Negative Sampling in Skip-Gram Model•10分钟
Word2Vec Example•30分钟
Word2Vec Worked Example •30分钟
Word2Vec Example 2•30分钟
4个作业•总计99分钟
Assess Your Learning: GLoVe•18分钟
Assess Your Learning: Word2Vec & CBOW•18分钟
Assess Your Learning: Skip-Gram & Negative Sampling•18分钟
Module 4 Quiz•45分钟
1个应用程序项目•总计3分钟
GloVe Training Process•3分钟
Evaluation Techniques
第 5 单元•小时 后完成
单元详情
This module delves into the evaluation techniques for Natural Language Processing (NLP) models, focusing on both intrinsic and extrinsic evaluation methods. Intrinsic evaluation assesses the model's performance based on internal criteria, such as word embedding quality, parsing accuracy, and language model perplexity. In contrast, extrinsic evaluation measures the model's effectiveness in real-world applications, including tasks like machine translation, sentiment analysis, and named entity recognition. You’ll also learn more about key differences between these evaluation types, and the importance of context and application in determining a model's utility. Additionally, you’ll review specific metrics like cross-entropy loss, perplexity, BLEU, and ROUGE scores, providing a comprehensive understanding of how to evaluate and improve NLP models.
涵盖的内容
9篇阅读材料2个作业1个应用程序项目
显示有关单元内容的信息
9篇阅读材料•总计99分钟
Week 5 Overview•2分钟
General Concept of Evaluation (in NLP)•15分钟
Key Differences Between Intrinsic and Extrinsic Evaluation•2分钟
Cross-Entropy Loss - Intrinsic•10分钟
Cross-Entropy and Learning from Incorrect Predictions•15分钟
Recall and Precision in Text Summarization or Translation•15分钟
Recall-Oriented Understudy for Gisting Evaluation (ROUGE) - Extrinsic•10分钟
2个作业•总计63分钟
Assess Your Learning: NLP Model Evaluation•18分钟
Module 5 Quiz•45分钟
1个应用程序项目•总计10分钟
Evaluation Techniques•10分钟
Topic Modeling
第 6 单元•小时 后完成
单元详情
This module explores various techniques for topic modeling in natural language processing (NLP), focusing on methods like Latent Semantic Analysis (LSA), Non-Negative Matrix Factorization (NMF), and Latent Dirichlet Allocation (LDA). It begins with an introduction to matrix factorization and the importance of transforming textual data into numerical representations. You’ll delve into the mechanics of LSA and NMF, paying attention to their use of TF-IDF and Singular Value Decomposition (SVD) to uncover latent semantic structures. Additionally, you’ll review LDA's probabilistic approach to topic modeling, explaining its reliance on Dirichlet distributions and Bayesian inference to identify hidden topics within a corpus. Through detailed examples and mathematical explanations, the module provides a comprehensive understanding of how these techniques can be applied to extract meaningful topics from large text datasets.
涵盖的内容
1个视频16篇阅读材料4个作业1个应用程序项目
显示有关单元内容的信息
1个视频•总计5分钟
Topic Modeling•5分钟
16篇阅读材料•总计133分钟
Week 6 Overview•2分钟
Matrix Factorization•1分钟
Latent Semantic Analysis (LSA)•15分钟
LSA Example•15分钟
Topic Modeling Using Latent Semantic Analysis (LSA)•5分钟
Dimensions and Applications•5分钟
Non-Negative Matrix Factorization (NMF)•5分钟
Operationalizing NMF•7分钟
Numerical Example of NMF•15分钟
Applications of NMF•2分钟
Latent Dirichlet Allocation (LDA)•5分钟
Defining the Problem and Key Assumptions•1分钟
Mathematical Model of LDA•10分钟
Steps in LDA: Mathematical Explanation•15分钟
Maximizing the Posterior Probability in LDA•15分钟
Full Example•15分钟
4个作业•总计99分钟
Assess Your Learning: Latent Semantic Analysis•18分钟
Assess Your Learning: Non-Negative Matrix Factorization•18分钟
Assess Your Learning: Latent Dirichlet Allocation•18分钟
Module 6 Quiz•45分钟
1个应用程序项目•总计10分钟
Recapping NMF & LDA•10分钟
Dependency Parsing
第 7 单元•小时 后完成
单元详情
This module delves into the essential techniques of syntactic and semantic parsing in natural language processing (NLP). You’ll begin with an exploration of linguistic structures, focusing on phrase structure and dependency structure, which are fundamental for understanding sentence syntax. Then you’ll review various parsing methods, including transition-based and graph-based dependency parsing, highlighting their respective advantages and challenges. Additionally, you’ll touch on neural transition-based parsing, which leverages neural networks for improved accuracy and efficiency. Finally, the module touches on semantic parsing, emphasizing its role in mapping sentences to formal representations of meaning, crucial for applications like dialogue systems and information extraction.
涵盖的内容
2个视频32篇阅读材料4个作业
显示有关单元内容的信息
2个视频•总计5分钟
Transition-Based and Graph Parsing Examples•2分钟
Neural Advancements in Parsing: Dependency and Semantics•3分钟
Founded in 1898, Northeastern is a global research university with a distinctive, experience-driven approach to education and discovery. The university is a leader in experiential learning, powered by the world’s most far-reaching cooperative education program. The spirit of collaboration guides a use-inspired research enterprise focused on solving global challenges in health, security, and sustainability.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I purchase the Certificate?
When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.