When will I have access to the lectures and assignments?

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

What will I get if I purchase the Certificate?

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Natural Language Processing

位教师：BITS Pilani Instructors Group

包含在中

了解更多

10个模块

深入了解一个主题并学习基础知识。

中级等级

推荐体验

4 周完成

在 10 小时一周

灵活的计划

自行安排学习进度

攻读学位

了解更多

10个模块

深入了解一个主题并学习基础知识。

中级等级

推荐体验

4 周完成

在 10 小时一周

灵活的计划

自行安排学习进度

攻读学位

了解更多

您将学到什么

Understand and recall core concepts and techniques in Natural Language Processing (NLP).
Analyse and evaluate NLP methods for varied tasks, considering performance, context, and suitability.
Design and develop real-world NLP applications by integrating multiple techniques.

您将获得的技能

您将学习的工具

要了解的详细信息

可分享的证书

添加到您的领英档案

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

该课程共有10个模块

Are you curious about how chatbots hold conversations or how ChatGPT generates human-like responses? This course in Natural Language Processing (NLP) is your gateway into the fascinating world where language meets AI. Designed for students and professionals alike, the course blends essential theory with hands-on experience to equip you with the skills needed to build intelligent language systems.

We start by unravelling what makes language so complex—and why teaching machines to understand it is such a challenging task. You’ll explore the inner workings of Natural Language Understanding (NLU) and Generation (NLG), investigate real-world NLP applications, and dive into current trends like large language models (LLMs) and transformer-based systems. From there, you’ll roll up your sleeves and learn core NLP techniques like tokenization, stemming, lemmatization, and sentence segmentation. You’ll master vector-based approaches like Bag of Words and TF-IDF, then progress to powerful word embeddings like Word2Vec, Skip-gram, and GloVe. As you advance, you'll build language models, train simple neural networks, and explore cutting-edge tools in POS tagging, syntactic parsing, and semantic analysis. You’ll even touch the future with knowledge graphs and Word Sense Disambiguation. By the end, you’ll be ready to innovate in the fast-evolving NLP landscape. Graduates of this NLP course can pursue roles such as NLP Engineer, Machine Learning Engineer, or Data Scientist with a focus on language technologies. Opportunities also exist in AI-driven fields like chatbots, voice assistants, sentiment analysis, and information retrieval. Advanced learners may explore careers in research, LLM fine-tuning, or knowledge graph development. Are you ready to unlock the power of cutting-edge NLP skills? Join us on this exciting journey into the world of language, AI, and intelligent data processing!

单元详情

This module introduces the fundamental concepts of Natural Language Processing (NLP). It begins with the definition of NLP and explores a variety of real-world applications. You will gain an understanding of Natural Language Understanding (NLU) and Natural Language Generation (NLG). The module also covers key evaluation metrics used to assess NLP systems. Additionally, a hands-on lab session will guide you through the implementation of basic NLP preprocessing techniques.

涵盖的内容

15个视频5篇阅读材料12个作业

15个视频总计82分钟

Course Introduction3分钟
Meet Your Instructor: Prof. Dr. Chetana Gavankar2分钟
NLP Definition3分钟
NLP Applications5分钟
Why NLP is Hard?10分钟
Natural Language Understanding 4分钟
Levels of Language Understanding5分钟
Natural Language Generation4分钟
Organisation of NLP System6分钟
Intrinsic vs. Extrinsic Evaluation4分钟
Challenges in Evaluation4分钟
NLP Tools Overview7分钟
Demo of NLP Tools6分钟
Basic NLP Application Development Using NLP Tools13分钟
Module Wrap-Up6分钟

5篇阅读材料总计70分钟

Course Overview10分钟
Recommended Reading: What is NLP?15分钟
Recommended Reading: NLP Fundamentals15分钟
Recommended Reading: Evaluation of NLP Systems15分钟
Recommended Reading: NLP Tools Introduction15分钟

12个作业总计45分钟

NLP Definition6分钟
NLP Applications3分钟
Why NLP is a Hard Problem3分钟
Natural Language Understanding 3分钟
Levels of Language Understanding3分钟
Natural Language Generation3分钟
Organisation of NLP System3分钟
Intrinsic vs. Extrinsic Evaluation6分钟
Challenges in Evaluation3分钟
NLP Tools Overview6分钟
Demo of NLP Tools3分钟
Basic NLP Application Development Using NLP Tools3分钟

This module introduces essential NLP preprocessing techniques. It begins with regular expressions for text pattern matching, followed by an overview of words and corpora as foundational data sources. Sentence segmentation and tokenization are then covered through practical demonstrations. Finally, the module explores normalization, lemmatization, and stemming as methods to standardise text, with a demo highlighting their differences and effects.

涵盖的内容

14个视频4篇阅读材料14个作业

14个视频总计79分钟

Regular Expressions8分钟
Words and Corpora5分钟
Sentence Segmentation3分钟
Code Demo Segmentation5分钟
Tokenization5分钟
Tokenization Methods7分钟
Code Demo Tokenization14分钟
Normalization 4分钟
Code Demo Normalization 4分钟
Stemming6分钟
Code Demo Stemming5分钟
Lemmatization 3分钟
Code Demo Lemmatization6分钟
Module Wrap-Up4分钟

4篇阅读材料总计115分钟

Recommended Reading: Basic Text Preprocessing35分钟
Recommended Reading: Segmentation and Tokenization 30分钟
Recommended Reading: Normalization20分钟
Recommended Reading: Stemming and Lemmatization30分钟

14个作业总计99分钟

Graded Quiz: Week 1 and 260分钟
Regular Expressions3分钟
Words and Corpora3分钟
Sentence Segmentation3分钟
Code Demo Segmentation3分钟
Tokenization3分钟
Tokenization Methods3分钟
Code Demo Tokenization3分钟
Normalization 3分钟
Code Demo Normalization3分钟
Stemming3分钟
Code Demo Stemming3分钟
Lemmatization3分钟
Code Demo Lemmatization3分钟

This module explores lexical and vector semantics, focusing on computational representations of word meaning. It covers word vectors, Bag of Words, and co-occurrence matrices to capture contextual relationships. Techniques such as TF-IDF are introduced to measure word importance, along with methods for computing word similarity. Practical examples and mathematical exercises on TF-IDF help reinforce these core NLP concepts.

涵盖的内容

13个视频3篇阅读材料10个作业

13个视频总计72分钟

Lexical Semantics 3分钟
Why Vectors?7分钟
Word and Vectors8分钟
Bag of Words4分钟
Computing Word Similarity3分钟
Cosine Similarity4分钟
Cosine Similarity Example7分钟
Term Frequency4分钟
Inverse Document Frequency11分钟
TF-IDF7分钟
Demo of Words as Vectors4分钟
Demo of TF-IDF8分钟
Module Wrap-Up4分钟

3篇阅读材料总计45分钟

Recommended Reading: Foundations of Lexical and Vector Semantics 15分钟
Recommended Reading: Representing Text Using Vectors 15分钟
Recommended Reading: Term and Inverse Document Frequency 15分钟

10个作业总计30分钟

Lexical Semantics 3分钟
Why Vectors? 3分钟
Word and Vectors 3分钟
Bag of Words3分钟
Computing Word Similarity 3分钟
Cosine Similarity 3分钟
Cosine Similarity Example 3分钟
Term Frequency 3分钟
Inverse Document Frequency 3分钟
TF-IDF 3分钟

This module introduces Word Embeddings, focusing on the transition from sparse to dense vector representations of words. It covers Word2Vec models, including Skip-gram and CBOW, explained with simple, intuitive examples. The module also explores GloVe embeddings, which capture global word co-occurrence statistics for improved semantic understanding. Learners will visualise word embeddings to gain insights into how words relate in vector space. Finally, the module highlights real-world applications of word embeddings in NLP tasks like sentiment analysis, machine translation, and question answering.

涵盖的内容

13个视频3篇阅读材料13个作业

13个视频总计79分钟

Word2Vec 4分钟
Basic 1-Hot Word Representation4分钟
Feature Based Word Representations3分钟
Skip Gram Algorithm Introduction6分钟
Skip Gram Probabilities8分钟
Skip-Gram Negative Sampling (SGNS) Approach7分钟
Skip-Gram Negative Training Data Example7分钟
SGNS Log Loss Function7分钟
Derivative of SGNS Loss Function6分钟
SGNS Example Part 112分钟
SGNS Example Part 28分钟
Continuous Bag of Words (CBOW)5分钟
Module Wrap Up 4分钟

3篇阅读材料总计45分钟

Recommended Reading: Basics of Word2Vec 15分钟
Recommended Reading: Skip-Gram Word Embedding 15分钟
Other Word2Vec Approaches Title: Essential Reading Material – CBOW and GloVe 15分钟

13个作业总计96分钟

Graded Quiz - Week 3 and 460分钟
Word2Vec3分钟
Basic 1-Hot Word Representation3分钟
Feature Based Word Representations3分钟
Skip Gram Algorithm Introduction3分钟
Skip Gram Probabilities3分钟
Skip-Gram Negative Sampling (SGNS) Approach3分钟
Skip-Gram Negative Training Data Example3分钟
SGNS Log Loss Function3分钟
Derivative of SGNS Loss Function3分钟
SGNS Example Part 13分钟
SGNS Example Part 23分钟
Continuous Bag of Words (CBOW)3分钟

This module introduces Language Modeling (LM) and its role in predicting word sequences in natural language. It explores practical applications of LMs and explains N-gram models, including challenges like generalization and handling zero probabilities. Techniques such as smoothing and stupid backoff are covered to improve model robustness. The module concludes with methods for evaluating language models using standard metrics.

涵盖的内容

15个视频4篇阅读材料13个作业

15个视频总计96分钟

What is Language Modeling?3分钟
Language Modelling Applications 3分钟
How to Build a Language Model 5分钟
Markov Assumption 2分钟
N-gram Language Models4分钟
Bi-gram Computation10分钟
Raw Probabilities10分钟
Perils of Overfitting3分钟
Laplace Smoothing14分钟
Interpolation & Backoff10分钟
How Good is the Model?3分钟
Extrinsic Evaluation5分钟
Perplexity & It's Example9分钟
Module Demo10分钟
Module Wrap-Up5分钟

4篇阅读材料总计60分钟

Recommended Reading: Language Modelling Introduction15分钟
Recommended Reading: N-grams 15分钟
Recommended Reading: Smoothing 15分钟
Recommended Reading: Language Modelling Evaluation 15分钟

13个作业总计39分钟

What is Language Modeling? 3分钟
Language Modelling Applications 3分钟
How to Build a Language Model 3分钟
Markov Assumption3分钟
N-gram Language Models 3分钟
Bi-gram Computation 3分钟
Raw Probabilities 3分钟
Perils of Overfitting 3分钟
Laplace Smoothing3分钟
Interpolation & Backoff3分钟
How Good is the Model?3分钟
Extrinsic Evaluation 3分钟
Perplexity & its Example3分钟

This module explores the use of Neural Networks in Language Modelling, starting with the fundamentals of Feed-Forward Neural Networks and their training process for language tasks. It introduces Neural Language Models, which capture complex patterns in text beyond traditional statistical methods. The module also provides a foundational understanding of Large Language Models (LLMs) and their capabilities. Finally, it introduces Prompt Engineering as a technique to effectively interact with and guide LLMs for various NLP applications.

涵盖的内容

17个视频5篇阅读材料16个作业

17个视频总计98分钟

Neural Network Unit3分钟
Non-Linear Activation Functions5分钟
Perceptron with Examples4分钟
Multi-Layer Perceptron8分钟
Softmax Function with Example4分钟
Feed Connected Neural Network4分钟
Feedforward Network5分钟
Forward Algorithm4分钟
Backpropagation Algorithm5分钟
Training Neural Network12分钟
Neural Language Modeling6分钟
Training Neural Language Model9分钟
N-gram Versus Neural Language Model4分钟
Neural LM Demo10分钟
What is LLM?6分钟
LLM Use Cases5分钟
Module Wrap Up3分钟

5篇阅读材料总计90分钟

Recommended Reading: Introduction to Neural Network15分钟
Recommended Reading: Feed Forward Neural Network 15分钟
Recommended Reading: Training Neural Network 15分钟
Recommended Reading: Neural Language Models 15分钟
Recommended Reading: Introduction to Large Language Models 30分钟

16个作业总计105分钟

Graded Quiz - Week 5 and 660分钟
Neural Network Unit3分钟
Non-Linear Activation Functions3分钟
Perceptron with Examples3分钟
Multi-Layer Perceptron3分钟
Softmax Function with Example3分钟
Feed Connected Neural Network3分钟
Feed Forward Network3分钟
Forward Algorithm3分钟
Backpropagation Algorithm3分钟
Training Neural Network3分钟
Neural Language Modeling3分钟
Training Neural Language Model3分钟
N-gram Versus Neural Language Model3分钟
What is LLM?3分钟
LLM Use Cases3分钟

This module provides an introduction to Part-of-Speech (POS) Tagging, techniques to perform POS Tagging and their applications in NLP. POS tagging is a fundamental task in Natural Language Processing (NLP) that involves assigning grammatical categories (like noun, verb, adjective) to words in text. Starting from basic linguistic foundations and real-world applications, the module dives into the evolution of POS tagging techniques—from statistical models like Hidden Markov Models (HMMs) and Maximum Entropy classifiers, to modern deep learning approaches using Recurrent Neural Networks (RNNs). Learners will gain a strong theoretical understanding and insight into how POS tagging supports downstream tasks like parsing, named entity recognition, and machine translation. The module includes a hands-on coding demonstration for POS tagging.

涵盖的内容

13个视频5篇阅读材料11个作业

13个视频总计74分钟

Outline of the Module 2分钟
What is POS Tagging? 6分钟
Challenges in POS Tagging4分钟
POS Tagsets 6分钟
Markov Chain5分钟
Hidden Markov Model5分钟
Hidden Markov Model as POS Tagger 6分钟
Viterbi Algorithm 8分钟
Viterbi Algorithm - Example8分钟
Logistic Regression - Overview9分钟
Multinomial Logistic Regression - Overview6分钟
Maximum Entropy Markov Models (MEMM)7分钟
Module Wrap Up2分钟

5篇阅读材料总计110分钟

Code Document: POS tagging using NLTK / spaCy 10分钟
Recommended Reading: Introduction to POS Tagging and Applications 30分钟
Code Document: Demonstrating HMM Based POS Tagger10分钟
Recommended Reading: HMM for POS Tagging 30分钟
Recommended Reading: Maximum Entropy Markov Models30分钟

11个作业总计33分钟

What is POS Tagging?3分钟
Challenges in POS Tagging3分钟
POS Tagsets 3分钟
Markov Chain3分钟
Hidden Markov Model3分钟
Hidden Markov Model as POS Tagger 3分钟
Viterbi Algorithm 3分钟
Viterbi Algorithm - Example3分钟
Logistic Regression - Overview3分钟
Multinomial Logistic Regression - Overview3分钟
Maximum Entropy Markov Models (MEMM)3分钟

This module introduces students to the syntactic structure of natural language and its critical role in Natural Language Processing (NLP) applications. Parsing is the task of assigning a structured representation—typically a tree—to a sentence, revealing the grammatical relationships between its components. The module begins by revisiting Context-Free Grammars (CFGs) and how they form the foundation for syntactic parsing. We explore Constituent Parsing, introducing classical parsing techniques such as the CKY (Cocke-Kasami-Younger) algorithm. The module then transitions to modern span-based neural parsing approaches that use neural networks to score and predict parse trees. A significant portion of the module is dedicated to Dependency Parsing, where syntactic structure is represented through direct relationships between words rather than phrases. Students will study both transition-based and graph-based dependency parsers, gaining insight into their strengths, algorithmic designs, and practical performance. Throughout the module, we emphasise real-world NLP applications.

涵盖的内容

18个视频4篇阅读材料17个作业

18个视频总计88分钟

Outline of the Module 2分钟
Introduction to Context-Free Grammars (CFGs)8分钟
Constituency and Phrase Structure5分钟
Ambiguity in Grammar4分钟
Chomsky Normal Form (CNF) and Grammar Normalisation5分钟
Treebanks and Empirical Grammar3分钟
CKY Algorithm7分钟
CKY Algorithm - Walkthrough8分钟
Parse Tree Recovery From CKY Table5分钟
Neural Span-based Constituency Parsing5分钟
What is Dependency Parsing?5分钟
Dependency Formalism5分钟
Universal Dependency Relations4分钟
Transition-Based Dependency Parsing 6分钟
Transition-Based Dependency Parsing - Walkthrough5分钟
Creating an Oracle 4分钟
Graph-Based Dependency Parsing5分钟
Module Wrap Up2分钟

4篇阅读材料总计120分钟

Recommended Reading: Review of Context-Free Grammars and Parsing in NLP 30分钟
Recommended Reading: Constituency Parsing and CKY Algorithm 30分钟
Recommended Reading: Dependency Parsing – Theory and Representations 30分钟
Recommended Reading: Dependency Parsing Algorithms and Modern Applications 30分钟

17个作业总计111分钟

Graded Quiz: Week 7 and 860分钟
Introduction to Context-Free Grammars (CFGs)3分钟
Constituency and Phrase Structure3分钟
Ambiguity in Grammar3分钟
Chomsky Normal Form (CNF) and Grammar Normalisation3分钟
Treebanks and Empirical Grammar3分钟
CKY Algorithm3分钟
CKY Algorithm - Walkthrough3分钟
Parse Tree Recovery From CKY Table3分钟
Neural Span-based Constituency Parsing3分钟
What is Dependency Parsing?3分钟
Dependency Formalism3分钟
Universal Dependency Relations3分钟
Transition-Based Dependency Parsing 3分钟
Transition-Based Dependency Parsing - Walkthrough6分钟
Creating an Oracle3分钟
Graph-Based Dependency Parsing3分钟

This module explores the semantic dimension of natural language by covering both lexical semantics—including word senses, ambiguity, and disambiguation techniques—and the semantic web—a framework for enabling machine-readable, structured understanding of web data. The module starts with foundational concepts in lexical semantics and WordNet, then proceeds to classical and modern word sense disambiguation (WSD) methods. The second part focuses on Semantic Web technologies, covering ontologies, knowledge graphs, RDF/OWL, and their role in enabling intelligent systems and knowledge-driven NLP applications.

涵盖的内容

17个视频5篇阅读材料14个作业

17个视频总计85分钟

Outline of the Module1分钟
What is a Word Sense?3分钟
Homonymy vs Polysemy7分钟
Sense Relations7分钟
Introduction to WordNet and Synsets7分钟
Relations in WordNet5分钟
Navigating WordNet Hierarchies and Graph Structures5分钟
What is Word Sense Disambiguation? 4分钟
Supervised WSD8分钟
Knowledge-Based WSD: Lesk Algorithm5分钟
From Syntactic Web to Semantic Web: What's the Problem?6分钟
Semantic Web Vision: Data Integration and Automation3分钟
Ontologies4分钟
Ontology Languages and Their Layers9分钟
What is a Knowledge Graph? 3分钟
Applications in NLP6分钟
Module Wrap Up1分钟

5篇阅读材料总计130分钟

Recommended Reading: Word Senses and Lexical Semantics30分钟
Code Document: Querying WordNet in Python (using nltk.corpus.wordnet)10分钟
Recommended Reading: WordNet and Semantic Lexicons30分钟
Recommended Reading: Word Sense Disambiguation (WSD)30分钟
Recommended Reading: Introduction to the Semantic Web and Ontologies30分钟

14个作业总计42分钟

What is a Word Sense? 3分钟
Homonymy vs Polysemy3分钟
Sense Relations3分钟
Introduction to WordNet and Synsets3分钟
Relations in WordNet3分钟
Navigating WordNet Hierarchies and Graph Structures3分钟
What is Word Sense Disambiguation?3分钟
Supervised WSD3分钟
Knowledge-Based WSD: Lesk Algorithm3分钟
Semantic Web Vision: Data Integration and Automation3分钟
Ontologies3分钟
Ontology Languages and Their Layers3分钟
What is a Knowledge Graph? 3分钟
Applications in NLP3分钟

This module introduces students to the evolution of neural network architectures in NLP, beginning with recurrent models (RNNs), progressing through attention mechanisms, and culminating in Transformer-based models that have revolutionised natural language processing. Through hands-on coding and application-driven lessons, students will explore how Transformers power state-of-the-art systems in sentiment analysis (text classification), machine translation, and question answering. The module emphasises both theoretical foundations and practical implementation using modern deep learning frameworks.