Programming Generative AI: Unit 3

Programming Generative AI: Unit 3

本课程是 Programming Generative AI 专项课程的一部分

位教师：Pearson

包含在 Coursera Plus 中

了解更多

1个模块

深入了解一个主题并学习基础知识。

中级等级

推荐体验

8 小时完成

灵活的计划

自行安排学习进度

1个模块

深入了解一个主题并学习基础知识。

中级等级

推荐体验

8 小时完成

灵活的计划

自行安排学习进度

您将学到什么

Understand and implement multimodal models that integrate images and text for advanced AI applications.
Build and optimize semantic image search engines using contrastive language-image pre-training.
Master the principles and practicalities of latent diffusion and stable diffusion for text-to-image generation.
Adapt, fine-tune, and efficiently evaluate pre-trained generative models for new tasks, styles, and real-time performance.

您将获得的技能

要了解的详细信息

可分享的证书

添加到您的领英档案

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

积累特定领域的专业知识

本课程是 Programming Generative AI 专项课程专项课程的一部分

在注册此课程时，您还会同时注册此专项课程。

向行业专家学习新概念
获得对主题或工具的基础理解
通过实践项目培养工作相关技能
获得可共享的职业证书

该课程共有1个模块

Unlock the full potential of generative AI with our advanced course module focused on state-of-the-art multimodal models. This course is designed for learners eager to bridge the gap between images and text, and to master the latest techniques in AI-driven content generation. You’ll begin by exploring the foundational concepts behind multimodal models, learning how contrastive language-image pre-training enables seamless integration of visual and textual data. Discover how these models power innovative applications like semantic image search, allowing you to query image content without manual labeling. Dive deeper into the mechanics of latent diffusion models and unravel the inner workings of stable diffusion, gaining the skills to transform text prompts into entirely new, never-before-seen images. The course also covers essential strategies for evaluating generative models and introduces efficient methods for fine-tuning and adapting pre-trained models to new styles and subjects. By the end, you’ll be equipped to build, adapt, and optimize cutting-edge text-to-image systems—ready to innovate in creative, research, or commercial settings.

This module delves into multimodal generative AI, focusing on models that connect images and text. Learners explore contrastive language-image pre-training for semantic image search and uncover the workings of latent diffusion and stable diffusion for text-to-image generation. The module then covers evaluation of generative models, parameter-efficient fine-tuning, and techniques to teach pre-trained models new styles and subjects. It concludes with methods to optimize diffusion models for faster, near real-time image generation, equipping students with both conceptual understanding and practical skills in advanced multimodal AI systems.

涵盖的内容

44个视频3个作业

44个视频总计407分钟

Topics0分钟
Components of a Multimodal Model5分钟
Vision-Language Understanding9分钟
Contrastive Language-Image Pretraining6分钟
Embedding Text and Images with CLIP14分钟
Zero-Shot Image Classification with CLIP3分钟
Semantic Image Search with CLIP10分钟
Conditional Generative Models5分钟
Introduction to Latent Diffusion Models8分钟
The Latent Diffusion Model Architecture5分钟
Failure Modes and Additional Tools6分钟
Stable Diffusion Deconstructed11分钟
Writing Our Own Stable Diffusion Pipeline11分钟
Decoding Images from the Stable Diffusion Latent Space4分钟
Improving Generation with Guidance9分钟
Playing with Prompts30分钟
Topics0分钟
Methods and Metrics for Evaluating Generative AI7分钟
Manual Evaluation of Stable Diffusion with DrawBench13分钟
Quantitative Evaluation of Diffusion Models with Human Preference Predictors20分钟
Overview of Methods for Fine-Tuning Diffusion Models9分钟
Sourcing and Preparing Image Datasets for Fine-Tuning7分钟
Generating Automatic Captions with BLIP-28分钟
Parameter Efficient Fine-Tuning with LoRA11分钟
Inspecting the Results of Fine-Tuning5分钟
Inference with LoRAs for Style-Specific Generation12分钟
Conceptual Overview of Textual Inversion8分钟
Subject-Specific Personalization with Dreambooth7分钟
Dreambooth versus LoRA Fine-Tuning6分钟
Dreambooth Fine-Tuning with Hugging Face14分钟
Inference with Dreambooth to Create Personalized AI Avatars14分钟
Adding Conditional Control to Text-to-Image Diffusion Models4分钟
Creating Edge and Depth Maps for Conditioning15分钟
Depth and Edge-Guided Stable Diffusion with ControlNet17分钟
Understanding and Experimenting with ControlNet Parameters8分钟
Generative Text Effects with Font Depth Maps2分钟
Few Step Generation with Adversarial Diffusion Distillation (ADD)7分钟
Reasons to Distill6分钟
Comparing SDXL and SDXL Turbo11分钟
Text-Guided Image-to-Image Translation16分钟
Video-Driven Frame-by-Frame Generation with SDXL Turbo13分钟
Near Real-Time Inference with PyTorch Performance Optimizations11分钟
Programming Generative AI: Summary1分钟
Course Summary1分钟

3个作业总计90分钟

Connecting Text and Images Quiz30分钟
Post-Training Procedures for Diffusion Models Quiz30分钟
End of Assessment Quiz30分钟

获得职业证书

将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。

位教师

Pearson

268 门课程9,891 名学生

提供方

Pearson

从 Software Development 浏览更多内容

状态：预览
University of Colorado Boulder
Introduction to Generative AI
课程
状态：预览
Indian Institute of Technology Guwahati
Programming with Generative AI
课程
状态：免费试用
Duke University
Introduction to Generative AI
课程
状态：免费试用
Scrimba
Intro to Dall-E and GPT Vision
课程

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

自 2018开始学习的学生

''能够按照自己的速度和节奏学习课程是一次很棒的经历。只要符合自己的时间表和心情，我就可以学习。'

Jennifer J.

自 2020开始学习的学生

''我直接将从课程中学到的概念和技能应用到一个令人兴奋的新工作项目中。'

Larry W.

自 2021开始学习的学生

''如果我的大学不提供我需要的主题课程，Coursera 便是最好的去处之一。'

Chaitanya A.

''学习不仅仅是在工作中做的更好：它远不止于此。Coursera 让我无限制地学习。'

通过 Coursera Plus 开启新生涯

无限制访问 10,000+ 世界一流的课程、实践项目和就业就绪证书课程 - 所有这些都包含在您的订阅中

了解更多

通过在线学位推动您的职业生涯

获取世界一流大学的学位 - 100% 在线

探索学位

加入超过 3400 家选择 Coursera for Business 的全球公司

提升员工的技能，使其在数字经济中脱颖而出

了解更多

常见问题

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.