"Unlock Multimodal Search" is an intermediate, hands-on course for developers and ML engineers ready to build the next generation of AI-powered search. Text-only search is no longer enough; this 90-minute course will teach you how to create applications that can search across different data types, such as finding text from an image. Using the powerful open-source vector database Weaviate, you will move from theory to a functioning demonstration. This course requires basic skills in Docker, APIs, Python, and the command line (CLI). Familiarity with vector databases. Docker Desktop must be installed.
This course is focused on execution. You will learn to configure a Weaviate schema to handle both image and text embeddings for a single object, ingest multimodal data, and perform powerful cross-modal queries. Through a final, hands-on project that mirrors a real-world job task, you will not only build an image-to-text search demo but also learn how to measure its accuracy with precision metrics. By the end, you'll be equipped to architect and validate sophisticated, multimodal AI applications.
This module provides the foundation for multimodal search. You will learn how to configure a Weaviate instance and design a schema that can store both image and text embeddings for a single object, preparing your database for cross-modal queries.
涵盖的内容
1个视频1篇阅读材料2个作业
显示有关单元内容的信息
1个视频•总计6分钟
How-To: Configure a Multimodal Schema•6分钟
1篇阅读材料•总计5分钟
How Weaviate Handles Multimodal Data•5分钟
2个作业•总计13分钟
Untitled•8分钟
Knowledge Check: Multimodal Schema Concepts•5分钟
Cross-Modal Querying and Analysis
第 2 单元•小时 后完成
单元详情
Now that your database is configured, this module focuses on execution and validation. You will learn how to perform cross-modal queries to find text from an image and, critically, how to analyze the accuracy of your results using precision metrics.
涵盖的内容
2个视频1篇阅读材料2个作业
显示有关单元内容的信息
2个视频•总计11分钟
The Power of Visual Search•4分钟
How-To: Run an Image-to-Text Search•7分钟
1篇阅读材料•总计4分钟
Cross-Modal Queries and Measuring Precision•4分钟
2个作业•总计38分钟
Hands-On Learning: Execute a Cross-Modal Query•8分钟
Coursera brings together a diverse network of subject matter experts who have demonstrated their expertise through professional industry experience or strong academic backgrounds. These instructors design and teach courses that make practical, career-relevant skills accessible to learners worldwide.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.