Benchmark & Optimize LLM App Performance

以 199 美元（原价 399 美元）购买一年 Coursera Plus，享受无限增长。立即节省

Benchmark & Optimize LLM App Performance

本课程是 Build Next-Gen LLM Apps with LangChain & LangGraph 专项课程的一部分

位教师：Starweaver

包含在中

了解更多

3个模块

深入了解一个主题并学习基础知识。

中级等级

推荐体验

4 小时完成

灵活的计划

自行安排学习进度

3个模块

深入了解一个主题并学习基础知识。

中级等级

推荐体验

4 小时完成

灵活的计划

自行安排学习进度

您将学到什么

Optimize LLM behavior using structured prompting and self-checks to reduce variance and errors.
Design scalable middleware to manage API requests, retries, caching, and token budgets for performance targets.
Build user-centered interfaces that collect feedback and improve LLM accuracy and user trust.

您将获得的技能

要了解的详细信息

可分享的证书

添加到您的领英档案

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

积累特定领域的专业知识

本课程是 Build Next-Gen LLM Apps with LangChain & LangGraph 专项课程专项课程的一部分

在注册此课程时，您还会同时注册此专项课程。

向行业专家学习新概念
获得对主题或工具的基础理解
通过实践项目培养工作相关技能
获得可共享的职业证书

该课程共有3个模块

Benchmark & Optimize LLM App Performance is a hands-on journey from “it works” to “it flies.” You’ll start by treating speed and cost as product features-defining a baseline with the right metrics (p50/p95 latency, tokens/sec, throughput, determinism, cost per task) and building a lightweight benchmarking harness you can rerun on every change. Next, you’ll learn to hunt bottlenecks across the stack-network, model, prompt, and post-processing-using practical patterns that cut tokens without cutting quality, plus caching strategies for embeddings, RAG, and tool calls. Then you’ll run A/B/C experiments to compare models and prompts on the same dataset, interpret results with simple stats, and choose a winner confidently. Finally, you’ll harden for production with concurrency limits, queues, timeouts, fallbacks, and a 30-day optimization playbook. Expect reusable templates, clear checklists, and realistic demos designed for busy developers and product builders who want measurable gains-not hype.

This course is designed for machine learning engineers, AI developers, data scientists, and product engineers who want to optimize and scale LLM-based applications for production environments. It’s also ideal for backend engineers and DevOps professionals aiming to enhance system performance, reduce latency, and improve cost-efficiency in AI deployments. Additionally, product managers and technical leads overseeing AI-powered systems will benefit from the practical insights provided, helping them to drive improvements in app performance and ensure that their LLM models deliver reliable, high-quality results at scale. This course requires basic knowledge of Python or JavaScript, familiarity with REST APIs, and a high-level understanding of how Large Language Models (LLMs) function. These skills will help you effectively engage with the course content, optimize performance, and implement solutions. By the end of this course, you'll have the skills to optimize LLM performance, tackle real-world bottlenecks, and implement efficient, scalable AI systems. You'll be ready to apply these techniques confidently, making your AI solutions faster, more reliable, and production-ready!

This module establishes why performance is a product feature, not a backend afterthought. We connect latency, cost, and answer quality to user-perceived speed (p50 vs p95, jitter) and trust. You’ll define a minimal metric set-latency, throughput, tokens/sec, determinism, and win-rate-then build a lightweight benchmarking harness that runs a small eval set, logs prompts/outputs, and exports clean CSVs. By the end, you’ll have a reproducible baseline you can rerun on every change.

涵盖的内容

4个视频2篇阅读材料1次同伴评审

4个视频总计26分钟

Welcome to Benchmarking LLM Apps2分钟
Metrics That Matter: Latency, Throughput & Token Efficiency7分钟
Building a Minimal Benchmark Harness (Design Walkthrough)8分钟
Run Your First Baseline & Export the Data8分钟

2篇阅读材料总计10分钟

Welcome to the Course: Course Overview5分钟
Evaluation Best Practices (OpenAI Docs)5分钟

1次同伴评审总计25分钟

Hands-On-Learning: Baseline or Bust: Your First Reproducible Benchmark25分钟

In this module, you'll trace where time actually goes: network hops, model inference, prompt bloat, and post-processing. You’ll learn practical prompt patterns that cut tokens without cutting quality, plus schema-first I/O that improves stability and parsing. We’ll add caching strategies for embeddings, RAG retrievals, and tool calls, including cache keys and invalidation rules to avoid stale answers. Expect clear heuristics for cold vs warm paths and a simple checklist to shave seconds-not just milliseconds.

涵盖的内容

3个视频1篇阅读材料1次同伴评审

The final module turns tuning into a disciplined workflow. You’ll run A/B/C tests across model tiers and prompt variants on the same dataset to compare latency, cost per task, and quality with simple stats - then pick a winner. We’ll cover safe scaling: concurrency limits, queues, backpressure, retries, timeouts, and graceful degradation/fallbacks. You’ll leave with a 30-day optimization plan and a production playbook that keeps your app fast, affordable, and reliable after launch.

涵盖的内容

4个视频1篇阅读材料1个作业2次同伴评审

4个视频总计26分钟

Why Experiment Design Beats Guesswork7分钟
Shipping Safely: Canaries, Feature Flags & Rollbacks8分钟
Run an A/B/C Test & Pick a Winner7分钟
Course Wrap-up3分钟

1篇阅读材料总计5分钟

Working with Evals (OpenAI) - designing and running evals5分钟

1个作业总计20分钟

Benchmark & Optimize LLM App Performance20分钟

2次同伴评审总计85分钟

Hands-On-Learning: Experiment Orchestrator: From Data to Decision 25分钟
Project: Optimize & Ship Your LLM App v1.060分钟

获得职业证书

将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。

位教师

Starweaver

Coursera

474 门课程912,887 名学生

提供方

Coursera

从 Machine Learning 浏览更多内容

Coursera
Optimize & Interface LLM Apps Effectively
课程
Coursera
Build, Analyze, and Refactor LLM Workflows
课程
Coursera
Validate LLM Embeddings for Production Use
课程
Coursera
Deploy Resilient AI Microservices with LangChain
课程

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

自 2018开始学习的学生

''能够按照自己的速度和节奏学习课程是一次很棒的经历。只要符合自己的时间表和心情，我就可以学习。'

Jennifer J.

自 2020开始学习的学生

''我直接将从课程中学到的概念和技能应用到一个令人兴奋的新工作项目中。'

Larry W.

自 2021开始学习的学生

''如果我的大学不提供我需要的主题课程，Coursera 便是最好的去处之一。'

Chaitanya A.

''学习不仅仅是在工作中做的更好：它远不止于此。Coursera 让我无限制地学习。'

常见问题

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Benchmark & Optimize LLM App Performance

您将学到什么

您将获得的技能

要了解的详细信息

了解顶级公司的员工如何掌握热门技能

积累特定领域的专业知识

该课程共有3个模块

Foundations of LLM Performance & Benchmarks

涵盖的内容

4个视频总计26分钟

2篇阅读材料总计10分钟

1次同伴评审总计25分钟

Finding & Fixing Bottlenecks: Prompt, Model, and System

涵盖的内容

3个视频总计22分钟

1篇阅读材料总计5分钟

1次同伴评审总计25分钟

Experimentation at Scale & the Performance Playbook

涵盖的内容

4个视频总计26分钟

1篇阅读材料总计5分钟

1个作业总计20分钟

2次同伴评审总计85分钟

获得职业证书

位教师

提供方

从 Machine Learning 浏览更多内容

Optimize & Interface LLM Apps Effectively

Build, Analyze, and Refactor LLM Workflows

Validate LLM Embeddings for Production Use

Deploy Resilient AI Microservices with LangChain

人们为什么选择 Coursera 来帮助自己实现职业发展

持续精进成长，享受超值优惠。

推动业务发展，增强团队能力

常见问题

更多问题

Benchmark & Optimize LLM App Performance

您将学到什么

您将获得的技能

要了解的详细信息

了解顶级公司的员工如何掌握热门技能

积累特定领域的专业知识

该课程共有3个模块

Foundations of LLM Performance & Benchmarks

涵盖的内容

Finding & Fixing Bottlenecks: Prompt, Model, and System

涵盖的内容

Experimentation at Scale & the Performance Playbook

涵盖的内容

获得职业证书

位教师

提供方

从 Machine Learning 浏览更多内容

Optimize & Interface LLM Apps Effectively

Build, Analyze, and Refactor LLM Workflows

Validate LLM Embeddings for Production Use

Deploy Resilient AI Microservices with LangChain

人们为什么选择 Coursera 来帮助自己实现职业发展

持续精进成长，享受超值优惠。

推动业务发展，增强团队能力

常见问题

When will I have access to the lectures and assignments?

What will I get if I subscribe to this Specialization?

Is financial aid available?

更多问题