New course: Build AI agents that generate images and videos -- an under-explored frontier. A key to ...

Andrew Ng(@AndrewYNg)

Andrew Ng(@AndrewYNg)2026年5月20日

New course: Build AI agents that generate images and videos -- an under-explored frontier. A key to ...

8.5Score

TL;DR · AI 摘要

Andrew Ng announces a new short course on building AI agents for generating images and videos, emphasizing the importance of self-evaluation and iteration for improving output quality. The course, developed in collaboration with Google Cloud, is taught by Katie Nguyen and Wafae Bakkali and focuses on three evaluation techniques: image-text similarity scoring, LLM judging against custom criteria, and structured rubrics for detailed assessment.

核心要点

The course teaches how to build AI agents that generate images and videos, with a focus on self-evaluation and iteration to enhance quality.
Key evaluation techniques include image-text similarity scoring, LLM judging, and structured rubrics for detailed assessment.
Participants will learn image and video prompt engineering, and how to create agents that can turn brand guidelines into UI mockups and plan multi-scene explainers with synchronized audio.

结构提纲

按章节快速跳转。

§Introduction to the New Course
Andrew Ng introduces a new short course focused on building AI agents for generating images and videos, highlighting the under-explored nature of this frontier in AI.
·Key Performance Factor: Self-Evaluation
The course emphasizes the importance of having AI agents evaluate their own output and iterate to improve quality, a key factor in achieving better performance.
·Course Collaboration and Instructors
The course is built in collaboration with Google Cloud and is taught by Katie Nguyen and Wafae Bakkali, bringing together expertise in AI and cloud technologies.
·Evaluation Techniques Taught
Students will learn three main evaluation techniques: image-text similarity scoring, LLM judging against custom criteria, and structured rubrics for detailed assessment of AI-generated content.
·Skills Gained
Participants will gain skills in image and video prompt engineering, building agents that can create UI mockups from brand guidelines and plan and animate multi-scene explainers with synchronized audi
§Conclusion and Call to Action
The announcement concludes with an invitation to join the course to build AI agents that create images and videos, providing a link to the course page.

思维导图

用一张图看清主题之间的关系。

查看大纲文本（无障碍 / 无 JS 友好）

New AI Course for Image and Video Generation

金句 / Highlights

值得收藏与分享的关键句。

A key to performance is having the agent evaluate its own output, and iterate to improve quality.
— First paragraph
⬇︎ 下载 PNG 𝕏 分享到 X
You'll learn three evaluation techniques and combine them in an agent: image-text similarity scoring to check the output matches the prompt, an LLM judge that scores against custom criteria like brand
— Second paragraph
⬇︎ 下载 PNG 𝕏 分享到 X
Skills you'll gain: - Learn image and video prompt engineering - Build an image agent that turns brand guidelines into UI mockups - Build a video agent that plans multi-scene explainers and animates r
— Second paragraph
⬇︎ 下载 PNG 𝕏 分享到 X

#AI#Machine Learning#Image Generation#Video Generation#Self-Evaluation#Iteration#Google Cloud#Katie Nguyen#Wafae Bakkali

打开原文

新课程：构建生成图像和视频的AI代理——一个尚未充分探索的前沿领域。性能的关键在于让代理评估自己的输出，并通过迭代提高质量。这个短期课程由和共同开发，由Katie Nguyen和Wafae Bakkali授课。您将学习三种评估技术，并将它们结合在一个代理中：图像-文本相似度评分，以检查输出是否符合提示；LLM评估器，根据自定义标准（如品牌一致性）进行评分；以及结构化的评分标准，将提示分解为可验证的 yes/no 问题，例如“主体是否在画面中？”和“相机运动是否匹配？”您将获得以下技能：- 学习图像和视频提示工程- 构建一个图像代理，将品牌指南转化为UI草图- 构建一个视频代理，规划多场景解释器，并与同步音频一起动画参考帧加入我们，构建创建图像和视频的代理！deeplearning.ai/courses/ai-age