2025.11.28 | 潜在奖励模型提速降显存;画布多模态生成碾压SOTA
Update: 2025-11-28
Description
本期的 6 篇论文如下:
[00:19 ] 🎬 Video Generation Models Are Good Latent Reward Models(视频生成模型是优秀的潜在奖励模型)
[01:07 ] 🎨 Canvas-to-Image: Compositional Image Generation with Multimodal Controls(画布到图像:基于多模态控制的组合式图像生成)
[01:49 ] 🎨 MIRA: Multimodal Iterative Reasoning Agent for Image Editing(MIRA:多模态迭代推理代理用于图像编辑)
[02:30 ] 📊 Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following(多准则:多模态评估器在多元化标准遵循上的基准测试)
[03:12 ] 🧠 What does it mean to understand language?(理解语言意味着什么?)
[03:47 ] 🧠 Agentic Learner with Grow-and-Refine Multimodal Semantic Memory(具有生长与精炼多模态语义记忆的自主学习者)
<figure>
</figure>【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
Comments
In Channel






