Arxiv Papers

Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers Support this podcast: <a href="https://podcasters.spotify.com/pod/show/arxiv-papers/support" rel="payment">https://podcasters.spotify.com/pod/show/arxiv-papers/support</a>

[QA] Transformers Struggle to Learn to Search

This study investigates transformers' search capabilities using graph connectivity, revealing that while they can learn to search, performance declines with larger graphs, unaffected by model size or in-context learning. https://arxiv.org/abs//2412.04703 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-09
07:41

Transformers Struggle to Learn to Search

This study investigates transformers' search capabilities using graph connectivity, revealing that while they can learn to search, performance declines with larger graphs, unaffected by model size or in-context learning. https://arxiv.org/abs//2412.04703 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-09
19:58

[QA] Navigation World Models

https://arxiv.org/abs//2412.03572 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-07
07:36

Navigation World Models

https://arxiv.org/abs//2412.03572 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-07
19:47

[QA] Motion Prompting: Controlling Video Generation with Motion Trajectories

This paper presents a video generation model using flexible motion prompts for enhanced control over dynamic actions, enabling detailed user interactions and showcasing emergent behaviors in video content creation. https://arxiv.org/abs//2412.02700 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-07
07:21

Motion Prompting: Controlling Video Generation with Motion Trajectories

This paper presents a video generation model using flexible motion prompts for enhanced control over dynamic actions, enabling detailed user interactions and showcasing emergent behaviors in video content creation. https://arxiv.org/abs//2412.02700 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-07
16:36

[QA] Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Infinity is a groundbreaking Bitwise Visual AutoRegressive Model that generates high-resolution images from text, outperforming existing models in speed and quality, with innovative scaling capabilities. https://arxiv.org/abs//2412.04431 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-06
08:01

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Infinity is a groundbreaking Bitwise Visual AutoRegressive Model that generates high-resolution images from text, outperforming existing models in speed and quality, with innovative scaling capabilities. https://arxiv.org/abs//2412.04431 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-06
21:32

[QA] NVILA: Efficient Frontier Visual Language Models

NVILA is an efficient visual language model that enhances accuracy while significantly reducing training costs, memory usage, and latency, outperforming many existing models across various benchmarks. https://arxiv.org/abs//2412.04468 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-06
07:38

NVILA: Efficient Frontier Visual Language Models

NVILA is an efficient visual language model that enhances accuracy while significantly reducing training costs, memory usage, and latency, outperforming many existing models across various benchmarks. https://arxiv.org/abs//2412.04468 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-06
23:11

[QA] o1-Coder: an o1 Replication for Coding

The report presents O1-CODER, a model for coding tasks using reinforcement learning and Monte Carlo Tree Search, focusing on System-2 thinking and standardized code testing. https://arxiv.org/abs//2412.00154 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-03
09:05

o1-Coder: an o1 Replication for Coding

The report presents O1-CODER, a model for coding tasks using reinforcement learning and Monte Carlo Tree Search, focusing on System-2 thinking and standardized code testing. https://arxiv.org/abs//2412.00154 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-03
19:21

[QA] Efficient Track Anything

https://arxiv.org/abs//2411.18933 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-03
07:55

Efficient Track Anything

https://arxiv.org/abs//2411.18933 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-03
19:04

[QA] Reverse Thinking Makes LLMs Stronger Reasoners

https://arxiv.org/abs//2411.19865 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-02
08:08

Reverse Thinking Makes LLMs Stronger Reasoners

https://arxiv.org/abs//2411.19865 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-02
16:36

[QA] Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM’s Reasoning Capability

This paper introduces cDPO, a method for identifying critical tokens in LLMs that lead to incorrect reasoning, enhancing model alignment through token-level rewards and improving performance on reasoning tasks. https://arxiv.org/abs//2411.19943 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-02
07:29

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM’s Reasoning Capability

This paper introduces cDPO, a method for identifying critical tokens in LLMs that lead to incorrect reasoning, enhancing model alignment through token-level rewards and improving performance on reasoning tasks. https://arxiv.org/abs//2411.19943 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-02
12:00

[QA] JetFormer: an autoregressive generative model of raw images and text

JetFormer is a novel autoregressive transformer that jointly models images and text, achieving competitive text-to-image generation without relying on separately pretrained components, enhancing both understanding and generation capabilities. https://arxiv.org/abs//2411.19722 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-02
07:29

JetFormer: an autoregressive generative model of raw images and text

JetFormer is a novel autoregressive transformer that jointly models images and text, achieving competitive text-to-image generation without relying on separately pretrained components, enhancing both understanding and generation capabilities. https://arxiv.org/abs//2411.19722 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

12-02
23:02

Recommend Channels