Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers Support this podcast: https://podcasters.spotify.com/pod/s ...
…
continue reading
https://arxiv.org/abs//2410.01606 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
https://arxiv.org/abs//2410.01606 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
https://arxiv.org/abs//2410.01748 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
https://arxiv.org/abs//2410.01748 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
https://arxiv.org/abs//2409.19951 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
https://arxiv.org/abs//2409.19951 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
This paper evaluates various model merging methods for compositional generalization in image classification, generation, and NLP, clarifying their merits, requirements, and computational costs in a shared experimental setting. https://arxiv.org/abs//2409.18314 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_paper…
…
continue reading
This paper evaluates various model merging methods for compositional generalization in image classification, generation, and NLP, clarifying their merits, requirements, and computational costs in a shared experimental setting. https://arxiv.org/abs//2409.18314 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_paper…
…
continue reading
Emu3 introduces a next-token prediction model for multimodal tasks, outperforming existing models and simplifying design by focusing on tokenization of images, text, and videos. https://arxiv.org/abs//2409.18869 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/p…
…
continue reading
Emu3 introduces a next-token prediction model for multimodal tasks, outperforming existing models and simplifying design by focusing on tokenization of images, text, and videos. https://arxiv.org/abs//2409.18869 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/p…
…
continue reading
MIO is a novel multimodal foundation model that excels in understanding and generating speech, text, images, and videos, outperforming existing models in any-to-any capabilities and diverse tasks. https://arxiv.org/abs//2409.17692 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podc…
…
continue reading
MIO is a novel multimodal foundation model that excels in understanding and generating speech, text, images, and videos, outperforming existing models in any-to-any capabilities and diverse tasks. https://arxiv.org/abs//2409.17692 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podc…
…
continue reading
The paper evaluates OpenAI's o1 model in medical scenarios, highlighting its enhanced reasoning and accuracy over GPT-4, while also identifying weaknesses and releasing data for further research. https://arxiv.org/abs//2409.15277 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podca…
…
continue reading
The paper evaluates OpenAI's o1 model in medical scenarios, highlighting its enhanced reasoning and accuracy over GPT-4, while also identifying weaknesses and releasing data for further research. https://arxiv.org/abs//2409.15277 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podca…
…
continue reading
1
[QA] Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models
8:44
The Logic-of-Thought (LoT) prompting method enhances logical reasoning in Large Language Models by integrating propositional logic, significantly improving performance across various reasoning tasks. https://arxiv.org/abs//2409.17539 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://p…
…
continue reading
The Logic-of-Thought (LoT) prompting method enhances logical reasoning in Large Language Models by integrating propositional logic, significantly improving performance across various reasoning tasks. https://arxiv.org/abs//2409.17539 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://p…
…
continue reading
We propose bge-en-icl, a model leveraging in-context learning in LLMs for high-quality text embeddings, achieving state-of-the-art performance on MTEB and AIR-Bench benchmarks. https://arxiv.org/abs//2409.15700 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/po…
…
continue reading
We propose bge-en-icl, a model leveraging in-context learning in LLMs for high-quality text embeddings, achieving state-of-the-art performance on MTEB and AIR-Bench benchmarks. https://arxiv.org/abs//2409.15700 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/po…
…
continue reading
The paper introduces PROX, a framework enabling small language models to refine data effectively, outperforming human-crafted methods and enhancing efficiency in LLM pre-training across various benchmarks. https://arxiv.org/abs//2409.17115 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
The paper introduces PROX, a framework enabling small language models to refine data effectively, outperforming human-crafted methods and enhancing efficiency in LLM pre-training across various benchmarks. https://arxiv.org/abs//2409.17115 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
The FISER framework enhances AI's ability to follow ambiguous human instructions by inferring intentions, outperforming traditional methods in collaborative tasks, particularly on the HandMeThat benchmark. https://arxiv.org/abs//2409.18073 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
The FISER framework enhances AI's ability to follow ambiguous human instructions by inferring intentions, outperforming traditional methods in collaborative tasks, particularly on the HandMeThat benchmark. https://arxiv.org/abs//2409.18073 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
This paper presents a learnable pruning method for Large Language Models, achieving efficient N:M sparsity, improved mask quality, and transferability across tasks, outperforming existing techniques in empirical evaluations. https://arxiv.org/abs//2409.17481 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers …
…
continue reading
This paper presents a learnable pruning method for Large Language Models, achieving efficient N:M sparsity, improved mask quality, and transferability across tasks, outperforming existing techniques in empirical evaluations. https://arxiv.org/abs//2409.17481 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers …
…
continue reading
This paper presents a method to enable large language models to perform counterfactual token generation, enhancing their capabilities without fine-tuning, and applying it for bias detection. https://arxiv.org/abs//2409.17027 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.a…
…
continue reading
This paper presents a method to enable large language models to perform counterfactual token generation, enhancing their capabilities without fine-tuning, and applying it for bias detection. https://arxiv.org/abs//2409.17027 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.a…
…
continue reading
The paper identifies stable regions in Transformers' residual streams, showing insensitivity to small changes but high sensitivity at boundaries, aligning with semantic distinctions and clustering similar prompts. https://arxiv.org/abs//2409.17113 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podca…
…
continue reading
The paper identifies stable regions in Transformers' residual streams, showing insensitivity to small changes but high sensitivity at boundaries, aligning with semantic distinctions and clustering similar prompts. https://arxiv.org/abs//2409.17113 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podca…
…
continue reading
We introduce Program Trace Prompting, enhancing chain of thought explanations with formal syntax, improving observability, and enabling analysis of reasoning errors across diverse tasks in the BIG-Bench Hard benchmark. https://arxiv.org/abs//2409.15359 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple …
…
continue reading
We introduce Program Trace Prompting, enhancing chain of thought explanations with formal syntax, improving observability, and enabling analysis of reasoning errors across diverse tasks in the BIG-Bench Hard benchmark. https://arxiv.org/abs//2409.15359 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple …
…
continue reading
This paper explores face pareidolia in computer vision, presenting a dataset of annotated images and analyzing the differences in face detection between humans and machines. https://arxiv.org/abs//2409.16143 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podca…
…
continue reading
This paper explores face pareidolia in computer vision, presenting a dataset of annotated images and analyzing the differences in face detection between humans and machines. https://arxiv.org/abs//2409.16143 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podca…
…
continue reading
1
[QA] Rule Extrapolation in Language Models: A Study of Compositional Generalization on OOD Prompts
8:20
The paper investigates out-of-distribution behavior in autoregressive LLMs through rule extrapolation in formal languages, analyzing various architectures and proposing a normative theory inspired by algorithmic information theory. https://arxiv.org/abs//2409.13728 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_…
…
continue reading
1
Rule Extrapolation in Language Models: A Study of Compositional Generalization on OOD Prompts
29:04
The paper investigates out-of-distribution behavior in autoregressive LLMs through rule extrapolation in formal languages, analyzing various architectures and proposing a normative theory inspired by algorithmic information theory. https://arxiv.org/abs//2409.13728 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_…
…
continue reading
This study evaluates the effectiveness of LLM-judge preferences in improving alignment, finding no correlation with concrete metrics and highlighting biases in LLM judgments. https://arxiv.org/abs//2409.15268 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podc…
…
continue reading
This study evaluates the effectiveness of LLM-judge preferences in improving alignment, finding no correlation with concrete metrics and highlighting biases in LLM judgments. https://arxiv.org/abs//2409.15268 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podc…
…
continue reading
This paper introduces LLM Surgery, a framework for efficiently modifying large language models to unlearn outdated information and integrate new knowledge without complete retraining, demonstrating significant performance improvements. https://arxiv.org/abs//2409.13054 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@ar…
…
continue reading
This paper introduces LLM Surgery, a framework for efficiently modifying large language models to unlearn outdated information and integrate new knowledge without complete retraining, demonstrating significant performance improvements. https://arxiv.org/abs//2409.13054 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@ar…
…
continue reading
This paper explores alternative geometries and softmax logits for language-image pre-training, finding that Euclidean CLIP (EuCLIP) performs as well as or better than the original CLIP. https://arxiv.org/abs//2409.13079 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.…
…
continue reading
This paper explores alternative geometries and softmax logits for language-image pre-training, finding that Euclidean CLIP (EuCLIP) performs as well as or better than the original CLIP. https://arxiv.org/abs//2409.13079 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.…
…
continue reading
The Kolmogorov–Arnold Transformer (KAT) enhances transformer performance by replacing MLP layers with Kolmogorov-Arnold Network layers, addressing key challenges and demonstrating superior results in various tasks. https://arxiv.org/abs//2409.10594 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podc…
…
continue reading
The Kolmogorov–Arnold Transformer (KAT) enhances transformer performance by replacing MLP layers with Kolmogorov-Arnold Network layers, addressing key challenges and demonstrating superior results in various tasks. https://arxiv.org/abs//2409.10594 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podc…
…
continue reading
This paper reveals a flaw in the inference pipeline of diffusion models for depth estimation, leading to a 2002#2 speed improvement and superior performance through end-to-end fine-tuning. https://arxiv.org/abs//2409.11355 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.app…
…
continue reading
This paper reveals a flaw in the inference pipeline of diffusion models for depth estimation, leading to a 2002#2 speed improvement and superior performance through end-to-end fine-tuning. https://arxiv.org/abs//2409.11355 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.app…
…
continue reading
1
[QA] Re-Introducing LayerNorm: Geometric Meaning, Irreversibility and a Comparative Study with RMSNorm
7:03
This paper explores the geometric implications of LayerNorm in transformers, revealing its irreversibility and redundancy, and advocates for RMSNorm as a more efficient alternative with similar performance. https://arxiv.org/abs//2409.12951 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: ht…
…
continue reading
1
Re-Introducing LayerNorm: Geometric Meaning, Irreversibility and a Comparative Study with RMSNorm
12:28
This paper explores the geometric implications of LayerNorm in transformers, revealing its irreversibility and redundancy, and advocates for RMSNorm as a more efficient alternative with similar performance. https://arxiv.org/abs//2409.12951 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: ht…
…
continue reading
This paper enhances masked particle modeling (MPM) for high-energy physics, improving performance through better implementation and a powerful decoder, outperforming previous methods in various jet physics tasks. https://arxiv.org/abs//2409.12589 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcas…
…
continue reading
This paper enhances masked particle modeling (MPM) for high-energy physics, improving performance through better implementation and a powerful decoder, outperforming previous methods in various jet physics tasks. https://arxiv.org/abs//2409.12589 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcas…
…
continue reading
https://arxiv.org/abs//2409.12180 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
https://arxiv.org/abs//2409.12180 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading