AI accuracy: New models and progress on hallucinations

Update: 2024-10-21

Description

What's the latest with artificial intelligence and models' accuracy?

Three objectives:

Compare and contrast traditional autoregressive LLMs (Gemini/chatGPT/Claude/LLaMA) vs nonautoregressive AI (NotebookLM) vs chain of thought reasoning models (Strawberry o1) – including the benefits, detriments, and tradeoffs of each

Error rates for NotebookLM vs traditional LLMs vs CoT reasoning models – focusing on accuracy benefits from a smaller corpus of curated source materials that users feed NotebookLM when promoting (vs an LLM trained on and inferencing from the entire web)

Which model (or combination of models) will be the future standard?

This podcast is AI-generated with NotebookLM, using the following sources, research, and analysis:

An Investigation of Language Model Interpretability via Sentence Editing (OSU, Stevens, 2021.04)

Are Auto-Regressive Large Language Models Here to Stay? (Medium, Bettencourt, 2023.12.28)

Attention Is All You Need (Google Brain, Vaswani/Shazeer/Parmar/Uszkoreit/Jones/Gomez/Kaiser/Polosukhin, 2017.06.12)

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension (Facebook AI, Lewis/Liu/Goyal/Ghazvininejad/Mohamed/Levy/Stoyanov/Zettlemoyer, 2019.10.29)

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs (Zhang/Du/Pang/Liu/Gao/Lin, 2024.06.13)

Contra LeCun on "Autoregressive LLMs are doomed" (LessWrong, rotatingpaguro, 2023.04.10)

Do large language models need sensory grounding for meaning and understanding? (LeCun, 2023.03.24)

Experimenting with Power Divergences for Language Modeling (Labeau/Cohen, 2019)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (Raffel/Shazeer/Roberts/Lee/Narang/Matena/Zhou/Li/Liu, 2023.09.19)

Improving Non-Autoregressive Translation Models Without Distillation (Huang/Perez/Volkovs, 2022.01.28)

Non-Autoregressive Neural Machine Translation (Gu/Bradbury/Xiong/Li/Socher, 2017.11.27)

On the Learning of Non-Autoregressive Transformers (Huang/Tao/Zhou/Li/Huang, 2022.06.13)

Towards Better Chain-of-Thought Prompting Strategies: A Survey (Yu/He/Wu/Dai/Chen, 2023.10.08).pdf

XLNet: Generalized Autoregressive Pretraining for Language Understanding (Yang/Dai/Yang/Carbonell/Salakhutdinov/Le, 2020.01.22)

Not investment advice; do your own due diligence!

# tech technology machine learning ML

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

Flying cars: If the future's not now, then who/what/where/when/why/how?

2024-11-2031:12

Quantum computing: Encryption, cryptography, crypto, chips, and AI

2024-11-1320:04

Mesh networking: A fully decentralized P2P internet

2024-11-0828:52

Whaling: The rise and fall and fallout of an industry

2024-11-0111:57

Storage and memory: The unsung chips on the shoulders of AI and high tech

2024-10-2817:15

Laplace's demon: Theoretical determinism in the universe

2024-10-2412:53

AI accuracy: New models and progress on hallucinations

2024-10-2114:17

Gaming: The state of the video game industry and Ubisoft $UBSFY

2024-10-1809:07

Shadow banking: The effect of nonbank lending on traditional bank credit

2024-10-1712:15

00:00

AI accuracy: New models and progress on hallucinations

#box-pro-ellipsis-173249284706618{-webkit-line-clamp:2;}AI accuracy: New models and progress on hallucinations

AI accuracy: New models and progress on hallucinations

Anthony Bardaro

AI accuracy: New models and progress on hallucinations