DiscoverSuper Data Science: ML & AI Podcast with Jon Krohn
Super Data Science: ML & AI Podcast with Jon Krohn
Claim Ownership

Super Data Science: ML & AI Podcast with Jon Krohn

Author: Jon Krohn

Subscribed: 12,667Played: 458,426
Share

Description

The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.


Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.


We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.

871 Episodes
Reverse
In this Five-Minute Friday, Jon Krohn looks into what he considers the world’s most powerful research tool to date, OpenAI’s Deep Research. Find out how OpenAI trained Deep Research to compile literature reviews of limitless topics, what similar tools are on the market, and where Jon sees the tool as having real-world value including how he uses it daily. Additional materials: www.superdatascience.com/870 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Jon Krohn talks to Varun Godbole about AI prompt engineering, generative wisdom, and AI generalists in this episode all about the interrelationships between humans and AI. Additional materials: www.superdatascience.com/869 This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
How to start a successful tech company, and how you can get started with DBT, TabPFN and BAML: Jon Krohn rounds up his favorite moments from February in this episode of “In Case You Missed It”. Additional materials: www.superdatascience.com/868 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
The realities of Agentic AI, AGI, and chatbots that don’t hallucinate: Andriy Burkov talks to Jon Krohn about AI in 2025. Best known for his concise machine learning modelling books, author and AI influencer Andriy Burkov also talks about his latest publication in the series, The Hundred-Page Language Learning Models Book.  Additional materials: www.superdatascience.com/867 This episode is brought to you by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Jon Krohn addresses a question for the ages: How close are we, really, to Jurassic Park? Dallas-based biotech company Colossal Biosciences is developing technology that aims to return previously extinct animals like the dodo and woolly mammoth to earth and, crucially, pull many others like the white rhino back from the brink of extinction.  Additional materials: www.superdatascience.com/866 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Jon Krohn talks to Cal Al-Dhubaib about the extraordinary success of AI and machine learning solutions provider Pandata, his ironclad hack for any company to define their core values, and how to attract and secure loyal clients. Cal thinks tech professionals make two critical mistakes in their careers: The first is that they too-often enjoy being the gatekeepers of their work rather than educating their clients and coworkers as to the details of their projects and why it benefits the company. The second is that tech professionals don’t show vulnerability, whether that means not knowing a topic or not fully understanding how a business works. This issue, Cal says, can spell the difference between a startup’s success and failure. Learn how tech startups can make an ironclad strategy for their future in this episode of The SuperDataScience Podcast. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (09:32) How to scale a successful data science consultancy (22:25) How Pandata navigates highly regulated environments  (27:59) How to tackle tech illiteracy in business  (36:32) What skills Cals looks for in new hires  (35:56) How to sell on a tech company  Additional materials: www.superdatascience.com/865
Jon Krohn investigates OpenAI’s new release, o3-mini, in this five-minute Friday, where he walks through the reasoning model’s capabilities and performance, cross-examining them against other major-league players, DeepSeek-R1, GPT-4o and Claude 3.5 Sonnet. Additional materials: www.superdatascience.com/864 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Jon Krohn talks tabular data with Frank Hutter, Professor of Artificial Intelligence at Universität Freiburg in Germany. Despite the great steps that deep learning has made in analysing images, audio, and natural language, tabular data has remained its insurmountable obstacle. In this episode, Frank Hutter details the path he has found around this obstacle even with limited data by using a ground-breaking transformer architecture. Named TabPFN, this approach is vastly outperforming other architectures, as testified by a write up of TabPFN’s capabilities in Nature. Frank talks about his work on version 2 of TabPFN, the architecture’s cross-industry applicability, and how TabPFN is able to return accurate results with synthetic data. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (05:57) All about the TabPFN architecture  (21:27) Use cases for Bayesian inference (35:07) On getting published in Nature (44:03) How TabPFN handles time series data (51:52) All about Prior Labs Additional materials: www.superdatascience.com/863
In this episode of “In Case You Missed It”, Jon Krohn shares his favorite clips from the last four weeks. He talks to Azeem Azhar, Florian Neukart, Kirill Eremenko, Hadelin de Ponteves, and Brooke Hopkins on what’s in store for AI in 2025, from quantum computing and customizable tools to handy checklists and how the mathematics of exponentials can help us keep our heads about the swift advancement of AI. Additional materials: www.superdatascience.com/862 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
How does a CrossFit winner, bobsledder and swimmer go on to have a glittering career in data analytics and engineering? Colleen Fotsch talks to Jon Krohn about transitioning into very different career paths, how sports gave her the competitive mindset she needed for success in data science, and seeing the niche role of analytics engineering as a bridge between data engineering and analysis. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (05:49) Colleen’s path from athlete to data analyst (1:14:41) About the data build tool (DBT) (1:22:51) Colleen’s work at CHG Healthcare (1:32:45) How Colleen and Tia-Clair got started with PRVN GO Additional materials: www.superdatascience.com/861
DeepSeek-curious? This Five-Minute Friday is for you! Jon Krohn investigates the overwhelming overnight success of this new LLM, the product of a Chinese hedge fund. DeepSeek is a market newcomer, and yet it runs shoulder to shoulder with behemoths from OpenAI, Anthropic and Google like it’s all in a day’s work. Additional materials: www.superdatascience.com/860 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this week’s guest interview, Vaibhav Gupta talks to Jon Krohn about creating a programming language, BAML, that helps companies save up to 30% on their AI costs. He explains how he started tailoring BAML to facilitate natural language generation interactions with AI models, how BAML helps companies optimize their outputs, and he also lets listeners into Boundary’s hiring process. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (04:53) What BAML stands for (14:33) Making a prompt engineering a serious practice (18:00) How BAML helps companies (23:30) Using retrieval-augmented generation (RAG) (43:09) How to get a job at Boundary Additional materials: www.superdatascience.com/859
Are you an Account Executive with experience in the technology sector? In this Five-Minute Friday, Jon Krohn tells listeners about an exciting new role that has opened up at The SuperDataScience Podcast. Additional materials: www.superdatascience.com/858 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Brooke Hopkins speaks to Jon Krohn about technology’s new frontiers in AI agents, how these agents will impact society, work and our creative enterprises, and what this might mean for our data-driven future. You will learn how Coval, a simulation and evaluation platform for AI voice and chat agents, helps companies balance precision and scalability while making few concessions on the way.  This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (07:49) What Coval does and how the platform works (21:16) Coval’s workflows (37:40) The future of AI agents  (46:28) The metrics to evaluate performance  (55:08) How close we are to achieving AI agent autonomy Additional materials: www.superdatascience.com/857
Get excited: The fastest-growing jobs in the US are AI Engineer and AI Consultant. In this Five-Minute Friday, Jon Krohn looks into the reports that reveal this job growth, and the trends any data scientist and AI professional will want to watch in 2025. Additional materials: www.superdatascience.com/856 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
How can we use AI to solve global problems like the environmental crisis, and how will future AI start to manage increasingly complex workflows? Famed futurist Azeem Azhar talks to Jon Krohn about the future of AI as a force for good, how we can stay mindful of an evolving job market, and Azeem’s favorite tools for automating his workflows. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (05:43) Azeem Azhar’s vision for AI’s future (14:16) How to prepare for technological shifts (20:35) How to be more like an AI-first company (38:46) The tools Azeem Azhar uses regularly (50:09) The benefits and risks of transitioning to renewable energy (1:09:28) Opportunities in the future workplace Additional materials: www.superdatascience.com/855
Join Jon Krohn as he unpacks Ray Kurzweil’s six epochs of intelligence evolution, a fascinating framework from The Singularity is Nearer. From the origins of atoms and molecules to the transformative future of brain-computer interfaces and cosmic intelligence, Jon explores how each stage builds upon the last. This quick yet profound journey reveals how humanity is shaping the Fifth Epoch—and hints at what’s next for intelligence in our universe. Additional materials: www.superdatascience.com/854 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Kirill Eremenko and Hadelin de Ponteves AI educators, whose courses have been taken by over 3 Million students, sit down with Jon Krohn to talk about how foundation models are transforming businesses. From real-world examples to clever customization techniques and powerful AWS tools, they cover it all. bravotech.ai - Partner with Kirill & Hadelin for GenAI implementation and training in your business. Mention the “SDS Podcast” in your inquiry to start with 3 complimentary hours of consulting. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (07:00) What are foundation models? (15:45) Overview of the foundation model lifecycle: 8 main steps. (29:11) Criteria for selecting the right foundation model for business use. (41:35) Exploring methods to customize foundation models. (53:04) Techniques to modify foundation models during deployment or inference. (01:11:00) Introduction to AWS generative AI tools like Amazon Q, Bedrock, and SageMaker. Additional materials: www.superdatascience.com/853
AI security, LLM engineering, how to choose the best LLM, and tech agnosticism: In our first “In Case You Missed It” of 2025, Jon Krohn starts the year with a round-up of our favorite recent interview moments. He selects from interviews with Andrew Ng, Ed Donner, Eiman Ebrahimi, Sadie St Lawrence, and Greg Epstein, covering the latest in AI development, touching on agentic workflows, promising new roles in AI, and what blew our minds last year. Additional materials: www.superdatascience.com/852 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Are our passwords safe, even with the increasing accessibility of quantum computing? Florian Neukart, Chief Product Officer at Terra Quantum AG, thinks so. In this episode, he outlines the three key elements of quantum-safe security. He speaks to Jon Krohn about the resourceful applications of quantum computing and workarounds for the demands of quantum computing on operational times and cooling systems. And if you’re interested in making the switch to quantum computing from machine learning, he also explores what you need (and don’t need) to make change happen. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (17:12) The real-world applications of quantum computing (23:35) The chips needed for quantum computing  (31:18) How quantum computing meets key business challenges (46:33) The ethical challenges of quantum technology (49:28) How to become proficient in quantum computing  (1:01:21) The future of quantum computing Additional materials: www.superdatascience.com/851
loading
Comments (30)

atefeh

professional 💖

Jan 27th
Reply

atefeh

thank you for this episode.

Mar 5th
Reply (1)

mrs rime

🔴💚Really Amazing ️You Can Try This💚WATCH💚ᗪOᗯᑎᒪOᗩᗪ👉https://co.fastmovies.org

Jan 16th
Reply

Priya Dharshini

🔴WATCH>>ᗪOᗯᑎᒪOᗩᗪ>>👉https://co.fastmovies.org

Jan 16th
Reply

Andrew Miller

I found this podcast really helpful for anyone who wants to better their knowledge of machine learning. I am especially interested in the data processing. If you want to deepen your knowledge of this topic, check this article https://techlogitic.net/categorization-and-data-labeling-for-supervised-machine-learning/. It has some pretty useful information and professional tips from experts in data annotation and tagging.

Apr 21st
Reply

Toben Nelson

a really nice and quick overview with just the right amount of detail.

Mar 3rd
Reply

Maryam Alizadeh

great thanks to you and your endeavors for this pod. I learnt a lot. welcome to Jon , wish you the best 👏👍

Jan 4th
Reply

Maryam Alizadeh

😢

Jan 4th
Reply

Masoud Fard

you are the best

Nov 19th
Reply

Nikhil Parmar

nice summarisation, Data Analyts looks at the past and data scientist looks at past and future

Oct 27th
Reply

Tough Nut

Great talk, very inspiring. thanks.

Aug 6th
Reply

Venkat M

Sleeps 3 hrs a day, not a good example for healthy person. sleep well and keep the brain more refreshed and healthy. #health

Mar 13th
Reply

Mehrdad Salimi

a lot of extra, unrelated stuff. Dude I appreciate your effort but you need to be specific and respect audiences' time.

Jan 19th
Reply

Maria Lacerda

Eu não conhecia Gabriela de Queiroz mas agora ouvindo esse podcast (já ouvi umas 5x) estou completamente encantada. Muito legal descobrir esse nível de profissional pelo mundo e ainda saber que trata-se de uma brasileira.

Dec 2nd
Reply

Natalia Zawadzka

Great job!👍 It's so interesting to listen your podcasts! thanks for sharing your knowledge and helping people to get into data business 🙌👍

Sep 10th
Reply

SriLatha K

Hi thanks for doing this podcast. Being a data engineer and who commutes a lot, I gain a lot from your podcasts. One suggestion that I would like to give is, it would be better if you do not interrupt the speaker until they complete their flow.

May 10th
Reply

Alberto Andrade

What amazing episode! Adrian rocks!! Congratulations!

Apr 25th
Reply

Simon SOUVANNARAT

Thanks for this advice !

Feb 6th
Reply

Troy Kirin

Great episode! I wish he touched on how to connect Sparklyr to data viz like Tableau!

Dec 10th
Reply

Richard Leyshon

thought this was one of my stoic podcast episodes! Great message.

Dec 1st
Reply