DiscoverData Skeptic
Data Skeptic
Claim Ownership

Data Skeptic

Author: Kyle Polich

Subscribed: 31,667Played: 571,124
Share

Description

The Data Skeptic Podcast features interviews and discussion of topics related to data science, statistics, machine learning, artificial intelligence and the like, all from the perspective of applying critical thinking and the scientific method to evaluate the veracity of claims and efficacy of approaches.
519 Episodes
Reverse
Optimal Foraging

Optimal Foraging

2024-02-2838:261

Claire Hemmingway, an assistant professor in the Department of Psychology and Ecology and Evolutionary Biology at the University of Tennessee in Knoxville, is our guest today. Her research is on decision-making in animal cognition, focusing on neotropical bats and bumblebees. Claire discussed how bumblebees make foraging decisions and how they communicate when foraging. She discussed how they set up experiments in the lab to address questions about bumblebees foraging. She also discussed some nuances between bees in the lab and those in the wild. Claire discussed factors that drive an animal's foraging decisions. She explained the foraging theory and how a colony works together to optimize its foraging. She also touched on some irrational foraging behaviors she observed in her study. Claire discussed some techniques bees use to learn from past behaviors. She discussed the effect of climate change on foraging bees' learning behavior. Claire discussed how bats respond to calling frogs when foraging. She also spoke about choice overload in that they make detrimental decisions when loaded with too many options.
Memory in Chess

Memory in Chess

2024-02-1248:39

On today’s show, we are joined by our co-host, Becky Hansis-O’Neil. Becky is a Ph.D. student at the University of Missouri, St Louis, where she studies bumblebees and tarantulas to understand their learning and cognitive work.   She joins us to discuss the paper: Perception in Chess. The paper aimed to understand how chess players perceive the positions of chess pieces on a chess board. She discussed the findings paper. She spoke about situations where grandmasters had better recall of chess positions than beginners and situations where they did not.   Becky and Kyle discussed the use of chess engines for cheating. They also discussed how chess players use chunking. Becky discussed some approaches to studying chess cognition, including eye tracking, EEG, and MRI.  ## Paper in Focus Perception in chess ## Resources Detecting Cheating in Chess with Ken Regan
OpenWorm

OpenWorm

2024-02-0534:13

On this episode, we are joined by Stephen Larson, the CEO of MetaCell and an affiliate of the OpenWorm foundation. Stephen discussed what the Openworm project is about. They hope to use a digital C. elegans nematode (C. elegans for short) to study the basics of life. Stephen discussed why C. elegans is an ideal organism for studying life in the lab. He also discussed the steps involved in simulating a digital organism. He mentioned the constraints on the cellular scale that informed their development of a digital C. elegans. Stephen discussed the validation process of the simulation. He discussed how they discovered the best parameters to capture the behavior of natural C. elegans. He also discussed how biologists embraced the project. Stephen discussed the computational requirements for improving the simulation parameters of the model and the kind of data they require to scale up. Stephen discussed some findings that the machine-learning communities can take away from the project. He also mentioned how students can get involved in the Openworm project. Rounding up, he shared future plans for the project.
What the Antlion Knows

What the Antlion Knows

2024-01-3041:44

Our guest is Becky Hansis-O’Neil, a Ph.D. student at the University of Missouri, St Louis, and our co-host for the new "Animal Intelligence" season. Becky shares her background on how she got into the field of behavioral intelligence and biology.
AI Roundtable

AI Roundtable

2024-01-1750:34

Kyle is joined by friends and former guests Pramit Choudhary and Frank Bell to have an open discussion of the impacts LLMs and machine learning have had in the past year on industry, and where things may go in the current year.
We are joined by Darren McKee, a Policy Advisor and the host of Reality Check — a critical thinking podcast. Darren gave a background about himself and how he got into the AI space. Darren shared his thoughts on AGI's achievements in the coming years. He defined AGI and discussed how to differentiate an AGI system. He also shared whether AI needs consciousness to be AGI. Darren discussed his worry about AI surpassing human understanding of the universe and potentially causing harm to humanity. He also shared examples of how AI is already used for nefarious purposes. He explored whether AI possesses inherently evil intentions and gave his thoughts on regulating AI.
I LLM and You Can Too

I LLM and You Can Too

2023-12-2323:52

It took a massive financial investment for the first large language models (LLMs) to be created.  Did their corporate backers lock these tools away for all but the richest?  No.  They provided comodity priced API options for using them.  Anyone can talk to Chat GPT or Bing.  What if you want to go a step beyond that and do something programatic?  Kyle explores your options in this episode.
Q&A with Kyle

Q&A with Kyle

2023-12-1940:23

We celebrate episode 1000000000 with some Q&A from host Kyle Polich.  We boil this episode down to four key questions: 1) How do you find guests 2) What is Data Skeptic all about? 3) What is Kyle all about? 4) What are Kyle's thoughts on AGI?   Thanks to our sponsorsdataannotation.tech/programmers https://www.webai.com/dataskeptic  
LLMs for Data Analysis

LLMs for Data Analysis

2023-12-1228:59

In this episode, we are joined by Amir Netz, a Technical Fellow at Microsoft and the CTO of Microsoft Fabric. He discusses how companies can use Microsoft's latest tools for business intelligence. Amir started by discussing how business intelligence has progressed in relevance over the years. Amir gave a brief introduction into what Power BI and Fabric are. He also discussed how Fabric distinguishes from other BI tools by building an end-to-end tool for the data journey. Amir spoke about the process of building and deploying machine learning models with Microsoft Fabric. He shared the difference between Software as a Service (SaaS) and Platform as a Service (PaaS). Amir discussed the benefits of Fabric's auto-integration and auto-optimization abilities. He also discussed the capabilities of Copilot in Fabric. He also discussed exciting future developments planned for Fabric. Amir shared techniques for limiting Copilot hallucination.
AI Platforms

AI Platforms

2023-12-0433:54

Our guest today is Eric Boyd, the Corporate Vice President of AI at Microsoft. Eric joins us to share how organizations can leverage AI for faster development. Eric shared the benefits of using natural language to build products. He discussed the future of version control and the level of AI background required to get started with Azure AI. He mentioned some foundational models in Azure AI and their capabilities. Follow Eric on LinkedIn to learn more about his work. Visit today's sponsor at https://webai.com/dataskeptic
Deploying LLMs

Deploying LLMs

2023-11-2735:00

We are excited to be joined by Aaron Reich and Priyanka Shah. Aaron is the CTO at Avanade, while Priyanka leads their AI/IoT offering for the SEA Region. Priyanka is also the MVP for Microsoft AI. They join us to discuss how LLMs are deployed in organizations.
In this episode, we are joined by Jenny Liang, a PhD student at Carnegie Mellon University, where she studies the usability of code generation tools. She discusses her recent survey on the usability of AI programming assistants. Jenny discussed the method she used to gather people to complete her survey. She also shared some questions in her survey alongside vital takeaways. She shared the major reasons for developers not wanting to us code-generation tools. She stressed that the code-generation tools might access the software developers' in-house code, which is intellectual property. Learn more about Jenny Liang via https://jennyliang.me/  
We are joined by Aman Madaan and Shuyan Zhou. They are both PhD students at the Language Technology Institute at Carnegie Mellon University. They join us to discuss their latest published paper, PAL: Program-aided Language Models. Aman and Shuyan started by sharing how the application of LLMs has evolved. They talked about the performance of LLMs on arithmetic tasks in contrast to coding tasks. Aman introduced their PAL model and how it helps LLMs improve at arithmetic tasks. He shared examples of the tasks PAL was tested on. Shuyan discussed how PAL’s performance was evaluated using Big Bench hard tasks. They discussed the kind of mistakes LLMs tend to make and how the PAL’s model circumvents these limitations. They also discussed how these developments in LLMS can improve kids learning. Rounding up, Aman discussed the CoCoGen project, a project that enables NLP tasks to be converted to graphs. Shuyan and Aman shared their next research steps. Follow Shuyan on Twitter @shuyanzhxyc. Follow Aman on @aman_madaan.
In this episode, we have Alessio Buscemi, a software engineer at Lifeware SA. Alessio was a post-doctoral researcher at the University of Luxembourg. He joins us to discuss his paper, A Comparative Study of Code Generation using ChatGPT 3.5 across 10 Programming Languages.  Alessio shared his thoughts on whether ChatGPT is a threat to software engineers. He discussed how LLMs can help software engineers become more efficient.
GraphText

GraphText

2023-10-3130:561

On the show today, we are joined by Jianan Zhao, a Computer Science student at Mila and the University of Montreal. His research focus is on graph databases and natural language processing. He joins us to discuss how to use graphs with LLMs efficiently.  
Today, we are joined by Rajiv Movva, a PhD student in Computer Science at Cornell Tech University. His research interest lies in the intersection of responsible AI and computational social science. He joins to discuss the findings of this work that analyzed LLM publication patterns. He shared the dataset he used for the survey. He also discussed the conditions for determining the papers to analyze. Rajiv shared some of the trends he observed from his analysis. For one, he observed there has been an increase in LLMs research. He also shared the proportions of papers published by universities, organizations, and industry leaders in LLMs such as OpenAI and Google. He mentioned the majority of the papers are centered on the social impact of LLMs. He also discussed other exciting application of LLMs such as in education.
We are excited to be joined by Josh Albrecht, the CTO of Imbue. Imbue is a research company whose mission is to create AI agents that are more robust, safer, and easier to use. He joins us to share findings of his work; Despite "super-human" performance, current LLMs are unsuited for decisions about ethics and safety.  
On today’s show, we are joined by Thilo Hagendorff, a Research Group Leader of Ethics of Generative AI at the University of Stuttgart. He joins us to discuss his research, Deception Abilities Emerged in Large Language Models. Thilo discussed how machine psychology is useful in machine learning tasks. He shared examples of cognitive tasks that LLMs have improved at solving. He shared his thoughts on whether there’s a ceiling to the tasks ML can solve.
Nieves Montes, a Ph.D. student at the Artificial Intelligence Research Institute in Barcelona, Spain, joins us. Her PhD research revolves around value-based reasoning in relation to norms. She shares her latest study, Combining theory of mind and abductive reasoning in agent‑oriented programming.
LLMs for Evil

LLMs for Evil

2023-09-2526:13

We are joined by Maximilian Mozes, a PhD student at the University College, London. His PhD research focuses on Natural Language Processing (NLP), particularly the intersection of adversarial machine learning and NLP. He joins us to discuss his latest research, Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities.
loading
Comments (32)

mrs rime

🔴💚Really Amazing ️You Can Try This💚WATCH💚ᗪOᗯᑎᒪOᗩᗪ👉https://co.fastmovies.org

Jan 16th
Reply

Priya Dharshini

🔴WATCH>>ᗪOᗯᑎᒪOᗩᗪ>>👉https://co.fastmovies.org

Jan 16th
Reply

Tommy king

You would normally use a 3D rendering programme or game engine that supports rendering animations and sequences to render sequences like Maya. Here are some general guidelines to assist you in doing that have a peek here https://thewordpoint.com/services/translation-service/film-script-translation depending on the number of countries you intend to release your film in, you may need to hire film screenplay translation services in more than one language if you want to deliver your tale to a global audience.

Jun 21st
Reply

DemonDogs

incredibly awful audio for some reason

Apr 25th
Reply

tell tims

Now a days surveys are crucial to implement any type of instructions or development works. So, it's great thing to conduct the surveys. That's why we (Tim Hortons) are conducting the surveys at the official survey site https://telltims-ca.com. In return we have providing free validation codes to the survey participants to redeem on their further visit.

Mar 27th
Reply

Edward McBride

Hi friends. I agree, this book is very useful. But I'm not strong on marketing. So I enlisted the help of a proven marketing agency, https://saphira.agency/. They helped me promote my business and make my marketing loud, bright, unique and effective. I am sure that you should contact this agency if you need help in promoting your brand or business.

Dec 29th
Reply

John Skinner

Sponsored Social Media Posts: You can pay to turn one of your company's social media posts into an ad. This method allows you to select the target audience, region, and duration of the ad. Pay Per Click Ads. With this model, you pay every time someone clicks on your ad. Search engines such as Google offer this service where your ad appears at the top of the results page for your chosen keywords.

Dec 29th
Reply

Anthony Hall

Visuals are key. Investing in high quality visual content for your website and social media is a must. Many users rely on images to decide who to follow and what messages to interact with. Consider including photos or videos of your products, services, facilities, or staff.

Dec 28th
Reply

Hasters

In general, as for the affiliate program for designers, it is best to find a special offer. I already know from experience that here https://masterbundles.com/become-our-affiliate/ in the company Master Bundles you can get up to 15% commission if you start cooperation with this service. For me for example it helped me a lot, I hope it will help you too. Good luck with this business.

Dec 21st
Reply

Bill

Wow this is fascinating

Oct 1st
Reply

DemonDogs

the guest has classic psychological and groupthink issues in his research he needs to get out more

Aug 3rd
Reply (1)

Data Science LL

Hi guys in data skeptic. Thanks to share your valuable contents. May I acheive the text of your podcast? Wish luck.

Jul 20th
Reply

Vassili Savinov

great episode, cant wait to hear next one. Thanks!

Jun 28th
Reply

DemonDogs

Description?

Jan 23rd
Reply

ncooty

@6:00: The threshold for statistical significance does not "depend on the outcome." It raises a red flag even to hear someone say that, especially the host of a "data science" podcast. (Of course, if he knew what he was talking about, he'd be a "statistician" instead.) He might more accurately have said that any such estimate of the minimum sample size depends on the number of planned comparisons and the assumed effect size for each measured effect. Confusion about this should disqualify someone from hosting such a podcast.

Aug 23rd
Reply

ncooty

@2:19: Too much interpretation as if respondents were randomly sampled. Respondents self-selected.

Aug 23rd
Reply

Antonio Andrade

thanks so much for sharing the results

Aug 12th
Reply

ncooty

@1:03: It doesn't "beg the question"; it "raises the question." To "beg the question" is to commit a logical fallacy in which one assumes the conclusion.

Jun 15th
Reply

Benjamin Weckerle

Is the spin-off / journal club podcast on castbox?

Jun 2nd
Reply

Platte Gruber

KILLER intro, awesome work!

Jan 8th
Reply (1)
Download from Google Play
Download from App Store