DiscoverNext in TechData Integration for AI
Data Integration for AI

Data Integration for AI

Update: 2025-09-02
Share

Description

Can you have too much data for an AI application? In the mad dash to collect the raw material for AI applications, it can be tempting to pull in as much as you can. Product manager Emily Jasper returns to the podcast with a set of recommendations for more strategic use of data with host Eric Hanselman. Just as it might not be wise to load up on everything on a buffet, being strategic about using the data that best suits the goals of your project can improve outcomes and help to manage risk. By understanding the data that you’re putting to work, you can bound the universe of outcomes and simplify the process of bringing it into the AI application pipeline. At the same time, the process of data governance becomes clearer when the sources are better understood. 

Bringing an understanding of the set of data resources that an enterprise has is critical and has to be accompanied by knowledge of the quality of that data. The principles of library sciences are back in focus in AI, as organizations work to curate data characteristics and provenance. As in so much of AI, matching the ecosystem of tools, data providers, and capabilities to the use cases being built is fundamental to project success. Managing risk in AI has become a process of bringing the right data to the right problem.

More S&P Global Content:

For S&P Global Subscribers:

Credits:

  • Host/Author: Eric Hanselman
  • Guest: Emily Jasper
  • Producer/Editor: Adam Kovalsky
  • Published With Assistance From: Sophie Carr, Feranmi Adeoshun, Kyra Smith
Comments 
In Channel
Stablecoins

Stablecoins

2025-10-1429:39

Industrial Metaverse

Industrial Metaverse

2025-10-0724:56

HR Tech

HR Tech

2025-09-3025:54

Data Migration

Data Migration

2025-09-2326:47

AI Infrastructure

AI Infrastructure

2025-09-1623:48

Black Hat and DefCon

Black Hat and DefCon

2025-08-1930:12

FinTech Advances

FinTech Advances

2025-08-1229:16

The Creator Economy

The Creator Economy

2025-08-0529:10

Advertising and Tech

Advertising and Tech

2025-07-2927:08

Security for MCP

Security for MCP

2025-07-0826:42

Context Around MCP

Context Around MCP

2025-07-0128:44

Datacenter Slowdown?

Datacenter Slowdown?

2025-06-1032:36

Personal Data

Personal Data

2025-06-0330:03

loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Data Integration for AI

Data Integration for AI

S&P Global Market Intelligence