#234 High Performance Generative AI Applications with Ram Sriharsha, CTO at Pinecone

Update: 2024-08-12

Description

Perhaps the biggest complaint about generative AI is hallucination. If the text you want to generate involves facts, for example, a chatbot that answers questions, then hallucination is a problem. The solution to this is to make use of a technique called retrieval augmented generation, where you store facts in a vector database and retrieve the most appropriate ones to send to the large language model to help it give accurate responses. So, what goes into building vector databases and how do they improve LLM performance so much?

Ram Sriharsha is currently the CTO at Pinecone. Before this role, he was the Director of Engineering at Pinecone and previously served as Vice President of Engineering at Splunk. He also worked as a Product Manager at Databricks. With a long history in the software development industry, Ram has held positions as an architect, lead product developer, and senior software engineer at various companies. Ram is also a long time contributor to Apache Spark.

In the episode, Richie and Ram explore common use-cases for vector databases, RAG in chatbots, steps to create a chatbot, static vs dynamic data, testing chatbot success, handling dynamic data, choosing language models, knowledge graphs, implementing vector databases, innovations in vector data bases, the future of LLMs and much more.

Links Mentioned in the Show:

New to DataCamp?

Learn on the go using the DataCamp mobile app

Empower your business with world-class data and AI skills with DataCamp for business

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

#257 Can You Use AI-Driven Pricing Ethically? with Jose Mendoza, Academic Director & Clinical Associate Professor at NYU

2024-11-0144:58

#256 From Deep Learning to SuperIntelligence with Terry Sejnowski, Head of Computational Neurobiology at Salk Institute

2024-10-2850:23

#255 Not Only Vector Databases: Putting Databases at the Heart of AI, with Andi Gutmans, VP and GM of Databases at Google

2024-10-2446:15

#254 Career Skills for Data Professionals with Wes Kao, Co-Founder of Maven

2024-10-2145:44

#253 The Infrastructure Supporting the Data Revolution with Saad Siddiqui, General Partner at Titanium Ventures

2024-10-1738:17

#252 Is Big Data Dead? MotherDuck and the Small Data Manifesto with Ryan Boyd Co-Founder at MotherDuck

2024-10-1448:50

#251 The New Toolkit For CDOs with Adrian Estala, VP, Field Chief Data Officer at Starburst

2024-10-1148:29

#250 How Data and AI are Changing Data Management with Jamie Lerner, CEO, President & Chairman at Quantum

2024-10-0748:28

#249 Towards Self-Service Data Engineering with Taylor Brown, Co-Founder and COO at Fivetran

2024-10-0350:28

#248 Effective Product Management for AI with Marily Nika, Gen AI Product Lead at Google Assistant

2024-09-3041:36

#247 Aligning AI with Enterprise Strategy with Leon Gordon, CEO at Onyx Data

2024-09-2640:13

#246 AI and the Future of Art with Kent Keirsey, Founder & CEO at Invoke

2024-09-2346:32

#245 Can We Make Generative AI Cheaper? With Natalia Vassilieva, Senior VP & Field CTO & Andy Hock, VP, Product & Strategy at Cerebras Systems

2024-09-1946:05

#244 Using Data to Optimize Costs in Healthcare with Travis Dalton and Jocelyn Jiang President/CEO & VP of Data & Decision Science at MultiPlan

2024-09-1639:06

#243 No-Code LLMs In Practice with Birago Jones & Karthik Dinakar, CEO & CTO at Pienso

2024-09-1254:05

#242 Data Storytelling for Kids with Cole Nussbaumer Knaflic, Founder and CEO of Storytelling with Data

2024-09-0950:02

#241 Getting Generative AI Into Production with Lin Qiao, CEO and Co-Founder of Fireworks AI

2024-09-0544:10

#240 Generative AI in the Enterprise with Steve Holden, Senior Vice President and Head of Single-Family Analytics at Fannie Mae

2024-09-0239:07

#239 New Models for Digital Transformation with Alison McCauley Chief Advocacy Officer at Think with AI & Founder of Unblocked Future

2024-08-2951:20

#238 Data & AI for Improving Patient Outcomes with Terry Myerson, CEO at Truveta

2024-08-2639:49

00:00

#234 High Performance Generative AI Applications with Ram Sriharsha, CTO at Pinecone

#box-pro-ellipsis-173063849586484{-webkit-line-clamp:2;}#234 High Performance Generative AI Applications with Ram Sriharsha, CTO at Pinecone

#234 High Performance Generative AI Applications with Ram Sriharsha, CTO at Pinecone

DataCamp

#234 High Performance Generative AI Applications with Ram Sriharsha, CTO at Pinecone