DiscoverSoftware Engineering Institute (SEI) Podcast SeriesEvaluating Large Language Models for Cybersecurity Tasks: Challenges and Best Practices
Evaluating Large Language Models for Cybersecurity Tasks: Challenges and Best Practices

Evaluating Large Language Models for Cybersecurity Tasks: Challenges and Best Practices

Update: 2024-07-25
Share

Description

How can we effectively use large language models (LLMs) for cybersecurity tasks? In this Carnegie Mellon University Software Engineering Institute podcast, Jeff Gennari and Sam Perl discuss applications for LLMs in cybersecurity, potential challenges, and recommendations for evaluating LLMs.
Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Evaluating Large Language Models for Cybersecurity Tasks: Challenges and Best Practices

Evaluating Large Language Models for Cybersecurity Tasks: Challenges and Best Practices