DiscoverCreativity Research Audio Journal (CRAJ)Ep.142. Creative Preference Optimization
Ep.142. Creative Preference Optimization

Ep.142. Creative Preference Optimization

Update: 2025-06-04
Share

Description

"Creative Preference Optimization" by Mete Ismayilzada, Antonio Laverghetta Jr., Simone A. Luchini, Reet Patel, Antoine Bosselut, Lonneke van der Plas, Roger Beaty


Summary

This document introduces Creative Preference Optimization (CRPO), a novel method designed to enhance the creativity of Large Language Models (LLMs). The authors argue that existing methods often focus too narrowly on single aspects of creativity, proposing CRPO as a modular approach that integrates signals from multiple creativity dimensions—novelty, diversity, surprise, and quality—into the preference optimization process. To train and evaluate their models, they also present MUCE, a new large-scale dataset of human creativity assessments. Their experiments show that models trained with CRPO outperform baseline LLMs, including strong commercial models, in generating content that is more novel, diverse, and surprising while maintaining high quality, suggesting that directly optimizing for creativity within preference frameworks is a promising direction.

Comments 
loading
In Channel
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Ep.142. Creative Preference Optimization

Ep.142. Creative Preference Optimization

Alog