DiscoverWhat is it about computational communication science?Observing Opinions: What is Pre-Processing?
Observing Opinions: What is Pre-Processing?

Observing Opinions: What is Pre-Processing?

Update: 2025-09-09
Share

Description

In this episode, Prof. Jamal Abdul Nasir from the University of Galway reveals why pre-processing is the backbone of all text analysis. He breaks down key steps like defining documents, tokenization, removing stop words, unification, and stemming vs. lemmatization. Jamal also explains unigrams vs. bigrams and how modern NLP techniques like byte-pair encoding are changing the game. Plus, he shares practical tips for making your pre-processing transparent and reproducible, helping your research stand strong and scale up.

Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Observing Opinions: What is Pre-Processing?

Observing Opinions: What is Pre-Processing?

Emese Domahidi & Mario Haim