Data Science #13 - Kolmogorov complexity paper review (1965) - Part 2

Update: 2024-10-01

Description

In the 14th episode we review the second part of Kolmogorov's seminal paper:
Three approaches to the quantitative definition of information’." Problems of information transmission 1.1 (1965): 1-7.

The paper introduces algorithmic complexity (or Kolmogorov complexity), which measures the amount of information in an object based on the length of the shortest program that can describe it.

This shifts focus from Shannon entropy, which measures uncertainty probabilistically, to understanding the complexity of structured objects.

Kolmogorov argues that systems like texts or biological data, governed by rules and patterns, are better analyzed by their compressibility—how efficiently they can be described—rather than by random probabilistic models.

In modern data science and AI, these ideas are crucial. Machine learning models, like neural networks, aim to compress data into efficient representations to generalize and predict.

Kolmogorov complexity underpins the idea of minimizing model complexity while preserving key information, which is essential for preventing overfitting and improving generalization.

In AI, tasks such as text generation and data compression directly apply Kolmogorov's concept of finding the most compact representation, making his work foundational for building efficient, powerful models.

This is part 2 out of 2 episodes covering this paper (the first one is in Episode 12).

Comments

In Channel

Data Science #34 - The deep learning original paper review, Hinton, Rumelhard & Williams (1985)

2025-11-2346:37

Data Science #33 - The Backpropagation method, Paul Werbos (1980)

2025-11-0357:45

Data Science #32 - A Markovian Decision Process, Richard Bellman (1957)

2025-09-1946:05

Data Science #31 - Correlation and causation (1921), Wright Sewall

2025-07-2648:11

Data Science #29 - The Chi-square automatic interaction detection(CHAID) algorithm (1979)

2025-05-2341:03

Data Science #28 - The Bloom filter algorithm

2025-05-2339:15

Data Science #27 - The History of Least Squares (1877)

2025-04-0232:09

Data Science #26 - The First Gradient decent algorithm by Cauchy (1847)

2025-03-2333:14

Data Science #24 - The Expectation Maximization (EM) algorithm Paper review (1977)

2025-02-0432:47

Data Science #23- The Markov Chain Monte Carl MCMC Paper review (1953)

2025-01-1437:54

Data Science #22 - The theory of dynamic programming, Paper review 1954

2025-01-0747:46

Data Science #21 - Steps Toward Artificial Intelligence

2024-12-2559:39

Data Science #20 - the Rao-Cramer bound (1945)

2024-12-0959:42

Data Science #19 - The Kullback–Leibler divergence paper (1951)

2024-12-0252:41

Data Science #18 - The k-nearest neighbors algorithm (1951)

2024-11-2544:01

Data Science #17 - The Monte Carlo Algorithm (1949)

2024-11-1838:11

Data Science #16 - The First Stochastic Descent Algorithm (1952)

2024-11-0742:20

Data Science #15 - The First Decision Tree Algorithm (1963)

2024-10-2836:35

Data Science #14 - The original k-means algorithm paper review (1957)

2024-10-1046:57

Data Science #13 - Kolmogorov complexity paper review (1965) - Part 2

2024-10-0129:25

00:00

Data Science #13 - Kolmogorov complexity paper review (1965) - Part 2

#box-pro-ellipsis-176739259717972{-webkit-line-clamp:2;}Data Science #13 - Kolmogorov complexity paper review (1965) - Part 2

Data Science #13 - Kolmogorov complexity paper review (1965) - Part 2

Mike E

Data Science #13 - Kolmogorov complexity paper review (1965) - Part 2