Tom Sydney Kerckhove on Twitter: “I haven’t found any programming tasks that an LLM could do even barely correctly. What kind of code are you all writing?!”

Update: 2025-06-03

Description

Interesting responses [WaybackSave/Archive] Tom Sydney Kerckhove on X: “I haven’t found any programming tasks that an LLM could do even barely correctly. What kind of code are you all writing?!” and later

They all come down to

excuses for using LLM without any substantial result (most of the results come down to one having to become the tester and fixer of the generated code without newer generated code being improved: the opposite of coaching an apprentice)

become better at prompting (which is basically regarding the prompt as a new programming language: been there, done that)

[WaybackSave/Archive] One of the “become better at prompting” replies referred to a blog post disguising prompting as writing lots of unit tests: [Wayback/Archive] The Cline AI Assistant is Mesmerizing · mtlynch.io

I tried out the Cline AI assistant yesterday, and then I went into a trance for five hours where I couldn’t do anything but stare transfixed at Cline fixing bugs for me.

…

I should be able to just keep showing the AI assistant test cases with the behavior I want in, and it should be able to just keep editing the code until the test passes.

…

And that’s when I was hooked. I was so amazed that I could develop software this way. I just told the tool what I wanted, and it kept doing exactly what I asked.

Well, it is relatively easy to make unit tests pass no matter the programming environment without solving the actual problem. Examples for one are at [Wayback/Archive] java – How to make an unit test always pass? – Stack Overflow.

I responded to one of the longer reactions:

Shorter: LLMs can’t understand.

That’s their fundamental problem: they are just a bunch of statistics trained on ever dwindling levels of input.

and later added

Larger models don’t solve a fundamental problem: they are based on statistics instead of reasoning.

More training data fails another fundamental problem: the data pool is already poisoned by LLM generated mediocre code.

The pool level already was of average programmers at best.

plus

Unless brain-inspired machine learning ¹ takes off, LLMs can improve little as they exhausted the data lake already ² and the internet is already full of data poisoning new models ³.

1/

¹ https://pmc.ncbi.nlm.nih.gov/articles/PMC10630093/

² https://petapixel.com/2025/01/09/have-ai-companies-run-out-of-training-data-elon-musk-ilya-sutskever-thinks-so/

³ https://www.nature.com/articles/s41586-024-07566-y

and

What could help to a limited extent is human curation of the data pool. That is prohibitively expensive.

2/2

The only use case for me using LLM is to get a rough idea of a new field of work, but usually using the better search engines get me there just as quickly.

This response was epic: [Wayback/Archive] Catalin on X: “@kerckhove_ts “write better prompts” army coming.”

[Wayback/Archive] Gi7fb9PWIAAwQod.jpg (496×264)

[Wayback/Archive] video.twimg.com/tweet_video/Gi7fb9PWIAAwQod.mp4

[Wayback/Archive] Tweet JSON

--jeroen

My tweets:

[WaybackSave/Archive] Jeroen Wiert Pluimers @wiert@mastodon.social on X: “@DanielHoffmann_ @cssslinger @kerckhove_ts Shorter: LLMs can’t understand. That’s their fundamental problem: they are just a bunch of statistics trained on ever dwindling levels of input.”

[WaybackSave/Archive] Jeroen Wiert Pluimers @wiert@mastodon.social on X: “@DanielHoffmann_ @cssslinger @kerckhove_ts Larger models don’t solve a fundamental problem: they are based on statistics instead of reasoning. More training data fails another fundamental problem: the data pool is already poisoned by LLM generated mediocre code. The pool level already was of average programmers at best.”

[WaybackSave/Archive] Jeroen Wiert Pluimers @wiert@mastodon.social on X: “@ten_crowns @DanielHoffmann_ @cssslinger @kerckhove_ts Unless brain-inspired machine learning ¹ takes off, LLMs can improve little as they exhausted the data lake already ² and the internet is already full of data poisoning new models ³. 1/ ¹ … ² … ³ …”

[WaybackSave/Archive] Jeroen Wiert Pluimers @wiert@mastodon.social on X: “@ten_crowns @DanielHoffmann_ @cssslinger @kerckhove_ts What could help to a limited extent is human curation of the data pool. That is prohibitively expensive. 2/2”

Comments

In Channel

Draad van @Walrathis expert in psychotraumatologie on Thread Reader App – over crisisrespons tijdens Oktoberfest 2025

2025-10-13--:--

Rudimentary DaynaPORT packet driver to use WiFi from DOS using BlueSCSI: GitHub – cml37/daynaport-dos-packet-driver

2025-10-08--:--

HSTS Preload List Submission

2025-10-08--:--

Iemand een alternatief NPO Soul & Jazz-kanaal met zo min mogelijk geklets overdag, en elk uur een nieuwsbulletin?

2025-09-26--:--

Tom Sydney Kerckhove on Twitter: “I haven’t found any programming tasks that an LLM could do even barely correctly. What kind of code are you all writing?!”

2025-06-03--:--

Office suites trick I was unaware off: you can use images as background of shapes, then distort by moving the corner points

2025-05-26--:--

Cyber Gangsta’s Paradise | Prof. Merli ft. MC BlackHat [Parody Music Video] – YouTube

2025-05-16--:--

Monty Python and the Holy Grail turns 50 – Ars Technica

2025-05-02--:--

Windows: extracting CD-audio for a funeral: CDex, MP3Gain (a replaygain like implementation which modifies MP3 metadata) plus UI wrapper and audacity (for combining tracks)

2025-03-27--:--

Ben Dicken on X: “You asked for it, so here it is. Visualizing CPU cache speeds relative to RAM. Cache optimization is important too!”

2025-03-18--:--

Insightful video on arm movements via Chris Kavanagh on X: “Found this in Reddit seems like it might be useful for some people.”

2025-01-31--:--

Some notes on mini/micro Apple //e emulators

2025-01-30--:--

VideoLAN on Twitter: “VLC automatic subtitles generation and translation based on local and open source AI models running on your machine working offline, and supporting numerous languages! Demo can be found on our #CES2025 booth in Eureka Park.” (video)

2025-01-11--:--

Laurens on X: “Heel goed item van Lubach over erfbelasting #Avondshow”

2025-01-10--:--

Likely the best Xmas commercial this year: Bubbles | Deutsche Telekom | Christmas Ad 2024 – YouTube

2024-12-25--:--

Jeffrey | JKCTech on X: “Dit is echt 1 van de aller mooiste edge cases voor een licht sensor die ik ooit heb gezien… https://t.co/wkm8ztbHI9” / X

2024-11-26--:--

GitHub – ggerganov/whisper.cpp: Port of OpenAI’s Whisper model in C/C++

2024-11-20--:--

The codewali on Twitter: “How API works?”

2024-09-17--:--

Twitter: getting a tweet video URL

2024-09-09--:--

MokupiPogisho👁️ on Twitter: “How to find hidden cameras in AirBnB 👁”

2024-09-06--:--

00:00

Tom Sydney Kerckhove on Twitter: “I haven’t found any programming tasks that an LLM could do even barely correctly. What kind of code are you all writing?!”

#box-pro-ellipsis-176146244461522{-webkit-line-clamp:2;}Tom Sydney Kerckhove on Twitter: “I haven’t found any programming tasks that an LLM could do even barely correctly. What kind of code are you all writing?!”

Tom Sydney Kerckhove on Twitter: “I haven’t found any programming tasks that an LLM could do even barely correctly. What kind of code are you all writing?!”

jpluimers

Tom Sydney Kerckhove on Twitter: “I haven’t found any programming tasks that an LLM could do even barely correctly. What kind of code are you all writing?!”