site banner

Repeating the LLM vs Advent of Code experiment

Last year I did an experiment with ChatGPT and Advent of Code. I was thinking of repeating it and since last year I was criticized for choice of model and prompt I'm going to crowdsource them: which LLM should I use, which one is best at writing code? What prompt should I give it?

6
Jump in the discussion.

No email address required.

The new Qwen 2.5 32B dropped, people are saying it's roughly as good as the newest Sonnet for coding. I don't know how easy it is to get access to Qwen but it is Chinese and should be cheap, it is open source...

Might be good to try out as a comparison, see if it's really that good or if they've just been benchmark hacking? But Sonnet is the most obvious pick IMO.