Search: [llm] - Cat's Whisker Links

This 2024 IOCC entry by Adrian Cable "implements an LLM inference engine in an impossibly minimal quantity of maximally incomprehensible C code". It can run llama2-7b in only 1750 bytes of C. That's basically a paragraph of C code to run an LLM 🤯

I was able to eventually convince ChatGPT that it actually works by asking it to de-obfuscate and explain the code.

I tried it on my 8-year-old Thinkpad (X1 Carbon 5th gen) with 16GB of RAM and got surprisingly good output at about 1 token/second!

llm

March 1, 2026 at 11:08:01 MST * · permalink

https://www.ioccc.org/2024/cable1/index.html#inventory

Using LLMs at Oxide

sensible takes on using LLMs

llm · computer

February 21, 2026 at 10:38:00 MST * · permalink

https://rfd.shared.oxide.computer/rfd/0576

Attention Is Off By One – Evan Miller

"Attention sinks" fix this in recent models

llm

August 8, 2025 at 10:59:02 MDT * · permalink

https://www.evanmiller.org/attention-is-off-by-one.html

Filters

Links per page

20 50 100