Understanding AI

David J Higgs

May 31

Although I did solve it, I definitely do not consider the current ARC-AGI-3 puzzles anywhere near "mind-blowingly easy" lol. Of course I do struggle with vividly visualizing anything, sense of direction, detailed visual memory, etc., relative to other "intelligence" based tasks, so I suppose it could be some good old human jaggedness showing up

Tom

May 31

Some of the tasks are pretty tricky, but I definitely found this one the easiest, and it has the fewest number of moves as the human baseline.

What's really wild is that they couldn't even get the first level right (which by design is quite easy). Odds are decent that changes soon, but it shows that there are still some pretty big differences between LLM and human intelligence.

Oleg Alexandrov

May 28Edited

Math is really hard, and humans have pretty good intuition as is. Likely quite some problems will be proved with variations of existing techniques, perhaps in new context, and with sheer thorough exploration of the strategy space.

So, it is quite likely AI will get at least as good as people soon enough. That will likely still leave out a lot of fiendishly hard problems. Will be fun to watch how it plays out.

I'm definitely excited to see how this plays out. I'm also excited to play around with models more with math and see what happens. We'll see if that ends up ever turning into a piece, or if it will always just be a hobby...

Mañana

Thanks for the fascinating write up. seems like a sophisticated example of remainder humanism.

What do you mean by remainder humanism?

Mañana

May 30

It's a defensive view set out by Leif Weatherby…a kind of ever shifting line intended to preserve human competencies from machine ones. It's an interesting book.

Seth

Appreciate the write-up, and I thought you did a very good job! (Though even if you didn't, how would I know...?)

The main advantage I suspect humans will retain over AI is... for lack of a better term, people are better thing-wanters. People are voracious and discerning thing-wanters, who are always changing their minds about exactly what things are worth wanting, and are always looking over the horizon for the next thing to want. This gets us in trouble sometimes--like in this case, no mathematician *wanted* to pursue this proof strategy--but in general I think this is an asset.

Modern AIs, on the other hand, more or less want whatever you tell them to want. Right now that's their entire thing: they'll hammer away relentlessly at any dumb thing you come up with. They want to be steered. That's not to say it's *trivial* to steer AI models, but on the whole they seem much more steerable than humans.

This could always change! But I suspect it won't, at least as long building frontier AI remains extremely capital intensive. I can't imagine anyone dumping billions of dollars into making their product *less* compliant and steerable.

Thank you! One of the things that I had to cut for length was a section called "what is the point of math?"

Even if AIs get better at mathematics, and the type of volition you are talking about, there's a sense in which math is about getting humans to understand math. The whole game of proving theorems is rather like the whole game of solving homework problems: the understanding gained and transmitted is the point, rather than the result per se. (There are context where this isn't true).

So one advantage humans will maintain is that the thing we want in mathematics is fundamentally human, too. That may not pay the bills though, as a really good recent substack post puts it: https://davidbessis.substack.com/p/the-fall-of-the-theorem-economy

Chris

May 29

Thank you for this very clear explanation. I am confused about one thing--I agree that the OpenAI diagram shows the c^2=65 case. However, for a 16x16 grid layout, I think the maximal number of unit distance pairs happens for c^2=25 (976 pairs) not c^2=65 (912 pairs). Am I missing something or did they make the figure incorrectly? Thanks for your work in putting this together.

May 29

Hi Chris, thanks for pointing this out! I actually note in a footnote that the maximal number of unit distance pairs happens for c^2=25, so you’re correct about that.

One subtlety here I’m not sure I communicated clearly enough: with something like the grid Erdos constructed, mathematicians really care about the behavior as the number of points expand rather than what is exactly optimal. We already knew that a square grid was not going to be optimal — things which look closer to the screenshot I shared in the second section have more unit distances. So having c^2=25 or c^2=65 isn’t a huge deal in this case: both are illustrating the same main idea.

That being said, OpenAI’s diagram is confusing. I think they thought that c^2=65 would look better, and they might be right tbh.