612 points · 9 comments · 1 month ago · PunchTornado
deepmind.googledang
takoid
I just tested the "generate an SVG of a pelican riding a bicycle" prompt and this is what I got: https://codepen.io/takoid/pen/wBWLOKj
The model thought for over 5 minutes to produce this. It's not quite photorealistic (some parts are definitely "off"), but this is definitely a significant leap in complexity.
MATTEHWHOU
Quick take: Gemini 3.1 Pro's long context is genuinely better now — I fed it a 200k token codebase and it could reference files from the beginning without losing track. That was a real problem in 3.0.
For pure code generation though, Claude still edges it out on following complex multi-step instructions. Gemini tends to take shortcuts when the task has more than ~5 constraints.
The exciting thing is how close they all are. Competition is working exactly as it should.
lkt