GitHub - abagames/flipwalker-game-benchmark: A benchmark that gives the same puzzle-game prompt to multiple AI coding agents and compares the results.

A benchmark that gives the same puzzle-game prompt to multiple AI coding agents and compares the results. - abagames/flipwalker-game-benchmark