Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

WTF-Beta

  1. Home
  2. Categories
  3. Off Key - General Discussion
  4. Testing AI coding tools

Testing AI coding tools

Scheduled Pinned Locked Moved Off Key - General Discussion
2 Posts 2 Posters 15 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • wtgW Offline
    wtgW Offline
    wtg
    wrote last edited by
    #1

    The idea of using AI to help with computer programming has become a contentious issue. On the one hand, coding agents can make horrific mistakes that require a lot of inefficient human oversight to fix, leading many developers to lose trust in the concept altogether. On the other hand, some coders insist that AI coding agents can be powerful tools and that frontier models are quickly getting better at coding in ways that overcome some of the common problems of the past.

    To see how effective these modern AI coding tools are becoming, we decided to test four major models with a simple task: re-creating the classic Windows game Minesweeper. Since it’s relatively easy for pattern-matching systems like LLMs to play off of existing code to re-create famous games, we added in one novelty curveball as well.

    Our straightforward prompt:

    Make a full-featured web version of Minesweeper with sound effects that

    1. Replicates the standard Windows game and

    2. implements a surprise, fun gameplay feature.

    Include mobile touchscreen support.

    https://arstechnica.com/ai/2025/12/the-ars-technica-ai-coding-agent-test-minesweeper-edition/

    When the world wearies and society ceases to satisfy, there is always the garden - Minnie Aumônier

    ShiroKuroS 1 Reply Last reply
    • wtgW wtg

      The idea of using AI to help with computer programming has become a contentious issue. On the one hand, coding agents can make horrific mistakes that require a lot of inefficient human oversight to fix, leading many developers to lose trust in the concept altogether. On the other hand, some coders insist that AI coding agents can be powerful tools and that frontier models are quickly getting better at coding in ways that overcome some of the common problems of the past.

      To see how effective these modern AI coding tools are becoming, we decided to test four major models with a simple task: re-creating the classic Windows game Minesweeper. Since it’s relatively easy for pattern-matching systems like LLMs to play off of existing code to re-create famous games, we added in one novelty curveball as well.

      Our straightforward prompt:

      Make a full-featured web version of Minesweeper with sound effects that

      1. Replicates the standard Windows game and

      2. implements a surprise, fun gameplay feature.

      Include mobile touchscreen support.

      https://arstechnica.com/ai/2025/12/the-ars-technica-ai-coding-agent-test-minesweeper-edition/

      ShiroKuroS Offline
      ShiroKuroS Offline
      ShiroKuro
      wrote last edited by
      #2

      @wtg that was interesting, thanks for posting it!

      Not the same as agentic AI, I’ve spent a lot of time testing ChatGPT, Gemini and Copilot in voice mode in Japanese to see which chatbots are best as a conversation practice partner for learners. They all have problems, but ChatGPT (the OpenAI product) is by far the best (currently). It’s interesting how much better it is than the others (esp since Copilot shares some infrastructure with ChatGPT).

      1 Reply Last reply
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      Powered by NodeBB | Contributors
      • Login

      • Don't have an account? Register

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • Users
      • Groups