Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

WTF-Beta

  1. Home
  2. Categories
  3. Off Key - General Discussion
  4. Testing AI coding tools

Testing AI coding tools

Scheduled Pinned Locked Moved Off Key - General Discussion
4 Posts 3 Posters 31 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • wtgW Offline
    wtgW Offline
    wtg
    wrote last edited by
    #1

    The idea of using AI to help with computer programming has become a contentious issue. On the one hand, coding agents can make horrific mistakes that require a lot of inefficient human oversight to fix, leading many developers to lose trust in the concept altogether. On the other hand, some coders insist that AI coding agents can be powerful tools and that frontier models are quickly getting better at coding in ways that overcome some of the common problems of the past.

    To see how effective these modern AI coding tools are becoming, we decided to test four major models with a simple task: re-creating the classic Windows game Minesweeper. Since it’s relatively easy for pattern-matching systems like LLMs to play off of existing code to re-create famous games, we added in one novelty curveball as well.

    Our straightforward prompt:

    Make a full-featured web version of Minesweeper with sound effects that

    1. Replicates the standard Windows game and

    2. implements a surprise, fun gameplay feature.

    Include mobile touchscreen support.

    https://arstechnica.com/ai/2025/12/the-ars-technica-ai-coding-agent-test-minesweeper-edition/

    When the world wearies and society ceases to satisfy, there is always the garden - Minnie Aumônier

    ShiroKuroS 1 Reply Last reply
    • wtgW wtg

      The idea of using AI to help with computer programming has become a contentious issue. On the one hand, coding agents can make horrific mistakes that require a lot of inefficient human oversight to fix, leading many developers to lose trust in the concept altogether. On the other hand, some coders insist that AI coding agents can be powerful tools and that frontier models are quickly getting better at coding in ways that overcome some of the common problems of the past.

      To see how effective these modern AI coding tools are becoming, we decided to test four major models with a simple task: re-creating the classic Windows game Minesweeper. Since it’s relatively easy for pattern-matching systems like LLMs to play off of existing code to re-create famous games, we added in one novelty curveball as well.

      Our straightforward prompt:

      Make a full-featured web version of Minesweeper with sound effects that

      1. Replicates the standard Windows game and

      2. implements a surprise, fun gameplay feature.

      Include mobile touchscreen support.

      https://arstechnica.com/ai/2025/12/the-ars-technica-ai-coding-agent-test-minesweeper-edition/

      ShiroKuroS Offline
      ShiroKuroS Offline
      ShiroKuro
      wrote last edited by
      #2

      @wtg that was interesting, thanks for posting it!

      Not the same as agentic AI, I’ve spent a lot of time testing ChatGPT, Gemini and Copilot in voice mode in Japanese to see which chatbots are best as a conversation practice partner for learners. They all have problems, but ChatGPT (the OpenAI product) is by far the best (currently). It’s interesting how much better it is than the others (esp since Copilot shares some infrastructure with ChatGPT).

      1 Reply Last reply
      • JodiJ Offline
        JodiJ Offline
        Jodi
        wrote last edited by Jodi
        #3

        My adult kid uses a subscription AI all the time as a software engineer. If you know what you are doing it is game changing from a productivity standpoint. It’s a much faster way to get all the information you need to solve a problem. (Which is all coding is, solving problems). He says it’s like a mirror. The more of an expert you are about coding, the better it will work for you.

        ShiroKuroS 1 Reply Last reply
        👍
        • JodiJ Jodi

          My adult kid uses a subscription AI all the time as a software engineer. If you know what you are doing it is game changing from a productivity standpoint. It’s a much faster way to get all the information you need to solve a problem. (Which is all coding is, solving problems). He says it’s like a mirror. The more of an expert you are about coding, the better it will work for you.

          ShiroKuroS Offline
          ShiroKuroS Offline
          ShiroKuro
          wrote last edited by
          #4

          @Jodi said in Testing AI coding tools:

          The more of an expert you are about coding, the better it will work for you.

          I suspect this is the case for most AI tasks. If you are already an expert, then you can use AI in ways that boost productivity, and probably creativity as well.

          The problem is, if you are not already an expert, using AI is probably going to prevent you from ever becoming one.

          1 Reply Last reply
          Reply
          • Reply as topic
          Log in to reply
          • Oldest to Newest
          • Newest to Oldest
          • Most Votes


          Powered by NodeBB | Contributors
          • Login

          • Don't have an account? Register

          • Login or register to search.
          • First post
            Last post
          0
          • Categories
          • Recent
          • Tags
          • Popular
          • Users
          • Groups