• jsomae@lemmy.ml
    link
    fedilink
    English
    arrow-up
    8
    ·
    12 hours ago

    Using an LLM as a chess engine is like using a power tool as a table leg. Pretty funny honestly, but it’s obviously not going to be good at it, at least not without scaffolding.

  • NeilBrü@lemmy.world
    link
    fedilink
    English
    arrow-up
    55
    arrow-down
    1
    ·
    edit-2
    13 hours ago

    An LLM is a poor computational/predictive paradigm for playing chess.

    • Bleys@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      13 hours ago

      The underlying neural network tech is the same as what the best chess AIs (AlphaZero, Leela) use. The problem is, as you said, that ChatGPT is designed specifically as an LLM so it’s been optimized strictly to write semi-coherent text first, and then any problem solving beyond that is ancillary. Which should say a lot about how inconsistent ChatGPT is at solving problems, given that it’s not actually optimized for any specific use cases.

      • NeilBrü@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        45 minutes ago

        Yes, I agree wholeheartedly with your clarification.

        My career path, as I stated in a different comment in regards to neural networks, is focused on generative DNNs for CAD applications and parametric 3D modeling. Before that, I began as a researcher in cancerous tissue classification and object detection in medical diagnostic imaging.

        Thus, large language models are well out of my area of expertise in terms of the architecture of their models.

        However, fundamentally it boils down to the fact that the specific large language model used was designed to predict text and not necessarily solve problems/play games to “win”/“survive”.

        (I admit that I’m just parroting what you stated and maybe rehashing what I stated even before that, but I like repeating and refining in simple terms to practice explaining to laymen and, dare I say, clients. It helps me feel as if I don’t come off too pompously when talking about this subject to others; forgive my tedium.)

      • NeilBrü@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1
        ·
        15 hours ago

        I’m impressed, if that’s true! In general, an LLM’s training cost vs. an LSTM, RNN, or some other more appropriate DNN algorithm suitable for the ruleset is laughably high.

        • Takapapatapaka@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          14 hours ago

          Oh yes, cost of training are ofc a great loss here, it’s not optimized at all, and it’s stuck at an average level.

          Interestingly, i believe some people did research on it and found some parameters in the model that seemed to represent the state of the chess board (as in, they seem to reflect the current state of the board, and when artificially modified, the model takes modification into account in its playing). It was used by a french youtuber to show how LLMs can somehow have a kinda representation of the world. I can try to get the sources back if you’re interested.

          • NeilBrü@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            edit-2
            13 hours ago

            Absolutely interested. Thank you for your time to share that.

            My career path in neural networks began as a researcher for cancerous tissue object detection in medical diagnostic imaging. Now it is switched to generative models for CAD (architecture, product design, game assets, etc.). I don’t really mess about with fine-tuning LLMs.

            However, I do self-host my own LLMs as code assistants. Thus, I’m only tangentially involved with the current LLM craze.

            But it does interest me, nonetheless!

  • nednobbins@lemm.ee
    link
    fedilink
    English
    arrow-up
    36
    arrow-down
    1
    ·
    20 hours ago

    Sometimes it seems like most of these AI articles are written by AIs with bad prompts.

    Human journalists would hopefully do a little research. A quick search would reveal that researches have been publishing about this for over a year so there’s no need to sensationalize it. Perhaps the human journalist could have spent a little time talking about why LLMs are bad at chess and how researchers are approaching the problem.

    LLMs on the other hand, are very good at producing clickbait articles with low information content.

    • nova_ad_vitum@lemmy.ca
      link
      fedilink
      English
      arrow-up
      19
      ·
      19 hours ago

      Gotham chess has a video of making chatgpt play chess against stockfish. Spoiler: chatgpt does not do well. It plays okay for a few moves but then the moment it gets in trouble it straight up cheats. Telling it to follow the rules of chess doesn’t help.

      This sort of gets to the heart of LLM-based “AI”. That one example to me really shows that there’s no actual reasoning happening inside. It’s producing answers that statistically look like answers that might be given based on that input.

      For some things it even works. But calling this intelligence is dubious at best.

      • Ultraviolet@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        edit-2
        17 hours ago

        Because it doesn’t have any understanding of the rules of chess or even an internal model of the game state, it just has the text of chess games in its training data and can reproduce the notation, but nothing to prevent it from making illegal moves, trying to move or capture pieces that don’t exist, incorrectly declaring check/checkmate, or any number of nonsensical things.

      • JacksonLamb@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        15 hours ago

        ChatGPT versus Deepseek is hilarious. They both cheat like crazy and then one side jedi mind tricks the winner into losing.

      • interdimensionalmeme@lemmy.ml
        link
        fedilink
        English
        arrow-up
        1
        ·
        17 hours ago

        I think the biggest problem is it’s very low ability to “test time adaptability”. Even when combined with a reasonning model outputting into its context, the weights do not learn out of the immediate context.

        I think the solution might be to train a LoRa overlay on the fly against the weights and run inference with that AND the unmodified weights and then have an overseer model self evaluate and recompose the raw outputs.

        Like humans are way better at answering stuff when it’s a collaboration of more than one person. I suspect the same is true of LLMs.

        • nednobbins@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          15 hours ago

          Like humans are way better at answering stuff when it’s a collaboration of more than one person. I suspect the same is true of LLMs.

          It is.

          It’s really common for non-language implementations of neural networks. If you have an NN that’s right some percentage of the time, you can often run it through a bunch of copies of the NNs and take the average and that average is correct a higher percentage of the time.

          Aider is an open source AI coding assistant that lets you use one model to plan the coding and a second one to do the actual coding. It works better than doing it in a single pass, even if you assign the the same model to planing and coding.

    • Lovable Sidekick@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      16 hours ago

      In this case it’s not even bad prompts, it’s a problem domain ChatGPT wasn’t designed to be good at. It’s like saying modern medicine is clearly bullshit because a doctor loses a basketball game.

  • Halosheep@lemm.ee
    link
    fedilink
    English
    arrow-up
    42
    arrow-down
    8
    ·
    22 hours ago

    I swear every single article critical of current LLMs is like, “The square got BLASTED by the triangle shape when it completely FAILED to go through the triangle shaped hole.”

    • drspod@lemmy.ml
      link
      fedilink
      English
      arrow-up
      32
      arrow-down
      1
      ·
      21 hours ago

      It’s newsworthy when the sellers of squares are saying that nobody will ever need a triangle again, and the shape-sector of the stock market is hysterically pumping money into companies that make or use squares.

      • inconel@lemmy.ca
        link
        fedilink
        English
        arrow-up
        15
        ·
        21 hours ago

        It’s also from a company claiming they’re getting closer to create morphing shape that can match any hole.

  • finitebanjo@lemmy.world
    link
    fedilink
    English
    arrow-up
    12
    ·
    19 hours ago

    All these comments asking “why don’t they just have chatgpt go and look up the correct answer”.

    That’s not how it works, you buffoons, it trains off of datasets long before it releases. It doesn’t think. It doesn’t learn after release, it won’t remember things you try to teach it.

    Really lowering my faith in humanity when even the AI skeptics don’t understand that it generates statistical representations of an answer based on answers given in the past.

    • PushButton@lemmy.world
      link
      fedilink
      English
      arrow-up
      16
      ·
      21 hours ago

      And yet everybody is selling to write code.

      The last time I checked, coding was requiring logic.

      • jj4211@lemmy.world
        link
        fedilink
        English
        arrow-up
        9
        ·
        19 hours ago

        To be fair, a decent chunk of coding is stupid boilerplate/minutia that varies environment to environment, language to language, library to library.

        So LLM can do some code completion, filling out a bunch of boilerplate that is blatantly obvious, generating the redundant text mandated by certain patterns, and keeping straight details between languages like “does this language want join as a method on a list with a string argument, or vice versa?”

        Problem is this can be sometimes more annoying than it’s worth, as miscompletions are annoying.

        • PushButton@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          18 hours ago

          Fair point.

          I liked the “upgraded autocompletion”, you know, an completion based on the context, just before the time that they pushed it too much with 20 lines of non sense…

          Now I am thinking of a way of doing the thing, then I receive a 20 lines suggestion.

          So I am checking if that make sense, losing my momentum, only to realize the suggestion us calling shit that don’t exist…

          Screw that.

          • merdaverse@lemm.ee
            link
            fedilink
            English
            arrow-up
            2
            ·
            17 hours ago

            The amount of garbage it spits out in autocomplete is distracting. If it’s constantly making me 5-10% less productive the many times it’s wrong, it should save me a lot of time when it is right, and generally, I haven’t found it able to do that.

            Yesterday I tried to prompt it to change around 20 call sites for a function where I had changed the signature. Easy, boring and repetitive, something that a junior could easily do. And all the models were absolutely clueless about it (using copilot)

  • arc99@lemmy.world
    link
    fedilink
    English
    arrow-up
    19
    arrow-down
    2
    ·
    22 hours ago

    Hardly surprising. Llms aren’t -thinking- they’re just shitting out the next token for any given input of tokens.

  • AlecSadler@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    54
    arrow-down
    6
    ·
    1 day ago

    ChatGPT has been, hands down, the worst AI coding assistant I’ve ever used.

    It regularly suggests code that doesn’t compile or isn’t even for the language.

    It generally suggests AC of code that is just a copy of the lines I just wrote.

    Sometimes it likes to suggest setting the same property like 5 times.

    It is absolute garbage and I do not recommend it to anyone.

    • arc99@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      22 hours ago

      All AIs are the same. They’re just scraping content from GitHub, stackoverflow etc with a bunch of guardrails slapped on to spew out sentences that conform to their training data but there is no intelligence. They’re super handy for basic code snippets but anyone using them anything remotely complex or nuanced will regret it.

      • NateNate60@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        17 hours ago

        One of my mates generated an entire website using Gemini. It was a React web app that tracks inventory for trading card dealers. It actually did come out functional and well-polished. That being said, the AI really struggled with several aspects of the project that humans would not:

        • It left database secrets in the code
        • The design of the website meant that it was impossible to operate securely
        • The quality of the code itself was hot garbage—unreadable and undocumented nonsense that somehow still worked
        • It did not break the code into multiple files. It piled everything into a single file
      • AlecSadler@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        17 hours ago

        I’ve used agents for implementing entire APIs and front-ends from the ground up with my own customizations and nuances.

        I will say that, for my pedantic needs, it typically only gets about 80-90% of the way there so I still have to put fingers to code, but it definitely saves a boat load of time in those instances.

    • ILikeBoobies@lemmy.ca
      link
      fedilink
      English
      arrow-up
      2
      ·
      21 hours ago

      I’ve had success with splitting a function into 2 and planning out an overview, though that’s more like talking to myself

      I wouldn’t use it to generate stuff though

    • Mobiuthuselah@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 day ago

      I don’t use it for coding. I use it sparingly really, but want to learn to use it more efficiently. Are there any areas in which you think it excels? Are there others that you’d recommend instead?

  • FMT99@lemmy.world
    link
    fedilink
    English
    arrow-up
    216
    arrow-down
    17
    ·
    1 day ago

    Did the author thinks ChatGPT is in fact an AGI? It’s a chatbot. Why would it be good at chess? It’s like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.

      • FMT99@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        17 hours ago

        Hey I didn’t say anywhere that corporations don’t lie to promote their product did I?

    • snooggums@lemmy.world
      link
      fedilink
      English
      arrow-up
      181
      arrow-down
      6
      ·
      1 day ago

      AI including ChatGPT is being marketed as super awesome at everything, which is why that and similar AI is being forced into absolutely everything and being sold as a replacement for people.

      Something marketed as AGI should be treated as AGI when proving it isn’t AGI.

      • pelespirit@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        13
        arrow-down
        3
        ·
        1 day ago

        Not to help the AI companies, but why don’t they program them to look up math programs and outsource chess to other programs when they’re asked for that stuff? It’s obvious they’re shit at it, why do they answer anyway? It’s because they’re programmed by know-it-all programmers, isn’t it.

        • ImplyingImplications@lemmy.ca
          link
          fedilink
          English
          arrow-up
          17
          ·
          1 day ago

          why don’t they program them

          AI models aren’t programmed traditionally. They’re generated by machine learning. Essentially the model is given test prompts and then given a rating on its answer. The model’s calculations will be adjusted so that its answer to the test prompt will be closer to the expected answer. You repeat this a few billion times with a few billion prompts and you will have generated a model that scores very high on all test prompts.

          Then someone asks it how many R’s are in strawberry and it gets the wrong answer. The only way to fix this is to add that as a test prompt and redo the machine learning process which takes an enormous amount of time and computational power each time it’s done, only for people to once again quickly find some kind of prompt it doesn’t answer well.

          There are already AI models that play chess incredibly well. Using machine learning to solve a complexe problem isn’t the issue. It’s trying to get one model to be good at absolutely everything.

        • CileTheSane@lemmy.ca
          link
          fedilink
          English
          arrow-up
          2
          ·
          19 hours ago

          why don’t they program them to look up math programs and outsource chess to other programs when they’re asked for that stuff?

          Because the AI doesn’t know what it’s being asked, it’s just a algorithm guessing what the next word in a reply is. It has no understanding of what the words mean.

          “Why doesn’t the man in the Chinese room just use a calculator for math questions?”

        • rebelsimile@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          25
          arrow-down
          2
          ·
          1 day ago

          Because they’re fucking terrible at designing tools to solve problems, they are obviously less and less good at pretending this is an omnitool that can do everything with perfect coherency (and if it isn’t working right it’s because you’re not believing or paying hard enough)

          • MrJgyFly@lemmy.world
            link
            fedilink
            English
            arrow-up
            5
            ·
            1 day ago

            Or they keep telling you that you just have to wait it out. It’s going to get better and better!

        • NoiseColor @lemmy.world
          link
          fedilink
          English
          arrow-up
          7
          arrow-down
          1
          ·
          1 day ago

          …or a simple counter to count the r in strawberry. Because that’s more difficult than one might think and they are starting to do this now.

        • four@lemmy.zip
          link
          fedilink
          English
          arrow-up
          4
          ·
          1 day ago

          I think they’re trying to do that. But AI can still fail at that lol

        • MajorasMaskForever@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 day ago

          From a technology standpoint, nothing is stopping them. From a business standpoint: hubris.

          To put time and effort into creating traditional logic based algorithms to compensate for this generic math model would be to admit what mathematicians and scientists have known for centuries. That models are good at finding patterns but they do not explain why a relationship exists (if it exists at all). The technology is fundamentally flawed for the use cases that OpenAI is trying to claim it can be used in, and programming around it would be to acknowledge that.

      • NoiseColor @lemmy.world
        link
        fedilink
        English
        arrow-up
        9
        arrow-down
        25
        ·
        1 day ago

        I don’t think ai is being marketed as awesome at everything. It’s got obvious flaws. Right now its not good for stuff like chess, probably not even tic tac toe. It’s a language model, its hard for it to calculate the playing field. But ai is in development, it might not need much to start playing chess.

        • vinnymac@lemmy.world
          link
          fedilink
          English
          arrow-up
          27
          arrow-down
          2
          ·
          1 day ago

          What the tech is being marketed as and what it’s capable of are not the same, and likely never will be. In fact all things are very rarely marketed how they truly behave, intentionally.

          Everyone is still trying to figure out what these Large Reasoning Models and Large Language Models are even capable of; Apple, one of the largest companies in the world just released a white paper this past week describing the “illusion of reasoning”. If it takes a scientific paper to understand what these models are and are not capable of, I assure you they’ll be selling snake oil for years after we fully understand every nuance of their capabilities.

          TL;DR Rich folks want them to be everything, so they’ll be sold as capable of everything until we repeatedly refute they are able to do so.

          • NoiseColor @lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            7
            ·
            1 day ago

            I think in many cases people intentionally or unintentionally disregard the time component here. Ai is in development. I think what is being marketed here, just like in the stock market, is a piece of the future. I don’t expect the models I use to be perfect and not make mistakes, so I use them accordingly. They are useful for what I use them for and I wouldn’t use them for chess. I don’t expect that laundry detergent to be just as perfect in the commercial either.

        • BassTurd@lemmy.world
          link
          fedilink
          English
          arrow-up
          16
          arrow-down
          2
          ·
          1 day ago

          Marketing does not mean functionality. AI is absolutely being sold to the public and enterprises as something that can solve everything. Obviously it can’t, but it’s being sold that way. I would bet the average person would be surprised by this headline solely on what they’ve heard about the capabilities of AI.

          • NoiseColor @lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            15
            ·
            1 day ago

            I don’t think anyone is so stupid to believe current ai can solve everything.

            And honestly, I didn’t see any marketing material that would claim that.

            • BassTurd@lemmy.world
              link
              fedilink
              English
              arrow-up
              14
              arrow-down
              2
              ·
              1 day ago

              You are both completely over estimating the intelligence level of “anyone” and not living in the same AI marketed universe as the rest of us. People are stupid. Really stupid.

              • NoiseColor @lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                1 day ago

                I don’t understand why this is so important, marketing is all about exaggerating, why expect something different here.

                • BassTurd@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  3
                  ·
                  1 day ago

                  It’s not important. You said AI isn’t being marketed to be able to do everything. I said yes it is. That’s it.

            • petrol_sniff_king@lemmy.blahaj.zone
              link
              fedilink
              English
              arrow-up
              2
              arrow-down
              1
              ·
              1 day ago

              The Zoom CEO, that is the video calling software, wanted to train AIs on your work emails and chat messages to create AI personalities you could send to the meetings you’re paid to sit through while you drink Corona on the beach and receive a “summary” later.

              The Zoom CEO, that is the video calling software, seems like a pretty stupid guy?

              Yeah. Yeah, he really does. Really… fuckin’… dumb.

              • jubilationtcornpone@sh.itjust.works
                link
                fedilink
                English
                arrow-up
                7
                ·
                1 day ago

                Same genius who forced all his own employees back into the office. An incomprehensibly stupid maneuver by an organization that literally owes its success to people working from home.

        • 4am@lemm.ee
          link
          fedilink
          English
          arrow-up
          10
          arrow-down
          2
          ·
          1 day ago

          Really then why are they cramming AI into every app and every device and replacing jobs with it and claiming they’re saving so much time and money and they’re the best now the hardest working most efficient company and this is the future and they have a director of AI vision that’s right a director of AI vision a true visionary to lead us into the promised land where we will make money automatically please bro just let this be the automatic money cheat oh god I’m about to

          • NoiseColor @lemmy.world
            link
            fedilink
            English
            arrow-up
            5
            arrow-down
            4
            ·
            1 day ago

            Those are two different things.

            1. they are craming ai everywhere because nobody wants to miss the boat and because it plays well in the stock market.

            2. the people claiming it’s awesome and that they are doing I don’t know what with it, replacing people are mostly influencers and a few deluded people.

            Ai can help people in many different roles today, so it makes sense to use it. Even in roles that is not particularly useful, it makes sense to prepare for when it is.

    • iAvicenna@lemmy.world
      link
      fedilink
      English
      arrow-up
      9
      ·
      1 day ago

      well so much hype has been generated around chatgpt being close to AGI that now it makes sense to ask questions like “can chatgpt prove the Riemann hypothesis”

    • suburban_hillbilly@lemmy.ml
      link
      fedilink
      English
      arrow-up
      22
      ·
      1 day ago

      Most people do. It’s just called AI in the media everywhere and marketing works. I think online folks forget that something as simple as getting a Lemmy account by yourself puts you into the top quintile of tech literacy.

    • Broken@lemmy.ml
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      2
      ·
      1 day ago

      I agree with your general statement, but in theory since all ChatGPT does is regurgitate information back and a lot of chess is memorization of historical games and types, it might actually perform well. No, it can’t think, but it can remember everything so at some point that might tip the results in it’s favor.

      • FMT99@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        22 hours ago

        I mean it may be possible but the complexity would be so many orders of magnitude greater. It’d be like learning chess by just memorizing all the moves great players made but without any context or understanding of the underlying strategy.

      • Eagle0110@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        2
        ·
        edit-2
        1 day ago

        Regurgitating an impression of, not regurgitating verbatim, that’s the problem here.

        Chess is 100% deterministic, so it falls flat.

        • Raltoid@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          edit-2
          1 day ago

          I’m guessing it’s not even hard to get it to “confidently” violate the rules.

    • TowardsTheFuture@lemmy.zip
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      1 day ago

      I think that’s generally the point is most people thing chat GPT is this sentient thing that knows everything and… no.

      • NoiseColor @lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 day ago

        Do they though? No one I talked to, not my coworkers that use it for work, not my friends, not my 72 year old mother think they are sentient.

        • TowardsTheFuture@lemmy.zip
          link
          fedilink
          English
          arrow-up
          1
          ·
          19 hours ago

          Okay I maybe exaggerated a bit, but a lot of people think it actually knows things, or is actually smart. Which… it’s not… at all. It’s just pattern recognition. Which was I assume the point of showing it can’t even beat the goddamn Atari because it cannot think or reason, it’s all just copy pasta and pattern recognition.

    • adhdplantdev@lemm.ee
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      2
      ·
      1 day ago

      Articles like this are good because it exposes the flaws with the ai and that it can’t be trusted with complex multi step tasks.

      Helps people see that think AI is close to a human that its not and its missing critical functionality

      • FMT99@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        22 hours ago

        The problem is though that this perpetuates the idea that ChatGPT is actually an AI.

        • adhdplantdev@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          18 hours ago

          People already think chatGPT is a general AI. We need more articles like this showing is ineffectiveness at being intelligent. Besides it helps find a limitations of this technology so that we can hopefully use it to argue against every single place

    • x00z@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 day ago

      In all fairness. Machine learning in chess engines is actually pretty strong.

      AlphaZero was developed by the artificial intelligence and research company DeepMind, which was acquired by Google. It is a computer program that reached a virtually unthinkable level of play using only reinforcement learning and self-play in order to train its neural networks. In other words, it was only given the rules of the game and then played against itself many millions of times (44 million games in the first nine hours, according to DeepMind).

      https://www.chess.com/terms/alphazero-chess-engine

      • jeeva@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        20 hours ago

        Sure, but machine learning like that is very different to how LLMs are trained and their output.

      • FMT99@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        22 hours ago

        Oh absolutely you can apply machine learning to game strategy. But you can’t expect a generalized chatbot to do well at strategic decision making for a specific game.

    • saltesc@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      2
      ·
      1 day ago

      I like referring to LLMs as VI (Virtual Intelligence from Mass Effect) since they merely give the impression of intelligence but are little more than search engines. In the end all one is doing is displaying expected results based on a popularity algorithm. However they do this inconsistently due to bad data in and limited caching.

  • OBJECTION!@lemmy.ml
    link
    fedilink
    English
    arrow-up
    50
    arrow-down
    2
    ·
    1 day ago

    Tbf, the article should probably mention the fact that machine learning programs designed to play chess blow everything else out of the water.

    • andallthat@lemmy.world
      link
      fedilink
      English
      arrow-up
      12
      ·
      edit-2
      20 hours ago

      Machine learning has existed for many years, now. The issue is with these funding-hungry new companies taking their LLMs, repackaging them as “AI” and attributing every ML win ever to “AI”.

      ML programs designed and trained specifically to identify tumors in medical imaging have become good diagnostic tools. But if you read in news that “AI helps cure cancer”, it makes it sound like it was a lone researcher who spent a few minutes engineering the right prompt for Copilot.

      Yes a specifically-designed and finely tuned ML program can now beat the best human chess player, but calling it “AI” and bundling it together with the latest Gemini or Claude iteration’s “reasoning capabilities” is intentionally misleading. That’s why articles like this one are needed. ML is a useful tool but far from the “super-human general intelligence” that is meant to replace half of human workers by the power of wishful prompting

    • Zenith@lemm.ee
      link
      fedilink
      English
      arrow-up
      10
      ·
      1 day ago

      I forgot which airline it is but one of the onboard games in the back of a headrest TV was a game called “Beginners Chess” which was notoriously difficult to beat so it was tested against other chess engines and it ranked in like the top five most powerful chess engines ever

  • Furbag@lemmy.world
    link
    fedilink
    English
    arrow-up
    21
    ·
    1 day ago

    Can ChatGPT actually play chess now? Last I checked, it couldn’t remember more than 5 moves of history so it wouldn’t be able to see the true board state and would make illegal moves, take it’s own pieces, materialize pieces out of thin air, etc.

    • skisnow@lemmy.ca
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      1 day ago

      It can’t, but that didn’t stop a bunch of gushing articles a while back about how it had an ELO of 2400 and other such nonsense. Turns out you could get it to have an ELO of 2400 under a very very specific set of circumstances, that include correcting it every time it hallucinated pieces or attempted to make illegal moves.

  • floofloof@lemmy.ca
    link
    fedilink
    English
    arrow-up
    38
    arrow-down
    4
    ·
    edit-2
    1 day ago

    I suppose it’s an interesting experiment, but it’s not that surprising that a word prediction machine can’t play chess.

    • otp@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      15
      arrow-down
      9
      ·
      1 day ago

      Because people want to feel superior because they don’t know how to use a ChatBot can count the number of "r"s in the word “strawberry”, lol

      • electricyarn@lemmy.world
        link
        fedilink
        English
        arrow-up
        12
        arrow-down
        2
        ·
        1 day ago

        Yeah, just because I can’t count the number of r’s in the word strawberry doesn’t mean I shouldn’t be put in charge of the US nuclear arsenal!

        • otp@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          1
          ·
          1 day ago

          That is more a failure of the person who made that decision than a failing of ChatBots, lol

          • wewbull@feddit.uk
            link
            fedilink
            English
            arrow-up
            2
            ·
            22 hours ago

            Agreed, which is why it’s important to have articles out in the wild that show the shortcomings of AI. If all people read is all the positive crap coming out of companies like OpenAI then they will make stupid decisions.