• xylogx@lemmy.world
    link
    fedilink
    English
    arrow-up
    114
    arrow-down
    4
    ·
    10 days ago

    So you’re saying the ad driven internet will die? And we will be left with what? Wikipedia and Lemmy? I for one welcome our AI overlords!

    • venusaur@lemmy.world
      link
      fedilink
      English
      arrow-up
      39
      arrow-down
      2
      ·
      edit-2
      10 days ago

      Nah, it’s saying that ad and AI-driven internet will prevail. People only use Google to find an answer and don’t dig deeper, and if they do, it’s often because the links are sponsored. People using GPT’s are even less likely to click a link. Currently no ads, but just wait.

      Apologies if you were joking.

    • jonathan7luke@lemmy.ml
      link
      fedilink
      English
      arrow-up
      21
      ·
      edit-2
      9 days ago

      This is part of the larger problem that AI tools are trained on (and profit off of) content that is produced and hosted by others who are now seeing their traffic change from humans to bots. For content sources that pay for hosting with ads, this means a loss in revenue to pay for hosting. For content sources like Wikipedia, they are seeing their hosting costs increase significantly due to the increase in bot traffic. Even if you want every website that depends on ad revenue to fail (which I don’t entirety agree with), AI is still damaging the open web in other ways. Websites like Wikipedia for example may soon be forced to lock content behind logins or leverage aggressive captchas just to fight the bot traffic, which makes things worse for those of us that still prefer to use actual websites over AI summaries.

      • pinkapple@lemmy.ml
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        2
        ·
        9 days ago

        Nobody is scraping wikipedia over and over to create datasets for AIs, there are already open datasets and API deals. But wiki in particular has always had a data dump of the entire db bimonthly.

        https://dumps.wikimedia.org/

        • TheOneCurly@lemm.ee
          link
          fedilink
          English
          arrow-up
          13
          arrow-down
          1
          ·
          9 days ago

          You clearly haven’t run a website recently. Until I set up anubis last week I was getting constant requests from dozens of various bot scrapers 24/7. That included the big ones.

          • pinkapple@lemmy.ml
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            7
            ·
            9 days ago

            Kay, and that has nothing to do with what i said. Scrapers, bots =/= AI. It’s not even the same companies that make the unfree datasets. The scrapers and bots that hit your website are not some random “AI” feeding on data lol. This is what some models are trained on, it’s already free so it’s doesn’t need to be individually rescraped and it’s mostly garbage quality data: https://commoncrawl.org/ Nobody wastes resources rescraping all this SEO infested dump.

            Your issue has everything to do with SEO than anything else. Btw before you diss common crawl, it’s used in research quite a lot so it’s not some evil thing that threatens people’s websites. Add robots.txt maybe.

            • TheOneCurly@lemm.ee
              link
              fedilink
              English
              arrow-up
              13
              arrow-down
              1
              ·
              9 days ago

              Oh ok I’ll just ignore the constant requests from GPTBot, ByteSpider, and the hundreds of others who very plainly, sometimes in their useragent, tell you that they’re grabbing content for training data. Robots.txt is nice and all but manually adding every single up and coming AI company is impossible. Like I said Anubis is the first time I’ve gotten them all to even remotely calm down.

              • pinkapple@lemmy.ml
                link
                fedilink
                English
                arrow-up
                1
                ·
                7 days ago

                Bots only identify themselves and their organization in the user agent, they don’t tell you specifically what they do with the data so stop your fairytales. They do give you a really handy url though with user agents and even IPs jn json if you want to fully block the crawlers but not the search bots sent by user prompts.

                Your ad revenue money can be secured.

                https://platform.openai.com/docs/bots/

                If for some reason you can’t be bothered to edit your own robots.txt (because it’s hard to tell which bots are search bots for muh ad money) then maybe hire someone.

                • TheOneCurly@lemm.ee
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  7 days ago

                  Lmao you linked to the same page I did where this text appears:

                  GPTBot is used to make our generative AI foundation models more useful and safe. It is used to crawl content that may be used in training our generative AI foundation models.

                  Also you’re so capitalism brained you assume anyone running a website must be doing so for profit. My hobby projects (personal homepage and personal git forge) were getting slammed by bots while I just paid the bills. I could have locked them both behind an auth portal but then I might as well just take them off the internet and run everything on my LAN.

        • jonathan7luke@lemmy.ml
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          1
          ·
          9 days ago

          But with the rise of AI, the dynamic is changing: We are observing a significant increase in request volume, with most of this traffic being driven by scraping bots collecting training data for large language models (LLMs) and other use cases. Automated requests for our content have grown exponentially, alongside the broader technology economy, via mechanisms including scraping, APIs, and bulk downloads. This expansion happened largely without sufficient attribution, which is key to drive new users to participate in the movement, and is causing a significant load on the underlying infrastructure that keeps our sites available for everyone.

          - https://diff.wikimedia.org/2025/04/01/how-crawlers-impact-the-operations-of-the-wikimedia-projects/

          • pinkapple@lemmy.ml
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            7 days ago

            via mechanisms including scraping, APIs, and bulk downloads.

            Omg exactly! Thanks. Yet nothing about having to use logins to stop bots because that kinda isn’t a thing when you already provide data dumps and an API to wikimedia commons.

            While undergoing a migration of our systems, we noticed that only a fraction of the expensive traffic hitting our core datacenters was behaving how web browsers would usually do, interpreting javascript code. When we took a closer look, we found out that at least 65% of this resource-consuming traffic we get for the website is coming from bots, a disproportionate amount given the overall pageviews from bots are about 35% of the total.

            Source for traffic being scraping data for training models: they’re blocking javascript therefore bots therefore crawlers, just trust me bro.

  • 𝕸𝖔𝖘𝖘@infosec.pub
    link
    fedilink
    English
    arrow-up
    50
    ·
    9 days ago

    Don’t take this the wrong way, but fuck your business model. The internet was supposed to be open and be ours, and you stole it for profit.

    • aidan@lemmy.world
      link
      fedilink
      English
      arrow-up
      26
      arrow-down
      3
      ·
      9 days ago

      To be honest: you can still make your own website, and in many ways big companies are actually making it easier through open-source projects and stuff like Let’s Encrypt. The web industry is remarkably open compared to what big companies do in other industries. A lot of the standards meetings and stuff you can just go to and give your opinion. Or ignore the standards and fork it yourself. This alarmism I fear will make people not take the actually alarming things like encryption bans or ID requirements seriously.

      • T156@lemmy.world
        link
        fedilink
        English
        arrow-up
        12
        ·
        9 days ago

        Only for some things, though. If you host your own e-mail these days, chances are, you’re going to have a very difficult time sending them anywhere without risking them being deleted, or automatically thrown into spam folders.

  • pinball_wizard@lemmy.zip
    link
    fedilink
    English
    arrow-up
    35
    ·
    9 days ago

    Letting Google break the law for years with illegal anti-competitive practices is now hurting everyone else’s ability to earn money.

    I wonder if we have the combined will to do anything about it, or if we will wait and hope the invisible hand of the market will fix it…

    • InternetCitizen2@lemmy.world
      link
      fedilink
      English
      arrow-up
      9
      ·
      8 days ago

      if we will wait and hope the invisible hand of the market will fix it…

      Have we lost faith in our handsome businessman? /s

    • muusemuuse@lemm.ee
      link
      fedilink
      English
      arrow-up
      37
      arrow-down
      1
      ·
      9 days ago

      That’s not what will happen. We will have to pay AND be tracked. They are not going to give anything up.

    • Lovable Sidekick@lemmy.world
      link
      fedilink
      English
      arrow-up
      14
      ·
      edit-2
      9 days ago

      When Orwell predicted universal surveillance he never anticipated that the people themselves would install the cameras, let alone pay a subscription.

    • CalipherJones@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      9 days ago

      This sounds more like “everyone is on TikTok and Instagram and will only ever be using TikTok and Instagram”.

    • cardfire@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      9 days ago

      The internet was founded on the sponsorship model where content was free and ads were ubiquitous. while I completely agree with you that I would rather pay for the product instead of being the product, at this informs every single sign up I make on the internet, I think it’s self deluding to think there’s any great again to go back to. The philosophy was always there, the execution just wasn’t possible until they had finished building their walled gardens

  • db2@lemmy.world
    link
    fedilink
    English
    arrow-up
    46
    arrow-down
    18
    ·
    10 days ago

    The web doesn’t have a business model, cloudflair, you do. And nobody cares because you suck.

    • sugar_in_your_tea@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      63
      arrow-down
      4
      ·
      10 days ago

      Eh, Cloudflare provides a pretty good service for a very reasonable price.

      But yeah, the web doesn’t have a business model in the same way a town square doesn’t, yet you can make a business work in both areas. Make a compelling product and people will pay you for it.

      • Dr. Moose@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        3
        ·
        9 days ago

        You mean product that literally makes web unusable for many and tracks your every single step with extremely invasive fingerprinting techniques? That product?

        • sugar_in_your_tea@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          2
          ·
          9 days ago

          That’s a big reason why I don’t use their security layer, mostly just their domain registrar. They have a ton of products that don’t involve tracking your users.

      • ThirdConsul@lemmy.ml
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        6
        ·
        9 days ago

        Cloudflare provides a pretty good service for a very reasonable price.

        You mean selling fingerprinted user data to advertisers?

  • devfuuu@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    ·
    10 days ago

    It needs to get even nastier so that it affects all the big players in a huge way so they get to do something about it. While it only affects the indie web we are all just gonna keep suffering.

  • morrowind@lemmy.ml
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    1
    ·
    10 days ago

    Yeah I think we’re going to be grappling with this issue for at least the next decade. The traditional web model falls apart under AI

    • thejml@lemm.ee
      link
      fedilink
      English
      arrow-up
      27
      arrow-down
      1
      ·
      10 days ago

      To be fair, the traditional web models were falling apart prior to AI as well. We’ve gone so far past “ad driven” that Everything has to be full of ads and clickbait to drive revenue just to run the infrastructure, let alone pay for the pages creation and upkeep. Journalists and developers, services and goods are all using adword soup to try to get anything close to a useful revenue stream and it’ll just keep getting worse until we figure out a better business model. We’re going to increasingly see paywalls to try to make up for that, but a large part of people on the internet won’t want to spend money on quality sources when they use to be able to get it for free. It’s been a race to the bottom for a while and it’s at a point that isn’t sustainable long term. AI just accelerates that to the next level.

      • feannag@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        4
        ·
        9 days ago

        What’s challenging about paywalls and not wanting to spend money is not necessarily not wanting to spend, but convenience and cost. If it costs me 10 cents for each blog or tutorial or github page I look at while working on a project, or 1 cent for every funny video, that adds up. And do I have to put my credit card in for every site? Hope that every site has good enough security to prevent payment information leaks?

        And I don’t think anyone is interested in a Netflix-style internet that fractures into 6 different subscriptions to get every site you need on the web.

        • morrowind@lemmy.ml
          link
          fedilink
          English
          arrow-up
          2
          ·
          9 days ago

          Some sort of universal microtransaction layer is the dream. I believe there’s also a proposed web standard for it.

          Scroll was also making it work before they got bought by Twitter

    • doodledup@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      9 days ago

      The traditional web was long gone anyways. There are like a dozent sites you find for any Google query. It’s so hard to find small hidden treasure on the internet.

  • AllHailTheSheep@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    13
    ·
    9 days ago

    maybe their business model. trust me. they’ll find a way to monetize the zero click internet too. then it’s back to square one

    • e461h@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8
      ·
      8 days ago

      I believe this is why tech execs and investors are so hot on pushing AI into everything. They’ll control everyone’s digital experience and you can 100% count on being force fed ads and paid propaganda. Embrace, extend, extinguish

      • futatorius@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        ·
        6 days ago

        Yeah, so much for all those promises of disintermediation being a benefit of the web.

      • gradual@lemmings.world
        link
        fedilink
        English
        arrow-up
        5
        ·
        8 days ago

        Yep. They have direct control over the flow of information.

        Honestly, Metal Gear Solid 2 was on fucking point.

        And so was 4.

        • conditional_soup@lemm.ee
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          8 days ago

          I tried playing 2 again recently because I had the same thought, and I had to stop because my wife would not stop laughing at Rose’s dialogue. God, I wish Kojima had ever met a woman.

          • gradual@lemmings.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            8 days ago

            What parts of Rose’s dialogue made her laugh so much?

            I could tell that it was commentary on the lonely and reserved lives that was stereotypical of gamers in the early to mid 2000s. So much of that game was meant to be directed towards the player, I wonder if it fell on deaf ears for her because she’s not really the target audience.

          • okmko@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            edit-2
            8 days ago

            And she’s 100% justified. The older you get the more appearant it becomes that he’s bad at writing dialogue and story. He’s a tendency of using controvencies to create drama and it often falls flat, if not into eye-roll territory.

            I could not stop cringing during Death Stranding. I had to fast forward the ending. I imagine Margaret Qualley being completely bewildered when they were capturing her character’s twin soul melding scene.

            • conditional_soup@lemm.ee
              link
              fedilink
              English
              arrow-up
              2
              ·
              8 days ago

              I don’t necessarily think he’s bad at writing dialogue and story, I think he’s mostly just bad at writing women. As I’ve gotten older, I went from taking Metal Gear Solid super seriously to treating it like nuclear/techno Evil Dead

              • okmko@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                edit-2
                8 days ago

                Yeah, we’re just less experienced and have fewer expectations when we’re young. We were much more impressionable then.

                Guys in general are bad at portraying women, as I understand. That’s on top of being bad in general for Kojima, I think. There’s a funny interview with the MGS2 English translator Agness Kaku where she comments that the writing at times is high school fanfic level.

  • ssfckdt@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    1
    ·
    9 days ago

    Can someone check in with the inventor of the web and ask him what the web’s business model is?