• MangoCats@feddit.it
    link
    fedilink
    English
    arrow-up
    3
    ·
    24 hours ago

    no one was thinking about using it for data back then

    Everyone with any foresight whatsoever has been thinking about using every source of data since the Babylonians were taking census 6000 years ago.

      • MangoCats@feddit.it
        link
        fedilink
        English
        arrow-up
        1
        ·
        21 hours ago

        Before LLMs there were all manner of systems “trained on data” back through “expert systems” of the 1990s and beyond.

        Having direct access to all the code definitely gave Microsoft business data about which languages were being used, and how, most popularly, and by who.

        • ExLisper@lemmy.curiana.net
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          20 hours ago

          And you think MS dropped $7.5B to get the data stackoverflow publishes every year for free?

          Of course owning data from the most popular development platform was useful to them but they didn’t buy to get data to train “expert system” or LLMs. They wanted to have direct contact with huge numbers of developers so they can sell them their products.