• macniel
      link
      fedilink
      112 days ago

      Yeah I don’t understand why they don’t have a codeberg or similar that they host themselves.

      • @[email protected]
        link
        fedilink
        42 days ago

        How would that help? If you release something as GPL code, you cannot prevent it from being used to train a model, no matter where it’s hosted.

        • @[email protected]
          link
          fedilink
          32 days ago

          There’s a difference between handing something to someone and leaving it somewhere they happen to be able to take it from.

        • @[email protected]
          link
          fedilink
          12 days ago

          Im personally waiting for a massive lawsuit, legally companies cannot train AI on GPL code (at least I don’t believe so)

          • @[email protected]
            link
            fedilink
            32 days ago

            There’s nothing in GPL that would forbid it. Only distribution without code publication is forbidden.

            • macniel
              link
              fedilink
              22 days ago

              mhm, and how would the distribution inside an LLM work? Are those code snippets CoPilot et al produce come with dedicated license sections?

              And regarding how it would help selfhosting the code: it wouldn’t be on the GITHub servers owned by Microsoft, which owns/operates CoPilot. Its akin to feeding the LLM directly by pushing it to their servers.