• @filister@lemmy.world
    link
    fedilink
    English
    581 month ago

    What is amazing in this case is that they achieved spending a fraction of the inference cost that OpenAI is paying.

    Plus they are a lot cheaper too. But I am pretty sure that the American government will ban them in no time, citing national security concerns, etc.

    Nevertheless, I think we need more open source models.

    Not to mention that NVIDIA also needs to be brought to earth.

    • @demesisx@infosec.pub
      link
      fedilink
      English
      221 month ago

      Even if they get banned, any startup could replicate their work if it is truly open source. The best thing about their solution is that it breaks the CUDA monopoly that NVDA has enjoyed. Buy your puts when NVDA bounces because that stock is GOING DOWN. There’s no world where a company that makes GPU’s is worth more than both Apple and Microsoft. It’s inevitable.

      • @toffi@feddit.org
        link
        fedilink
        English
        191 month ago

        Never forget kids the market can stay irrational much longer than you can stay solvent.

      • Eager Eagle
        link
        fedilink
        English
        31 month ago

        I wish that was true, but this doesn’t threaten any monopoly

        • @demesisx@infosec.pub
          link
          fedilink
          English
          12
          edit-2
          1 month ago

          It certainly does.

          Until last week, you absolutely NEEDED an NVidia GPU equipped with CUDA to run all AI models.

          Today, that is simply not true. (watch the video at the end of this comment)

          I watched this video and my initial reaction to this news was validated and then some: this video made me even more bearish on NVDA.

          Edit: corrected and redacted.

          • Eager Eagle
            link
            fedilink
            English
            6
            edit-2
            1 month ago

            mate, that means they are using PTX directly. If anything, they are more dependent to NVIDIA and the CUDA platform than anyone else.

            to simplify: they are bypassing the CUDA API, not the NVIDIA instruction set architecture and not CUDA as a platform.