☆ Yσɠƚԋσʂ ☆ to [email protected]English • edit-21 month agoMeta is reportedly scrambling ‘war rooms’ of engineers to figure out how DeepSeek’s AI is beating everyone else at a fraction of the pricefortune.comexternal-linkmessage-square19fedilinkarrow-up134arrow-down12cross-posted to: nottheoniontechtechnologylocalllama
arrow-up132arrow-down1external-linkMeta is reportedly scrambling ‘war rooms’ of engineers to figure out how DeepSeek’s AI is beating everyone else at a fraction of the pricefortune.com☆ Yσɠƚԋσʂ ☆ to [email protected]English • edit-21 month agomessage-square19fedilinkcross-posted to: nottheoniontechtechnologylocalllama
minus-squaremelroylinkfedilink0•1 month agoI see ok. I only want to add that DeepSeek is not the first or the only model that is using mixture-of-experts (MoE).
minus-square☆ Yσɠƚԋσʂ ☆OPlinkfedilink5•1 month agoOk, but it is clearly the first one to use this approach to such an effect.
I see ok. I only want to add that DeepSeek is not the first or the only model that is using mixture-of-experts (MoE).
Ok, but it is clearly the first one to use this approach to such an effect.