Cutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to download

@[email protected] · 21 hours ago

Cutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to download

@[email protected] · edit-2 20 hours ago

Fuck it, I use local LLMs enough, will give this a crack.

Edit: it’s doing 6 paragraphs in 8.2 seconds, the last model I used was doing like 1 paragraph in 12 seconds. Crazy fast in my experience.

yeehaw · 20 hours ago

How are they to run, how useful are they, and any you can recommend?

@[email protected] · 20 hours ago

Dead simple to run, I use Ollama to run local models and it’s like 3 words to setup from the command line.

Useful is entirely relative. I use mine personally and somewhat professionally, but I only use it to draft text and manually alter it. AI is amazing, but it’s also crap. You gotta work it a bit.

Umm this model from what I can see, I’m using the 8b model and it’s fast to generate, time will tell how good the quality is but I’m impressed after a few minutes play.

chiisana · 16 hours ago

8B parameter tag is the distilled llama 3.1 model, which should be great for general writing. 7B is distilled qwen 2.5 math, and 14B is distilled qwen 2.5 (general purpose but good at coding). They have the entire table called out on their huggingface page, which is handy to know which one to use for specific purposes.

The full model is 671B and unfortunately not going to work on most consumer hardwares, so it is still tethered to the cloud for most people.

Also, it being a made in China model, there are some degree of censorship mandated. So depending on use case, this may be a point of consideration, too.

Overall, it’s super cool to see something at this level to be generally available, especially with all the technical details out in the open. Hopefully we’ll see more models with this level of capability become available so there are even more choices and competition.

@[email protected] · 16 hours ago

Also, the release of R1 under the MIT license means that in principle anyone can use R1 to generate synthetic training sets for improving other (non-reasoning) models. This may be a real game changer.

The one fly in the ointment is that Deepseek didn’t deign to share details of their synthetic data generation procedure. But they are already way more transparent than any other non-academic AI lab, so it’s hard to get mad at them over this.

elgordino · 18 hours ago

If you want a really simple way to run a variety of local models with a nice UI take a look at https://jan.ai/

@[email protected] · 16 hours ago

so what of its reasoning? can it deduce? can it follow specific logic/equations in mathematical notation or in plain language?

@[email protected] · 16 hours ago

Try it out for yourself: https://chat.deepseek.com/

It can understand LaTeX as well as outputting it. In my limited testing on sample physics problems, it performs pretty well. It also scored 100% on the 2023 A Level maths exam.

@[email protected] · 15 hours ago

interesting, so i guess it can answer questions from any exam

19 hours ago

Does it deny Tiananmen square?

@[email protected] · edit-2 17 hours ago

Using the 7bn parameter variant:

@[email protected] · 16 hours ago

I’m not sure if this is funny or just sad.

12 hours ago

Both

12 hours ago

Hahah fuck that’s the funniest most depressing thing ever. Please repost this image I recon in would be a good post.

@[email protected] · edit-2 16 hours ago

It’s MIT licensed, so anyone is free to go about decensoring it. There are already “abliterated” (decensored) variants uploaded to huggingface, at least for the distilled models.

This procedure also decensors stuff that western models routinely censor. So ironically these Chinese open source models are giving us the most free speech friendly LLMs around.

13 hours ago

I use a dolphin fine tuned meta llama model myself but I will have to compare it to this one.

@[email protected] · 12 hours ago

Have you tried a Tuna Tuned Obama Llama instead?

@[email protected] · 18 hours ago

Asked very plainly, it refuses to answer questions related to it, but it requires very little convincing to talk about it. Much softer censorship than most of the other available models.

blargbluuk · 15 hours ago

How did you convince it? Just curious