Filotagency

Overview

Sectors Healthcare
Posted Jobs 0
Viewed 93

Company Description

Open-R1: a Completely Open Reproduction Of DeepSeek-R1

Hey there! This article is an introduction to the job, not a claim that we’ve reproduced R1 yet. We’re building in the open, so as soon as we have assessment numbers, we’ll share them. You can follow our development on Hugging Face and GitHub.

True, but it appears like there’s nothing to be evaluated since right now. I assume the ultimate objective is to train a brand-new reasoning model and after that use the exact same assessment metrics as o1 and the DeepSeek-R1.

Well, there should be at least some sanity check and validation to make sure the model was trained properly.

Oh yes, if you are speaking about the evaluation variety of deepseek’s design it’s coming really quickly!

As discussed in the article there is no design called Open-R1 to test at all … not yet anyhow. This is a blog detailing that Hugging face will take the R1 Deepseek model, exercise how it was constructed as described in the paper and from what they released, and then duplicate that procedure.

in reality this is basically how science works … A develops a strategy, discovery or innovation and it is checked by B, C and D to see if it is reproduceable. Thats been the foundation of research now for a couple of centuries.

This blog is not stating they have actually currently done so … Its a blog detailing an intent to start training a model like R1 and calling it Open-R1.

Also DeepSeek-R1 was only released recently, and even in their paper they laid out the compute hours needed. While those are low calculate hours for a SOTA design this does not indicate you can train stated design in a week. I ‘d personally like to be able to train a transformer design in a week, however we might need to wait a while for that level of calculate innovation.

So there are no standards for a design that has not been developed yet right? As outlined in the blog, and once again in reply to your concern.

However fear not, there is a GitHub Repo already and contributors (hell I may join myself), some prelim work done, and a plan of attack. A great starting position.

n
@edbeeching
has assessed the released designs currently

( src: https://x.com/edwardbeeching/status/1884273209136275742)

R1 just trained on o1 outputs, so jointly …/ s. This is what the brand-new AI czars are saying

Hi! This article is an introduction to the project, not a claim that we have actually recreated R1 yet. We will absolutely share the missing piece when we have them, you can expect the models and datasets to be upload in this Hugging Face org and the code to be in this GitHub repo

That’s great and crucial to understand this tremendous buzz that lacks technical understanding and description. Science has to do with recreation, and if they declare to be open, let them fullfill the open part.

Please do publish the training expense.

We will!

Excalidraw Hi n
@bojan2501
thanks, we will certainly be striving to make certain this training recipe can work for small language models on customer hardware since not everybody has a cluster of H100s at home:-RRB- The tool we utilized for the images was Excalidraw! https://excalidraw.com

anticipating it! WTF are your talking about?

should be a joke

It’s actually cool to see how the entire open source neighborhood comes together!

Ops …

5.5 M is number press reporter in the deepseekv3 tech report (just the training, not the experiment afaik), for R1 difficult to approximate tbh however much less than 5.5 M imo

Historically, they have never ever released code or datasets of their LLM training, so I wouldn’t anticipate this time to be various. If they would release it that would be remarkable naturally!

Yes naturally!

So basically you’re asking to replace existing censorship with another flavour of censorship?

The code for the designs are inside the design repositories, e.g. for V3: https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/modeling_deepseek.py

Hello Team, I’m Ray Bernard, the author and creator of EQUATOR. My research study group will be working on a paper focused on reproducing certain parts of DeepSeek R1. Our goal is to recreate the cold start and supply your group with a dataset that includes COT and other strategies to support these efforts. We like to contribute our work to help. Please let me know if you find this helpful. Best, Ray Bernard https://www.facebook.com/groups/1186310571520299/

Where is the evaluation numbers? without it you can’t call it reproduction.

8 replies

True, but it appears like there’s nothing to be examined since right now. I assume the ultimate objective is to train a new reasoning design and then utilize the very same examination metrics as o1 and the DeepSeek-R1.

That’s rather fascinating, I was asking myself why the questions the author exposed here are not being asked by others? I believe the work they have done is unforgettable however at the same time I wonder why they wouldn’t put these missing pieces on if they are expected to be completely open.
Why even without recreation and comprehension of the development they could impact so much the marketplace in this way?

4 replies

Hi! This post is an intro to the task, not a claim that we’ve reproduced R1 yet. We will totally share the missing out on piece when we have them, you can expect the designs and datasets to be upload in this Hugging Face org and the code to be in this GitHub repo

Interesting read, and it is great that we see more effort into this instructions: more optimization and less brute force.
Also wonder what tool did the author usage for producing action diagram.

2 replies

Excalidraw I’m so delighted that effort like this already exist, I’m gon na try to contribute:-RRB- 1 reply

eagerly anticipating it! So racist articel

2 replies

WTF are your speaking about?

Awesome to have this open recreation began!

For Step # 1 check out https://github.com/open-thoughts/open-thoughts!

https://x.com/ryanmart3n/status/1884284101265612856

Let’s do this thing!

1 reply

It’s actually cool to see how the entire open source community comes together!

Does anyone understand the actual training expense of r1? I can’t find it in the paper or the announcement post. Is the 6M cost reported by media simply the number taken from v3’s training expense?

2 replies

Ops …

Has anyone asked the DeepSeek team to release their training information and code, or a minimum of share them independently with an independent duplication job like this? Have they turned down such a demand?

A faithful duplication depends upon using the very same dataset and hyperparameters. Otherwise, any major inconsistencies with the published benchmarks would be tough to pin down-whether due to training information differences or the duplication approach itself.

1 reply

Historically, they have never launched code or datasets of their LLM training, so I wouldn’t anticipate this time to be different. If they would release it that would be remarkable naturally!

In the meantime we need to make best guess quotes and see if we can arrive ourselves.

You supply excellent replication process of Deepseek reasoning training. I will attempt something similar to it.

This is actually excellent info, can we tweak with specific use case when code is ?

1 reply

Yes of course!

Please consider getting rid of prejudiced, polluted or unaligned training information and make an effort to eliminate copyrighted works from the crawl from consumption. This will make the model more usable. If you recycled anthropic curation checks, this may likewise help, eliminate obviouslybiased information will likely include a lot of worth. We do not want another tainted, unaligned open source model, right? And no corporate would ever use deepseek or a design that recycles it, right?
We appreciate your work for the benefit of mankind, we hope.
Miike C from NJ

1 reply

So generally you’re asking to replace existing censorship with another flavour of censorship?

Can’t wait! Hopefully the design will be uncensored however whatever you can do is alright! Love seeing open source building itself up. I’m not clever sufficient to in fact assist but I can contribute moral assistance lol

Hello guys, I am even just attempting to find code for DeepSeek-V2, in order to completely comprehend multi-head latent attention. You do not appear to have code in Hugging Face even for that. Or am I missing out on something? Don’t see anything in src/transformers/models. MLA is not properly explained in their paper, so it would be very important to have code for this.

Contact Form

User Name:
Email Address:
Phone Number:
Message:
By clicking checkbox, you agree to our Terms and Conditions and Privacy Policy

Overview

Company Description

Join our mailing list!

Login to your account

Reset Password

Signup to your Account

Answers

Account Activation