Article: https://proton.me/blog/deepseek
Calls it “Deepsneak”, failing to make it clear that the reason people love Deepseek is that you can download and it run it securely on any of your own private devices or servers - unlike most of the competing SOTA AIs.
I can’t speak for Proton, but the last couple weeks are showing some very clear biases coming out.
How is this Open Source? The official repository https://github.com/deepseek-ai/DeepSeek-R1 contains images only, a PDF file, and links to download the model. I don’t see any code. What exactly is Open Source here? And if so, where to get the source code?
Open-Source in AI usually posted to HuggingFace instead of GitHub: https://huggingface.co/deepseek-ai/DeepSeek-R1
In deep learning generally open source doesn’t include actual training or inference code. Rather it means they publish the model weights and parameters (necessary to run it locally/on your own hardware) and publish academic papers explaining how the model was trained. I’m sure Stallman disagrees but from the standpoint of deep learning research DeepSeek definitely qualifies as an “open source model”
Just because they call it Open Source does not make it. DeepSeek is not Open Source, it only provides model weights and parameters, not any source code and training data. I still don’t know whats in the model and we only get “binary” data, not any source code. This is not Libre software.
There is a nice (even if by now already a bit outdated) analysis about the openness of different “open source” generative AI projects in the following article: Liesenfeld, Andreas, and Mark Dingemanse. “Rethinking open source generative AI: open washing and the EU AI Act.” The 2024 ACM Conference on Fairness, Accountability, and Transparency. 2024.
So “Open Source” to AI is just releasing a .psd file used to export a jpeg, and you need some other proprietary software like Photoshop in order to use it.
What other proprietary software is necessary to use model weights?