767
OpenAI be like
(lemmy.zip)
Post memes here.
A meme is an idea, behavior, or style that spreads by means of imitation from person to person within a culture and often carries symbolic meaning representing a particular phenomenon or theme.
An Internet meme or meme, is a cultural item that is spread via the Internet, often through social media platforms. The name is by the concept of memes proposed by Richard Dawkins in 1972. Internet memes can take various forms, such as images, videos, GIFs, and various other viral sensations.
Laittakaa meemejä tänne.
Here's a list of open source models: open-llms
Models are only open source if the weights are freely available along with the code used to generate them.
I would argue to be truly open source the training data needs to be as well.
I really appreciate that! I was asking more for the information of it, I doubt I could do anything with the link. Lol. I don't understand thing 1 about this stuff. I don't even know wtf a weight is in this context lol
In this context "weight" is a mathematical term. Have you ever heard the term "weighted average"? Basically it means calculating an average where some elements are more "influent/important" than others, the number that indicates the importance of an element is called a weight.
One oversimplification of how any neural network work could be this:
Training an AI means finding the weights that give the best results, and thus, for an AI to be open-source, we need both the weights and the training code that generated them.
Personally, I feel that we should also have the original training data itself to call it open source, not just weights and code.
What does the open source training data include? I've read a few open source training data that is also tested for biased but I haven't really looked at them.
Absolutely agree that to be called open source the training data should also be open. It would also pretty much mean that true open source models would be ethically trained.
Yeah, good call. Training data should be available as well.
Thank you!
And yeah, it really does seem like the training data should be open. Like, not even just to be considered open source, just to be allowed to do this at all, ethically, the training data should be known, at least to some degree. Like, there's so much shit out there, knowing what they trained on would help make some kind of ethical choice in using it
And as I understand it these Chinese "open source" models are only the weights? No way to "compile" your own version.
I'm not sure what you mean about Chinese models, but you can find the code used for training. Open Llama, for example, gives you the weights, the data, and the code used for training. You can do everything yourself, if you wanted to. The hardest part is getting the appropriate hardware.