- cross-posted to:
- linux@lemmy.ml
- cross-posted to:
- linux@lemmy.ml
Fully open and accessible: Fully open-source release of model weights, training hyperparameters, datasets, and code, fostering innovation and collaboration within the AI community.
That’s actually pretty good. Seems to be open source as the OSI defines it, rather than the much more common “this model is open source, but the dataset is a secret”.
I see all these graphs about how much better this LLM is than another, but do those graphs actually translate to real world usefulness?
I think more of the issue is what constitutes actual open source. This is actually open source, and it performs well. If you’re familiar with the space, then it’s a big deal.
I see, thank you.
Damn, they even chose a dataset with a open license.
Is it really or is it just a binary release like everything else?
Edit: It is actually Foss
Everything is explained and linked in the project, so…
Yeah I noticed that after writing this. Really cool stuff
I have yet to see a 3B model that’s not dumb.
Thank god for this. Setting up deepseek to utilize my AMD GPU through llama was near impossible
Smart people, I beg of thee, explain! What can it do?
Edit: looks to be another text based one, not image generation right?
It’s language only, hence, LM
To be fair, I didn’t know if that language included programming language, and thus maybe still consider image based AI to be included in LLM. Is there a different designation for the type of AI that does image generation?
Nice, thanks!
Got it up and running on a Debian distrobox… now I need to figure out how to train it. Will be my first steps into this type of thing – so prob will take me a bit to figure out how it all works
It knows everything about everything you ever received by mail from your local grocery store.
Can it learn my local database of PDF books I illegally downloaded years ago? No!
That’s right! Isn’t it great?
Huh?