AMD Announces "Instella" Fully Open-Source 3B Language Models

just_another_person@lemmy.world · 4 months ago

AMD Announces "Instella" Fully Open-Source 3B Language Models

TheGrandNagus@lemmy.world · 4 months ago

Fully open and accessible: Fully open-source release of model weights, training hyperparameters, datasets, and code, fostering innovation and collaboration within the AI community.

That’s actually pretty good. Seems to be open source as the OSI defines it, rather than the much more common “this model is open source, but the dataset is a secret”.

HappyFrog@lemmy.blahaj.zone · 4 months ago

I see all these graphs about how much better this LLM is than another, but do those graphs actually translate to real world usefulness?

just_another_person@lemmy.world · 4 months ago

I think more of the issue is what constitutes actual open source. This is actually open source, and it performs well. If you’re familiar with the space, then it’s a big deal.

HappyFrog@lemmy.blahaj.zone · edit-2 4 months ago

I see, thank you.

Damn, they even chose a dataset with a open license.

Possibly linux@lemmy.zip · edit-2 4 months ago

Is it really or is it just a binary release like everything else?

Edit: It is actually Foss

just_another_person@lemmy.world · 4 months ago

Everything is explained and linked in the project, so…

Possibly linux@lemmy.zip · 4 months ago

Yeah I noticed that after writing this. Really cool stuff

oldfart@lemm.ee · 4 months ago

I have yet to see a 3B model that’s not dumb.

penquin@lemm.ee · 4 months ago

Thank god for this. Setting up deepseek to utilize my AMD GPU through llama was near impossible

GaMEChld@lemmy.world · edit-2 4 months ago

Smart people, I beg of thee, explain! What can it do?

Edit: looks to be another text based one, not image generation right?

just_another_person@lemmy.world · 4 months ago

It’s language only, hence, LM

GaMEChld@lemmy.world · 4 months ago

To be fair, I didn’t know if that language included programming language, and thus maybe still consider image based AI to be included in LLM. Is there a different designation for the type of AI that does image generation?

just_another_person@lemmy.world · 4 months ago

Yes: https://www.hachi-x.com/en/single-post/differences-between-llm-vlm-lvm-lmm-mllm-generative-ai-and-foundation-models

GaMEChld@lemmy.world · 4 months ago

Nice, thanks!

Rando@sh.itjust.works · 4 months ago

Got it up and running on a Debian distrobox… now I need to figure out how to train it. Will be my first steps into this type of thing – so prob will take me a bit to figure out how it all works

werefreeatlast@lemmy.world · 4 months ago

It knows everything about everything you ever received by mail from your local grocery store.

Can it learn my local database of PDF books I illegally downloaded years ago? No!

That’s right! Isn’t it great?

just_another_person@lemmy.world · 4 months ago

Huh?

AMD Announces "Instella" Fully Open-Source 3B Language Models

AMD Announces "Instella" Fully Open-Source 3B Language Models

Attention Required! | Cloudflare