just_another_person@lemmy.world to Linux@lemmy.worldEnglish · 10 days agoAMD Announces "Instella" Fully Open-Source 3B Language Modelswww.phoronix.comexternal-linkmessage-square17fedilinkarrow-up173arrow-down13cross-posted to: linux@lemmy.ml
arrow-up170arrow-down1external-linkAMD Announces "Instella" Fully Open-Source 3B Language Modelswww.phoronix.comjust_another_person@lemmy.world to Linux@lemmy.worldEnglish · 10 days agomessage-square17fedilinkcross-posted to: linux@lemmy.ml
minus-squareHappyFrog@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up6·10 days agoI see all these graphs about how much better this LLM is than another, but do those graphs actually translate to real world usefulness?
minus-squarejust_another_person@lemmy.worldOPlinkfedilinkEnglisharrow-up8arrow-down2·10 days agoI think more of the issue is what constitutes actual open source. This is actually open source, and it performs well. If you’re familiar with the space, then it’s a big deal.
minus-squareHappyFrog@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up3·edit-210 days agoI see, thank you. Damn, they even chose a dataset with a open license.
minus-squarePossibly linux@lemmy.ziplinkfedilinkEnglisharrow-up2arrow-down1·edit-210 days agoIs it really or is it just a binary release like everything else? Edit: It is actually Foss
minus-squarejust_another_person@lemmy.worldOPlinkfedilinkEnglisharrow-up4·10 days agoEverything is explained and linked in the project, so…
minus-squarePossibly linux@lemmy.ziplinkfedilinkEnglisharrow-up2·10 days agoYeah I noticed that after writing this. Really cool stuff
minus-squareoldfart@lemm.eelinkfedilinkEnglisharrow-up1·8 days agoI have yet to see a 3B model that’s not dumb.
I see all these graphs about how much better this LLM is than another, but do those graphs actually translate to real world usefulness?
I think more of the issue is what constitutes actual open source. This is actually open source, and it performs well. If you’re familiar with the space, then it’s a big deal.
I see, thank you.
Damn, they even chose a dataset with a open license.
Is it really or is it just a binary release like everything else?
Edit: It is actually Foss
Everything is explained and linked in the project, so…
Yeah I noticed that after writing this. Really cool stuff
I have yet to see a 3B model that’s not dumb.