Llama 3.1 Megathread

Blaed@lemmy.world · 4 months ago

Llama 3.1 Megathread

Blaed@lemmy.world · 10 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Blaed@lemmy.world · 10 months ago

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Blaed@lemmy.world · edit-2 10 months ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Blaed@lemmy.world · 10 months ago

Develop Alongside Local LLMs w/ Open Interpreter

Blaed@lemmy.world · 10 months ago

What open-source LLMs are you using in 2024?

Blaed@lemmy.world · 10 months ago

FOSAI 2024

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0003 - Expanding Horizons

Blaed@lemmy.world · 1 year ago

Mistral seems to be the popular choice. I think it’s the most open-source friendly out of the bunch. I will keep function calling in mind as I design some of our models! Thanks for bringing that up.

Blaed@lemmy.world · edit-2 1 year ago

We're building FOSAI models! Cast your votes and pick your tunings.

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0002 - A New Challenger Approaches!

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0001 - Happy FOSAI Friday!

Blaed@lemmy.world · 1 year ago

LM Studio - A new tool to discover, download, and run local LLMs

Blaed@lemmy.world · 1 year ago

CodeLlama-34B - the First Open-Source Model Beating GPT-4 on HumanEvals

Blaed@lemmy.world · 1 year ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Blaed@lemmy.world · 1 year ago

Introducing Stable-Diffusion.cpp (Inference in Pure C/C++)

Blaed@lemmy.world · 1 year ago

Cheetor - A New Multi-Modal LLM Strategy Empowered by Controllable Knowledge Re-Injection

Blaed@lemmy.world · edit-2 1 year ago

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

Blaed@lemmy.world · 1 year ago

Vicuna v1.5 Has Been Released!

Blaed@lemmy.world · 1 year ago

Vicuna v1.5 Has Been Released!

Blaed@lemmy.world · 1 year ago

I am actively testing this out. It’s hard to say at the moment. There’s a lot to figure out deploying a model into a live environment, but I think there’s real value in using them for technical tasks - especially as models mature and improve over time.

At the moment, though, performance is closer to GPT 3.5 than GPT 4, but I wouldn’t be surprised if this is no longer the case within the next year or so.

Blaed@lemmy.world · 1 year ago

After finally having a chance to test some of the new Llama-2 models, I think you’re right. There’s still some work to be done to get them tuned up… I’m going to dust off some of my notes and get a new index of those other popular gen-1 models out there later this week.

I’m very curious to try out some of these docker images, too. Thanks for sharing those! I’ll check them when I can. I could also make a post about them if you feel like featuring some of your work. Just let me know!

Blaed@lemmy.world · 1 year ago

Free Open-Source AI LLM Guide

Blaed@lemmy.world · 1 year ago

Assuming everything from the papers translate into current platforms, yes! A rather significant one at that. Time will tell us the true results as people begin tinkering with this new approach in the near future.

Blaed@lemmy.world · 1 year ago

Thanks for reading! I’m glad you enjoy the content. I find this tech beyond fascinating.

Who knows, over time you might even begin to pick up on some of the nuance you describe.

We’re all learning this together!

Blaed@lemmy.world · edit-2 1 year ago

OpenAI has launched a new initiative, Superalignment, aimed at guiding and controlling ultra-intelligent AI systems. Recognizing the imminent arrival of AI that surpasses human intellect, the project will dedicate significant resources to ensure these advanced systems act in accordance with human intent. It’s a crucial step in managing the transformative and potentially dangerous impact of superintelligent AI.

I like to think this starts to explore interesting philosophical questions like human intent, consciousness, and the projection of will into systems that are far beyond our capabilities in raw processing power and input/output. What may happen from this intended alignment is yet to be seen, but I think we can all agree the last thing we want in these emerging intelligent machines is to do things we don’t want them to do.

‘Superalignment’ is OpenAI’s response in how to put up these safeguards. Whether or not this is the best method is to be determined.

Blaed@lemmy.world · 1 year ago

All of these are great thoughts and ponderings! Totally correct in the right circumstances, too.

Massive context lengths that can retain coherent memory and attention over long periods of time would enable all sorts of breakthroughs in LLM technology. At this point, you would be held back by performance, compute, and datasets, rather than LLM context windows and short-term memory. In this context, our focus would be towards optimizing attention or improving speed and accuracy.

Let’s say you had hundreds of pages of a digital journal and felt like feeding this to a local LLM (where your data stays private). If the model was running sufficiently at high quality, you could have an AI assistant, coach, partner, or tutor that was caught up to speed with your project’s goals, your personal aspirations, and your daily life within a matter of a few hours (or a few weeks, depending on hardware capabilities).

Missing areas of expertise you want your AI to have? Upload and feed it more datasets Matrix style, any text-based information that humanity has shared online is available to the model.

From here, you could further finetune and give your LLM a persona, having an assistant and personal operating system that breaks down your life with you, or you could simply ‘chat’ with your life, those pages you fed it, and reflect upon your thoughts and memories, tuned to a super intelligence beyond your own.

Poses some fascinating questions, doesn’t it? About consciousness? Thought? You? This is the sort of stuff that keeps me up at night… If you trained a private LLM on your own notes, thoughts, reflections and introspection, wouldn’t you be imposing a level of consciousness into a system far beyond your own mental capacities? I have already started to use LLMs on the daily. In the right conditions, I would absolutely utilize a tool like this. We’re not at super intelligence yet, but an unlimited context window for a model of that caliber would be groundbreaking.

Information of any kind could be digitalized and formatted into datasets (at massive lengths), enabling this assistant or personal database to grow overtime with innovations of a project, you, your life, learning and discovering things alongside the intention and desire for it to function. At that point, we’re starting to get into augmented human capabilities.

What this means over the course of many years and breakthroughs in models and training methods would be fascinating thought experiment to consider for a society where everyone is using massive context length LLMs regularly.

Sci-fi is quickly becoming a reality, how exciting! I’m here for it, that’s for sure. Let’s hope the technology stays free, and open and accessible for all of us to participate in its marvels.

Blaed@lemmy.world · edit-2 1 year ago

You are correct in thinking this will demand a lot of compute. Hardware will need to scale to match these context lengths, but that is becoming increasingly possible with things like NVIDIA’s Grace Hopper architecture and AMDs recent commitment to expanding their hardware selection for emerging AI markets and demand.

There are also some really interesting frameworks and hardware developments being made at TinyCorp & TinyGrad that aim to run these emerging technologies efficiently and accessibly. He talks about this in detail in his podcast with Lex Fridman, a great watch if you’re interested in this sort of stuff.

It is an exciting time for technology and innovation. We have already started to hit exaflops of compute…

Blaed@lemmy.world · 1 year ago

Great question. I ponder this too, which is why I started /c/FOSAI. We have to do everything we can to make sure our future stays open for all, our faith cannot be put into the hands of a select few, but rather - the majority of many.

Time will tell who truly supports this. I’m hopeful OpenAI is the good guy we want them to be, but other businesses keep me from jumping to that conclusion. I like what they are doing alongside Microsoft, but we need more players in the game. Fresh minds to shake things up a little.

If you’re reading this, support FOSS, support FOSAI, and support the Fediverse. It’s the only way we can take back the internet, one server at a time.

Blaed@lemmy.world · 1 year ago

That’s okay! I hope you find what you’re looking for. If not, I’m sure someone will create a community for you soon. There’s a lot of new users migrating, only a matter of time before more content starts filling up the empty spaces!

Blaed@lemmy.world · 1 year ago

If you’re interested in free, open-source artificial intelligence news, breakthroughs, and developments - you should head over and subscribe to /c/FOSAI. I’d love to have you! Say hi anytime. I do my best to avoid spam, sensationalism, and clickbait.

Moderates