daredevil

joined 1 year ago
[–] daredevil@kbin.social 3 points 8 months ago

Get well soon, and thanks for the update.

 
 

Terminal Trove showcases the best of the terminal, Discover a collection of CLI, TUI, and more developer tools at Terminal Trove.

 

On Monday, Mistral AI announced a new AI language model called Mixtral 8x7B, a "mixture of experts" (MoE) model with open weights that reportedly truly matches OpenAI's GPT-3.5 in performance—an achievement that has been claimed by others in the past but is being taken seriously by AI heavyweights such as OpenAI's Andrej Karpathy and Jim Fan. That means we're closer to having a ChatGPT-3.5-level AI assistant that can run freely and locally on our devices, given the right implementation.

Mistral, based in Paris and founded by Arthur Mensch, Guillaume Lample, and Timothée Lacroix, has seen a rapid rise in the AI space recently. It has been quickly raising venture capital to become a sort of French anti-OpenAI, championing smaller models with eye-catching performance. Most notably, Mistral's models run locally with open weights that can be downloaded and used with fewer restrictions than closed AI models from OpenAI, Anthropic, or Google. (In this context "weights" are the computer files that represent a trained neural network.)

Mixtral 8x7B can process a 32K token context window and works in French, German, Spanish, Italian, and English. It works much like ChatGPT in that it can assist with compositional tasks, analyze data, troubleshoot software, and write programs. Mistral claims that it outperforms Meta's much larger LLaMA 2 70B (70 billion parameter) large language model and that it matches or exceeds OpenAI's GPT-3.5 on certain benchmarks, as seen in the chart below.
A chart of Mixtral 8x7B performance vs. LLaMA 2 70B and GPT-3.5, provided by Mistral.

The speed at which open-weights AI models have caught up with OpenAI's top offering a year ago has taken many by surprise. Pietro Schirano, the founder of EverArt, wrote on X, "Just incredible. I am running Mistral 8x7B instruct at 27 tokens per second, completely locally thanks to @LMStudioAI. A model that scores better than GPT-3.5, locally. Imagine where we will be 1 year from now."

LexicaArt founder Sharif Shameem tweeted, "The Mixtral MoE model genuinely feels like an inflection point — a true GPT-3.5 level model that can run at 30 tokens/sec on an M1. Imagine all the products now possible when inference is 100% free and your data stays on your device." To which Andrej Karpathy replied, "Agree. It feels like the capability / reasoning power has made major strides, lagging behind is more the UI/UX of the whole thing, maybe some tool use finetuning, maybe some RAG databases, etc."

Mixture of experts

So what does mixture of experts mean? As this excellent Hugging Face guide explains, it refers to a machine-learning model architecture where a gate network routes input data to different specialized neural network components, known as "experts," for processing. The advantage of this is that it enables more efficient and scalable model training and inference, as only a subset of experts are activated for each input, reducing the computational load compared to monolithic models with equivalent parameter counts.

In layperson's terms, a MoE is like having a team of specialized workers (the "experts") in a factory, where a smart system (the "gate network") decides which worker is best suited to handle each specific task. This setup makes the whole process more efficient and faster, as each task is done by an expert in that area, and not every worker needs to be involved in every task, unlike in a traditional factory where every worker might have to do a bit of everything.

OpenAI has been rumored to use a MoE system with GPT-4, accounting for some of its performance. In the case of Mixtral 8x7B, the name implies that the model is a mixture of eight 7 billion-parameter neural networks, but as Karpathy pointed out in a tweet, the name is slightly misleading because, "it is not all 7B params that are being 8x'd, only the FeedForward blocks in the Transformer are 8x'd, everything else stays the same. Hence also why total number of params is not 56B but only 46.7B."

Mixtral is not the first "open" mixture of experts model, but it is notable for its relatively small size in parameter count and performance. It's out now, available on Hugging Face and BitTorrent under the Apache 2.0 license. People have been running it locally using an app called LM Studio. Also, Mistral began offering beta access to an API for three levels of Mistral models on Monday.

 

Discovery helps reveal how a long-lost empire used multiculturalism to achieve political stability

[–] daredevil@kbin.social 1 points 1 year ago (4 children)

I've taken care of it. 🙂

 

@Ernest has pushed an update which allows users to request ownership/moderation of abandoned magazines. Ghost/abandoned magazines were fairly prevalent after the initial wave of hype due to users either squatting magazine names or becoming inactive for other reasons. Now is your chance to get involved, if you were waiting to do so.

To request ownership/moderator privileges, scroll down to where it says "MODERATORS" in the sidebar. There will be an icon of a hand pointing upwards that you can click on, then make the request. Cheers, and thank you for your hard work Ernest, as well as future mods.

[–] daredevil@kbin.social 7 points 1 year ago

My advice would be to take things gradually. This endeavor can be a bit overwhelming if you're one to hyperfixate.

[–] daredevil@kbin.social 1 points 1 year ago

Bangs are awesome, and so are the Vim keybinds

 

Title: Let the Battles Begin!
Name: Final Fantasy VII
Year Released: 1997
Composer: Nobuo Uematsu
Developer: Square Enix
Platform: PlayStation

 

Title: Green Hill Zone
Game Name: Sonic the Hedgehog
Year Released: 1991
Composer: Masato Nakamura
Developer: Sonic Team
Platform: Sega Genesis

 

Composer: Junichi Masuda
Game: Pokémon Red and Blue (Pokémon Red and Green in Japan)
Year Released: 1996
Platform: Game Boy

[–] daredevil@kbin.social 2 points 1 year ago* (last edited 1 year ago) (1 children)

I would also love it if we could prevent microblogs from federating to a magazine, as banning currently does nothing to prevent this issue and seems a bit counterintuitive. Similar to the concerns OP raised RE: commenting, this seems like it could be another vector for bad actors to attack from.

[–] daredevil@kbin.social 4 points 1 year ago (11 children)

Is Ubuntu is a requirement or am I misunderstanding?

 

• Game: Mega Man 3 (Capcom, 1990, NES)
• ReMixer(s): Disco Dan
• Composer(s): Harumi Fujita, Yasuaki Fujita
• Song(s): 'Title'
• Posted: 2001-11-22, evaluated by djpretzel

[–] daredevil@kbin.social 5 points 1 year ago

This is not the emoji you're looking for hand wave

[–] daredevil@kbin.social 18 points 1 year ago

This does not spark joy.

[–] daredevil@kbin.social 1 points 1 year ago* (last edited 1 year ago)

Contributed 50 translations to Japanese yesterday :) I'll continue contributing how I can

[–] daredevil@kbin.social 4 points 1 year ago* (last edited 1 year ago)

I understand wanting to help the platform grow, but I don't think invalidating the opinions and contributions of a-man-from-earth is a good way to approach it. The holier-than-thou attitude might also have the opposite effect that your original post is attempting to achieve. The lack of active moderators is certainly an issue, along with spam and the existence of various federation issues are problems as well. I get that these things take time, so I'm being patient. That said, I still enjoy kbin and contribute how and when I can.

[–] daredevil@kbin.social 2 points 1 year ago

Didn't realize there was a bot in place for mirroring content as bot posts apparently don't federate to kbin. @Arorus you're free to delete my posts or if others also don't want to see them then I'll take them down.

 

"...Euphrasie, three days ago, one of your journalists secretly followed a suspect all the way from the Court of Fontaine to Romaritime Harbor, and almost ended up being tied up and thrown into the sea by a gang of criminals. Whether or not there's any truth in the notion that 'nearer to the action is closer to the truth,' surely Miss Charlotte doesn't value her reports more than she does her own life?"
— Yet another exasperated exchange between Captain Chevreuse of the Special Security and Surveillance Patrol and Euphrasie, Editor-in-Chief of The Steambird

◆ Name: Charlotte
◆ Title: Lens of Verity
◆ Reporter of The Steambird
◆ Vision: Cryo
◆ Constellation: Hualina Veritas

Fontaine's famous newspaper The Steambird has a veritable legion of reporters it can call upon, each with their own area of expertise. Some specialize in celebrity gossip, others follow the word on the street, while others still focus on political affairs...

But among them all, there is one that stands head and shoulders above the rest thanks to her seemingly boundless reserve of energy and perseverance — the inimitable Charlotte.

Unswervingly committed to the principle that "nearer to the action is closer to the truth," Charlotte has a habit of popping up literally anywhere and everywhere in Fontaine — from its widest avenues to its narrowest back alleys, its highest vantage points to its lowest subterranean vaults, even its tallest mountains to its deepest undersea caverns. She captures the "truth" with her Kamera, records it in her articles, and finally unveils it for all to see.

And when the "truth" comes out, she's met with a variety of different reactions ranging from applause, to embarrassment, to outright fury. There are even some who would resort to any means necessary to make a particular article connected to themselves disappear. Or alternatively, just make Charlotte disappear.

For this reason, the newspaper's Editor-in-Chief Euphrasie has on numerous occasions felt the need to distance Charlotte from the Court of Fontaine by sending her off on faraway "field reporting" jobs, only recalling her once the Maison Gardiennage or Special Security and Surveillance Patrol had finally managed to clear things up.

But despite all this, neither the toil of the job itself nor the pressure of external denunciations and threats has ever phased Charlotte in the slightest.

With her trusty companion Monsieur Verite by her side, she invariably carries out her journalistic duties with unfaltering fervor, rushing about in pursuit of all the "truths" out there just waiting to be discovered.

view more: next ›