Firefox

20135 readers

21 users here now

A place to discuss the news and latest developments on the open-source browser Firefox

founded 5 years ago

MODERATORS

k_o_t@lemmy.ml

212

Firefox introduces AI as experimental feature (lemmy.fish)

submitted 7 months ago by potentiallynotfelix@lemmy.fish to c/firefox@lemmy.ml

99 comments fedilink hide all child comments

They support Claude, ChatGPT, Gemini, HuggingChat, and Mistral.

you are viewing a single comment's thread
view the rest of the comments

[–] ocassionallyaduck@lemmy.world 4 points 7 months ago (1 children)

Try again. Simplified models take the large ones and pare them down in terms of memory requirements, and can be run off the CPU even. The "smol" model I mentioned is real, and hyperfast.

Llama 3.2 is pretty solid as well.

[–] Lojcs@lemm.ee 3 points 7 months ago* (last edited 7 months ago) (1 children)

These are the answers they gave the first time.

Qwencoder is persistent after 6 rerolls.

Anyways, how do I make these use my gpu? ollama logs say the model will fit into vram / offloaing all layers but gpu usage doesn't change and cpu gets the load. And regardless of the model size vram usage never changes and ram only goes up by couple hundred megabytes. Any advice? (Linux / Nvidia) Edit: it didn't have cuda enabled apparently, fixed now

[–] ocassionallyaduck@lemmy.world 5 points 7 months ago

Nice.

Yea I don't trust any AI models for facts, period. They all just lie. Confidently. The smol model there at least tried and got it right at first... Before confusing the sentence context.

Qwen is a good model too. But if you wanted something to run home automation or do text summaroes, smol is solid enough. I'm using CPU so it's good enough.