this post was submitted on 23 May 2024
917 points (97.4% liked)

Technology

59342 readers
5242 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Update: https://www.bleepingcomputer.com/news/microsoft/microsoft-outage-affects-bing-copilot-duckduckgo-and-chatgpt-internet-search/

It's also important to note that ChatGPT internet search and DuckDuckGo are experiencing similar issues because they use the Bing API.

UPDATE 2

20240523_210619

you are viewing a single comment's thread
view the rest of the comments
[–] joneskind@lemmy.world 2 points 5 months ago* (last edited 5 months ago) (1 children)

Most of 7b-8b models run just fine in 4bits quant and won’t use more than 4 or 5 GB of VRAM.

The only important metric is the amount of VRAM as the model must be loaded in VRAM for fast inference.

You could use CPU and RAM but it is really painfully slow.

If you got an Apple Silicon Mac it could be even simpler.

[–] veniasilente@lemm.ee 2 points 5 months ago (1 children)

I have an Intel Celeron Mobile laptop with iGPU and, I think, 256MB VRAM. How many bs does that get me for the LLM?

~~Only half-joking. That's my still functional old daily driver now serving as homelab~~

[–] joneskind@lemmy.world 2 points 5 months ago

Well, I got a good news and a bad news.

The bad news is you won't do shit with that my dear friend.

The good news is that you won't need it because the duck is back.