Microblog Memes

7658 readers

3235 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

Rules:

Please put at least one word relevant to the post in the post title.
Be nice.
No advertising, brand promotion or guerilla marketing.
Posters are encouraged to link to the toot or tweet etc in the description of posts.

Related communities:

founded 2 years ago

MODERATORS

ReadyUser31@lemmy.world

aeronmelon@lemmy.world

needanke@feddit.org

1451

firefox also isn't immune (lemmy.blahaj.zone)

submitted 17 hours ago by not_IO@lemmy.blahaj.zone to c/microblogmemes@lemmy.world

129 comments fedilink hide all child comments

https://mastodon.social/@gwynnion/114541537909461004

you are viewing a single comment's thread
view the rest of the comments

[–] brucethemoose@lemmy.world 2 points 12 hours ago* (last edited 12 hours ago)

some bits related to its training data

AKA ANY details about its training data, and its training hyperparameters, and literally any other details about its training. An 'open' secret among LLM tinkerers is that the Chinese companies seem to have particularly strong English/Chinese training data (not so much other languages though), and I'll give you one guess on how.

Deepseek is unusal in that they are open sourcing the general techniques they used and even some (not all) of the software frameworks they use.

Don't get me wrong, I think any level of openness should be encouraged (unlike OpenAI being as closed as physically possible), but they are still very closed. Unlike, say, IBM Granite models which should be reproducible.