Science Memes

11189 readers

2058 users here now

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.

Rules

Don't throw mud. Behave like an intellectual and remember the human.
Keep it rooted (on topic).
No spam.
Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.

Research Committee

!spiders@lemmy.world

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

founded 2 years ago

MODERATORS

Sal@mander.xyz

fossilesque@mander.xyz

SciBot@mander.xyz

825

answer = sum(n) / len(n) (lemmy.eco.br)

submitted 4 months ago by driving_crooner@lemmy.eco.br to c/science_memes@mander.xyz

129 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] magic_lobster_party@kbin.run 3 points 4 months ago* (last edited 4 months ago)

The explanation is not that simple. Some model configurations work well. Others don’t. Not all continuous and differentiable models cut it.

It’s not given a model can generalize the problem so well. It can just memorize the training data, but completely fail on any new data it hasn’t seen.

What makes a model be able to see a picture of a cat it has never seen before, and respond with “ah yes, that’s a cat”? What kind of “cat-like” features has it managed to generalize? Why does these features work well?

When I ask ChatGPT to translate a script from Java to Python, how is it able to interpret the instruction and execute it? What features has it managed to generalize to be able to perform this task?

Just saying “why wouldn’t it work” isn’t a valid explanation.