this post was submitted on 25 Oct 2023
9 points (80.0% liked)

Hacker News

4123 readers
3 users here now

This community serves to share top posts on Hacker News with the wider fediverse.

Rules0. Keep it legal

  1. Keep it civil and SFW
  2. Keep it safe for members of marginalised groups

founded 1 year ago
MODERATORS
 

There is a discussion on Hacker News, but feel free to comment here as well.

you are viewing a single comment's thread
view the rest of the comments
[–] lvxferre@lemmy.ml 3 points 1 year ago* (last edited 1 year ago)

Systematic generalisation, in a nutshell, works like this:

  • one apple, two apples
  • one ball, two balls
  • one rose, two roses
  • one ___, two ___s

It's an actual feature of language, and it operates on both the morphological and syntactical layers.

And IMO a good start, but not enough. As machine text generation moves away from LLMs and their "ooga booga, bash token on token" approach, eventually you'll need to deal with the fact that the morpheme (aka token) itself don't matter that much, it's just an interface for a semantic layer. And that you need that semantic layer if you want anything past "potatoes are active, oranges are passive".