this post was submitted on 28 Nov 2023
6 points (87.5% liked)

Self-Hosted Main

515 readers
1 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

For Example

We welcome posts that include suggestions for good self-hosted alternatives to popular online services, how they are better, or how they give back control of your data. Also include hints and tips for less technical readers.

Useful Lists

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[โ€“] crysisnotaverted@alien.top 2 points 11 months ago (1 children)

I might be too dumb for this one, boys. I can't wrap my head around what "Open source AI Agent evaluations for web tasks" means...

Other than me being stupid, that is one well designed github repo, lol.

[โ€“] asim-shrestha@alien.top 1 points 11 months ago

๐Ÿ˜‚ a bit opaque if you're not super familiar with the space i suppose

โ€‹

ELI5: Theres a lot of work being done with LLMs to take actions on websites. This open source repo provides static versions of these websites along with some evaluation criteria to measure the performance of your LLM "agents". Its quite a pain to reliably test these agents otherwise. (An agent being some system of code that will take a goal like "travel to xyz on this page" and use an llm to translate that into actual actions)