hersh

joined 1 year ago
[–] hersh 5 points 7 months ago (5 children)

If you click the Chat button on a DDG search page, it says:

DuckDuckGo AI Chat is a private AI-powered chat service that currently supports OpenAI’s GPT-3.5 and Anthropic’s Claude chat models.

So at minimum they are sharing data with one additional third party, either OpenAI or Anthropic depending on which model you choose.

OpenAI and Anthropic have similar terms and conditions for enterprise customers. They are not completely transparent and any given enterprise could have their own custom license terms, but my understanding is that they generally will not store queries or use them for training purposes. You'd better seek clarification from DDG. I was not able to find information on this in DDG's privacy policy.

Obviously, this is not legal advice, and I do not speak for any of these companies. This is just my understanding based on the last time I looked over the OpenAI and Anthropic privacy policies, which was a few months ago.

[–] hersh 62 points 7 months ago* (last edited 7 months ago) (1 children)

I've been using Kagi for a while, so I'll post a few quick thoughts I had after reading the article, linked blog, and mastodon thread.

The one thing in the blog post I strongly disagree with is her statement that the summarizer is "the same old AI bullshit". I think they just assumed that without actually testing it. The summarizer is fantastic, and is a great example of the right way to use LLMs. Its output comes entirely from the URL or file you specify. It does not hallucinate. You can ask it follow-up questions about the document, and again, its replies are limited in scope to what's actually in that document. If you ask it something out of scope it'll tell you that it can't find that information in the document. This is great because it's using the LLM for what LLMs are actually good for — complex language parsing — and not for what they're bad for, like reasoning or information storage/retrieval. It's actually quite difficult to misuse the summarizer. It's straightforward and effective. This is Kagi's killer feature, IMO.

I can't speak as highly of its search-integrated AI features like FastGPT. They mostly take information from the contents of the first few search results, but they also seem to "fill in the blanks" more than I'd like. Sometimes it gives me information that is simply not in the sources that it cites. It's not as bad as using ChatGPT (which hallucinates all day every day, and if it ever tries to cite source is hallucinates those, too) but it needs improvement.

That said, Kagi doesn't shove the AI down your throat like you might think reading the blog post. These are separate features that need to be explicitly activated. Plain searches don't return results from the LLMs. If you want that, you need to explicitly go to the assistant or trigger the "quick answer" feature on the search results page. EDIT: I just realized that this is not totally true as of a few weeks ago, when they added a feature to automatically trigger the "quick answer" blurb for search queries ending in a question mark. Perhaps that's why Lori felt like it was over-emphasized.

Anyway, back to the summarizer, here is an example of it in action. I gave it the URL of the Mastodon post. I think this is an excellent result. I also asked it an unrelated followup question so you can see that it limits itself to the information in the URL. It will not make shit up!

The summarizer lets me download conversations in markdown form, so I'll just paste it right in here so you can see.

Exploring https://hackers.town/@lori/112255132348604770

Assistant:
Key points:

  • The author wrote a blog post criticizing the search engine Kagi, which then prompted the CEO of Kagi, Vlad, to email the author directly to argue against the criticism. [1]
  • The author woke up to an email from Vlad, the CEO of Kagi, who was upset about the blog post the author had written. [1]
  • The author decided to publicly share the email exchange with Vlad, as they felt it was important for people to see his unhinged behavior. [1]
  • The author refused to engage in a private debate with Vlad, as they did not want to be taken "to Crime Scene Number Two" without any accountability. [1]
  • The author believes Vlad is unable to accept criticism or disagreement, and that it "eats him alive" when someone doesn't agree with him or his company. [1]
  • The author states that no other search engine CEO has emailed them directly to "mansplain their company" in this manner. [1]
  • The author sees Vlad's behavior as "petty and sad" and believes he has no understanding of boundaries. [1]
  • Other users in the thread share similar experiences of Vlad's inability to accept criticism and his tendency to simply restate his own opinions rather than address counterpoints. [1]
  • The author and others characterize Vlad's behavior as that of a narcissist who cannot tolerate disagreement. [1]
  • The author and others are amazed by Vlad's decision to set up a separate company in another country just to print and give away t-shirts, rather than focus on improving Kagi's core search functionality.

[1] lori (@lori@hackers.town)

Query:
What's the capital of North Dakota?

Assistant:
The knowledge provided does not contain any information about the capital of North Dakota. The context is focused on an email exchange between the author and the CEO of Kagi search engine regarding criticism of the company.

[–] hersh 5 points 9 months ago (1 children)

Because it's not the same class of device. The PS Portal is very niche. It's a $200 device that basically just runs the PS Remote Play app.

I've used PS Remote Play on my phone and laptop, and it's just not good in the cases I actually want to use it: when traveling away from home. Even with a good Internet connection it's only "okay". It's utterly useless when in transit (trains, places, etc.), and 99% useless in any public place (e.g. cafe or library WiFi).

These are all cases where the Switch, Deck, and similar devices excel. The PS Portal addresses a much smaller market.

[–] hersh 6 points 9 months ago

Oh yes, definitely. I think this is why Mozilla has not made this the default behavior in Firefox; there will always be the risk of false-positives breaking copied links, so it's important that people know that there's some kind of mutation happening.

ClearURLs uses a JSON file with site-specific regex patterns and rules. In theory I could customize this for myself, or better yet submit a pull request on their GitHub. If I have time I'll look into it.

[–] hersh 60 points 9 months ago (6 children)

Personally, I have found this feature to be too limited. I still use the ClearURLs extension, which is more effective in my experience.

However, neither one is a silver bullet. Here's an example I just took from Amazon (I blocked out some values with X's):

Original URL:
https://www.amazon.com/Hydro-Flask-Around-Tumbler-Trillium/dp/B0C353845H/ref=XXXX?qid=XXXXXXXXXX&refinements=p_XXXXXXXXXXXXX&rps=1&s=sporting-goods&sr=XXX

Using Firefox's "copy link without site tracking" feature:
https://www.amazon.com/Hydro-Flask-Around-Tumbler-Trillium/dp/B0C353845H/ref=XXXX?qid=XXXXXXXXXX&refinements=p_XXXXXXXXXXXXX&rps=1&s=sporting-goods

Using ClearURLs:
https://www.amazon.com/Hydro-Flask-Around-Tumbler-Trillium/dp/B0C353845H?refinements=p_XXXXXXXXXXXXX&rps=1

The ideal, canonical URL, which no tools I'm familiar with will reliably generate:
https://www.amazon.com/dp/B0C353845H

Longer but still fully de-personalized URL:
https://www.amazon.com/Hydro-Flask-Around-Tumbler-Trillium/dp/B0C353845H

If anybody knows a better solution that works with a wide variety of sites, please share!

[–] hersh 21 points 9 months ago (7 children)

Not sure if you're referring to the graphics or to the shitty bench design. If the latter...it's a real thing. :(

They're called "leaning benches" or "lean bars". This bench design is sort of "futuristic" in the sense that adoption has only recently started taking off around the world. They are a user-hostile design made specifically to prevent people (specifically homeless people) from lying down, sleeping, or otherwise, y'know, using it as a goddamn bench. Because removing the ability for anyone to sit down is apparently, in the eyes of authorities, a small price to pay to make homeless people's lives that much harder.

The Wikipedia article for "Leaning bench" redirects to hostile architecture, where you can read more about this and similar efforts, if you are in the mood to be enraged at the sheer malice of bureaucrats.

I've seen them in several cities across America. NYC starting rolling them out within the past decade and you'll see them in any recently renovated station. See https://www.nydailynews.com/2017/09/11/subway-riders-slam-brooklyn-stations-new-leaning-bars-as-incredibly-unwelcoming/ (scroll through the image slideshow to see the new).

Not sure if the image embed will work here but I'll try:

[–] hersh 13 points 10 months ago (1 children)

Correct. This is also why Apple switched to zsh as the default shell over bash. They still ship Bash 3.2 in macOS, because from 4.0 on, Bash started using GPLv3 instead of GPLv2.

I'm not against the idea of creating proprietary software out of open-source software, if the license allows that. However, I am always against this practice of "closing the door behind you".

[–] hersh 226 points 10 months ago (14 children)

Apple: builds their entire software ecosystem on free, open-source foundations.

Also Apple: better have a million euros if you want to even start distributing software.

The best use case for an external app store is free open-source software, like we have on the Android side with F-Droid. Apple stopped that before it even started. Jeez.

[–] hersh 6 points 10 months ago (1 children)

Yeah, I wouldn't be too confident in Facebook's implementation, and I certainly don't believe that their interests are aligned with their users'.

That said, it seems like we're reaching a turning point for big tech, where having access to private user data becomes more of a liability than an asset. Having access to the data means that they will be required by law to provide that data to governments in various circumstances. They might have other legal obligations in how they handle, store, and process that data. All of this comes with costs in terms of person-hours and infrastructure. Google specifically cited this is a reason they are moving Android location history on-device; they don't want to deal with law enforcement constantly asking them to spy on people. It's not because they give a shit about user privacy; it's because they're tired of providing law enforcement with free labor.

I suspect it also helps them comply with some of the recent privacy protection laws in the EU, though I'm not 100% sure on that. Again, this is a liability issue for them, not a user-privacy issue.

Also, how much valuable information were they getting from private messages in the first place? Considering how much people willingly put out in the open, and how much can be inferred simply by the metadata they still have access to (e.g. the social graph), it seems likely that the actual message data was largely redundant or superfluous. Facebook is certainly in position to measure this objectively.

The social graph is powerful, and if you really care about privacy, you need to worry about it. If you're a journalist, whistleblower, or political dissident, you absolutely do not want Facebook (and by extension governments) to know who you talk you or when. It doesn't matter if they don't know what you're saying; the association alone is enough to blow your cover.

The metadata problem is common to a lot of platforms. Even Signal cannot use E2EE for metadata; they need to know who you're communicating with in order to deliver your messages to them. Signal doesn't retain that metadata, but ultimately you need to take their word on that.

[–] hersh 1 points 10 months ago

Any Safari extensions installed that might be interfering with this behavior? That's the best I can figure.

[–] hersh 13 points 10 months ago (4 children)

Bell Riots are coming this year. The Second American Civil War starts in 2026, which leads directly into WWIII.

From there, everything is pretty much terrible until warp drive is invented.

[–] hersh 7 points 10 months ago (12 children)

The lengths people will go to just to avoid learning how to properly cook vegetables...

view more: ‹ prev next ›