this post was submitted on 19 Apr 2024
292 points (95.9% liked)

Technology

59235 readers
3429 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] ptz@dubvee.org 95 points 6 months ago (1 children)

Also Microsoft.....

Microsoft warns deepfake election subversion is disturbingly easy

I know the genie's out of the bottle, but goddamn.

[–] Etterra@lemmy.world 18 points 6 months ago

Microsoft: I know this will only be used for evil, but I'll be damned if I'm gonna pass up on the hype-boost to my market share.

Every other big corp: same!

[–] slaacaa@lemmy.world 85 points 6 months ago

“At long last, we have created the Torment Nexus from classic sci-fi novel Don't Create The Torment Nexus”

[–] brown567@sh.itjust.works 47 points 6 months ago (2 children)

Can we maybe stop making these? XD

[–] NoRodent@lemmy.world 60 points 6 months ago (1 children)
[–] ThePantser@lemmy.world 24 points 6 months ago (1 children)

This coming from the guy who turned himself into a fly for fun

[–] SatansMaggotyCumFart@lemmy.world 11 points 6 months ago (1 children)

It’s not his fault earth girls are easy.

load more comments (1 replies)
[–] MysticKetchup@lemmy.world 21 points 6 months ago (6 children)

Like what even is a legitimate use case for these? It just seems tailor made for either misinformation or pointless memes, neither of which seem like a good sales pitch

[–] Deceptichum@sh.itjust.works 7 points 6 months ago* (last edited 6 months ago) (3 children)

I could see a few uses, but the biggest would probably be advertising. Tailored ads that look like they’re coming from a real person.

Imagine Jake from State Farm addressing you personally about your insurance in an ad.

Not that I endorse advertising, I’d like to see it all banned.

I think it could be useful to humanise some things though and talking to a “person” AI in a video call might be more comfortable for some people wanting to do tasks such as say navigate my mobile phone carriers shitty AI help system.

Really any sort of AI assistant device could benefit from a human imprint.

[–] SatansMaggotyCumFart@lemmy.world 11 points 6 months ago

Imagine your dead relative selling you extended warranty for your vehicle.

load more comments (2 replies)
[–] chiisana@lemmy.chiisana.net 4 points 6 months ago (4 children)

Say you’re a movie studio director making the next big movie with some big name celebs. Filming is in progress, and one of the actor dies in the most on brand way possible. Everyone decides that the film must be finished to honor the actor’s legacy, but how can you film someone who is dead? This technology would enable you to create footage the VFX team can use to lay over top of stand-in actor’s face and provide a better experience for your audience.

I’m sure there are other uses, but this one pops to mind as a very legitimate use case that could’ve benefited from the technology.

[–] catloaf@lemm.ee 11 points 6 months ago (1 children)

how can you film someone who is dead?

Hot take: don't? They're dead, leave them dead. Rewrite and reshoot if you really have to.

[–] chiisana@lemmy.chiisana.net 6 points 6 months ago

Sure that’s an entirely valid option; but not the one the producing team and the deceased’s family opted for… and they had a much larger say in it than you and I combined.

[–] MysticKetchup@lemmy.world 5 points 6 months ago (3 children)

We've already recreated dead actors or older actors whole cloth with VFX. Plus it still seems like a niche use case for something that can be done by VFX artists that can also do way more

load more comments (3 replies)
[–] Pheonixdown@lemm.ee 4 points 6 months ago

Gotta crank up that dystopia meter.

This is slowly moving toward having Content On Demand. Imagine being able to prompt your content app for a movie/series you want to watch, and it just makes it and streams it to you.

[–] ramirezmike@programming.dev 4 points 6 months ago

this is so dystopian. Imagine spending your career honing your skill as an actor, dying and then having a computer replace you with just a photograph as a source. How is that honoring an actor??

An actual, practical example is generating video for VR chats like Apple has somewhat tried to do with their headset. Rather than using the cameras/sensors to generate and animate a 3d model based on you, it could do something more like this, albeit 2d.

[–] frezik@midwest.social 3 points 6 months ago

Maybe a historical biopic in the style of photos of the time. Like take pictures of Lincoln, Grant, Lee, etc., use voice actors plus modern reenactors for background characters, and build it into a whole movie.

I dunno, I'm probably reaching.

load more comments (2 replies)
[–] dhork@lemmy.world 41 points 6 months ago (3 children)

Vasa? Like, the Swedish ship that sank 10 minutes after it was launched? Who named that project?

[–] Jimmycakes@lemmy.world 10 points 6 months ago

They developed an ai to name all future ai. Ironically it is unnamed.

[–] dumbass@leminal.space 5 points 6 months ago

There are a lot of flying vehicles named after birds who famously plummet to the ground at breakneck speeds.

[–] hakunawazo@lemmy.world 3 points 6 months ago

No, like the crispbread.

[–] redcalcium@lemmy.institute 31 points 6 months ago (4 children)

Combine this with an LLM with speech-to-text input and we could create a talking paintings like in harry potter movies. Heck, hang it on a door and hook it with smart lock to recreate the dorm doors in harry potter and see if people can trick it to open the door.

[–] NotMyOldRedditName@lemmy.world 29 points 6 months ago (1 children)

Any sufficiently advanced technology is indistinguishable from magic.

Harry Potter wasn't a fantasy movie, it was a SciFi and we just didn't know it.

[–] venoft@lemmy.world 14 points 6 months ago (1 children)

It was midichlorians all along.

[–] dumbass@leminal.space 12 points 6 months ago (1 children)
[–] popekingjoe@lemmy.world 11 points 6 months ago
[–] FlyingSquid@lemmy.world 4 points 6 months ago

I like your optimism where this doesn't result in making everything worse.

load more comments (2 replies)
[–] ArmoredThirteen@lemmy.ml 30 points 6 months ago (2 children)

These vids are just off enough that I think doing a bunch of mushrooms and watching them would be a deeply haunting experience

[–] Dozzi92@lemmy.world 5 points 6 months ago* (last edited 6 months ago) (1 children)
load more comments (1 replies)
load more comments (1 replies)
[–] Maeve@kbin.social 25 points 6 months ago (3 children)

A long time ago, someone from a not free country wrote a white paper on why we should care about privacy, because written words can be edited to level false accusations (charges) with false evidence. This chills me to the bone.

[–] Kiosade@lemmy.ca 7 points 6 months ago

This is turning into some Mistborn shit. “Don’t trust writing not written on metal”

[–] Sanctus@lemmy.dbzer0.com 6 points 6 months ago

"You shot that man, citizen. Here is video evidence. Put your hands against the wall." - and more coming to you soon!

load more comments (1 replies)
[–] MeekerThanBeaker@lemmy.world 24 points 6 months ago (1 children)

This is why I don't post my picture online and I never talk to anyone ever, while hiding my head inside a nylon stocking (unrelated).

[–] werefreeatlast@lemmy.world 19 points 6 months ago

Freddie, this is your mom. Look all I want for my birthday is for you to please start using teams new. It's so much better than teams classic. I alread... Microsoft already installed it for you. Okay honey? And could you also start using a microsoft.com account so you can get financially hooked like all the Gmail users? It's pretty smart. Don't you want to be smart like Jonny? Tata!

[–] antlion@lemmy.dbzer0.com 14 points 6 months ago

Since it’s trained on celebrities, can it do ugly people or would it try to make them prettier in animation?

The teeth change sizes, which is kinda weird, but probably fixable.

It’s not too hard to notice for an up close face shot, but if it was farther away it might be hard - the intonation and facial expressions are spot on. They should use this to re-do all the digital faces in Star Wars.

[–] venusaur@lemmy.world 12 points 6 months ago (1 children)

One photo? That’s incredible.

[–] clark@midwest.social 16 points 6 months ago (1 children)

Yeah. Incredibly horrific.

[–] T00l_shed@lemmy.world 3 points 6 months ago

Yes I hate what AI is becoming capable of. Last year everyone was laughing at the shitty fingers, but were quickly moving past that. I'm concerned that in the near future it will be hard to tell truth from fiction.

[–] BetaDoggo_@lemmy.world 10 points 6 months ago (1 children)

The "why would they make this" people don't understand how important this type of research is. It's important to show what's possible so that we can be ready for it. There are many bad actors already pursuing similar tools if they don't have them already. The worst case is being blindsided by something not seen before.

load more comments (1 replies)
[–] AnAnonymous@lemm.ee 9 points 6 months ago

Paranoia vibes starting in 3, 2, 1..

[–] I_Miss_Daniel@lemmy.world 5 points 6 months ago

Feed it Microsoft Merlin. What will happen?

[–] simplejack@lemmy.world 5 points 6 months ago

Microsoft’s research teams always makes some pretty crazy stuff. The problem with Microsoft is that they absolutely suck at translating their lab work into consumer products. Their labs publications are an amazing archive of shit that MS couldn’t get out the door properly or on time. Example - multitouch gesture UIs.

As interesting as this is, I’ll bet MS just ends up using some tech that Open AI launches before MS’s bureaucratic product team can get their shit together.

[–] thefartographer@lemm.ee 5 points 6 months ago (1 children)

The pores don't stretch, but the teeth and irises sure do!

[–] T00l_shed@lemmy.world 2 points 6 months ago

I'm sure they will fix that before you know it.

[–] Dasus@lemmy.world 2 points 6 months ago* (last edited 6 months ago) (1 children)

One use of this I'm in favour of is recreating Majel Barret's voice as an AI for computer systems.

[–] kromem@lemmy.world 6 points 6 months ago (2 children)

This project doesn't recreate or simulate voices at all.

It takes a still photograph and created a lip synched video of that person saying the paired full audio clip.

There's other projects that simulate voices.

load more comments (2 replies)
[–] autotldr@lemmings.world 2 points 6 months ago

This is the best summary I could come up with:


On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track.

In the future, it could power virtual avatars that render locally and don't require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.

To show off the model, Microsoft created a VASA-1 research page featuring many sample videos of the tool in action, including people singing and speaking in sync with pre-recorded audio tracks.

The examples also include some more fanciful generations, such as Mona Lisa rapping to an audio track of Anne Hathaway performing a "Paparazzi" song on Conan O'Brien.

While the Microsoft researchers tout potential positive applications like enhancing educational equity, improving accessibility, and providing therapeutic companionship, the technology could also easily be misused.

"We are opposed to any behavior to create misleading or harmful contents of real persons, and are interested in applying our technique for advancing forgery detection," write the researchers.


The original article contains 797 words, the summary contains 183 words. Saved 77%. I'm a bot and I'm open source!

load more comments
view more: next ›