AI expert Sasha Luccioni rates artificial intelligence scenes in "Iron Man," "Terminator 2: Judgment Day," and "Westworld."
Category
😹
FunTranscript
00:00I definitely laughed really hard when they started smashing all their
00:11equipment because the vast majority of AI models nowadays don't actually run
00:15on your device. My name is Sasha Luccioni and I'm an AI researcher at
00:20Hugging Face, a global startup that works on responsible AI. My research is really
00:24around evaluating the ethical and environmental impacts of AI models.
00:29Today we're going to be looking at AI scenes in movies and TV shows and judge
00:34how real they are. Jarvis, you there? At your service. Engage heads-up display. Check.
00:40Chatbots right now are more like, you know, Siri can set an alarm for you or
00:44Alexa can add, I don't know, chocolate milk to your grocery list, but we are far, far
00:49from Jarvis. Sadly, that would be really great though. I think the real challenge
00:54of that would be to take all these different sources of information, so
00:58Tony Stark's history and what he likes and his habits, and if you start
01:02integrating different sources of information, it's really hard to
01:05essentially weave them together because currently AI is not very good at taking
01:10different sources of input and stitching it together and responding to it in real
01:14time. There are still terabytes of calculations needed before an actual
01:18flight is a Jarvis. Sometimes you got to run before you can walk. So the way it
01:23usually works now for AI and aviation, it's really responding to, for example,
01:28GPS coordinates, altitude, actually wind speeds and wind directions as well. You
01:34would have to pre-train the system, so just, you know, going out for a test
01:37flight like Tony Stark did would be pretty dangerous because if there's no
01:41data about that location, if there's no, like, topology data of the city of Los
01:45Angeles, you can be in big trouble.
01:53Autopilots and planes use a lot of artificial intelligence, especially when
01:58they're cruising, especially, you know, when things are pretty stable.
02:07If there's, like, really a life-or-death decision, like is the case here, ice
02:11buildup, we want to make sure that we're listening to the AI instead of just, like,
02:14yoloing our way through the sky. Most of the AI that's used on planes is human in
02:18the loop. Human in the loop is a type of AI system where humans are really
02:22involved in the decision-making, and it essentially makes sure that no
02:26particularly damaging or particularly high-stakes decisions get made without a
02:31human getting involved in the process somehow. It's really important to have a
02:35human pilot call the shots even when the sensors are freezing up, and that's why
02:39we always have, you know, two pilots in planes because AI isn't really good with
02:44new situations it's never been trained with, so you always want to have someone
02:47to pick up, you know, in case of a mistake. I would give it a four because the
02:52elements are there, so we definitely have AI assistance, we definitely have AI
02:55autopilot, but it's not at the level of Iron Man.
03:05That's actually something that's done right now. I mean, every time you go
03:10through a toll booth, for example, in the United States or Canada, it will read
03:13your license plate. It could also, like, figure out what the make of your car
03:18is, and we've gotten really good at object recognition, especially in terms
03:22of, like, cars and street scenes, mostly because of autonomous vehicles.
03:34Estimating someone's height or, like, approximate weight is definitely
03:39something that people use AI for, like, I would say, like, in, like, CCTV
03:43surveillance, like, especially in, you know, if you're trying to identify
03:47someone, like, a perpetrator, this is something that AI is trained to do.
03:52Whether it does it super well will depend, essentially, like, for example, how, like,
03:57if the person's wearing baggy clothing, whether you have objects around the
04:02person, you're trying to figure out, like, how tall someone is. It's really hard if
04:05they're just, like, in a bare room where you have no objects that you can use to
04:09compare and to triangulate, especially if you have a person who's beside a car or,
04:13like, beside a pool table, then you can really say, well, it's probably, like, this
04:17height. It would be really hard for AI to do hand-to-hand combat in a way that's
04:27really, like, reactive. There's an element of trying to predict what someone will
04:31do and reacting quite quickly, and AI is typically not very good at predicting
04:35what people, because we're so unpredictable and we're so spontaneous.
04:38You know, as human beings, we're actually, we do this almost subconsciously, like,
04:41if you're, if you had to, you know, throw something at someone who was running, you
04:45would tend to kind of throw it at where you think they would be in a second or
04:49two, but for an AI, they would have to, like, predict the trajectory of someone's
04:53movement and then grab them at that time. I would rate this clip an 8. This movie
04:56is definitely ahead of its time, and I think it actually came to shape a lot of
05:00the AI research that was done in the next decades.
05:03Where am I going? Come on, come on.
05:04Standby, standby!
05:05I'm losing picture again!
05:07Looking for another. The entity is knocking off satellites faster than I can hack into them.
05:12So the villain in the recent Mission Impossible movie is a omnipotent,
05:16omniscient entity that's an AI algorithm that can hack satellites, that can
05:21predict human behavior, that can clone voices. AI can definitely knock out
05:25satellites if it's being used by a human, so it's kind of like a computer virus,
05:30right? Once you plant the virus, it can do all sorts of things, and actually, AI
05:34is being used in hacking, it's being used in kind of, like, cyber attacks, but it's
05:38not, like, once again, the agency doesn't come from the AI, it comes from the
05:42person who's gonna choose the place to deploy it, or, like, the type of satellite
05:46to target.
05:48Reggie, I don't see her. Where is she?
05:50Down the narrow alley and turn left.
05:51The voice cloning was spot-on. That can definitely be done already right now.
05:56So, essentially, how AI voice generation works is that if you have enough audio of
06:01someone talking, it will learn, like, the actual voice frequencies, the actual, like,
06:05audio frequencies of someone's voice and the way they talk. And in this case, it
06:09really seemed like it was taking words that Benji already said and just, like,
06:12shuffling them around or just playing them back without actually doing the
06:16the modeling part.
06:17Ethan, our comms have been breached. You're talking to the entity.
06:20Turn right. Take the bridge to your left.
06:22Ethan, that is not me!
06:24AI voice duplication has gotten so good. I remember we had a case, late 2023, where
06:30someone imitated the voice of the mayor of London saying some, like, super racist
06:35remarks. People reached out to us saying, like, can you verify if this is really the
06:40mayor of London speaking or someone spoofing his voice? And we were listening to
06:44those clips obsessively, and it's actually super, super hard to tell. And especially
06:47for a public figure like a mayor, there's enough of his voice data out there that
06:52you don't even need to, like, hack into anything. You can just use the clips that
06:56are out there, use his speeches.
07:01I definitely laughed really hard when they started smashing all their equipment
07:05because the vast majority of AI models nowadays don't actually run on your
07:11device, on your laptop, on your phone.
07:13I give this clip a three because of the agency issue. So AI is only a tool and it
07:18can be used for good, it could be used for bad, but by itself, it's not going to
07:22wake up and start hacking satellites.
07:32Having so many drones in one spot and being able to, like, make them all function
07:37somehow together for me is really hard to believe because, I mean, currently drones
07:42usually work either, like, by themselves, kind of high in the sky in terms of
07:47warfare. Or, you know, sometimes they will have synchronized drones, but more like
07:51the tiny little ones for, like, the firework type displays. But actually having, like,
07:55military drones in such close quarters and not, like, shooting at each other, I think
08:01it's a really big technical challenge.
08:12Tony Stark's glasses allow anyone using them to control the AI systems that he
08:16created. That includes the defense satellites and the combat drones that you see in
08:21this clip. You could use smart glasses to control drones if you were talking to your
08:25glasses, you know, with all the noise that is in the clip.
08:29Like, you have to be really sure that, you know, that's what he said and that's the
08:32instruction that he gave because there's always some amount of interference.
08:35I feel like tech companies have been trying to make smart glasses a thing for, like, the
08:39last decade. But currently they're so clunky and not particularly user friendly.
08:44So, you know, once again, if you wanted someone to be able to control satellites or
08:47drones with them, like, they would need to be hooked up to your brain.
08:50And we do not have that level of connection between, you know, neural links and
08:55smart wearables right now.
08:57Like, we're not there yet.
08:59I would rate this clip like a two for realism.
09:01If you knew the trouble I had getting an AI to read and duplicate facial expressions.
09:07So I turned on every microphone and camera across the entire planet.
09:11Getting everyone's cell phone data would be a really hard hack to do.
09:16So I would I would hope that our data is more is more well, it's better protected than
09:21that, let's say. But of course, like, I mean, you do hear of cases of microphones being
09:26switched on when someone's phone is on.
09:29What's your favorite color?
09:31Red.
09:32Why?
09:33Then what is my favorite color?
09:35I don't know.
09:36So there are AIs that are purported to detect if people are lying and to detect their
09:41emotions. The only one I could tend I would trust to a certain
09:47extent would be things like lie detector tests, which already exist, but that really are
09:52based on heartbeats and like and how stressed people are getting really in the
09:56physiological sense, like how stressed our body is becoming.
09:59But anything that's based on just video, for example, of someone talking into a camera,
10:04I really wouldn't trust that.
10:05For example, if it was trained on Caleb's data, it could turn that it was it could tell
10:08that Caleb is lying, but it couldn't tell couldn't read their it's his mind, essentially.
10:12And in that case, you need to like direct access to someone's brain.
10:15I would rate this a four because the whole consciousness aspect and the aspect
10:20of detecting a lie just based on, you know, like a single word answer
10:26is really hard to believe.
10:29I've run a brothel for 10 years, and if there's one thing I know, it's when I'm being
10:33Westworld is pretty realistic in terms of AI.
10:37And here they're using a dialogue tree, which isn't really used that much anymore.
10:41You could like program chatbots to actually follow essentially like a decision tree
10:45based on, you know, inputs.
10:47But they're really brittle.
10:48Even back then, this was like IBM Watson days, like there were limitations, even like in
10:52specific cases of we were working on like chatbots when you lost your credit card, like
10:56they would break quite quickly.
10:57So they were already being phased out.
10:59And nowadays, with the modern day chatbots that are based on large language models, they
11:03don't use this kind of like schematic, deterministic way
11:07of planning dialogue at all.
11:09They're actually based on probabilities and predicting next words.
11:12ChatGPT and other dialogue systems sound or look so realistic.
11:17It's because they're trained on essentially billions and billions of words.
11:20So it's kind of cute because it's like taking like an OG AI technique and being and
11:24using it and in like a very, very forward thinking, very futuristic context.
11:30You can't, you can't, I can't.
11:36AI can definitely improvise in the sense that it's not no longer hard coded like the
11:41dialogue tree that Maeve had.
11:42So it can improvise in the realm of what it was trained on.
11:47You train an AI model on customer service logs and you have people losing their credit
11:52cards and, you know, asking where the nearest ATM is.
11:55Like it will do fairly well, even if you use words that weren't exactly the same ones as
11:59what it was trained on. But if you start asking it, like, what is the meaning of life?
12:02They will answer things that make no sense.
12:05Essentially, that's that's when you know that your system is malfunctioning.
12:15I think that the chances that AI drones will mess up are definitely too high for them to
12:20be used in warfare.
12:21If I was in charge, we wouldn't be using AI drones because I've seen the way AI
12:26technologies fail spectacular.
12:28And also for me, using it, AI to actually like make the decision of who to kill is
12:35completely unacceptable from a moral and ethical perspective.
12:44Probably what the drone is doing is some form of facial recognition, which drones are
12:49able to do. Once again, like if there's another person moving around or also like I
12:54don't know how people didn't notice it because like it was obviously flying around.
12:58I feel like an AI sniper could do a good job from far away, but in like real time,
13:03right, like detecting a target and shooting the target.
13:05But doing that kind of like reconnaissance with a drone and then following up with a
13:09sniper, like I feel that that's not very realistic.
13:12It's really hard for AI to predict the future in a meaningful way.
13:16So, for example, once you have that drone that like mapped out where people were, say
13:20that they're moving around in that space.
13:22So for me, like the plausibility is like how would the bullet or the missile go from
13:27where the drone saw the person to be five minutes ago to where the person is if they
13:32move? But there's like some piece of the puzzle that's missing for me for those like
13:36two steps. First, the reconnaissance and then the shooting to take place.
13:39The first part is very plausible in terms of technology.
13:42So dialogue trees are definitely a thing and we're definitely used.
13:46But the second part of like the drone controlled bullets for me is just out of this
13:51realm, out of our galaxy, honestly.
13:53So I guess on an average, it would be a six.
14:02Typically, we say swarm intelligence when it's multiple robots, it can be any kind of
14:08technology that will coordinate together and will be able to cover more ground or scan
14:14more items. And there's a branch of AI called planning.
14:17And so, for example, if you have one robot, you know, you have to do all of the corners
14:21of a room or a building.
14:22But if you have multiple robots, then you can like do use planning in order to dispatch
14:26them in the most optimal way possible so that you spend less time searching.
14:33I got an ID. It's not him.
14:35I think that Minority Report planted a seed in people's brain that you can predict
14:40crime. There are lots of people who are working on predictive policing, for example,
14:45and these systems get sold to precincts across the world as a way of actually, you know,
14:51anticipating crimes before they happen.
14:54Everything from robots to actual like allocations of police resources to certain areas
14:59because an AI model told them that that's where the crime was going to happen.
15:03And the percentage was like one percent accurate of like commercial predictive policing
15:08systems when they're applied in the real world.
15:11I would give this one a seven.
15:13Keep walking.
15:16And stop. I figured you were hungry.
15:19Her is definitely a projection of where I think a lot of people would want AI to be.
15:24And I think that's why many people love that movie so much.
15:26And actually, you know, it keeps coming back as an example of like an AI girlfriend or
15:33companion that's able to read emotions and anticipate needs definitely in a way that's
15:39currently impossible, like especially, for example, like, oh, I figured you were hungry.
15:45I'm wondering how that kind of anticipation can be done, because even as a human, you
15:48know, I don't know if we're able to anticipate someone's hunger just by looking at them.
15:53His name is Alan Watts.
15:54Do you know him? He was a philosopher.
15:56He died in the 1970s and they input all of his writing and and everything they ever knew
16:00about him into an OS and created an artificially hyper intelligent version of him.
16:04Very nice to meet you, Theodore.
16:06That definitely sounds like something complex to do.
16:08Like you could definitely make a limited version of someone who's who's passed away based
16:15on, for example, interviews with that person or writing from that person.
16:19So, for example, like Abraham Lincoln or Ruth Bader Ginsburg.
16:22But once again, it would be like a really limited representation of them.
16:26It would be like the legal texts they wrote or or like whatever the laws that they put
16:31forth, putting any kind of consent issues aside.
16:34But making any kind of like virtual clone of someone is never going to be like their full
16:37persona. It's always going to be like a little sliver of who they actually are.
16:41Scraping data from the Internet, right, like is used for for training all these AI language
16:46models. And to what extent that's that's ethical or to what extent consent even is part of
16:52the picture for living people is still under debate.
16:54So I feel that for dead people is like a next level that we haven't even gotten to yet.
16:59I would rate this around a five.
17:02So I think it's it's pretty it's halfway there in terms of the capabilities.
17:06But of course, they're jacked up.
17:08We'd have to cut his higher brain functions without disturbing the purely automatic and
17:14regulatory systems.
17:15Currently, the way that AI systems are made, you can't really distinguish between different
17:20functionalities.
17:21We can't really meaningfully know how what part of an AI model is doing what thing, its
17:27ability to generate poetry and not like answer questions like that's all done under the same
17:33hood.
17:39There are applications of AI that try to do lip reading, especially for assistance more
17:45than surveillance.
17:46From what I've seen, you really need to be facing the camera.
17:49You need to be speaking relatively slowly.
17:52Anytime there's like a beard or, you know, or a face or a face mask, for example, then
17:57it's obviously impossible.
17:59And so I think that in this case, because they're speaking from the side, it would be
18:02really hard to use any of the techniques that we currently have for lip reading because
18:05you don't really see them speaking at the camera.
18:08I'm afraid my mind is going.
18:13The fact that at the end he's like unplugging the discs or whatever, like we don't do that
18:18anymore. I mean, I guess it's not implausible as much as as very.
18:21Old fashioned, there's different levels of self-awareness when it comes to a machine or a
18:26computer. How can definitely be aware of its drives being unplugged, like the physical
18:33absence of a drive that was connected before that's being disconnected now.
18:36So that's definitely a form of awareness.
18:39It's a form of, you know, interpreting physical knowledge and acting upon that.
18:43But whether an AI will be able to associate a disc being unplugged to death or not
18:50existing anymore. So that's really more like the metaphysical property that is a little bit
18:54less, less clear to me.
18:56I would rate this clip like a two because we're not there yet in terms of lip reading and
19:02we're definitely not there yet in terms of self-awareness.
19:05My favorite scene was the dialogue tree scene from Westworld because we see real AI
19:11techniques with the names as well, which is really rare in movies that you actually see
19:15something being done and then an explanation of what that exact technique is.