YTread Logo
YTread Logo

AI Generated Videos Just Changed Forever

Mar 06, 2024
- That's hot. (bright, upbeat music) - Okay, this is really awesome and really scary at the same time, and it's hitting me in a way I really wasn't expecting. Do you remember Will Smith eating spaghetti? Do you remember when this is what AI-

generated

videos

looked like? Remember when we said, "Okay, this AI thing is cool and all," but there's clearly a long way to go "before there's any need to worry." Well, welcome to the future because this is also an AI

generated

video. And so is this, completely synthesized from nothing by computers. This one too, this is not real.
ai generated videos just changed forever
It's absolutely ridiculous how far we've come in literally one year. This looks like another ChatGPT, DALL.E moment for AI. And maybe I'm exaggerating because, well, I'm a video creator, so an AI that's actually doing my job maybe feels a little more threatening, so I'm particularly impressed by that. But this is also really good. So today, Sam Altman and OpenAI announced a new model called Sora and it can generate entire video clips up to one minute

just

by entering text. In the same way that DALL.E was able to understand our text input and turn it into a photorealistic or stylized image or whatever you want, same with Sora but now since they are

videos

it also needs to understand how all these things like reflections and Textures, materials and physics interact with each other over time to create a reasonable looking video.
ai generated videos just changed forever

More Interesting Facts About,

ai generated videos just changed forever...

And of course, right away, there are a ton of examples on their website that are crazy. Now, before I show you this, I

just

need you to keep this in mind: You're about to watch a ton of AI-generated videos, and you know you're about to watch a ton of AI-generated content. So your brain is already looking for these things and it's not perfect, you'll find imperfections, but not everyone who sees AI-generated content on the internet knows they're looking for that. So keep that in mind too. This is also the worst this technology will be from now on.
ai generated videos just changed forever
Well, here you have one of the videos. None of these clips have audio, but the message of this one is an elegant woman walking down a Tokyo street filled with warm, bright, neon, animated city signs. She is wearing a black leather jacket, a long red dress and black boots. This video is already way ahead of where we were. It has precise lighting, it has materials, it has skin tones, movements, it even has reflections everywhere. Now, of course, if you look at it for more than 10 seconds, very closely, there are many clues. Like this guy in the background seems to glide in a strange way.
ai generated videos just changed forever
The frame rates and reflections in the water are, for some reason, lower than the rest of the video. The camera movement in general is a little inconsistent and, I don't know, feels a little out of place. But of course, this is where we were a year ago. So keep it in the back of your head for all of this. Well, how about this one? This is another one that has a long message about a camera following behind a white vintage SUV with a black roof rack as it speeds down a steep dirt road. This is also, again, very good.
It looks a little more video game due to how solid the drone footage is, but clearly very usable. Here's another one, a litter of golden retriever puppies playing in the snow. Their heads go in and out of the snow cover, it's so good. It feels like the physics of the fur and the ears and everything with the snow flying in slow motion is incredible. I went through all the sample videos on the OpenAI website and clearly these are the top picks they chose to share, where they just put some text in and then got a video and didn't modify it.
But there are some really impressive things there. Some have humans, others don't. Some of them are more realistic, like the truck that drives them, but others are more video game or more stylized. A lot of it is slow motion, I just have to say how incredibly fast these models are improving, like that's the shocking part. I don't remember even many months ago, DALL-E 3, really high-end, and you could always find something strange about it. Especially if you ask for something like a photorealistic image of a human being, something like hands or ears would always be a little off, no matter the physics.
But even this video here seems crazy at first glance. The message of this AI-generated video is a young man in his 20s sitting on a piece of cloud in the sky reading a book. This seems like 90% of the way to me. Like it's beyond the uncanny valley of Apple characters, which are actually based on humans. This is a made-up person. I mean, his eyes are a little strange and the movement of the pages of the book is a little strange. And yes, obviously he's in a cloud and that gives him away, but the lighting and the shadows and the skin tones and then all the realism of the textures on the shirt and the way the shirt moves and the pants and the hair, They are all really impressive.
And then for this one, they wrote a movie trailer that showed the 30-year-old astronaut's adventures in a red wool knitted motorcycle helmet, blue sky, salt desert, cinematic style, shot on 35 millimeter film. And the close-ups of his face, the fabrics on the helmet, the grain of the film in every shot, and the cinematic style, this is one of the most convincing AI-generated videos I've ever seen, minus perhaps the strange physics of that guy walking. in fast motion. So Sam Altman, if you follow him on Twitter, you'll see a lot more requests from similar people and he'll post a lot more generated videos.
And if you want to see his profile, you can see them. But here's the thing with these AI-generated videos now: no matter how good they have gotten to this point, they can and will pass as real videos to people who aren't looking for AI-generated videos. That's obviously incredibly sketchy during a US election year, and also scary for a lot of other Internet-related reasons, but it's also perfect for stock footage. Like there are already all kinds of presentations and ads and then PowerPoints that need weirdly specific stock videos. And these AI-generated videos are already good enough to be 100% approved for that purpose.
Look at this one, this one with the waves in Big Sur, this drone shot. Honestly, if I saw this on Twitter I wouldn't think twice. I'd be like, "Oh, nice drone shot, dude." You wouldn't even think about AI if you weren't looking at pixels like it's the way water moves. As if this were a totally usable video in an ad for some California-based product. And that has all kinds of implications for the drone pilot who no longer needs to be hired, for all the photographers and videographers whose images no longer need a license to appear in that advertisement that is being made.
It's already that good. There are other things like this wall of TVs, which would be something totally expensive and difficult to film with a camera and all these old expensive props, but if you can generate it this well with reflections and the environment and everything around it, I mean , why do it any other way? It is also very capable of recording historical-themed images. This was supposedly California during the gold rush. It's AI-generated, but it could pass for the opening scene of an old Western with the right music. How long will it be until an entire ad, every shot, is generated entirely with AI?
Or what about a full YouTube video or full movie? I'm tempted to say we're a long way from that because, you know, this thing is still clearly flawed and there's no sound, and there's a long way to go with rapid engineering to fix these things. But of course, the spaghetti was just like it was a year ago. Now, in fact, OpenAI, on their website, also shows some of the disadvantages of this particular model. And because who would know better than the people who have been using it? By the way, this is a very private tool at the moment.
It has super limited access, so it's in the hands of red team members, which basically means people testing it, pushing the limits, trying to break it, and some trusted creators. But they have found many strange things. Like this clip here of a group of gray wolf pups that look normal at first, but then it's pretty clear that something is a little wrong with the way they appear out of nowhere and pass each other. That's a little weird. Or this clip of a guy running on a treadmill, which I mean, I don't really have to say much more about why this one is weird.
But this is my favorite, once again, so try putting yourself in the mind of someone who doesn't expect AI. You're just browsing Facebook or Twitter or something, right? So you just watched this video. First, I just want you to watch this clip as if it were a stock video you found of a grandmother celebrating her birthday. And she just tries to think, I wonder what birthday she's celebrating, right? I don't know, how old do you think she is? 60? Sixty-five? Maybe it's the great '70s. He seems to really like that cake. Now, did you see it?
Did you catch that? I'll play it again, but this time, watch the video knowing that AI-generated photos and videos have trouble rendering hands accurately. I'll play it again. And now it feels super obvious, like every time you look at it, you look at a different pair of hands, it gets weirder and weirder. You can watch it like five times and there is evidence after evidence, not to mention the strange inconsistencies with the wind direction in the sails. But even as I say all that, even as it comes out of my mouth, I can't help but remember that 12 months ago we were criticizing this. (Will laughs) So what does it all mean?
Well, I mean, that's what it means now and what it means for the future. Now, Sora, this thing you've created is clearly a really impressive video generation AI tool that will fool people and also be very useful. There is also a watermark in the bottom corner of every video generated by it. So if you watch one of those videos and ideally it hasn't been cropped, then that's at least a pretty clear indicator that it was AI-generated. It's a Sora video. But I also think they will have to be very careful with this, they will have to take into account a lot of security things.
I think they'll probably have to be even safer than DALL.E. Like you shouldn't be able to generate images of people. You shouldn't be able to make a politician look like they're doing something on video, especially this year. You probably won't be able to make Will Smith eat spaghetti, but that also means that stock video generation will definitely impact video licensing. I can basically guarantee that. Logistically, why would someone who earns something pay for images of a cliff house when he can generate one for free or for a small subscription fee? That's the really scary part of what this tool entails.
But in the future, everything becomes quite existential. I mean, okay, if this applies to every video that's ever been made by humans, then surely it can't be innovative or creative in ways that humans haven't already been, right? I don't know. Either way, I'll have all the links below for all the Sora stuff, for the OpenAI stuff, and I guess I'll talk to you next year when we look back and say, "Remember that first version of Sora" and how bad . Did those wolf cubs look like "when they appeared out of nowhere?" Just remember, this is the worst this technology will be from now on.
Thanks for watching. See you next time, peace. (bright and happy music)

If you have any copyright issue, please Contact