Open AI Quietly Released a Better ChatGPT Version Surprising Users

  • last month
Open AI recently upgraded ChatGPT with a new version called GPT-4o latest making It faster and more accurate,though they announced it quietly.
Transcript
00:00If you've been feeling like your AI buddy's been acting a bit different lately, maybe
00:06quicker, sharper, and just a tad smarter, you're not alone.
00:10OpenAI has been sneaky, rolling out some major changes without a big announcement.
00:15But don't worry, I've got all the details you need to know right here.
00:18So let's talk about it.
00:20Last week, I started noticing that ChatGPT felt different.
00:23It was like the responses were more on point, faster, and just generally better.
00:28I wasn't the only one either.
00:30People all over social media were talking about how ChatGPT seemed to be upgraded.
00:34But here's the thing, OpenAI didn't say a word about it at first.
00:38It was all very hush-hush until they finally dropped a little bombshell on us.
00:42OpenAI took to X to casually mention that they'd slipped in a new version of their
00:48GPT 4.0 model into ChatGPT.
00:51So they just updated the model we've all been using without making a big deal about
00:55it.
00:56The message was simple.
00:57We've had the GPT 4.0 model out in ChatGPT since last week.
01:00Hope you all are enjoying it and check it out if you haven't.
01:03We think you'll like it.
01:04That's it.
01:05No fancy press release.
01:06No grand unveiling.
01:07Just a tweet.
01:08Typical OpenAI, right?
01:09Now, if you're wondering what's so special about this new model, let's break it down.
01:13The updated version of GPT 4.0, which they're calling ChatGPT 4.0 Latest, is essentially
01:19a fine-tuned and optimized version of what we had before.
01:23But here's where it gets interesting.
01:24While OpenAI hasn't spilled all the beans, there's a lot of speculation about what
01:28this new model actually is.
01:31Some people out there are thinking this might be part of a bigger strategy by OpenAI to
01:36release different sized models, kind of like what Google and Anthropic are doing.
01:41There's been talk about a GPT 4.0 Large and some think this latest update could be
01:47a stepping stone in that direction.
01:49But I'm not totally sold on that idea because let's be real, if it were a brand new model,
01:53they probably would have hyped it up a lot more.
01:55So what can this new model do?
01:57Well, from what I've seen and what others have reported, it's performing better on tasks
02:01that require complex reasoning and creativity.
02:04Like if you've been asking ChatGPT to help with coding or solve tricky problems, you
02:08might've noticed it's just a little bit sharper now.
02:11It's also faster, which is a nice bonus, but of course it's not perfect.
02:15There are still some weird quirks.
02:17For example, in one test, the model was asked to stack a book, nine eggs, a laptop, a bottle,
02:23and a nail in a stable manner.
02:25The solution, it suggested putting nine eggs on top of a bottle.
02:28I mean, come on, who does that?
02:30And then when it was asked how many R's are in the word strawberry, it came back with
02:35two, which is definitely wrong.
02:37So yeah, there are still some bugs to work out, but overall the update is a step in the
02:41right direction.
02:42Now, talking about strawberry, let's talk about something that's been generating a lot
02:46of hype, Project Strawberry.
02:48The idea behind Project Strawberry is that it could be a new post-training method that
02:52boosts the model's reasoning skills.
02:54Some people are even saying that the improvements we're seeing in ChatGPT might be the first
02:59signs of this mysterious project in action.
03:01One of the coolest things about the new ChatGPT 4.0 latest model is how it handles multi-step
03:06reasoning.
03:08This basically means the AI isn't just jumping to conclusions, it's thinking things through
03:12step-by-step before it gives you an answer.
03:15That's a pretty big deal because it leads to more accurate and thoughtful responses,
03:19which is something we all want, right?
03:22The new model has already made waves in the AI community, especially in something called
03:26the LMSYS leaderboard.
03:28Now, if you're not familiar with it, the LMSYS leaderboard is like the Olympics for AI models.
03:34They put different models head-to-head in all sorts of tasks.
03:37And the new ChatGP 4.0 latest model just crushed it.
03:40It scored a whopping 1314 points, which is the highest score ever recorded on that leaderboard.
03:45This means it's outperforming some of the biggest names in the game, like Google, Anthropic
03:50and Meta.
03:51Now, if you're thinking, how do I get my hands on this new model?
03:54Well, it's super easy.
03:56OpenAI has already swapped out the old GPT 4.0 with the new version in both the ChatGPT
04:01website and app.
04:02So all you have to do is fire up ChatGPT and you're good to go.
04:06If you're on the free plan, you might hit some message limits, but for those of you
04:10who are on the plus plan, you can push the model to the limit and really see what it
04:14can do.
04:15Also, if you're not ready to shell out the $20 a month for the plus plan, you can still
04:19get a good feel for the new model before you hit those limits.
04:23And then if you run out of messages, you can switch over to GPT 4.0 mini.
04:27It's not quite the same, but it's still pretty powerful.
04:30Also, one more really interesting thing is how OpenAI has been testing these updates.
04:35They've been sneaking experimental models into places like the LMSYS chatbot arena under
04:40random names.
04:41So people don't even realize they're testing new tech.
04:43The ChatGPT 4.0 latest model, for example, was tested under the name Anonymous Chatbot,
04:49and it got over 11,000 votes from users.
04:52That's a lot of people unknowingly helping out with the testing, which just goes to show
04:56how clever OpenAI's approach is.
04:58So what's next?
04:59Well, if this update is anything to go by, we can expect OpenAI to keep refining and
05:02improving ChatGPT.
05:04They're clearly focused on making it better at reasoning, creativity, and all those tasks
05:08that require a bit more brainpower.
05:10And who knows?
05:11Maybe we'll see even more of Project Strawberry in the future.
05:13All right, now I also want to talk about a new AI model that just came out, but it didn't
05:18really get the attention it deserves.
05:20This model, called Falcon Mamba 7B, was released by the Technology Innovation Institute, TII,
05:26in Abu Dhabi.
05:27TII is known for working on cutting-edge technologies like AI, quantum computing, and robotics,
05:33and now they've dropped this new model.
05:36It's available on Hugging Face, and it's an open-source model, which is pretty cool.
05:40But what really sets it apart is the new architecture it's using.
05:43Most of us are familiar with transformer models, which have been dominating the AI scene for
05:48a while now, but Falcon Mamba 7B uses something different, called the Mamba State Space Language
05:54Model, SLM architecture.
05:56This new approach is quickly becoming a solid alternative to those traditional transformer
06:01models.
06:02Now, why is this important?
06:03Well, transformers are great, but they have some issues, especially when it comes to handling
06:07longer pieces of text.
06:09You see, transformers use an attention mechanism that looks at every word in a text, and compares
06:15it to every other word to understand the context.
06:17But as the text gets longer, this process demands more and more computing power and
06:22memory.
06:23If you don't have the resources to keep up, the model slows down and struggles with
06:26longer texts.
06:28This is where SSLM comes in.
06:30Unlike transformers, SSLM doesn't just rely on comparing words to each other, instead
06:35it continuously updates a state as it processes the text.
06:39This means it can handle much longer sequences of text without needing a ton of extra memory
06:44or computing power.
06:45Now, Falcon Mamba 7B uses this SSLM architecture, which was originally developed by researchers
06:51at Carnegie Mellon and Princeton Universities.
06:53What's cool about this model is that it can dynamically adjust its parameters based on
06:57the input so it knows when to focus on certain parts of the text and when to ignore others.
07:02So, how does Falcon Mamba 7B stack up against the big players like Metas Llama 3.8b, Llama
07:083.18b, and Mistral 7B?
07:11TII ran some tests, and the results are pretty impressive.
07:14In terms of how much text the model can handle, Falcon Mamba 7B can fit larger sequences than
07:20the transformer models using just a single 24GB A10 GPU.
07:26This means it can theoretically handle infinite context length if you process the text token
07:31by token or in chunks.
07:33And again, Falcon Mamba 7B came out on top.
07:35It beat Mistral 7B's sliding window attention architecture by generating all tokens at a
07:40constant speed without any increase in memory usage.
07:43That's a big deal for anyone working with large-scale AI tasks because it means the
07:46model is both fast and efficient.
07:49Even when it comes to standard industry benchmarks, Falcon Mamba 7B holds its own.
07:54In tests like ARC, TruthfulQA, and GSM 8K, it outperformed or matched the top transformer
08:00models.
08:01There were a couple of benchmarks, like MMLU and Heliswag, where it didn't quite take
08:05the lead, but it was still right up there with the best of them.
08:09But here's the thing.
08:10This is just the beginning for Falcon Mamba 7B.
08:13Tiichu has big plans to keep optimizing the model and expanding its capabilities.
08:18They're not just stopping at SSLM, they're also pushing the limits of transformer models
08:22to keep driving innovation in AI.
08:24So if you're into AI or just curious about what the future holds, keep an eye on Falcon
08:29Mamba 7B.
08:30It's already making a name for itself, and with TII's continued efforts, it's only going
08:35to get better.
08:36Plus, with over 45 million downloads of their Falcon models, TII is proving that they're
08:41a major player in the AI world.
08:43Alright, if you found this interesting, make sure to hit that like button, subscribe, and
08:48stay tuned for more AI insights.
08:50Thanks for watching, and I'll catch you in the next one.

Recommended