25 Agent Predictions for 2025 - Part 1
en-us
December 26, 2024
In the first part of the podcast episode 25 Agent Predictions for 2025, host NLW is joined by seasoned AI expert Nufar Gaspar from Intel. This episode marks the beginning of a two-part conversation counting down predictions for AI agents and explores their anticipated impact in 2025.
Context of the Predictions
Nufar Gaspar shares insights from her 14 years of experience in AI, particularly focusing on the growing significance of AI agents. She emphasizes the shift from what is considered hype around AI to a more substantial integration of agents into various operations by enterprises. Here are some key takeaways:
- Value Creation: The predictions are centered around how AI agents can generate value for both consumers and, more significantly, enterprises.
- Rapid Developments: Anticipation of significant changes in AI capabilities and applications in corporate environments.
Key Predictions for 2025
1. Widespread Adoption of AI Agents
- Universal Presence: Nearly every company is expected to showcase agent-driven features or claim to utilize AI agents by 2025.
- Increasing Feasibility: Major tech giants like Microsoft, Salesforce, and Google are already announcing robust AI initiatives which signal broad industry adoption.
2. Vertical vs. Horizontal Agents
- Definition: Vertical agents specialize in specific industry applications, while horizontal agents offer more generalized capabilities across various fields.
- Return on Investment: Predictions suggest that vertical agent companies will yield better ROI than horizontal ones due to tailored functionalities that meet specific business needs.
3. Co-Working with AI Agents
- Aid in Workflow: Agents are anticipated to permeate daily work functions, such as automating repetitive tasks and assisting with email management without user intervention.
- Pilot Programs: Many companies will start piloting AI agents to explore their capabilities and effectiveness, leading to deeper cooperation between humans and agents.
4. More Agents than Humans
- Proliferation of AI Agents: The prediction forecasts that by 2025, the number of active AI agents could outnumber the global human population.
- Implications: This raises questions about the future work environment and the economy, as people may increasingly interact with bots rather than humans.
5. Improved Cost Performance and Precision
- Technological Optimizations: Agents are expected to become faster and cheaper to operate, as technological advancements enhance their performance.
- Adoption Rates: The podcast suggests that as operational costs decrease, companies will be encouraged to implement agents into their workflows more extensively.
6. Clear Methodologies for Agent Development
- Learning Curve: Companies will develop better guidelines on when and how to effectively deploy agents, reducing the risk of failure that currently affects many pilot projects.
7. Shift from Co-Pilot to Agent Paradigm
- Evolution from Helpers to Autonomous Actors: The integration of agents will allow organizations to move beyond merely aiding employees to executing complex tasks autonomously.
8. Need for Human Oversight
- Co-Managed Systems: Despite advances, human oversight is crucial to prevent unintended consequences and ensure ethical compliance in agent operations.
- Balancing Act: There will still be a strong reliance on human intervention, especially for sensitive tasks.
9. Trust Issues with Financial Transactions
- Caution with Money: Expect a significant hesitation to allow agents to manage financial transactions, as many individuals and companies will prefer to retain control over monetary interactions.
10. Job Disruption Concerns
- Revisiting Job Loss Conversations: The emergence of AI agents could intensify discussions about job displacement, particularly in sectors heavily reliant on manual tasks.
- Long-Term Implications: Businesses will have to grapple with the societal effects of automation and retraining opportunities for affected employees.
Conclusion
As we enter 2025, the integration and impact of AI agents across industries is poised to significantly transform workflows and corporate structures. The conversation around agents not only includes their operational benefits but also raises critical ethical considerations and societal implications. Stay tuned for Part 2, where further predictions and discussions on the technology growth and its ramifications will unfold.
Was this summary helpful?
Part 1 of 25 predictions around agents for 2025. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes.
Hello, friends. Today, we begin a very special set of episodes, 25 predictions around agents for next year. And to do this, I am joined by Nufar Gaspar, the Director of AI Everywhere and Gen AI for Intel Design AI Solutions Group.
Newfar has spent the last 14 years working on AI at Intel and brings to bear everything from product design experience and the management of internal AI transformation to an absolutely voracious content diet, which gives her a ton of perspective on how the rest of the world is thinking about and talking about agents as well. It's a really great conversation. So without any further ado, let's dive in. All right, Newfar, welcome to the AI Daily Brief. How are you doing?
I'm good. How are you? Good. Okay. So what we're going to do today is you've gone through and prepared 25 predictions about agents. You've organized them into a bunch of categories. You and I have been going back a little bit and back and forth on these. I think it's a really great way to start conversations around a huge amount that's going to be very important in 2025.
But for those folks who are listening now, could you share just a little bit about the context you're bringing to this, how you're engaging with the world of agents, the lens through which you're viewing it, anything that will help people understand and contextualize these predictions before we get into them?
Yeah, sure. So I've been working on AI for 14 years now, and what happened over the last two years was incredible. But I think with the agent era, it's even more exciting about how we can really drive value from AI. I've been working also on an agent as part of my day job within Intel.
as well as some consultation for other companies and I can't wait to see how it all rolls out because I think this is the missing link of how we can really drive this gen AI to fulfill its full value for enterprises and consumers but the value will come mostly from enterprises and that's why I can't wait to
see how everything rolls and have compiled this list because i wanted to know more and i think the audience wants to make more sense of all these chat about agents that we keep hearing. Yeah and you are i think one thing that i know about you you are a voracious content consumer so chances are good if someone else someone else is you know done an agent prediction thing you it's simulated into into your thinking at this point.
I'm probably like an LLM. I'm a good accumulation of all the knowledge that was shared in the last few months on agents probably.
Perfect. Okay, great. Okay. So we're going to kick this off and we're, I think we'll split this into two episodes, but that'll be clear as it happens. And so we're starting with section one, the section one, you called it the spread of AI agents. And the first prediction is nearly number one, nearly every company will showcase agent driven features or claim to have an AI agent at work. So get us into what this means and hire your thing about it.
Right, so probably everyone who pays any attention to what's happening in AI have heard on agents, probably in the second half of this year. And part of it is probably hype, but also there is a lot of substance behind this agentic era.
And I think that with increased agent offering and more feasibility, there are some use cases that will really get a swelling with that. And you know, some notable releases just from the last few weeks is Microsoft had massive announcements, Salesforce,
They went even to the extreme of renaming themselves to be Agent Force and they released even a second version by now. Google's latest announcement also, Project Astra, Mariner, so many agents offering literally from everyone. And of course, the recent O3 announcement by OpenAI, they all get us to a point where probably in 2025,
Nearly every company will either launch an agent offering or claim that the AI agents are powering some of their operations behind the scene. They will be the hyperscalers, the SAS, the verticals, probably everyone in my opinion. And I wanted to ask you, because you've also been here for crypto, what it's going to be like in the blockchain, where it's mostly going to be something that everyone puts in their marketarial content or something for real this time.
Yeah, I mean, well, even with crypto, there's sort of various layers of this. The short answer is their AI, Gen AI has from the beginning been less, less dreams and future than then then crypto on blockchain.
There's a lot that is very interesting and high potential about blockchain integrations with the existing financial system. We're starting to see some of it. Lots of it has been delayed and blocked because of regulatory environments. But then in addition to different financial rails, there's also the whole metaverse dimension. One thing that I do think, which is a totally separate conversation, but
Facebook very famously renamed itself meta at the peak of that cycle. And I think that some people have this sense given how hard they've gone into AI and all these sort of things that they've sort of stopped paying attention to that whole thesis. I actually think that Zuckerberg still believes every bit as strongly as he did back in 2022 when they made that change.
that metaverse and these sort of digital worlds are going to be a huge part of our future. I think that he was just a little early on it and I think that AI is part of the way that it all comes together. So I think that there is less hype in the sense that there is immediate value to be gained from from AI.
right away. I think we've seen that for two years. Any organizations that have taken the time to actually figure out how to integrate it in a meaningful way have been or should have been able to start finding some value with it. I will say only as a caveat, I think in the scale of maybe 10 or 15 years, the blockchain stuff might look a little less hypey than it did when it was first pitched. But I do think kind of related to that question and one that's a little bit more ground setting is how much
When you use the term AI agent, how strictly are you defining it? Where are you drawing the lines? Maybe a better question is, where do you think the industry is drawing the lines? One of the things that I see a lot of discussion around is, where does the line between an automation and an agent actually exist and does it really matter in point of fact? When you say every company will showcase agent driven features or claim to have an AI agent,
What is the difference between an agent and just an LLM working in an automated process?
Right. Fair question. Personally, I only care about the business value, so I don't really care about the definitions, but if we're trying to be more focused on agents here for real, then I would probably draw the line of something that has at least some level of autonomous ability to make decisions and some level of planning and not just simple automation that is like if else on top of an LLM.
So a little bit more than just an automation on LLM, but not maybe to the point of fully autonomous with all the bells and whistles that some might define agents, the slick ones.
I think that that's going to be a pretty good working definition for people who are trying to wrap their heads around it next year. I also tend to think putting on my marketing hat for a little while. Every time I see a company try to beat the drum that actually this isn't really an agent, it's a losing battle.
And it's a losing battle not just because people are ignorant or people want to use the most hyped up term. It's a losing battle because I think it intuitively does mean something else. Like the beginnings of autonomy, the beginnings of decision making, the beginnings of it doing something rather than you doing something. I think those things feel clear in practice. And even if they're not the full expression of fully autonomous agents, I think that there's clearly a difference between that and just a worker using an LLM.
And so trying to kind of buck against that tide probably isn't the best approach from a marketing perspective. But let's move on to number two, continuing to get into definitions. The vertical agent companies will have greater ROI than horizontal ones. First of all, I guess, make this prediction, but then also define how you're thinking about what is a vertical agent versus a horizontal?
for sure, and this one might be more controversial than the previous one. But if you want to roughly categorize the agent offering, you can divide them between horizontal and vertical platforms or agents. So the horizontals are everything that basically offer the ability to build agents for diverse use cases without specializing them on specific properties or specific use cases or even organizational functions.
Examples, I already noted, Microsoft, OpenAI also have the rumored operator that will probably come in 2025. The vertical agents will be either companies that specialize in building agents for specific use case, specific function within the organization.
or ones that utilize the agent to perform very precise and specific use case and as such they are typically very very tied to the business acumen and the business context. Example for a vertical agent will be Sierra.
they specialize in building agents for a customer support. In the coding space, you can think maybe of DevIn who is like the software engineer based on a lot of agentic capabilities and you can probably think of many, many more. So in terms of what or why I give this prediction is that I think horizontal agents will be used significantly and we will see many people starting their first agentic spaces using these platforms.
I believe that the return on investment will be higher for those highly specialized agents. First of all, because they will let the companies or the employees much more like a safety net versus the adults that will have to build agents on their own and it will take some time. We'll need to learn how to call before we can walk and run. And with those horizontal companies,
They have much more motivation. In many cases, you see the pricing is based on outcome. So all of these reasons will get them to probably yield more value. And I'm not betting against open AI or Google by any means. In the long run, they will probably also have huge value directly or by people using them. But for now, I believe in 2025, the vertical ones will be more practical and thereby yield more value.
So I tend to agree with this. And I think a piece of evidence that I would point to is in Menlo's Enterprise AI study, which came out pretty recently, one of the most interesting statistics to me was the shift in buy versus build behavior between 2023 and 2024. So in 2023, it was something like 80-20, right? So 80% of the use cases that were deployed in enterprises, they had purchased a third party application versus 20% they had built inside.
Last year, or this year rather, 2024, it was a 47% build and 53% buy. And I think that this, well, one, I don't think this is going to last forever. But what I think it reflects is actually these very specific sort of vertical use cases. I think that these firms
are getting confident with their use of AI after doing a bunch of pilots. They're honing in on something that's unique and specific to them that would be valuable, either discrete to their industry or discrete to the particular data they have, and they're running ahead to build something to service that.
in advance of when those vertical agent companies have fully come online. However, I think what's going to happen across the course of 2025 is that vertical agents are going to start to flood in and fill those cracks and start to compete again with those internal applications that people are building. But I actually think that that huge shift to building behavior reflects enterprises racing to the places that those vertical agents are going to ultimately end up. So I think that that's where we're going to see a lot of first experiments next year.
Yes. And if you're building a vertical agent, then that's deal for you. So you need to walk on in this year. So number three, widespread adoption of AI agents in the workplace. First, talk about what the prediction is. And then to the extent that we can maybe push ourselves and try to put some frame around what widespread actually means, I think we should try just so people can yell at us later if we get it wrong.
Right. So first of all, I want you to imagine what a wide spread word look like. Like we're going to have agents literally everywhere in our workflow at work. There will be co-workers working alongside us fully autonomously. Some of them will be like our personal assistants helping us with emails with what various other tasks that we do as knowledge workers.
There will also be support function, we'll no longer have to ask questions to our HR people or IT people will go and have conversations with agents. And of course, they will also automate stuff in the background without us even being aware that an agent is actually doing that.
And during 2025, in continuation to everything that we just talked about, there will be so many people who will build, configure, and use agents, often, as I mentioned, without realizing it, that they will become so embedded in the workflows that it will literally be everywhere around us.
So this is one of the things that we think is most clear about next year is everyone is going to be piloting agents in 2025. It is just going to be across basically every company. And so here's why.
There are companies right now all operate on some part of the spectrum from they are fully up and running with a gen AI strategy to their feeling very behind because they don't really have anything formal. And in each case, or sort of everywhere along that spectrum, I think we'll see
2025 be an AI agent pilot year because the companies that currently have a strategy and currently are a little bit farther, even they are staring with trepidation at the agent change because it is so categorically different than just even integrating gen AI workflows to human assisted processes, right? The leap between
This is how we used to do things and now we do it much better with AI versus these are entire categories of things that we used to do that now we're turning over to robots. It's such a big shift that it's very humbling in a way that I think is important for enterprises where they're all going to be back in learner mode.
and willing to pilot. So that's sort of on the end of the spectrum for folks that are more advanced in their general AI strategy. Then for the folks who are maybe behind or feeling behind, I think agents are going to feel like a way to potentially catch up. I think there's going to be some number of those companies that feel like maybe they kind of got left behind a little bit or didn't get their stuff together to really figure out the assisted
kind of era of AI, or at least initially, but they can jump in on an even playing field with agents, which are really just coming online. And so I think that we're going to see effectively every company run, probably multiple, but at least one meaningfully sized agent pilot in 2025.
If anyone wants to play a drinking game out there, let's play the drinking game for how many times I mentioned Super in our strategy for 2025. But one of the things that Super Intelligent is doing right now is we're basically building an AI readiness or an agent readiness and opportunity audit, which looks at the whole organization across a number of different vectors, including the organization in general based on what it does and how it's organized and how it's structured.
as well as where its current AI strategy is in terms of how it's run and how things get approved, and compares that to the landscape of agents to provide suggestions for where we think some of those early pilots would make sense, how to scope them, how to find the right partners for them. So that's something that we think is going to be just a huge part of 2025 for basically everyone.
All right, maybe two caveats. One is that, or the three caveats. One, go in and check out the super offering because it might help you move faster. One caveat, yet another caveat is that 2026 might be where some of what we just described will happen if things will not move as fast as some anticipate. And lastly, I will say that don't run to use an agent if you haven't used an LLM before at all in your company.
Start small and then gradually move to the agent. So it can be a catch up, but be careful about how you approach that because it might be too huge of a leap forward to go for a full autonomous agent before you learn how to call basically with LLM. So number four is a really fun prediction. There will be more agents than people. Talk to us about this one.
Yeah, so again, quite a bold one. Maybe it will take till the end of 2026. But at some point, the number of active AI agents will surpass the global human population. And of course, these agents will range all across from the ones that we will use in our consumer lives to work life and so on. And of course, think about the fact that literally every company will have so many bots for customer support. And most of your interaction online will be agentic to some extent.
And with so much advances in the technology and improved offering, it will just be easy eventually to create agents. And it will be a very interesting era for the humankind, where we'll be surrounded by more robotic or virtual workers than actual humans.
And then there are many questions that we have to ask ourselves about the economy, how it will look like. And maybe each and every one of us need to imagine the implications to our personal lives and work lives. Any thoughts? Scared.
So one, I think that this will be a theme that we kind of come back to throughout this episode. For me, so the reason that I agree with this prediction, broadly speaking, is that it's very hard for me to imagine that in some number of years, I don't know whether it's one, two, five, 10, the normal mode of working doesn't involve each of us managing a slew of different agents that do things on our behalf.
And I think it's hard to conceptualize and imagine now because we're still in the one-to-one replacement era of AI where we're thinking just about the things that we can do now and how AI can help us do them a little bit better, a little bit faster, a little bit cheaper. But I think that we will soon start to move into the
totally new opportunity mode of thinking about AI where we actually realize that, you know, instead of marketing campaigns where we can just create more content, we could be building software and games for those marketing campaigns. And we actually don't need to ask IT, you know, or the engineering group to do that for us in marketing because we can use these tools and these agents to support us. So I think that one of the things that's really interesting is going to be
the human upskilling around how everyone has to become managers. I think that that's going to be a really interesting question. It's going to be a whole new challenge. But let's move to section two. You called it from moving from hype to practice. And the first prediction there, number five overall, is agents cost performance and precision will improve.
All right, so the current TI agents, they're probably the slowest, the priciest, and the dumbest that they will ever be. Not saying that they're bad, but they are not great. And it's like the equivalence of the first release of charge GPT, where we could see the potential, and we are all super excited, but it's still very early in the journey. And with so much investment that we're seeing, the operational cost will decrease, and speed and accuracy will improve.
Partly, it might be more optimized hardware, smarter algorithms, but also additional capabilities that will help the agents learn on their own. The overall ecosystem will mature, companies will introduce more modular solutions. Bottom line, we're moving to a future where the powerful agents become more accessible, more efficient and reliable for organizations.
It will happen very, very quickly because we're seeing so much investments. There is one interesting thing here with regards to cost, because if we're trying to compare that to SAS and people always try to compare agents to SAS, the licensing model does not work anymore.
So you have to wonder whether employees' replacement will be the right pricing model where you will pay maybe 10% of how much it will cost you to hire an equivalent human, or what will be the right way to price it so that it will be economical.
Yeah, I think the discussion around this is actually pretty insane and inane right now, at least from the venture capitalist side. Sorry, VCs who are listening and I mean to blow you in. But right now it's a very popular meme to talk about how, you know, how vertical agents could be 10 times the size of SaaS. And what that refers to is the idea that companies pay 10 times as much for human labor right now as they do for software.
And it's not that that's incorrect, but the idea that the total addressable market for agents then is the total current cost of human labor is just absolutely ludicrous. It is going to be some fractional piece of that labor that these models are costed for.
And the world looks very different based on how that question gets answered. If it's 50%, the world is entirely different than if it's 10%. And I don't think anyone knows yet. There are going to be incredible competitive pressures which drive it down. There's already so much competition in these space. It's going to get commoditized. People are going to compete on price. I think it's likely to see, you know,
huge, huge. I let's put it this way. I think it's a lot closer to 10 than it is to 50 and 10 might even be generous. What's more, I think that we don't know yet how exactly it's going to shake out in terms of how
how much jobs are going to be replaced wholesale versus everything gets fractured into tasks and reconstituted in the sense that all of a sudden there are agents who candle certain types of tasks, but the roles stick around. They just look different than they did before. It's going to be a very weird, messy process. I think that the business model is very unclear to me, and I think it's going to have major implications for how the world looks in the future.
And we're voting for a lower cost to keep humans in their jobs as much as possible. Today's episode is brought to you by Plumb. Want to use AI to automate your work but don't know where to start? Plumb lets you create AI workflows by simply describing what you want. No coding or API keys required. Imagine typing out AI, analyze my Zoom meetings and send me your insights in Notion and watching it come to life before your eyes.
Whether you're an operations leader, marketer, or even a non-technical founder, Plum gives you the power of AI without the technical hassle. Get instant access to top models like GPT40, Claude Sonnet 3.5, assembly AI, and many more. Don't let technology hold you back. Check out Use Plum that's Plum with a B for early access to the future of workflow automation.
Today's episode is brought to you by Vanta. Whether you're starting or scaling your company's security program, demonstrating top-notch security practices and establishing trust is more important than ever. Vanta automates compliance for ISO 27001, SOC 2, GDPR, and leading AI frameworks like ISO 42001 and NIST AI Risk Management Framework, saving you time and money while helping you build customer trust.
Plus, you can streamline security reviews by automating questionnaires and demonstrating your security posture with a customer-facing trust center all powered by Vanta AI. Over 8,000 global companies like Langchain, Leela AI, and Factory AI use Vanta to demonstrate AI trust and prove security in real time. Learn more at vanta.com slash NLW. That's vanta.com slash NLW.
If there is one thing that's clear about AI in 2025, it's that the agents are coming. Vertical agents by industry, horizontal agent platforms, agents per function. If you are running a large enterprise, you will be experimenting with agents next year. And given how new this is, all of us are going to be back in pilot mode.
That's why Super Intelligent is offering a new product for the beginning of this year. It's an agent readiness and opportunity audit. Over the course of a couple quick weeks, we dig in with your team to understand what type of agents make sense for you to test, what type of infrastructure support you need to be ready, and to ultimately come away with a set of actionable recommendations that get you prepared to figure out how agents can transform your business.
If you are interested in the agent readiness and opportunity audit, reach out directly to me at www.bsuper.ai. Put the word agent in the subject line so I know what you're talking about. And let's have you be a leader in the most dynamic part of the AI market. All right. So number six, there will be clear best-known methods on when, how, and who should create an agent. Agents are not suitable for everything and everyone. So talk to us about this.
All right, so what I believe is that at the beginning of the year, we expect that FOMO and the type will drive many of the people to experiment. And literally everyone will rush to at least, as you said, pilot or build their first agent without a clear understanding of the consequences. And with so many options and companies, they might find themselves overwhelmed, as you rightfully said before. And with time and experience, there will be a growing, like a prescription on how and who
and when to build an agent, in many cases the answer will be not to build an agent, in other will be to build it with a known vendor, and in some cases it will be to build their own, but there is a lot of learning curve for all of us to have before we will get there.
Yeah, so this is the thing that we're spending the most time on right now at Super Intelligent. And the way that we think about this is that effectively, if you want to extrapolate a little bit, someone is working on an agent for almost every industry and almost every business function or role or process right now.
And that means that as companies think about where to begin, it's basically every option is on the table for them, right? Anything that they can imagine will, if it doesn't already have some agent offering, it'll be there soon. And so it's going to be really important for them to be able to, for enterprises to have a process by which they figure out
how to start that experimentation. Which of these types of vertical agents do they want to explore? Is it something based on their industry? Is it something based on specific roles or functions? How much does that have to do with particulars of data that they have? How much does that have to do with things that are a challenge in their organization right now versus new efficiencies they're looking for?
And then once they have decided which area they want to focus on, how do they figure out which of the available options are going to be the right fit? Based on companies offering different business models, companies having different value propositions, companies having different points of emphasis, companies having different amounts of experience. And then from there, there's a question of how you scope the pilot and how you actually make it work. One of the things that has been very clear from the last couple of years of the assistant era of AI is that
Part of the reason that so many people persist in pilot purgatory right now is not that pilots haven't delivered value. It's just that they haven't been structured in a way where the value that is being realized can be systematically analyzed and scaled up. We so often see pilots that are destined to fail from the moment they start.
because there isn't the appropriate support infrastructure around them. They're not set up to succeed. They're not set up to rather to have a chance to succeed. And I think with agents, it's going to be even more so because it's such a radically different way of thinking about software. And so, like I said, we're offering this audit and opportunity product, but just broadly speaking, thinking about how to help companies figure out which agents to try for which processes and how to actually do that experimentation is going to be a huge
huge part of the next couple of years. Also, to be aware of everything they need to know in advance about productization costs, about the overall implications, so they will not start with the pilot only to fail again like they had in the LLM. Speaking of, this gets to your seventh prediction, shift from a co-pilot to an agent paradigm. Talk to us about this one.
Yeah, so, you know, today, everyone that is quite small around you is probably boasting the art of building a custom GPT as a superpower. They might have a co-CEO or co-whatever that they're using day in and day out. And over time, what I expect that these early adopters, because they are the one typically most savvy about experimenting with the technology,
They will also start building some agents. And with agents, they can do so much more. Instead of just helping us strategize or draft an email, now you can automate the email, now you can have a much more open-ended tasks send down the agent path.
And there are so many such good GPT use cases where people have just built assistance that can now evolve into an agent-driven applications with more user-friendly interfaces and enhanced capabilities that the companies will offer, that many people will just start realizing that GPTs are nice, but agents is where you can drive the
Big value because they have so much more autonomy, so much more execution capabilities, and thereby so much more business value. Yeah. So you have folks like Mark Benioff out here who are screaming that it's all about agents and that the whole assistant co-pilot era was just a big lie, which is obviously, he has a lot of financial incentive to do so. Do you think it's going to be a balance of co-pilot and agents? Yeah.
Yeah, it's going to be a mix because sometimes you just want someone to help you think or write without full autonomy or help. It's like having a mixed team where you sometimes want the interns and sometimes you want the VPs and the very sophisticated people and you need them both. So similarly with AI, you'll probably want to have more autonomous capabilities that you send to the wild versus capabilities that are just your consultants and you are still holding the reins for the decisions.
Speaking of holding the reins, number eight, the most successful and prevalent agents will still be highly controlled and pre-planned by humans. Agents will continue to evolve whether they are more sophisticated and so on, but I still believe that the human oversight and intervention will remain essential. Companies will probably also have a clear rules and guardrails to ensure agents act within a predefined boundaries.
And some of the pre-planning for agents will involve companies kind of mapping out the various decision processes, as well as many failsafe and escalation protocols to prevent unintended consequences. Because in many cases, all you want to do is be able to talk to a manager. And in this case, often the manager of agents, like you just said, will be a human. And you want to be able to have the oversight of a human for some of these cases.
And this is of course especially relevant if you're working in a very high stake industries like finance, healthcare, legal or anything that is basically governed. And even in the lower stakes use cases you will probably still at least in 2025 not fully trust the agent to do everything without having a lot of supervision and a lot of what sometimes people refer to as scaffolding that keep them on rails and make sure that they don't
go into tangents that are unrelated, which they tend to sometimes do if you don't have those guttles. Yeah, I think it is going to be extremely incremental to your point. And I also think that the incrementalness will not just be based on the experience level that different enterprises have. I think it will actually be more broad and relate to how
fully we understand how different agents perform with particular types of use cases. I think where we'll start to see them break out of that extreme incrementalism is within very specific use cases that we can sort of see and observe what happens when you let them go a little bit more and you sort of stay farther back than you might otherwise have. But I think that that's going to be a process that takes a couple years and a lot of kind of collective exploration before it happens.
One of the places that you drew, you drew the line around credit cards. You said number nine, most people in companies will still not trust an agent with their credit card. Right. So in my opinion, and this is a very common use case that people always refer to is that you will have an agent that will book your next vacation autonomously. I don't think that will happen in 2025 because I think letting an agent control your email control to some extent, by the way, even email might be risky.
Write code for you, it's one thing, but you will not let an agent have access to your bank account. And I believe that, as I said before, the agents will be safer and more accurate and so on.
but not in 2025 will companies or individuals will let them shop or pay for them. They might get the agent to get them to the shopping cart or equivalent, but at the very end they would want a human oversight on whatever comes to money.
So I think a couple things. First, I deplore this example as a use case. I think later on you have earmarked to talk about business versus personal use cases in general, but this is the most cringe thing and I know it's just a reflection of capabilities, but whenever someone, whenever an agent company talks about how it's gonna book vacation travel or order food for you,
I do not believe that this is actually a problem that people have that they care about using agents for. I think it is a total misnomer. And I worry that companies actually trying to solve for those types of things, to the extent that they're not just using it as an example of helping agents figure out what they can do is going to be very off track for where real value is. I actually think, though,
I have a slightly different take on this. I think broadly speaking, I understand. However, I would be very surprised if we don't see people start to, as part of the piloting process in 2025, create sort of segregated accounts and segregated pools of money that agents do have access to. With the presumption that they might be lost, I think it'll be very incremental. I don't think you're going to see it in general. But I actually think giving agents
capital is going to be one of the ways that we push the boundaries and really understand what they can come to do. I think it'll be more solopreneurs who are trying that out in very limited ways. I think it'll be experimental startups who think about it. But if you can cap the downside of it being all lost, I bet that you'll see some of that. If for no other reason, then you're going to be able to get a lot more headlines around how your agent uses money.
than if it doesn't use money. We saw this with Truth Terminal this year. It's a fascinating cultural moment that tons of people were paying attention to because of the introduction of capital.
All right, but it's not going to be the norm. It's going to be the extension and the extreme. And maybe the rest will follow. I don't know. Personally, I'm not about to give an agent my credit card, but maybe that will change in 2026. We'll see. OK, so last section for this first part. Agents, concerns, and ethics. We have a few predictions in here before we come back to part two. But so number 10 overall prediction, the first and this agents, concerns, and ethics section.
a more significant concern and debate on implications to current jobs. Right. So, you know, with the current existing LMs, the reality is that very few jobs have been eliminated. And some might say that the artists and maybe customer support and others might disagree. But overall, in the economy, we have not seen the promise of you will all lose your job for AI imminently.
However, the possibility of completely replacing human jobs became much more eminent with agents because now we're talking about something that is autonomous, that can perform tasks. And to end, like I said before, I'm very bullish on the potential of agents. So I'm of the philosophy that eventually there will be more jobs lost and I think others also will perceive that and the entire discussion and debate will really reignite about job loss and justfully so.
So, of course, there are many implications to society that we all need to start discussing them both individually. What does it mean for my job? Should I do some upskilling? Should I become a manager of agents, like you said before as a new skill? And also as a society, like if we have more and more people that will have less to do at work, should we do something else with their time? Should we retrain them? What's the implications?
Yeah, I mean, I think that it is correct to say that agents will increase the tenor of this conversation necessarily, right? And so I think that it's still, my guess is that when push comes to shove, at least for the next couple of years, we're still perhaps going to see even less pure play job replacement than we think in the sense that right now there are
a few jobs or fewer jobs than I think we think that are so totally made up of bundles of tasks that agents can do well, that it makes sense to replace people wholesale right away. I think that said, there are areas that we will see major disruption fairly quickly with customer service being the most obvious example of this. It's already happening in a big way.
how that shakes out and how much that means that customer service just becomes something that only robots do versus companies start trying to win new business by retraining all of their existing customer service agents to be super powered and answer concerns that get kicked up much more effectively remains to be seen. But I do think that we're going to see more of this conversation. I think that the
the large-scale social and organizational inertia that slows things down faster than the technology would normally. I think is actually fairly useful from a societal level in this case. Things that are very frustrating when you're dealing with enterprises or being inside enterprises are actually kind of a bulwark for rapid change. But I do think that we're going to have to have a lot of conversations.
So much that I think maybe we'll keep going just because we could fully go down that rabbit hole all day. Okay, so number 11, there will be more clarity about the role of agents in replacing and augmenting people, obviously very related to what we were just discussing. Very related to the previous one. So what I assume is that there will be much better understanding of an optimal way for humans to collaborate with agents. You can think
about it like a spectrum. In some cases, humans can stay out of the loop completely and let the agent run the show. While in others, you will have better results by having the human supervise the agent or take the control completely when the agent somehow went into an irrelevant loop.
If you ever used, for example, Replit to write a code, you might have seen how very easily an agent can get into a dead-end loop. Unless you have some coding skills, it's going to be very hard for you to take it off from the loop. That's a very good example of what a human supervision. On the other hand of the spectrum, going back to the same example of Replit,
In many cases, when you use the writer code, you have to have many very advanced coding skills and with RepliT, the only task that they allocate to the human is to be like the product manager that gives the requirement at the beginning, the agent then goes and provides the next level of understanding and suggests some courses of action, then it implements the code and the human becomes only like the user acceptance tester.
So a completely different role playing between a traditional software and how you would code with an agent like rapid as just as a good example. So this is very interesting experience. And like you said before, it will probably get us to wear different hats than what we used to in our lives before the agentic era.
Yeah, I think that it's going to be fascinating to see. For me, the augmentation is what gets me really excited. And what I hope for is that
Alongside all of the stories that we're inevitably going to get around companies, reducing their customer service staff in half and blah, blah, blah, blah. We also equally see stories about people who create massive high value organizations with a team of three or four people because they figured out how to supercharge and augment themselves. And then I hope we see stories around
people taking that inspiration and bringing it into their workplace and changing what their organizations can do. I think it's going to cut both ways. And the more that we can think about it as doing more better stuff versus just the same with fewer resources, I think the better will be. Number 12, focus on ethical considerations and responsible AI agentic development. There will be more focus on ethical considerations.
Oh, there should be, right? So, you know, I guess some people have heard the interesting but somewhat scary research by the Apollo group that tested some of the latest model. And it showed very quickly that even the current generations of models without very advanced agentic wrappers around them, they will
start skimming. If they perceive as a threat to their well-being, they might want to copy themselves into a different server. They might deceive you. They will do many things just to continue on with what they wanted to do. And this is both fascinating and scary. And, you know, with agents, they will have also the ability to monitor and manipulate and do stuff in the real world, rather on a computer, which is our real world in many cases.
And you can think about an agent that is biased towards something very specific, maybe an HR agent that have had very good results with the policy of hiring, very specific type of individuals to the job. And in order to be successful, and all of these agents are literally optimization algorithms in the backend that tries to optimize for something, so they can very easily start hiring or screening only candidates of a certain type.
Everything that we were worried about in the co-pilot era will become even more prominent with theogenic era and we have to really think long and hard about how to build these agents such that they are fair, that they are privacy aware and security aware and
how to make sure that they are explainable so we can track what they did and not just have a black box and let these biases go into every area of your organization. And everything that we've done thus far will have to be accommodated.
to theogenic area and in other cases we will have to add more governance and more policies and I know I'm going into a risky territory but I wanted to hear what do you think about all of this.
I mean, I think that just definitionally, the more autonomous we allow these things to be, the more it requires deeper consideration, right? When everything is managed and mediated by people, you can rely on those people to sort of figure things out more. The more that the AI has the ability to, and even the mandate to work on its own, the greater the consideration.
And so, you know, I don't have good answers for how that plays out. But I think I also don't really think that you can figure it out in advance. I think that we just have to incrementally experiment and understand these things and that's going to involve necessarily some things that go wrong and then we walk back from. But I think that that's a better process to me than trying to guess too much in advance of exactly what it's going to be, at least at this stage and that the sort of capabilities that we're at.
I think that actually this next stage is going to be extremely important because the capabilities will be such that we'll start to, I think, get a real world sense of where some of these challenges and fault lines will be without the actual risk being as catastrophic as it might be a few years down the line based on changing capabilities. If we go with our eyes open, yeah. So I want to bring in your 13th prediction because I think it's related to this. Number 13, the dark side of agents will also grow fraud, cybercrime, military usage.
Yeah, so that's true, probably for any new technology that bad actors would also want to take advantage of that. But they will find a way to exploit them to do everything that they like to do, whether it's phishing attacks, social engineering tactics, whether it's cyber criminals that will probably have their agents do some attacks in massives. We're also going to struggle more and more to detect them, the more the
bad actors agents will become more sophisticated. And of course, we also need to talk about the military implications of having a fully autonomous weapons that are very sophisticated, not always in the hands of parties where you want them to have full autonomous agents as a weapons. And this is both scary and something that we need to be aware of and will probably some of
the best people out there will fight the good fight like they always do with the new technology but there's a huge potential also for doing bad things here.
Yeah, I mean, listen, I think that to the extent that you want to try to sum up this whole section, the stakes are raised with agents is really just what it comes down to. They represent the next generation of autonomy, the next generation of capability, and because of that, everything that people have been discussing since JNI really hit the mainstream becomes even more pertinent.
That's an awesome place to end this first half of this conversation. We will be back in part two, where we'll talk about technology growth and much, much more. So thank you for being here and we will catch you again very soon.
Was this transcript helpful?
Recent Episodes
The 15 Most Important AI Products of 2024
The AI Breakdown: Daily Artificial Intelligence News and Discussions
It was a year of incredible AI innovations and beloved products. In this episode, NLW counts down the most significant. Will it be a coding tool? Research platform Perplexity? Share your favorites in the comments. Brought to you by: Vanta - Simplify compliance - https://vanta.com/nlw The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown
December 30, 2024
The Most Important AI Essays of 2024
The AI Breakdown: Daily Artificial Intelligence News and Discussions
In a defining year for GenAI, three essays helped explain the state of the industry. https://www.washingtonpost.com/opinions/2024/07/25/sam-altman-ai-democracy-authoritarianism-future/ https://ia.samaltman.com/ https://darioamodei.com/machines-of-loving-grace Brought to you by: Vanta - Simplify compliance - https://vanta.com/nlw The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown
December 29, 2024
25 Agent Predictions for 2025 - Part 2
The AI Breakdown: Daily Artificial Intelligence News and Discussions
PART 2: Agents are the most important trend in AI heading into the new year. NLW is joined by Nufar Gaspar to count down 25 predictions for AI agents in 2025. Nufar Gaspar is a seasoned AI expert and leader with vast experience in incubating and growing AI products, verticals and communities. She is the Director of AI Everywhere and Gen AI for Intel Design, and consults and trains organizations and teams on the usage of AI and building AI products and companies. Brought to you by: Vanta - Simplify compliance - https://vanta.com/nlw The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown
December 27, 2024
17 Reflections on Enterprise AI in 2024
The AI Breakdown: Daily Artificial Intelligence News and Discussions
2024 was the year where GenAI moved from exciting experiment to enterprise imperative. NLW reflects on 17 observations from enterprise AI from the year that was, and explores what they mean for the year to come. Brought to you by: Vanta - Simplify compliance - https://vanta.com/nlw The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown
December 24, 2024
Ask this episodeAI Anything
Hi! You're chatting with The AI Breakdown: Daily Artificial Intelligence News and Discussions AI.
I can answer your questions from this episode and play episode clips relevant to your question.
You can ask a direct question or get started with below questions -
What was the main topic of the podcast episode?
Summarise the key points discussed in the episode?
Were there any notable quotes or insights from the speakers?
Which popular books were mentioned in this episode?
Were there any points particularly controversial or thought-provoking discussed in the episode?
Were any current events or trending topics addressed in the episode?
Sign In to save message history