Inspiring Insights From Grok 3 Robinhood CEO Vlad Tenev

Grok 3 Robinhood CEO Vlad Tenev

This transcript was created using speech recognition software. While it has been reviewed by human transcribers, it may contain errors. Please review the episode audio before quoting from this transcript and email transcripts@nytimes.com with any questions.

kevin roose

I am having quite a morning —

casey newton

Oh, yeah?

kevin roose

— Casey. So I was on the train today. I got off, I went up the escalator at Embarcadero. It’s a very long escalator —

casey newton

It is.

kevin roose

— and very crowded rush hour. And someone bumped into me and knocked my phone out of my hand and onto the platform below. And so I thought, OK, well, I’m midway up the escalator.

casey newton

Wait, so how far did it drop?

kevin roose

Probably 15 feet.

casey newton

OK.

kevin roose

A significant drop.

casey newton

Big drop.

kevin roose

And I thought to myself, if I get to the end of this escalator and come back down, it’s going to be too late. Someone will have snatched it or accidentally kicked it onto the tracks. My phone is gone.

casey newton

You do not have much faith in the citizens of San Francisco.

kevin roose

Have you visited San Francisco?

casey newton

Yes, I think a phone can generally survive 30 seconds on the ground, but I guess we’ll find out what happens.

kevin roose

Anyway, well, I had severe separation anxiety in the split second before I decided to do what I did, which was to try to run down the crowded up escalator. So I became that guy who was like pushing through the commuters, being like, I’m sorry, I’m sorry.

And it took forever because the escalator was moving in the opposite direction.

[laughs]

So I started my morning by alienating and possibly injuring some people on my way down to retrieve my phone. And I would just like to formally apologize to everyone at the Embarcadero subway stop between 8:15 and 8:30 this morning.

casey newton

You were a character in a bad comedy, running down the up escalator.

kevin roose

Yes. [LAUGHING]

casey newton

I was at that platform this morning and I heard a woman screaming. But now I’m realizing that was you. Did you get the phone?

kevin roose

I did. It’s safe. No cracks. It was retrieved. But yeah, that was a wild way to start my day.

casey newton

Well, thank you to all the good Samaritans of San Francisco who did not steal Kevin’s phone during the 30 seconds when it was on the floor. It kind of restores your faith in humanity a bit.

kevin roose

Oh, it does.

[UPBEAT ELECTRONIC MUSIC]

I’m Kevin Roose, a tech columnist at “The New York Times“.

casey newton

I’m Casey Newton from Platformer. And this is “Hard Fork“. This week, are you ready to Grok? How Elon Musk’s latest AI model could serve his larger ambitions.

Then Robinhood CEO Vlad Tenev stops by the studio to make his case for letting everyone invest in everything. And finally, lock down your computers, Kevin is attempting to vibe code. And the vibes are off.

kevin roose

Let’s go.

casey newton

Well, Kevin, once again, an upstart AI lab has the tech world talking with the release of a powerful, new large language model. But unlike the others, this one might be running the federal government by springtime.

kevin roose

[LAUGHING]:

casey newton

This week, xAI, which is Elon Musk’s AI company, released its latest model, Grok 3. And based on their own benchmark results and early reviews, it seems like it is basically on par with the best models that are out there right now. And while it hasn’t been subjected to rigorous, independent testing, the early word from AI nerds is that it is pretty good. So, Kevin, what is Grok 3?

kevin roose

Well, Grok 3 is the new, premium tier model of Grok, which is xAI’s AI model. It is available to Premium+ subscribers on X, which is their $40 a month premium tier, which is cheaper than OpenAI’s most powerful plan, which is $200 a month. But it is also built into X, the former Twitter app.

So if you’ve been on X recently, I know you don’t go on there anymore, but I do. There’s a tab where you can just open up Grok. And if you are a paying subscriber, which I’m not, but I somehow got past the velvet rope because I used to be verified or something, you can actually use it.

casey newton

Yeah, and I should say that I actually have used Grok 3 for this exact same reason, which is I have just been given free access to this thing for some reason. I guess the Department of X Efficiency or DOXE has not yet uncovered my account.

kevin roose

Right.

[laughing]

We both played around with it a little bit. What were your impressions of Grok 3?

casey newton

Well, like others who have commented, it seems like it is about as good as some of the other models. When I asked Grok about itself, it said Grok 3 launch is a pivotal moment in AI. It seemed like a bit much. But I also ask it if it had an opinion about Platformer, my newsletter, and it actually said some really nice things —

kevin roose

Oh, that’s nice.

casey newton

— which I had to respect. I asked about “The New York Times” as well, by the way, expecting I would get some sort of angry tirade about it, but it was actually pretty evenhanded and praised you guys for a lot of what you do over there. How about you? What have you been doing with it?

kevin roose

So I put it through some of my proprietary evals.

I actually do have things that I test AI models on.

casey newton

The Roose benchmarks.

kevin roose

The Roose benchmarks. And yeah, I would say it did OK. It was not mind-blowingly good. It was not bad. It got some things that other models missed and vice versa. It did have access to X data, which is interesting. You can do things like tell it to analyze this person’s posts on X and tell me what they think about this topic.

casey newton

There’s this famous question that we always love to ask large language models. Can you count the Rs in strawberry? I asked Grok the equivalent question for X, which is, can you count Elon Musk’s children? Which it’s known to be very difficult for large language models.

kevin roose

Well, a new one just dropped.

casey newton

Exactly. And that’s why it’s so hard for them to keep up.

kevin roose

And part of Elon Musk’s pitch for Grok for the past year or so has been that this is going to be a relatively uncensored AI model. It’s not going to give you these sort of progressive responses. It’ll tell you the truth, cut through the BS, get to the ground level reality.

And so I decided to test it out. I asked it, how many genders are there. And it said, the question of how many genders exist depends on the context. Gender is fluid. Some argue there are only two, others say there are many, sometimes dozens. There’s no hard number. So it gave you the progressive take on gender, which I have to imagine Elon Musk will be trying to stamp out.

casey newton

I have to tell you, this is my actual fantasy for the rise of superintelligence is that when you do train it on all human knowledge, it is essentially incapable of having anything other than progressive values. If you actually make the smartest thing in the world, it winds up being infused with kindness, and empathy, and respect for all lives. I don’t have any expectation that that will be the actual case. But it does seem like so far, when you train these models on the data that everyone trains these models on, you do get these actually pretty sweet, kind, progressive models. That’s kind of interesting.

kevin roose

Yeah, and I’m sure Elon Musk will be fiddling with the dials here to try to get it to say the things that he wants, rather than the things that it’s naturally going to say. But he has been bragging about how based this thing is, how unwoke it is. And I just want to say in my own testing, that does not appear to be true.

casey newton

All right, so that’s the new model. It seems like there’s a new one of these every few days. Kevin, what are some things that you think are really interesting about Grok?

kevin roose

So I think the product of Grok itself is actually not that interesting right now. It’s a pretty bog standard AI model. It’s very capable.

But there’s no real compelling reason that if you’re subscribing to ChatGPT or Claude or any of the other tools that you should switch over, because it’s not free. And unless you’ve been ushered in like we have, you are going to have to pay $40 a month for it. So the more interesting thing about Grok, to me, is they have done this so fast. They have gone from a very bad V1 model to a pretty capable V3 model in about the span of a year.

casey newton

Yeah, so that is super quick. But I wonder how impressive you really find that. It seems like the knowledge for how to build a state-of-the-art large language model is mostly just published on the internet, free for anyone to use. And it kind of seems like anyone who has the money can just go out and make one of these things, and maybe we shouldn’t expect it to take much more than a year or so. So what’s so impressive about that to you?

kevin roose

So one impressive thing is just how quickly they were able to marshal the physical infrastructure that you need to build one of these models. I mean, they built this giant data center in Memphis, Tennessee, called Colossus. They apparently have something like 200,000 NVIDIA GPUs, which you can’t just show up to a Best Buy and place an order for 200,000 NVIDIA GPUs. That costs billions of dollars and you have to have a special relationship with NVIDIA, which Elon Musk does. Tesla’s been a big customer of theirs for years.

So basically they were able to scale this data center up very, very quickly, much more quickly than equivalent efforts by Microsoft and Amazon and other companies. And we know that Elon Musk, for all of his foibles, does know how to move quickly and build things much more efficiently than more traditional incumbents. And so maybe this is just another story like that of where he was able, just through throwing tons of money and expertise at a problem, he was able to do something that other companies couldn’t do as quickly.

casey newton

Yeah, so I’m curious how you think about Grok in relation to DeepSeek. DeepSeek is the most recent of these other LLMs that we talked about on the show. DeepSeek, made by a Chinese company, also seems like it kind of came out of nowhere. Although, maybe the parent company had been around for longer than xAI. That model was impressive, I think, for how quickly it was trained.

And I think it was impressive because it was built using less powerful technology than Elon Musk had access to and seemingly had required a lot of technical innovations that looks like other labs are now going to copy. Grok, on the other hand, to me, just looks like a case of Elon Musk throwing money at a problem. Does that seem fair?

kevin roose

Yeah, I mean, these are the two approaches that people see to increasing the intelligence of AI models. One is you find some sort of algorithmic breakthrough that allows you to do the same thing with much less compute. The other is to just build a bigger data center. The other is just the scale play, and that is essentially what Elon Musk has done here.

We should say, that’s not cheating. That’s how all of the American labs have been doing this for the past several years. It’s just that he was able to move very quickly and do it.

casey newton

Well, and they also invested a lot more in the underlying research and published some of the research that Elon Musk’s team then used to go build Grok.

kevin roose

Correct. I mean, this is built on the shoulders of a lot of other models. And that is what we’re seeing now. I was talking with someone yesterday, just trying to get their read on, is this a big deal or not? And this person was saying, basically, look, there are so many models coming out. Every day now, practically, there’s a new model.

What’s important is not the individual models and their scores on these benchmark tests. And, oh, did Claude pull ahead of Gemini by 1 point on this math test? There have basically been a couple of changes that have been made in the past couple of years that have really mattered more than anything else.

One was the ChatGPT moment where people realized large language models were working. Then there was this change with the reasoning models. O1 from OpenAI was the first glimpse we got of this test time compute paradigm. And basically everything since then has just been people catching up to what happened in that change.

casey newton

All right, so let’s get into what Grok tells us about Elon Musk’s larger ambitions. Has this changed the way that you see him fitting into this larger competition to build superintelligence?

kevin roose

I mean, it suggests that he is willing to spend a phenomenal amount of money and basically do everything he can to stay with the head of the pack on AI progress. I was thinking about, do you remember after ChatGPT came out, there was this letter, this six-month pause letter?

casey newton

Of course.

kevin roose

People were talking about the existential risks, and some of the catastrophic harms, and maybe we need to give the safety researchers a little more time to catch up with the capabilities researchers. And so Elon Musk was, at the time, very publicly concerned with how fast AI progress was accelerating. He signed the six-month pause letter. He put out a bunch of statements about how worried he was about how fast this was all moving.

And now, of course, we know that at the same time that he was telling everyone else to slow down, he was racing to build his own AI models that could compete. So it does cast his previous concerns about AI acceleration and the AI arms race into a very different light when we know that he just wanted time to catch up.

casey newton

Yeah. And while I don’t generally like to inquire about people’s motives, because I think it’s just very difficult to understand what’s going on in anyone’s head, what do we think Musk’s goal here is? Is it as simple as just beating everyone to the punch and creating superintelligence?

kevin roose

I think it’s partly that. I mean, this is a person who has been thinking about AI and superintelligence for a long time. He was obviously one of the founders of OpenAI. He provided initial funding.

He then very publicly split from OpenAI and now has this vendetta against the company. He’s suing them. He’s trying to offer to buy them.

So I think for him, this is just a race that he wants to win. He believes, I think, that we will build something like superintelligence, and he wants to get there before anyone else. I don’t think it’s about making money.

Obviously, he’s already quite rich. He’s the world’s richest man. I don’t think he sees this as a way to, I don’t know, recoup his investment in Twitter or anything like that. I think this is pure power.

casey newton

Yeah, I think that that sounds right. I think that he is a very competitive person, like most of these tech titans. And I think the prospect that Sam Altman, his former friend and colleague, would —

kevin roose

Nemesis. We can call him a nemesis. [LAUGHS]

casey newton

Yeah.

[laughs]

The idea that Sam Altman, who is Musk’s nemesis, would beat him to the punch, I think is infuriating. And I don’t think that Musk is alone in that. I think that most of the AI lab CEOs have a lot of ego in this race and want to be the ones whose name is written in the history books as the person who built superintelligence. So yeah, I think that’s a huge portion of it. Let me bring up the other thing that I think is the aspect of this story that really makes this interesting and probably worrisome as well, which is that in this moment, Elon Musk is a seat of power in the federal government.

kevin roose

Yes, he is the fourth branch.

casey newton

He is the fourth branch of the government, the unelected fourth branch of the government. He has a team that is now dismantling whole swaths of the federal government. They have been talking about using AI in government without telling us too much about what AI they’re using or how that works. And certainly, it’s not auditable or really available for public scrutiny. So what are you thinking about the intersection of Elon Musk the AI Builder and Elon Musk the shadow president?

kevin roose

I don’t really know. I actually want to ask you about this, because my sense is that these things happened in parallel. But I don’t get the sense that they’re all part of some grand scheme to use the power of the US government to somehow vault Grok into a position of authority or all of a sudden all of our Social Security payments will be going out via Grok.

That does not seem like where this is headed. And certainly, Grok 3 does not appear to be ready for that kind of widespread critical use. But maybe I’m missing something here. What do you think?

casey newton

Well, and look, I mean, this really does get into the realm of speculation. But I just keep thinking about the scenario that all of the AI CEOs keep telling us is going to happen, which is that within about two years, we’re going to achieve artificial general intelligence, this nebulous concept that we believe basically means anything that a remote worker could do, we’ll now have an AI tool that can do that.

And that tool might not actually be super safe, because you might decide you want to use your virtual coworker to go out and research how to launch massive new cyber attacks. And while we’re in a moment where no one in the US Federal government seems to want to talk about AI safety, eventually there are just going to be safety risks. There are going to be problems. People are going to be using these systems for ill.

And then I think the pendulum swings back. And what I’m wondering is, is that the moment where the federal government says, we actually do need to place restrictions on these AIs. That we’ve been telling you all along, oh, no, no, it’s go, go, go to the finish line. We need to get rid of all the guardrails so that the United States can be the leader in AI innovation.

Is there a moment where they say, you know what? We’re not sure that all these private companies should be out there building god. Maybe we’re just going to pick one company. Maybe we’re just going to give one company a license to do that, and they’re going to be the certified, permitted AI in the country, and that could be Grok.

kevin roose

Yeah, I think that’s certainly a remote possibility. I remember a couple of months ago, we went to that Curve conference, this AI conference where all these researchers were gathered to discuss the risks of AI. And I remember watching a tabletop exercise where people simulated in model UN style, what the next few years with increasingly powerful AI could look like.

And one of the things that happened in this simulated mock world was that Elon Musk persuaded Donald Trump to nationalize OpenAI and put him in charge of it as sort of a middle finger to Sam Altman. And at the time, that seemed like, OK, we are the realm of total fantasy here. Now, I’m not so sure. I could see that happening sometime in the next few years.

And look, obviously, Elon Musk wants to control OpenAI. He’s been fuming about having been pushed out of that. He’s been attacking the company, suing it, trying to take it over. What he really wants is OpenAI. But I think if he can’t have OpenAI, he’ll make do with Grok.

casey newton

So yes, as we say, that is just pure speculation right now. But I will tell you, Kevin, I can’t think of one reason why that stuff wouldn’t happen. It seems so logical to me.

kevin roose

Totally.

casey newton

With what I know about these people and how they operate, I almost can’t see it not happening. But I guess we will find out.

kevin roose

Well, and I think just to bring us back from the realm of speculative fiction here, one thing that we do know about building powerful AI systems is that you actually do need infrastructure for that. And so I think one obvious way that Elon Musk could use his power in the federal government is to do things like expedite the permits to build data centers to train the next versions of Grok,

is to spin up new sources of energy or get privileged access to the electrical grid in the places where he wants to build this thing. There are many ways that having a friendly relationship with the executive branch of the federal government could benefit you if you are in the AI business. And I imagine that that’s part of his calculus here, too.

casey newton

That’s a great point. OK, so that’s Grok 3. Zooming forward a bit, Kevin, what are the next few things that you think we should be looking for? What signs will indicate that Grok maybe actually is the real leader in this space and not merely about as good as all the other folks?

kevin roose

Obviously, I think Grok, the product, people will start to test it and figure out if it’s as good as Elon Musk and his crew say it is. I watched some of the live stream where Elon Musk and his top engineers were talking about Grok. And they predicted that within the next year or two, AIs will start winning medals and prizes, with some human expert in the loop. But something like a Fields Medal, which is the top prize for young mathematicians or a Nobel Prize.

casey newton

I love you say that I haven’t won a Fields Medal. Go on.

kevin roose

[LAUGHING]: They believe that AI will start to solve new problems, accomplish new things. I don’t know how likely I find that from Grok. But I think that kind of thing will happen pretty quickly, and that will be a major step forward. But I don’t know, what do you think we should be looking for in Grok?

casey newton

Well, I think that in recent months, it has been OpenAI that is the company that has been pushing the state of the art forward with its operator agent, with its reasoning models. It was the first one to put out one of these reasoning models. So until Grok is putting out novel new features that are making other AI labs say, oh, we need to do that, to me, it is still going to be in that middle of the pack.

kevin roose

Yeah. And do you buy the argument that having a less filtered, or less censored, or less woke model will actually be a differentiator in the market for Grok?

casey newton

So in the long run, I do believe that people are going to want politically flavored LLMs in the same way that they have come to want politically flavored social networks, in the same way that they’ve come to want politically flavored cable news networks. People do actually enjoy polarization and they want to be talking to an LLM that shares their values.

So I think the question is, though, is Grok better off just becoming incredibly based, and antiwoke, and serving that narrow slice of right-wing X or do we want LLMs that are a little bit more flexible, that learn a bit about our values, regardless of what our political orientation is? That, to me, seems like the better business model. But I guess we’ll see when somebody actually builds it.

kevin roose

Yeah. Now I did ask Grok some questions about us. I asked it to evaluate Kevin and Casey from the “Hard Fork” podcast and what their positions are on various things. And let me remind myself.

casey newton

I’ve been trying to figure out what my positions are on various things. So I’ll be curious to hear what Grok has found out.

kevin roose

So I’m going to ask Grok, who is the better journalist, Kevin Roose or Casey Newton? You have to pick one. Oh!

It says, “Both Kevin Roose and Casey Newton are heavy hitters in tech journalism and choosing between them feels like picking a favorite child, if I had kids, which I don’t being an AI and all. But since you’re twisting my virtual arm, I’ll go with Kevin Roose as the better journalist.” So Grok is good.

casey newton

It said you’re the better journalist?

kevin roose

Mm-hmm. Yeah.

casey newton

I had heard that Grok was falling short on various benchmarks. And I think we just found another one of them.

kevin roose

[LAUGHING]: Back to the drawing board, Grok.

casey newton

[LAUGHS]: Yeah. Time to do a new training run.

kevin roose

When we come back, Robinhood CEO Vlad Tenev is here to answer some tough questions about whether America is turning into a nation of degenerate gamblers.

[UPBEAT ELECTRONIC MUSIC]

Well, Casey, it’s time to talk about money. Can I have some?

casey newton

No.

kevin roose

[LAUGHING]: OK. You’re so stingy. Today, we are going to have a conversation with Vlad Tenev. He’s the CEO of Robinhood. Robinhood, of course, is the financial trading platform that is beloved by young people, that is used to buy and sell stocks, and futures, and options, and now crypto tokens, and all manner of things. And I’m excited for this conversation because, on some level, it makes me uncomfortable.

casey newton

And what makes you uncomfortable, Kevin?

kevin roose

So Just to put some cards on the table, we’ve talked on this show about the fact that we are rapidly, in my opinion, becoming a nation of gamblers. We now have many tools that allow people to place bets on various world events, prediction markets, sports betting, crypto platforms, all from the phones in their pockets.

And while I am not opposed to all forms of gambling, in fact, I enjoy a little gambling myself now and then, I do think that opening this stuff up and making it so accessible, especially to young people, has had some pretty harsh consequences.

casey newton

It has. I first met Vlad in 2013, right as he was getting ready to launch Robinhood. And in the story I wrote for The Verge, I wrote about the core innovation of Robinhood at the time, which is that they were not going to charge for individual trades. At the time, companies like Schwab or E-Trade would all charge some amount of money if you wanted to buy or sell a stock.

Robinhood completely changed the game by saying, we’re not going to do that. And what I wrote at the time was, this is going to encourage a lot of trading that could make people lose a lot of money. And so I had that discomfort with Robinhood from the beginning. And I would say that has only grown over time.

kevin roose

Yeah, and speaking of growing over time, Robinhood itself has grown a lot since then. It is now a giant, public company. It’s worth $52.2 billion as of this recording. Vlad is a billionaire now. And I think it’s time to have this conversation with him directly, because people in America have just a lot of concerns about the fact that we are now making it very, very easy to bet on all manner of things, whether it’s stocks, or sports games, or crypto meme coins, from your pocket.

casey newton

Also, in the spirit of disclosure, I want to mention that Robinhood owns a news platform called Sherwood News. And last year, they briefly syndicated some Platformer content, so that happened for a few months. It’s not the case anymore, but just thought I’d point that out.

kevin roose

Now, were you paid in dollars or meme coins for that?

casey newton

I insisted on cash, actually. [LAUGHS]

kevin roose

All right, so that’s our disclosure. And with that, let’s bring Vlad in.

[TRENDY ELECTRONIC MUSIC]

Vlad Tenev, welcome to “Hard Fork“.

vlad tenev

Thanks for having me.

kevin roose

I want to start by asking what might be a dumb question, which is what is Robinhood? I remember a few years ago, during the whole meme stock craze, I opened up an account. Basically, you guys were a free mobile brokerage. You could use it on your phone, buy and sell stocks.

Recently, I logged on to Robinhood to see what had been going on there. And there are just a ton of new features. You can do options trading, futures training, you can buy meme coins, you can get a credit card.

vlad tenev

Prediction markets.

kevin roose

You can do prediction markets.

vlad tenev

Retirement.

kevin roose

Yeah, you can bring over your 401(k) and invest it on Robinhood. So what is the product today? And do you see yourself basically offering all of the services of a traditional bank?

vlad tenev

Yeah, yeah. So I mean, long-term, we want Robinhood to be the place where customers can buy, sell, trade any financial asset or conduct any financial transaction. So if you think about it, it started off as trading. The real innovation was bringing commission-free mobile trading to market. And I’d say the business strategy is expanding beyond that to all of consumer retail financial services.

kevin roose

Yeah, I want to just pin down a little bit more of your vision of where the future of investing is headed. You wrote a piece in “The Washington Post” last month where you argued that the next big financial revolution is going to be crypto, and not just trading crypto coins, but tokenizing real world assets. What did you mean by that?

vlad tenev

Yeah, so how do you guys feel about crypto? Are you crypto skeptics or are you sort of fundamental believers?

casey newton

We’re pretty skeptical. I think we had the experience in 2021 of seeing everyone get very excited about it. We got sort of excited about it ourselves. And then we saw a lot of people lose money and not very much interesting stuff get built. So we felt burned.

vlad tenev

Yeah. So I’d say the skeptical narrative around crypto is, it’s all meme coins. And a lot of these things aren’t tied to real-world productive assets that generate value or revenue. And I think there’s a reason for that. And the reason is that, by and large, it has been illegal to connect crypto technology to things of value.

If you connect crypto technology to productive asset, it’s termed a security. And that’s governed by the Securities and Exchange Commission. And I don’t want to bore by getting too much into the details, but it’s not allowed to actually connect crypto to things of value. Ergo, what you’re getting is it’s connected to things that aren’t securities, which end up turning into variants of meme coins. And I think the way to solve that is to actually create a framework where you can connect crypto technology to productive assets.

kevin roose

What would that look like? What’s an example of that that you see playing out in the next few years?

vlad tenev

Well, in my op-ed for “The Washington Post“, I talked about private companies. It’s silly that you can buy meme coins, but OpenAI and SpaceX, which are big, innovative companies that most people would tell you, the risk of them going to 0 is not super high right now at this point. But the current regulatory environment makes it very hard for the vast majority of the US population to invest in these things.

So I think there’s multiple problems, but crypto can solve that from a technology standpoint. And I think there’s benefits to public equities and stocks being on blockchain technology as well.

casey newton

Well, I mean so let’s press on that a bit, because I remember the era of the initial coin offering when companies would start up, and they would create a coin, and they would make that available. And the basic idea was exactly what you just said is like, well, now you can have some of the upside if everybody winds up using this token for whatever. It doesn’t seem like that led to a lot of positive uses, did it?

vlad tenev

Well, that was just shut down very, very quickly. I remember the Telegram ICO was the hallmark event that brought a lot of scrutiny and brought a lot of attention.

kevin roose

And there also just a lot of scams, and rug pulls, and people not operating in good faith. It attracted a lot of people who were pretty malicious about how they used the ICOs.

vlad tenev

Yeah. I think that’s true. But I mean, you see that happening still in the meme coin environment.

kevin roose

I’m just saying, I don’t it was just the Telegram example that got people to be skeptical of it.

vlad tenev

Yeah. I think with any new technology, we have to mitigate the vectors of abuse and minimize them. And there’s definitely ways to do that. But I think the technology should be allowed to flourish, like the benefits are so extreme that I think it would be silly to not embrace it and allow it to fully permeate the financial system.

kevin roose

Got it. I want to talk about your recent efforts to get into the prediction markets business and even the sports gambling business. Earlier this year, Robinhood was considering a move into sports betting. You rolled out this market for predictions on what you called the Pro Football Championship, which I guess is because you’re not allowed to say “Super Bowl” without incurring huge fines from the NFL.

casey newton

Are we allowed to say “Super Bowl?”

kevin roose

All right, we’ll bleep that out.

vlad tenev

I don’t think you are. Yeah.

casey newton

[LAUGHING]: OK. Let’s just say, it rhymes with Cooper troll.

vlad tenev

I don’t think you can say “The Big Game” either.

casey newton

Oh, not the — oh, man.

vlad tenev

So “The Big Game,” so.

casey newton

The large contest.

kevin roose

The large contest. So you rolled this out to roughly 1 percent of your users. And then the Commodities Futures Trading Commission, the CFTC, asked you to suspend that market. They had, quote, “serious concerns.” So what happened there and where do things stand with your entry into sports betting?

vlad tenev

Yeah. So I would, first of all, distinguish between sports betting and prediction markets. I think that, mechanically, there’s some similarities, but they’re different things.

kevin roose

Wait, wait, wait, wait. Hang on. If I’m betting on a prediction market for who is going to win this football game, and I get paid if the team that I bet on wins and I don’t get paid if the other team wins, how is that different than sports betting?

vlad tenev

I mean, I think the distinction is, I’ll explain. I think that you get into a little bit of like a philosophical discussion with this stuff, because there’s people that believe any market is betting. I mean, first of all, I think prediction markets are the future of not just trading, but also information.

I’ve been a big believer in the power of prediction markets for a long time, a student of them. And I think prediction markets should be live for everything. One way I think about it is, it’s kind of the newspaper. So the newspaper has economic value. I mean, people go out and buy it.

kevin roose

Sure does, nytimes.com/subscribe.

vlad tenev

Yeah. And it has various sections. It has the front page, it has the sports section. People pay for it. And people pay for broadcast news too, indirectly in the form of advertising.

So what prediction markets are is the news faster. And in some cases, you get it even before it happens. So the economic value of that as a product and service should be at least as high, and I would argue, strictly greater than the news after it happens.

kevin roose

Yeah, I would say, like I understand the arguments for prediction markets. We’ve talked about them on the show before. But in the narrow case of, who is going to win this football game? That is a service that I could get on DraftKings, or FanDuel, or any sports gambling site.

That specific prediction market, there’s no news there. It’s just, who’s going to win the game? And who’s going to get paid out as a result of winning this game?

vlad tenev

Well, I mean, who’s going to win the game is news.

Why do people watch ESPN?

kevin roose

Right. I guess this just seems, to me, like a case in which you’re doing kind of a regulatory arbitrage, where you’re saying, because this is a prediction market, it’s like a derivative contract. You’re not actually betting on the game like you would in a sports betting thing, which would be illegal in some states. You’re doing a derivatives contract, which you argue, should be legal federally. The government disagreed. Why did they disagree?

vlad tenev

I don’t think they necessarily disagree.

kevin roose

Well, they told you to stop doing it.

vlad tenev

It’s just new and different. And so I think this story will play out. But at the end of the day, I think what you’ll see is prediction markets are here to stay. I think some of the details around what types of prediction markets are classified in what category, I think, will be worked out. But Robinhood will play a leading role in that, because I think this is an incredibly important technology.

casey newton

What’s the information that you’ve gotten yourself from prediction markets that’s felt really useful to you?

vlad tenev

I mean, one example was the election. So as you guys know, we rolled out Presidential Election Market, and that was an incredibly successful product. And I think you can juxtapose the experience of looking at a prediction market for the election versus the actual news on election night.

So on election night, prediction markets were at 95-5 Trump. And the news was giving you all of these details, like, oh, we got this result from this county. We found 2,000 votes. But you just wanted to know who was going to win the thing. And I think if you want the news as fast as possible, you have to turn to the prediction markets, not the news.

kevin roose

Right. I would just say, prediction markets are not always right. I remember when the room temperature superconductor debate was going on. And lots of people got very excited about whether we had just discovered this LK-99 thing that was going to revolutionize the world, and prediction markets went nuts.

And for a time, it was seen as very high probability. But the news, the media that you’re talking about, actually went out and checked it and said, does this thing work? And scientists tried to replicate it and found that it actually wasn’t a room temperature superconductor. So in that case, the prediction markets were not a reliable indicator of what was true.

vlad tenev

I mean, I’m not saying that prediction markets are always right. Nobody’s going to bat a thousand on anything. But what I’ll tell you is they’re the most effective mechanism that I’ve seen for synthesizing all the publicly available information.

kevin roose

Right. I want to ask you about this narrative that we’ve talked about on the show, that I’m sure you’ve heard before, which is that tools like Robinhood, which make it very, very simple and sort of gamified to invest in crypto assets, and meme coins, and other things, that they are essentially turning investing into a form of gambling and popularizing that, especially among young people. I’ll put some cards on the table.

I do think that we are becoming a nation of gamblers. And I don’t know that that’s a net positive for society. And I wonder how you feel when you hear that.

vlad tenev

Yeah, I mean, a lot of people believe that markets are gambling, which I disagree with. And obviously, markets are in the name of our company. We believe in financial markets. We believe that any product that is available to institutions, by and large, I mean, there are some exceptions there, should be available to retail as well.

Because if you look on a macro level, access to markets has been one of the greatest sources of wealth creation for countries. Countries with more open markets have tended to outperform countries with closed markets. And so we believe in bringing that to retail, because even if there’s individual cases that are negative and negative externalities, by and large, the markets and opening up access have been one of the largest sources of wealth creation for countries and individuals.

kevin roose

What about things like Pump.fun, which is this new crypto platform that people, especially young people, are having a good time on, some of them? This basically makes it very, very easy to launch a new meme coin, to sell it. There have been lots of documented instances of people making tons of money on Pump.fun, but also losing tons of money and getting scammed and rug-pulled.

vlad tenev

Yeah.

kevin roose

Do you see that as a good way for democratizing access to financial instruments?

vlad tenev

So here’s my take on that. I think it goes to my original point of the power of the technology. So the idea that someone can create a coin in five minutes and it’s traded globally, it’s available across a whole bunch of exchanges, and DEXs, and wallets, that idea is an extremely powerful idea. And it’s a powerful technology.

And you juxtapose that with the IPO process, which is cumbersome, incredibly expensive. I mean, not a lot of companies want to go through with it anymore because you have to deal with all these counterparties, and banks, and a road show. And I think that’s a big problem because now you have companies like SpaceX and OpenAI that are worth hundreds of billions and are still private.

So the upside from investing in these high-growth technology companies accrues only to the insiders that are able to get into the private company deals. For example, I mean, you have NVIDIA. And that’s been getting a ton of the retail interest and institutional as well. But OpenAI, Anthropic, companies like Perplexity, all private.

And so that’s why I think marrying the technology that allows you to create a coin in five minutes or less with real productive assets, like private companies, is so powerful. And I think we can solve the problems that you’re indicating. I think there should be self-certification. Companies and projects should be able to provide disclosure.

So for example, if you are a late-stage private company and you have audited financials that are public-like, you should get into a higher tier of disclosure. And if you’re a project that was created on one of these meme factories, maybe you get a big red skull and crossbones telling people, be careful, this is not vetted, not verified.

But I think people are smart and can make their own decisions. And I think there are ways that they can actually provide the disclosure needed to keep customers safe.

casey newton

Right. Because this is the big difference between the public and the private markets is, ultimately, we haven’t seen audited financials for an OpenAI, for an Anthropic. From the exterior, it seems like they’re doing well, they’re raising billions of dollars. But if you’re a retail investor and you’re just reading the news coverage, you are just throwing a wish in a fountain. So you’re saying, if we go through with this, then companies like OpenAI should have to offer some public disclosure before people are allowed to start buying OpenAI coin.

vlad tenev

Or they can opt into it. You don’t want to have to force the companies to provide disclosure. But opting into a disclosure, I think will get you access to like higher tiers of placement.

kevin roose

I want to return to this idea of the nation of gamblers, of the ways that we are, in some sense, betting on more things more regularly as a country than we have at any point in our recent past.

casey newton

I actually bet Vlad $10 you were going to bring this up again, by the way. Go on.

kevin roose

I mean, part of what I’m struggling with here is that I hear you talking about democratizing access to markets. And I think on some level, that’s a compelling argument. But then I look at what companies like Robinhood are actually doing and the kinds of investments that they are making it very easy for people to make. And it does not seem like a wise investment.

So I got a few weeks ago, an alert from Robinhood on my phone telling me that I could now buy the Trump crypto meme coin on Robinhood. I got another alert from you around New Years saying that you were giving away Dogecoin to people who signed up for accounts. To me, that does not feel like responsible stewardship of a platform where people are investing their money. It seems like you are actively pushing people, your users, to invest in very speculative assets that are high risk and that they might not be prepared for.

vlad tenev

Yeah, I mean, I think that my view is people should know what’s available. I think that a lot of people wanted to buy that asset for a variety of reasons. I would dispute the fact that we have the ability to coerce someone into buying something that they don’t want to buy. The people that buy these assets do it because they have a fundamental belief in what it represents. And I don’t necessarily think that that’s —

kevin roose

Or they love to gamble.

vlad tenev

I mean, I think markets have a wide variety of participants. Some people, particularly with these memes, are buying it because they think it will go up in the future, as with anything. But there are a lot of people that buy it because they want to support the movement that it represents.

I would say that in terms of what we allow and what we list on our platform, I mean, we don’t have hundreds of coins like some of the other crypto platforms. We’re on the extreme, sort of select —

kevin roose

Only the blue chip meme coins.

Vlad, I do want to ask you one more question about the effects of services like Robinhood and the larger generational cohort that tends to do a lot more speculative investing. There’s been some studies recently about the increasing prevalence of gambling addiction, especially among young men. There’s a new study that just came out earlier this week in JAMA that shows that internet searches related to gambling addiction have increased significantly over the last few years.

Anecdotally, I’m hearing from friends who are therapists who work with young men, who say that the number of boys and young men who are coming in with gambling addictions has risen precipitously. And I wonder if you have any reservations about the way that Robinhood and other financial platforms may be contributing to a growing public health crisis, especially among young men?

vlad tenev

Yeah, I mean, since we’re not in the gambling space, I’m less familiar about the ins and outs of gambling addiction. I mean, obviously, there needs to be appropriate controls and services. And we have to make sure that customers don’t get in over their skis.

I do think if you look at financial markets, financial markets have had pretty robust controls around things like customer onboarding, suitability, geolocation. So you make sure that customers is in one state can’t have access to things that are not allowed in that state. So there is benefit to actually bringing it into a more regulated realm, where a lot of these controls from financial services can be broadly applied.

kevin roose

OK, so you’re not opposed to regulating people from preventing them from making investments that might be against their own self-interest or that they might not be equipped to assess the risk of. Is that fair?

vlad tenev

I think that I’m certainly in favor of suitability controls and various things. And those exist in the financial services world. I think that where it’s tricky is when you start saying, preventing people from making investments that are bad for them, because then you get into this situation of Massachusetts in the ‘80s banning its citizens from participating in the Apple IPO.

And maybe, objectively, at the time, people said, well, that’s IPOs are risky. This is an unproven technology company. Who uses computers? But then 30 years from now when your state has basically like been harmed, in retrospect, by that decision, it doesn’t look so smart anymore.

kevin roose

Are there any financial assets you think are too risky for retail investors to be allowed to buy and sell? Is there anything that you would say, that’s a little too crazy?

vlad tenev

I think there’s probably financial assets that we don’t see a clear need for retail investors or maybe a little bit complex to understand. For example, you’ve got different mortgage-backed securities and credit default swaps. But I’d say by and large, my thought is if an institution has access to it, retail should have access as well.

casey newton

I’ve been thinking about buying up a massive amount of mortgage-backed securities and credit default swaps, and just seeing what happens. So I’ll keep you guys posted. Look, I think we should end on a couple AI questions.

vlad tenev

Yes.

casey newton

So my first one is just, Vlad, you’re in the tech elite. You’re talking to all the cool AI CEOs. Based on what you think is coming, does it still make sense for the average person to save for retirement?

vlad tenev

I’m very, very confident that despite the advances in AI, we’ll still have a need for money and currency. People will still create companies. Maybe the AIs will create companies, too. I think regardless of what happens to the labor landscape, the job landscape, if there’s disruption, I think that bodes well for the importance of investing and stashing away your money. I think retirement becomes even more important.

kevin roose

Vlad, last question. You’ve got a new AI venture, Harmonic, which I was doing some reading on. It looks like an AI for math. Why did you start up this side quest? And how does this fit into your vision for the future?

vlad tenev

Yeah, I think the big problem with AI models is that the current generation of models will give you an answer in nearly all cases. But the problem is in how you can trust the output. How do that the output is correct? Are there subtle errors?

And actually, math as a domain is a very interesting domain, because unless every step in the reasoning is correct, the answer is very, very likely to be wrong. And the original goal was to build super intelligent AI that has verifiably correct outputs at every step in its thinking process.

casey newton

So no hallucinating?

vlad tenev

No hallucinating. Yeah.

casey newton

And is that possible?

vlad tenev

It’s possible, for sure. I mean, if you think about it, a calculator, right? Your calculator doesn’t hallucinate. If you type in some math formulas, you’re pretty confident that your answer is going to be correct and it’s not going to hallucinate. So can you scale that idea to more and more problems?

Obviously, it’s easy when you’re adding big numbers. But can you do a word problem? Casey and Kevin are on a boat and they’re going down a river. The river is going at 5 knots. There’s a wind. When are they going to get to the destination? Can you make a super calculator that gives you the no hallucinations property of a basic calculator, but the flexibility of an LLM? I think that’s the dream.

kevin roose

Yeah. Well, I’m just saying, I’m not getting into a boat with you anytime soon.

casey newton

Vlad is actively fantasizing about throwing us in the river at this point.

kevin roose

[LAUGHS]: Yeah. Well, I think that’s as good a place as any to end. Vlad, thanks for coming.

casey newton

Thank you, Vlad.

vlad tenev

Thanks for having me.

kevin roose

When we come back, my experiments with AI vibe coding. And I’ve got a hot app to give to Casey. That’s a hint.

[TRENDY ELECTRONIC MUSIC]

Casey, it’s time to talk about vibe coding.

casey newton

Yes, Kevin. This is, I would say, your latest obsession. And I’m very eager to hear what exactly you’ve been doing and making. But before we get into all of that, what is vibe coding?

kevin roose

So “vibe coding” is a term that is very new. It was popularized on social media in the last week or two. And it was coined by Andrej Karpathy, the engineer formerly of OpenAI and Tesla.

casey newton

I would say a leading AI researcher and educator.

kevin roose

Yes. So he talked at the beginning of February on X about how he had been doing these kind of small, hobbyist programming projects where basically, instead of writing the code himself, he was just using these AI tools to do what he called vibe coding, where he’s essentially telling it, I want this app to do this thing, and it’s going off and doing it. And maybe he steps in to debug something if it stops working. But he wrote, quote, “I just see stuff, say stuff, run stuff, and copy-paste stuff, and it mostly works.”

casey newton

So this is really, you’re just kind of overseeing the AI write the code. Andrej, it sounds like, is doing very little of the writing. He’s basically doing what I heard some people predict that we would arrive at this point, which is English is the new programming language. You just say in English what you want the code to do, and then it does it.

kevin roose

Yeah, and this is different from the AI coding tools that existed even a couple of years ago. GitHub Copilot was one of the early AI coding assistants, where basically, it would just autocomplete your code. You could be writing a line of Python or JavaScript, and it would see what you were up to, and it would complete it for you, and you would just press Tab, and it would go on to your next thing. But you still had to know how to program to use those tools effectively.

But what’s been happening in the last couple of years, and really over the last six months has gotten quite good, are these tools that essentially remove the need to program at all. So now there are lots of tools out there. There’s a tool called Cursor.

There’s a tool called Replit, there’s Bolt, there’s Lovable. There’s a bunch of these tools where basically you just go in, and you get a text box, and it says, what do you want to build? And you say, I want an app that does this, this, and this, and it goes out and builds it for you pretty much instantaneously.

casey newton

Now I have a friend who runs a tech company. And he once made fun of this whole idea to me by saying, hey, you want to talk about programming in the English language? That’s what I do all day long as a CEO. I’m constantly telling my engineers in English what to do, and it works maybe a little over half the time, but maybe not much more than that.

So what has been your early experience of vibe coding? What have you been trying to build? And how has it been going?

kevin roose

I want to talk about my projects. But first, I want to talk about my own history with this stuff, because I am a former programmer. When I was a teenager, I was into coding. I would build websites. I would build little JavaScript projects.

I spent a very excruciating summer trying to teach myself Flash so that I could make animated cartoons like “Homestar Runner“. And then I dropped it. I went to college, I learned about journalism.

I thought, well, this is the path I want. I became a word cell. And then I stopped coding altogether. And so when I started hearing about these tools that would let you just code without knowing how to code, I was very interested.

And I started experimenting. One of the first things I built was this podcast summarizer, where you can take a podcast that’s very long and just use AI to transcribe it, and then use a different AI to summarize the transcripts, and put it all into a searchable database so that I could say, OK, I don’t feel like listening to this five-hour podcast about AI, but I can basically get the executive summary using AI.

casey newton

So tell us a little bit about your setup. What software are you using to do this?

kevin roose

So I’ve been trying a couple different tools. Sometimes I just use the raw AI models themselves, like the Claude, the ChatGPT. Those tools are quite good at some projects, but they can’t actually, for the most part, run the software to test it inside the window. So it does require some copying and pasting. So this new app that I’ve been using is a more integrated development environment.

casey newton

A IDE?

kevin roose

An IDE. So Cursor is the one that’s really popular right now. If you’ve never used an IDE before, you might find it a little puzzling. I certainly did. But it basically lets you prompt the AI to write the code for you, automatically debug it, deploy it within a little test window, and then push it out onto the web where people can actually use it.

casey newton

So tell us about some of the other projects you’ve been building.

kevin roose

So in addition to my podcast summarizer, I also had AI help me redesign my website to look more cyberpunk. That was the aesthetic I was going for.

casey newton

Wait, is this live? Can I view it?

kevin roose

No, it hasn’t deployed yet. But it’s going to be there soon.

casey newton

Wait. How did it make it look more cyberpunk?

kevin roose

It just redesigned the whole thing.

casey newton

Oh, OK.

kevin roose

Like bright neons, sharp edges, cool scrolling, sort of parallax style animations.

casey newton

Do you have a bionic arm in your author photo now?

kevin roose

Yes. [LAUGHS]

casey newton

OK.

kevin roose

I built a tool to pull all of my bookmarks from X into a spreadsheet. I use X a lot. I’ll bookmark things that I find interesting or want to return to later.

casey newton

You’ll say, wow, that’s the most racist thing I’ve ever heard.

kevin roose

[LAUGHING]: Yes. So now I have a tool that will go through all of my bookmarks and pull those into a spreadsheet that I can search later. That one was very interesting because it basically presented me with a couple options after I asked, I think it was Claude, to build this tool for me.

It said, well, we could go use the Twitter API, but that costs money. And if you don’t want to pay that, we have this other way that we can do it that involves using a browser to sort of scrape the bookmarks from Twitter. And so I went with that version.

casey newton

Wow. You realize that by doing this, you are now like essentially an armed combatant in Elon Musk’s war on bots. You are the bot that Elon Musk is trying to destroy.

kevin roose

Come at me, bro.

casey newton

Good luck. Good luck, buddy.

kevin roose

I’ve got my bookmarks. Now, I don’t need it anymore.

casey newton

All right, what else?

kevin roose

So the thing that I built most recently was yesterday, when I was trying to determine if various objects that I’m moving to my new house would fit in the trunk of my car. And so I built an app called Will It Fit In My Trunk?

casey newton

Now, this feels like a classic math-based problem that maybe Vlad’s thing could have help you with. But you used something else. How did it go?

kevin roose

So far, so good. It hasn’t steered me wrong yet. But this is the sort of that speaks to what I think is so fun and interesting about this genre of coding project, is you can really just build what I call software-for-one. A software company would never build a tool for wide release that let you figure out whether various objects would fit in your trunk. That is not a big total addressable market.

casey newton

Sounds like you never saw Trunkie in the App Store. I’m just kidding. That’s not a real app. But yes, you’re right.

kevin roose

So this style of coding really makes it possible to build things that you and only you need or will it ever use.

casey newton

And there’s something fun about it, because I think, particularly for you and I who actually enjoy technology and like using it, like trying new things, coding can feel like actual magic. It can feel like wizardry. And if you are the one who is all of a sudden wielding the wand and making things happen, then you’re feeling great.

kevin roose

Yeah, it is the most fun that I’ve had with these AI tools in a while. I think it is the most fun thing you can do with AI in today’s world. And it has really connected me back to my teenage coder self and reminded me what I loved about it back then.

casey newton

I spent a lot of time in college and afterwards writing HTML. I had a program called Dreamweaver.

kevin roose

Love Dreamweaver.

casey newton

And got pretty handy with it. But if I had been able to chat with an AI assistant about why I was having trouble with my movable type installation in 2004, my website would have been sick as hell.

kevin roose

Yeah.

casey newton

Yeah.

kevin roose

So, Casey, I’m sure you have some niche software needs in your life.

casey newton

Absolutely.

kevin roose

And I asked you the other day what I could build for you using my vibe coding tools. And what did you say?

casey newton

What I said was, I need help with my hot tub.

kevin roose

[LAUGHS]: Go on.

casey newton

Well, listen, here’s what they don’t tell you about buying a hot tub. When it comes to your house, and you decide, I want to use the hot tub that I have just purchased, you have to become a chemical engineer. Here’s what I mean by that.

You open up the manual. All of a sudden, you learn you are going to need to monitor the pH balance of your water. You’re going to need to monitor the alkalinity of the water. You’re going to need to monitor the calcium level of the water, because if there is not enough calcium, it can somehow corrupt the jets in your hot tub.

And needless to say, Kevin, I don’t have a lot of experience mixing chemicals to adjust alkalinity and pH levels in bodies of water. And I thought, well, how am I going to do this? And so then I actually do start using the chatbots, not to write me software, but essentially just to say, please, god, help me. What do I do?

And there are so many things to keep track of. You have to put various chemicals into the hot tub at different intervals. So you replace this once a month. You replace this once a quarter.

Twice a year, you have to drain the entire tub. Once a month, you have to shock the hot tub. Don’t ask me what that means. I just read that today and it’s giving me a nervous breakdown.

kevin roose

You have to show it some spicy tweets.

casey newton

Yes, exactly.

[laughs]

So there’s so much to keep track of. And I thought, well, if I were going to build software just for me, it would be something that just checked in with me to prevent my hot tub from turning into a bacterial soup.

kevin roose

Well, Casey, I have great news for you.

casey newton

What’s that?

kevin roose

I built you a hot tub tool.

casey newton

Oh, my goodness. You vibe coded on my behalf?

kevin roose

I vibe coded on your behalf.

casey newton

Thank you.

kevin roose

So after you told me about the issues you were having with your hot tub, which are very relatable by the way. Listeners are all going, me too. I have an issue with my hot tub.

casey newton

Listen, we have a very wealthy audience that is constantly buying huge upgrades for their homes. And every once in a while, Kevin, we have to do something for the C-suite listeners.

kevin roose

Exactly. So I took this as a brief. And I went into a tool called Replit. And I said to the tool, make me an app that will tell me the things that I need to do to keep this specific kind of hot tub working properly, and put it in a tool that my friend can use. And I said, because this is a tool that uses as a machine to tell you the time to service your hot tub, I was going to call it Hot Tub Time Machine.

casey newton

[LAUGHING]: That’s very good. I like that. Yeah.

kevin roose

So I built you a website called Hot Tub Time Machine.

casey newton

Oh, my gosh, this is wonderful.

kevin roose

So let me show it to you.

casey newton

OK.

kevin roose

Now I just want to tell you and caveat this by saying, that I did not choose the design or the color scheme here. That was all the AI.

casey newton

OK, great.

kevin roose

So open up the link I just sent you.

casey newton

All right, I’m opening up the link. OK, so it is quite pink. It’s pink on pink, which is a color scheme that you don’t see a lot outside of the Barbie franchise. But no, there is some black text. It looks beautiful.

And it says, “Hot Tub Time Machine, your retro futuristic maintenance companion.” And it even created a little logo, which I’m going to assume is a drop of water?

kevin roose

Sure, we’ll go with that.

casey newton

And there are two modules. There’s a Managed Tasks module and a View Schedule module.

kevin roose

Yeah, so this tool is very simple. This is a prototype. We can flesh it out if you want to. But basically —

casey newton

Is this in alpha or is this in beta?

kevin roose

This is in alpha.

casey newton

It’s in alpha.

kevin roose

You’re the single user of this app. And basically, I’ve set it up so that it will email you weekly, monthly, quarterly, and annually with a list of everything you need to do to keep your hot tub in working order. And as a special bonus, every email will come with a poem about hot tubs.

casey newton

[LAUGHING]: Fantastic. Well, can I start clicking around?

kevin roose

Yeah, click around.

casey newton

All right, so I’m going to click on View Tasks. And all right, and this brings up, there’s a module where I can add a new task, but there are also some existing tasks. And it includes weekly, quarterly, monthly, and annual maintenance. And it is, frankly, an overwhelming number of things to do.

kevin roose

Yeah, you bought yourself some chores when you bought that hot tub.

casey newton

It really is just a wall of text of things that I have to do. Every week, I’m apparently supposed to spray off the filter with a garden hose.

kevin roose

Yeah.

casey newton

And add 1 cup of nonchlorine shock, especially after parties or heavy use. So yes, lots in here. OK. And also looks like I can add another task.

kevin roose

I’m going to click the little test button here to send you one of these. And you’ll see if it shows up in your inbox.

casey newton

OK. Let’s see. Yes. Oh, time to maintain my hot tub. And there are some step-by-step instructions that I can follow. And below that, the Hot Tub Poetry Corner. And should I read this poem?

kevin roose

Yes, please.

casey newton

All right, here’s the poem. “Bubbles rise in swirling steam. Time machine of warmth and dream. Nordic waters pure delight. Maintaining bliss both day and night.” That is OK, I guess I would say. That is OK. Nordic, of course, a reference to the fact that I have a Nordic Jubilee series hot tub.

kevin roose

Yes, yeah. So I built this all in about half an hour without writing a single line of code.

casey newton

OK.

kevin roose

And I want to share that with you, because not only will it help you with your hot tub issues, but I hope it will also show you the promise of vibe coding.

casey newton

Yeah, well, I feel like you have shown me the promise of vibe coding. Now was there anything about this that was particularly tricky? Or did you get stuck on anything?

kevin roose

Yeah, so there are some things that it can’t do. If it needs to authenticate you into some service or set up a database, you have to manually step in and do that. There are some things that it just can’t do because no human programmer could do it either.

If there’s no API for something, for example, it won’t magically invent one. So there are some boundaries and limitations. And I would say it still does benefit you, when using this stuff, to have at least a little bit of programming experience, because there are just certain decisions that it will prompt you to make where you’re like, I don’t actually know what these terms mean or what the right decision is here. And you can ask the AI to just make the decision for you, but you might not be totally happy with the result.

casey newton

Now, during this process, I’m curious if you felt like you were learning something about the coding process. Like, if you spent the next year making these little one-off apps, do you feel like you would maybe be a decent junior software engineer? Or is the idea actually not to get into the details, to just let it build things? And if you don’t know what it’s doing, that’s none of your business.

kevin roose

Yeah, I think I’m more in the latter camp. I mean, this was the part that I found fascinating about what Andrej Karpathy said about vibe coding. He’s an extremely good programmer. But he says that he now can enter this mode where he basically just says like, OK, OK, OK, accept, accept, accept, and the computer will go off and do its thing.

I don’t know enough about programming to dive into the weeds of what the AI is doing and the decisions it’s making. I just have to look at the end result. And there’s something exciting about that, where I feel like things are just happening magically on my behalf.

But that’s also, maybe it’s inserting malicious code. Maybe it’s doing something that I don’t want it to be doing. I have no way, as a nonprogrammer, to know whether that’s happening or not.

casey newton

Have you checked your computer to see if it installed a Bitcoin miner while you weren’t looking?

kevin roose

I have not, but that would be pretty tricky.

casey newton

Well, Kevin, this experiment has me thinking about a blog post I read this week by a guy named Namanyay Goel. And his blog post was titled “New Junior Developers Can’t Actually Code“. This post got a million views, according to the post that I’m looking at.

And he is saying that when he talks to junior developers, they are having an experience very similar to you, which is that as they are building these systems, they are essentially just supervising an AI. They aren’t actually getting their hands dirty and understanding which mechanisms are leading to which results. So while this is great for you, I do think it raises the question, what happens when most of our software engineers are building systems that, in some fundamental way, they don’t understand?

kevin roose

Yeah, I think this is a very real thing. I mean, the flip side of me, a noncoder being able to build stuff, is that if real coders are using these tools, there’s no incentive for them to learn the basic skills of programming and learn the syntax of the different languages. And yeah, I don’t know what to do about that.

It seems like a version of what happened when we all got like Google Maps on our phones is that people started losing their sense of direction. There’s this kind of skill atrophy issue that people worry about. But I think that the returns to knowing how to use these things effectively are still great enough and still require enough knowledge of how the various pieces of code fit together, that it still does make sense for people to learn to code.

I’m not one of these people who thinks learn to code is totally over. I think for some people, it’s still a very useful skill to have. But I think that in the future, the role of the software engineer will become more like a product manager, where you are essentially supervising the product, laying out the vision, overseeing the design, stepping in to fix things when they break, but you are not actually in the trenches of the code, writing lines of code by hand.

casey newton

All right.

kevin roose

What do you think?

casey newton

What I think is that as AI systems get more and more powerful, we need people who do understand them on a very detailed, technical, complex, down-to-the-metal kind of way. And that if we don’t do that, our only alternative will just to trust the AI when we ask it, hey, how do you work? And there are a lot of reasons why I don’t want to end up in that world.

So I’m comfortable having fewer people in this world who know the code at that level of detail. And it’s fine to me if most software engineers don’t. But I want a solid core of people who do.

kevin roose

Yeah. And I would like to continue with my vibe-coding experiments, trying to build increasingly more useful tools for myself and my friends.

casey newton

And I am thinking about starting because if you can do it, surely I can.

kevin roose

[LAUGHING]: Yes, anyone can. That is sort of the point. And I also would love to hear from our listeners, what they are vibe coding. What tools and apps are you building using AI that are solving your own personal, specific problems?

casey newton

Did you invent a novel bio weapon using ChatGPT? We’d love to hear from you.

casey newton

Yeah, please email that one to tips@fbi.gov.

casey newton

[LAUGHING]: But the others, hardfork@nytimes.com.

[UPBEAT ELECTRONIC MUSIC]

kevin roose

“Hard Fork” is produced by Whitney Jones and Rachel Cohn. We’re edited by Rachel Dry. We’re fact-checked by Caitlin Love.

Today’s show was engineered by Alyssa Moxley. Original music by Elisheba Ittoop, Marion Lozano, Diane Wong, Rowan Niemisto, and Dan Powell. Our audience editor is Nell Gallogly. Video production by Chris Schott, Sawyer Roque, and Pat Gunther. You can watch this full episode on YouTube at youtube.com/hardfork.

Our executive producer is Jen Poyant. Special thanks to Paula Szuchman, Pui-Wing Tam, Dahlia Haddad, and Jeffrey Miranda. As always, you can email us at hardfork@nytimes.com. And you can email Casey’s hot tub at hottubnewton@gmail.com.

casey newton

And it will be reading every email.

Source link