a16z Podcast - OpenAI如何为8亿周活跃用户构建服务:模型专业化与微调 封面

OpenAI如何为8亿周活跃用户构建服务:模型专业化与微调

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

本集简介

在本期节目中,a16z普通合伙人马丁·卡萨多与OpenAI平台工程负责人Sherwin Wu深入对话,解析OpenAI如何统筹平台架构中的模型、定价与基础设施体系,以及其如何从单一通用模型转向专业化系统组合、定制微调选项与基于节点的智能体工作流。 他们探讨了开发者为何倾向于坚守信任的模型家族、这种信任如何建立,以及行业为何摒弃了"万能模型"的理念。Sherwin还阐述了从提示工程到上下文设计的演变历程,并说明企业如何利用OpenAI的微调与RFT API通过自有数据塑造模型行为。 对话亮点包括: • OpenAI如何平衡横向API平台与ChatGPT等垂直产品 • 从Codex到Composer模型的演进之路 • 使用量定价为何有效而结果导向定价的局限 • Harmonic Labs和Rockset收购为OpenAI智能体工作带来的增益 • 新型智能体构建器为何采用确定性节点架构而非自由漫游模式 资源链接: 关注Sherwin Wu的X账号:https://x.com/sherwinwu 关注马丁·卡萨多的X账号:https://x.com/martin_casado 持续关注: 若喜欢本期节目,请点赞、订阅并分享给朋友! a16z的X账号:https://x.com/a16z a16z的领英主页:https://www.linkedin.com/company/a16z Spotify收听a16z播客:https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX Apple播客收听a16z播客:https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711 关注主持人:https://x.com/eriktorenberg 免责声明:本内容仅作信息参考,不作为法律、商业、税务或投资建议,亦不用于评估任何投资或证券,且不针对任何a16z基金的现有或潜在投资者。a16z及其关联机构可能持有讨论企业的投资。详情参见http://a16z.com/disclosures 持续关注: a16z的X账号 a16z的领英主页 Spotify收听a16z播客 Apple播客收听a16z播客 关注主持人:https://twitter.com/eriktorenberg 免责声明:本内容仅作信息参考,不作为法律、商业、税务或投资建议,亦不用于评估任何投资或证券,且不针对任何a16z基金的现有或潜在投资者。a16z及其关联机构可能持有讨论企业的投资。详情参见a16z.com/disclosures。 本节目由AdsWizz旗下Simplecast托管。个人数据收集及广告用途相关信息请参见pcm.adswizz.com。

双语字幕

仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。

Speaker 0

我们希望ChatGPT成为第一方应用。

We want ChatGPT as a first party app.

Speaker 0

第一方应用是现在获得8亿惊叹或什么的绝佳方式。

First party app's a really great way to get 800,000,000 Wows or whatever now.

Speaker 0

全球十分之一的人口。

Tenth of the globe.

Speaker 0

对吧?

Right?

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

全球10%的人口每周都在使用它。

10% of the globe uses it every week.

Speaker 0

每周如此。

Every week.

Speaker 0

是的。

Yeah.

Speaker 0

即便是OpenAI,最初的想法也是会有一个统一所有功能的模型。

Even with an OpenAI, the the thinking was that there would be, like, one model that rolls them all.

Speaker 0

而现在,情况已经完全改变了。

And it's, like, definitely completely changed.

Speaker 0

越来越明显的是,未来将会有许多专业模型共存的空间。

It's, like, becoming increasingly clear that there will be room for a bunch of specialized models.

Speaker 0

很可能会出现多种其他类型的模型。

There will likely be a proliferation of other types of model.

Speaker 0

许多公司都坐拥大量宝贵的数据资源。

Companies just have giant treasure troves of data that they are sitting on.

Speaker 0

最近的关键突破在于强化微调技术的应用。

The big unlock that has happened recently is with the reinforcement fine tuning.

Speaker 0

通过这种设置,我们现在可以让你实际运行强化学习,从而更充分地利用你的数据。

With that setup, we're now letting you actually run RL, which allows you to leverage your data way more.

Speaker 1

OpenAI向自己的敌人出售武器。

OpenAI sells weapons to its own enemies.

Speaker 1

每天都有成千上万的初创公司基于OpenAI的API进行开发,其中许多试图直接与ChaiGPT竞争。

Every day, thousands of startups build on OpenAI's API, many trying to compete directly with ChaiGPT.

Speaker 1

这是终极的平台悖论。

It's the ultimate platform paradox.

Speaker 1

要么支持你的竞争对手,要么失去整个生态系统。

Enable your competitors or lose the ecosystem.

Speaker 1

Sherman Wu负责这项高难度的工作。

Sherman Wu runs this high wire act.

Speaker 1

他领导OpenAI开发者平台的工程团队,该API支撑着硅谷半数的人工智能雄心。

He leads engineering for OpenAI's developer platform, the API that powers half of Silicon Valley's AI ambitions.

Speaker 1

在加入OpenAI之前,他在Opendoor工作了六年,教机器给房屋定价,一个错误的预测就可能造成数百万损失。

Before OpenAI, he spent six years at Opendoor, teaching machines to price houses where a single wrong prediction could cost millions.

Speaker 1

今天,Sherwin将与16z的普通合伙人Martin Casado坐下来探讨一个前所未闻的观点:模型本身正在成为反去中介化技术。

Today, Sherwin sits down with a 16 z general partner Martin Casado to explore something nobody that the models themselves are becoming anti disintermediation technology.

Speaker 1

你无法将它们抽象化,任何试图将它们隐藏在软件背后的尝试都会失败,因为用户已经知道并关心他们正在使用哪个模型。

You can't abstract them away, and every attempt to hide them behind software fails because users already know and care which model they're using.

Speaker 1

这正在改变平台运作的一切方式。

It's changing everything about how platforms work.

Speaker 1

Sherwin和Martine讨论了为什么OpenAI放弃了‘一个模型统治一切’的梦想,他们如何为智能访问定价,以及为什么确定性工作流可能比纯粹的AI代理更重要。

Sherwin and Martine talk about why OpenAI abandoned the dream of one model to rule them all, how they price access to intelligence, and why deterministic workflows might matter more than pure AI agents.

Speaker 2

Sherwin,非常感谢你的加入。

Sherwin, thanks very much for joining.

Speaker 2

我们正在与Sherman Wu进行对话。

So we're being joined by Sherman Wu.

Speaker 2

实际上,如果你能详细介绍一下你的背景,为那些可能不了解你的人,那就太好了。

It'd be great actually if you provided the long form of your background as we get into this just for those that may not know you.

Speaker 2

我认为Sherman是顶尖的AI思想领袖之一,所以非常期待这次对话。

I mean, I view Sherman as one of the top AI thought leaders, so I'm really looking forward to this.

Speaker 0

是的。

Yeah.

Speaker 0

嗯。

Yeah.

Speaker 0

谢谢邀请。

Thanks for having me.

Speaker 0

能上这个播客我真的很兴奋。

I'm really excited to be on the podcast.

Speaker 0

对。

Yeah.

Speaker 0

简单介绍一下我的背景。

So a little bit more of my background.

Speaker 0

或许我们可以从现在开始往前回溯。

So maybe we can start from present and go backwards.

Speaker 0

我目前负责OpenAI开发者平台的工程团队。

So I currently lead the engineering team for OpenAI's developer platform.

Speaker 0

当然,其中最重要的产品就是API。

So the biggest product in there, of course, is the API.

Speaker 0

还有更多关于

Is there more for

Speaker 2

开发者平台的内容不仅仅是API吗?

the developer platform than the API?

Speaker 0

某种程度上

It's kind

Speaker 2

人们以为它们是同义词。

of assumed that it was synonymous.

Speaker 0

嗯,我还会考虑我们平台侧引入的其他内容。

Well, so I also think about other things that we put into our platform side.

Speaker 0

从技术上讲,我们的政府工作也包括在不同领域提供和部署这些服务。

So, technically, our government work is also, like, offering and deploying this into different areas.

Speaker 0

是的。

Yeah.

Speaker 0

比如,我谈到过

Like, I've talked about

Speaker 2

哦,你们有本地部署吗?

Oh, do have, like, a local deployment?

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

实际上我们在洛斯阿拉莫斯国家实验室确实有本地部署。

So we actually do have a local deployment at Los Alamos National Labs.

Speaker 0

非常酷。

It's super cool.

Speaker 0

我去参观过。

I went to visit it.

Speaker 0

这和我习惯的非常不同。

It's very different than what I'm used to.

Speaker 0

不过,在一台保密超级计算机上,这确实说得通。

But, yeah, in a classified supercomputer That makes sense.

Speaker 0

我们的模型在那里运行。

With our model running there.

Speaker 0

就是这样。

So there's that.

Speaker 0

但主要还是API,因为你去过洛斯阿拉莫斯吗?

But, like, mostly the API because Did you go to Los Alamos?

Speaker 0

我们去过吗?

We did?

Speaker 0

是的。

Yeah.

Speaker 0

我确实去过洛斯阿拉莫斯。

I did go to Los Alamos.

Speaker 0

那里很棒。

It was great.

Speaker 0

他们带我们参观了一圈。

They showed us around.

Speaker 0

他们带我们参观了一些历史遗址。

They showed us some of historic sites.

Speaker 0

真正的历史。

Real history.

Speaker 0

是啊。

Yeah.

Speaker 2

是啊。

Yeah.

Speaker 2

我以前在利弗莫尔工作过,伙计,所以我有点...

I used work at Livermore, man, so I've got, like, a

Speaker 0

哦,是啊。

Oh, yeah.

Speaker 0

是啊。

Yeah.

Speaker 0

是啊。

Yeah.

Speaker 2

大学毕业后的第一份工作。

First job out of college.

Speaker 0

没错。

So Right.

Speaker 2

对。

Right.

Speaker 2

对。

Right.

Speaker 2

是啊。

Yeah.

Speaker 2

接下来会有些那样的内容。

Some of that next.

Speaker 0

是啊。

Yeah.

Speaker 0

嗯,我们希望如此。

Well, we hope to.

Speaker 0

是的。

Yeah.

Speaker 0

我在开发者平台工作。

So I work on the developer platform.

Speaker 0

我已经在这个平台工作了大约三年。

I've been working on it for around three years now.

Speaker 0

我是2022年加入的。

So I joined in 2022.

Speaker 0

我基本上是被雇来开发API产品的,当时这是OpenAI唯一的产品。

I was basically hired to work on the API product, which at the time was the only product that OpenAI had.

Speaker 0

是的。

Yeah.

Speaker 0

基本上这段时间我一直在做这个。

And I've basically just worked on it the entire time.

Speaker 0

我一直对开发者方面和这项技术的创业故事非常感兴趣,所以见证它的发展真的非常酷。

I've always been super interested in the developer side and kind of like the startup story of this technology, and so it's been really, really cool to kinda see this evolve.

Speaker 0

这就是我在OpenAI的时光。

And so that's my time in OpenAI.

Speaker 0

在加入OpenAI之前,我在Opendoor工作了大约六年。

Before OpenAI, I was at Opendoor for around six years.

Speaker 0

我当时负责定价相关的工作。

I was working on the pricing side.

Speaker 0

我之前的大致背景是

My my general background before

Speaker 2

我觉得这真是个异类,你

I think it's such a dissident, you

Speaker 0

知道,是的。

know Yeah.

Speaker 2

是啊。

Yeah.

Speaker 2

从Opendoor的定价工作,到管理API。

Pricing at OpenDoor to, like, running API.

Speaker 0

这真是截然不同,对我来说观察这些公司之间的差异实际上非常有趣。

It's such a different it's been fascinating actually for me to see the differences between the companies.

Speaker 0

比如,它们的运营方式完全不同。

Like, they're run so differently.

Speaker 0

它们都有OpenN这个名字,所以有些重叠之处。

They both have OpenN the name, so there's some overlap.

Speaker 0

但基本上也就这些相似点了。

But that's pretty much it.

Speaker 0

是啊。

Yeah.

Speaker 0

不过确实,我在那里待了大约六年,在定价团队工作。

But, yeah, I was there for around six years working on the pricing team.

Speaker 0

所以我们团队基本上负责运行机器学习模型。

So our team basically would run the ML models.

Speaker 0

这是

This is

Speaker 2

实际上是在Opendoor平台上为资产定价。

actually pricing the assets on Opendoor.

Speaker 2

是的。

Yeah.

Speaker 2

没错。

Yeah.

Speaker 2

库存。

The inventory.

Speaker 0

正是如此。

Exactly.

Speaker 0

所以,Opendoor会买卖房屋,

So, yeah, Opendoor would buy and sell homes,

Speaker 2

而且是的。

and Yeah.

Speaker 0

他们的主要项目是用全现金报价直接从卖家手中收购房屋。

Their main project was buying homes directly from people selling them with all cash offers.

Speaker 0

是的。

Yeah.

Speaker 0

对。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

因此,我的团队负责决定我们愿意为这些房产支付的价格。

And so my team was responsible for how much we would pay for them.

Speaker 0

所以这实际上是一个很有趣的机器学习挑战。

And so it was a really fun, like, ML challenge.

Speaker 0

这其中还涉及大量的运营因素,因为...

It had a huge operational element to it as well because Yeah.

Speaker 0

显然,并非所有流程都实现了自动化。

Not everything was automated, obviously.

Speaker 0

是的。

Yeah.

Speaker 0

但这确实是一个非常引人入胜的技术挑战,

But it was a really fascinating technical challenge and

Speaker 2

在API方面是否有类似的概念,比如GPU容量采购,还是

Is there any sense of that on the API side, like, GPU capacity buying, or is

Speaker 0

完全无关?

it just totally unrelated?

Speaker 0

在API方面吗?

On the API side?

Speaker 0

确实有少量关于我们如何为模型定价的内容,但我不认为我们做得像Opendoor那样复杂。

There's is a small bit of, like, how we price the models, but I don't think we do anything as sophisticated as Opendoor.

Speaker 0

Opendoor就是一个非常棘手的问题。

Opendoor is just, like, such a hard problem.

Speaker 0

这就像是一个非常昂贵的资产。

It's, like, such a, like, expensive asset.

Speaker 0

持有成本非常高昂。

The holding costs are very expensive.

Speaker 0

你得把这些资产持有好几个月的时间。

You're, like, holding onto it for, like, months at a time.

Speaker 0

持有时间存在很大的变数。

There's, like, a variability in the holding time.

Speaker 2

而且潜在风险的长尾效应非常明显。

And that's a long tail of potential things that could go wrong.

Speaker 0

长尾效应。

Long tail.

Speaker 0

是的。

Yes.

Speaker 0

你得从投资组合的角度来考虑,如果其中一套房产你持有了两年,整个情况就会急转直下,直接变成亏损。

And, like, you, like, try to think about it from a portfolio perspective, and, like, if one of them just, like, you're holding on it for two years, it blows everything, like, goes negative.

Speaker 0

所以这是完全不同的

So it's a very, very different

Speaker 2

六年?

Six years?

Speaker 0

不同的挑战。

Different challenge.

Speaker 0

是啊。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

在那里待了六年。

Six years there.

Speaker 0

哇。

Wow.

Speaker 0

经历了许多起起落落。

Lots of ups and downs.

Speaker 0

见证了许多繁荣时期。

Saw a lot of the booms.

Speaker 0

也目睹了许多艰难时刻。

Saw a lot of the struggles.

Speaker 0

然后我们进行了很多次IPO。

And then we IPO ed far far a lot.

Speaker 0

但总的来说,这是一段非常棒的经历。

But, yeah, just in general, it was a very great experience.

Speaker 0

对我来说,它还具有非常强的业务运营性质。

I think for me, it was it also just had such a very, like, business operations Yeah.

Speaker 0

而且是一种非常按部就班的文化,而OpenAI则截然不同。

And, like, a very, like, by the book type of culture, whereas OpenAI is, like, very different.

Speaker 2

嗯,这真有趣。

Well, it's so interesting.

Speaker 2

我刚才还在想这个问题。

I was just thinking about it now.

Speaker 2

即使对于那样的公司,你也不会把它当作科技公司来看待。

It's like even for a company like that, like, you don't think about it as a tech company.

Speaker 2

但如果存在深层次的技术问题,实际上就是定价问题。

But if there is a deep technology problem, it actually is the pricing.

Speaker 2

对吧?

Right?

Speaker 2

这就像个机器学习问题。

It's like an ML problem.

Speaker 0

是啊。

Yeah.

Speaker 0

这正是吸引

That's what attracted

Speaker 2

我网站的地方。

me website.

Speaker 2

不是

It's not

Speaker 0

那个,对。

the Yeah.

Speaker 2

没错。

Yeah.

Speaker 2

是的。

Yeah.

Speaker 2

平台。

Platform.

Speaker 0

不是

It's not

Speaker 2

API。

the API.

Speaker 2

字面意思就是那样。

It's literally that.

Speaker 0

对。

Yep.

Speaker 0

对。

Yep.

Speaker 0

对。

Yep.

Speaker 0

这正是吸引我的地方。

And that's what attracted me to it.

Speaker 0

我觉得这才是真正有趣的地方。

I think that's what's what's interesting.

Speaker 0

这也是一种利润率比OpenAI低的商业模式,因为你只是在房产交易中赚取微薄的差价。

It's also a way, like, lower margin business than OpenAI because you're, like, making a tiny spread on these homes.

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Right.

Speaker 0

他们会谈论基点,说什么'早餐吃几个基点'之类的话。

They would talk about, like, basis points, like eating bits for breakfast and all that stuff.

Speaker 0

是啊。

Yeah.

Speaker 0

总之,我在Opendoor工作了大约六年。

Anyways, I was at Opendoor for around six years.

Speaker 0

在那之前,我大学毕业后的第一份工作是在Quora,Adam D'Angelo的公司。

And then before that was my first job out of college, which was at Quora, Adam Dans from No.

Speaker 0

好的。

Okay.

Speaker 0

Group 是的。

Group Yeah.

Speaker 0

我当时负责的是新闻推送。

So I was working on the newsfeed.

Speaker 0

我做过一段时间的新闻推送排序,也参与过产品方面的工作。

So worked on newsfeed ranking for a bit, worked on the product side.

Speaker 0

那实际上是我第一次接触工业界的实际机器学习,从Quora的工程师那里学到了很多。

That was actually my first exposure to, like, actual ML and industry and learned a lot from the engineers at Quora.

Speaker 0

我们基本上招聘了很多早期的推送工程师。

We basically hired lot of the early feed engineers.

Speaker 2

你在那里的时候Charlie还在吗?

Was was Charlie still there when you were there?

Speaker 0

查理在我那时已经不在那里了。

Charlie was not there when I when

Speaker 2

我在那里的时候。

I was there.

Speaker 2

就在

Like, right after

Speaker 0

你离开后。

you left.

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 2

那真是一支传奇团队。

And that was a really legendary team.

Speaker 2

至今仍被视为极具标志性的创始团队。

It's still known to be kind of this super iconic founding team.

Speaker 0

是啊。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

早期的创始团队确实非常稳固。

The early founding team was really solid.

Speaker 0

我至今仍觉得,即便当时我在那里时,也常常对我们团队人才的质量感到惊叹。

I still think that even while I was there, I would still, like, am amazed at the quality of the talent that we had.

Speaker 0

我想大概是在公司规模50到100人左右的时候。

I think there's, like, when the company's, like, 50 to a 100 people.

Speaker 0

不过确实,Perplexity团队的很多人都曾在那里。

But, yeah, like, a bunch of the Perplexity team was there.

Speaker 0

丹尼斯当时和我同在信息流团队。

Dennis was on the feed team with me.

Speaker 0

还有Johnny Ho、Jerry Ma。

Johnny Ho, Jerry Ma.

Speaker 2

是的。

Yeah.

Speaker 2

没错。

That's right.

Speaker 2

这太疯狂了。

This is crazy.

Speaker 0

然后是亚历山大,负责规模扩展的。

And then Alexander, the scale.

Speaker 0

是啊。

Yeah.

Speaker 0

简直疯狂。

Was crazy.

Speaker 0

他高中和大学期间在那里工作。

Was there between high school and college.

Speaker 0

那是个不可思议的团队。

It was an incredible team.

Speaker 0

我觉得当时在那里时有点把这视为理所当然。

I think I kinda took it for granted while I was there.

Speaker 2

是啊。

Yeah.

Speaker 2

那是个很棒的团队。

It was a good group.

Speaker 2

你是怎么认识Cora的?

How'd you get to Cora?

Speaker 0

你本科时学什么专业?

What did study in an undergrad?

Speaker 0

嗯。

Yeah.

Speaker 0

在那之前,我本科是在MIT读的。

So before that, I was at MIT for undergrad.

Speaker 0

我学的是计算机科学。

I studied computer science.

Speaker 0

就是那种,计算机科学和硕士学位一起读的,有点像压缩课程那种。

Did, like, one of those, like, computer science and the master's degree kinda, like, cramped it in.

Speaker 0

对。

Yeah.

Speaker 0

嗯。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

我最终去了Cora是因为在那里获得了一个我们称之为'外部实习'的机会。

I ended up at Cora because I got in what we call an externship there.

Speaker 0

在MIT,一月份其实是放假的。

So at MIT, you actually get January off.

Speaker 0

所以有秋季学期,然后一月份放假。

So there's, like, the fall semester, then January's off.

Speaker 2

这挺酷的。

That's cool.

Speaker 0

然后是春季学期。

And then you have the spring semester.

Speaker 0

所以这段时间被称为独立活动期。

And so it's called independent activities period.

Speaker 0

有些人会选择上课。

So some people just, like, take classes.

Speaker 0

有些人则什么都不做。

Some people just do nothing.

Speaker 0

但也有人会进行为期一个月的实习。

But some people will do, like, month long internships.

Speaker 0

好的。

Okay.

Speaker 0

有些疯狂的公司会给大学生提供为期一个月的实习机会。

And some crazy companies will offer a month long internship to a college student.

Speaker 0

是啊。

Yeah.

Speaker 0

是啊。

Yeah.

Speaker 0

这其实更像是一种吸引人们参与的方式

And it really is just kinda like a way to get people into

Speaker 2

你是从波士顿过来的吗?

Did you come out here from Boston?

Speaker 0

还是说具体怎么操作的?

Or how did that work?

Speaker 0

对。

Yeah.

Speaker 0

那简直太疯狂了。

It was crazy.

Speaker 0

所以你得先申请。

So you had to apply.

Speaker 0

我记得没错。

I remember yeah.

Speaker 0

我想这大概是2013年1月左右的事。

This is, I think, 2013, January or something.

Speaker 0

你需要申请,我记得核心实习项目是报酬最高的那个。

You had to apply, and I remember the core internship was the one that just paid the most.

Speaker 0

我记得他们支付的报酬大概是8000到9000美元。

They paid, I think, was, like, 8,000, $9,000.

Speaker 0

我当时就想,哇。

And I was like, wow.

Speaker 0

那差不多是一个月的生活费,而你实际上只用了大概一半时间就适应了?

That's, like, old for a month, and you're you're kinda ramping up, like, half the time?

Speaker 2

够我吃一年了。

I can eat for a year.

Speaker 0

是啊。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

作为一名大学生。

As a college student.

Speaker 0

感觉非常棒。

It's like, great.

Speaker 0

而且,他们还会负责你的往返机票。

And, yeah, they would kinda, like, fly you out here.

Speaker 0

所以我参加了面试,幸运地拿到了录用通知。

So I did the interviews and then luckily got an offer.

Speaker 0

所以,是的,我一月份就来了。

And so, yeah, I came out for January.

Speaker 0

那时他们刚搬到山景城的新办公室。

That was right when they moved into their new Mountain View office.

Speaker 0

我基本上就是...嗯。

And I basically yeah.

Speaker 0

说实话,我大概花了两周时间适应,然后有两周时间在Feed团队高效工作。

Honestly, just ramped up for, like, two weeks and then have two weeks of good productivity working on the feed team.

Speaker 2

所以那就是面向用户的,嗯,对。

So that was that, like, user facing, like Yeah.

Speaker 2

是面向用户的产品工作吗?

User facing product work?

Speaker 0

对。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

我清楚地记得那两周的实习项目就是给我们的功能商店添加几个小功能。

I distinctly remember my externship project for those two weeks was just to, like, add a couple features to our feature store.

Speaker 0

没错。

Yeah.

Speaker 0

这些功能会整合进模型里。

And that would make its way into the model.

Speaker 0

我记得我的导师是Tudor,他现在应该是在负责,我想是的。

I remember my mentor there was is Tudor, who's now running, I think Yeah.

Speaker 0

它叫Harmonic Labs。

It's called Harmonic Labs.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

疯狂的团队。

Crazy team.

Speaker 0

疯狂

Crazy

Speaker 2

难以置信。

unbelievable.

Speaker 2

我是说,他们就像是...顺便提一句,我认为这是硅谷未被充分讲述的故事之一,关于那个原始团队最终在Core的表现有多么出色。

I mean, they're like and by the way, I think it's one of the untold stories of Silicon Valley is, like, how good that original team ended up Core is.

Speaker 2

我的意思是,他们中很多人至今仍在Core并且表现优异,但从Core走出去的人才如今遍布各地。

I mean, a lot of them are still there and still good, but the diaspora from Core is everywhere.

Speaker 2

是啊。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

这其实也是我最终加入OpenAI的契机,某种程度上算是从那里快进过来的。

That's actually how I ended up at OpenAI too, kinda fast forwarding from there.

Speaker 0

因为OpenAI当时保持着相对低调的姿态。

Because OpenAI kinda kept a quiet profile ish.

Speaker 0

我一直都有关注他们,因为我认识的一些Quora员工后来都陆续去了那里。

I'd always kind of kept tabs on them because a bunch of the Quora people I knew kinda, like, ended up there.

Speaker 0

就像是,时不时关注一下,然后他们就说,是啊。

It's like, checking in on it, and they were like, yeah.

Speaker 0

这里正在发生一些疯狂的事情。

Something crazy is happening here.

Speaker 0

你真的应该来看看。

You should definitely check it out.

Speaker 0

所以,没错,我确实欠Quora很多。

So, yeah, I definitely owe a lot to Quora.

Speaker 0

但话说回来,我作为应届生选择去那里而非其他公司的部分原因是,那个团队实在太棒了,我觉得能从他们身上学到很多。

But, yeah, part of the reason why I went there versus other options as a new grad was the team was just so incredible, and I just felt like I could learn a ton from them.

Speaker 0

我当时没考虑之后的所有事情。

I didn't think about everything afterwards.

Speaker 0

我就想,天啊,如果能从这群人身上汲取些知识,那就太棒了。

I was just like, man, if I could just, like, absorb some knowledge from this group of people, it'd be great.

Speaker 2

太赞了。

Awesome.

Speaker 2

是啊。

Yeah.

Speaker 2

我想先从一个点切入,OpenAI让我觉得非常独特的一点是,它既是一家相当扁平化的公司。

So one place I wanted to start is something that I find very unique about OpenAI is it's both a pretty horizontal company.

Speaker 2

比如说,它拥有API接口。

Like, it's got an API.

Speaker 2

可以说,我们拥有一个庞大的企业客户组合。

Like, I would say, we've got this massive portfolio of companies.

Speaker 2

对吧?

Right?

Speaker 2

而且我认为其中相当一部分企业都在使用这个API。

And I would say a good fraction of them use the API.

Speaker 2

同时它又是个垂直型公司,因为你们还开发了完整的应用程序。

And then it's also a vertical company in that you've got full on apps.

Speaker 2

对吧?

Right?

Speaker 2

是的。

Yep.

Speaker 2

比如,每个人都在用ChatGPT。

Like, everybody uses ChatGPT, for example.

Speaker 2

所以你负责API和开发者工具这一块。

And so you're responsible for the API and kind of the DevTools side.

Speaker 2

那么也许一开始的问题是,两者之间是否存在内部张力?

So maybe just to begin with, is there an internal tension between the two?

Speaker 2

比如,这是个讨论点吗?

Like, is that a discussion?

Speaker 2

比如API可能会怎样,甚至可能帮助到垂直版本的竞争对手,还是说因为发展太快所以这根本不是问题?

Like, like, the API may whatever and may help a competitor to, like, the vertical version, or is it not that things are just growing so fast, it's not an issue?

Speaker 2

我很想听听你对此的看法。

I would just love to how you think about that.

Speaker 2

顺便说,很少有公司能同时拥有这两者。

By way, it's very unusual for companies to have both of that.

Speaker 2

这两件事在这么早期阶段同时出现确实非常罕见。

These two things this early is very unusual.

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

我完全同意。

I completely agree.

Speaker 0

我认为确实存在一定程度的紧张关系。

I think there is some amount of tension.

Speaker 0

我觉得在这方面真正有帮助的是,从创始人角度出发,Sam和Greg从一开始就对我们处理这个问题的方式非常坚持原则。

I think one thing that really helps here is Sam and Greg, just from a founder perspective, have since day one just been very principled in the way in which we approach this.

Speaker 0

他们一直告诉我们,我们希望ChatGPT首先是一个第一方应用。

They've always have kinda told us we want ChatGPT as a first party app.

Speaker 0

同时我们也需要开放API。

We also want the API.

Speaker 0

最棒的是,我认为他们能做到这一点,归根结底是因为这符合OpenAI的使命——创造AJA并尽可能广泛地分配其带来的好处。

And the nice thing is I think they're able to do this because at the end the day, it kinda comes back to the mission of OpenAI, which is to create AJA and then to distribute the benefits as broadly as possible.

Speaker 0

因此如果你这样理解,你会希望它出现在尽可能多的平台上,而第一方应用就是个绝佳渠道——你知道的,就像现在达到了8亿个'哇'之类的。

And so if you interpret this, you want it in as many surfaces as you want, and the first party app's a really great way to get, you know, it was like 800,000,000 Wow's or whatever now.

Speaker 0

是啊。

Yeah.

Speaker 0

8亿个'哇'?

800,000,000 Wow's?

Speaker 0

没错。

Yeah.

Speaker 0

确实。

It's yeah.

Speaker 0

这真的...仔细想想简直令人难以置信。

It's pretty it's it's actually mind boggling to think about.

Speaker 0

I

展开剩余字幕(还有 480 条)
Speaker 2

我觉得很多听众可能还没意识到这个数字有多庞大,但这真的...太疯狂了。

don't think many people listening to this don't understand how big that is, but that is I mean It's crazy.

Speaker 2

是啊。

Yeah.

Speaker 2

按照达到8亿所用的时间来看,这简直可以载入史册了。

That's gonna be, like, actually historic for the time it's taken to get to 800,000,000.

Speaker 0

确实是历史性的。

It's historic.

Speaker 0

而且想想我们投入的时间和需要扩展的规模...

Also It's just like, yeah, the amount of time and just like how much we've had to scale up.

Speaker 0

这相当于全球人口的十分之一。

That's a tenth of the globe.

Speaker 0

对吧?

Right?

Speaker 0

没错。

Yeah.

Speaker 0

是啊。

Yeah.

Speaker 0

全球10%的人口每周都在使用它

10% of the globe uses it every

Speaker 2

每周。

Every week.

Speaker 0

没错。

Yeah.

Speaker 0

这太棒了。

That's great.

Speaker 0

而且还在增长。

And it's growing.

Speaker 0

而且还在增长。

And it's growing.

Speaker 0

所以,在某个时刻,你知道,它会达到甚至更高的水平。

So, like, at some point, know, it'll hit like, you know, it'll go even higher than that.

Speaker 0

所以,是的,显然,那里的覆盖范围是无与伦比的。

And so so, yeah, like, obviously, the reach there is is unmatched.

Speaker 0

但除此之外,能够拥有一个平台让我们触及更广泛的受众。

But then also just, like, being able to have a platform where we can reach even more than just that.

Speaker 0

比如,我们内部有时会讨论,我们的API最终能覆盖多少终端用户?

Like, one thing we we talk about internally sometimes is, like, what does our end user reach from the API?

Speaker 0

实际上,它的覆盖范围确实非常非常广泛。

Like, it's actually it was, really, really it is really broad.

Speaker 0

可能甚至——这很难说,因为ChatGPT增长太快了——但在某些时候,它肯定比ChatGPT还要大。

It might might even it's hard because ChatGPT is growing so quickly, but, like, at some points, it was definitely larger than than ChatGPT.

Speaker 0

我们能够利用所有这些资源并获得我们想要的覆盖范围,我认为这真的很好。

And the fact that we're able to get tap in all of this and and and get the reach that we want, I think, is really good.

Speaker 0

是的。

Yeah.

Speaker 0

不过,确实,有时候也存在一些紧张关系。

But, yeah, I mean, there's definitely some tension sometimes.

Speaker 0

我认为这个问题已经在几个地方出现了。

I think the I think it's come up in a couple of places.

Speaker 0

我认为其中之一是在产品方面。

I think one of them is is on the product side.

Speaker 0

就像你提到的,有时候会有竞争对手在我们的平台上进行开发。

So as you mentioned, you know, sometimes there are competitors kinda like building on our Yeah.

Speaker 0

在我们的平台上。

On our Yeah.

Speaker 0

这些平台上的开发者可能不太高兴如果ChatGPT推出与他们竞争的产品。

Platform who, you know, might not be happy if ChatGPT launches something that competes with them.

Speaker 0

是的。

Yeah.

Speaker 2

我的意思是,这就像云操作系统一样古老的故事。

I mean, that you know, that's the tale of as the old as the cloud operating systems or whatever.

Speaker 2

所以,更像是ChatGPT是否担心竞争对手的问题。

So, like, that's, you know I think it's more like, does ChatGPT worry about the competitor Yeah.

Speaker 2

你知道的,这类事情。

You know, type thing.

Speaker 2

就像,你知道的,你在扶持一个竞争对手。

Like, you know, you enabling a competitor.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

所以,我的意思是,有趣的是,我觉得并不特别担心。

So, I mean, the interesting thing is, like, I would say not particularly.

Speaker 0

主要只是因为我们增长得太快了。

Mostly just because we've been growing so quickly.

Speaker 0

这就像

This is like

Speaker 2

我明白了。

that I get.

Speaker 2

你知道,它现在就像一股强大的力量。

It's such a, you know, force right now.

Speaker 0

是啊。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

增长能解决太多太多不同的问题。

Growth solves so many so many different things.

Speaker 0

而且,我们另一种思考方式是,大家都在围绕AGI进行建设,朝着AGI方向努力。

And, like and and the other way we think about it is, like, everyone's kind of building building around AGI, building towards AGI.

Speaker 0

当然,这里难免会有一些重叠。

Of course, there's gonna be some overlap here.

Speaker 0

但我想说,至少从我的立场看,我更多感受到来自客户端的压力,比如API用户本身。

But I would say, at least in my position, I feel more of this tension from the customer, like the API customers themselves.

Speaker 0

就像在说,天啊,你们是不是要开发我正在做的这个东西?

It's like, my gosh, are you gonna build this thing that I'm working on?

Speaker 2

这个故事和系统一样古老。

That story is as old as the system.

Speaker 2

从来没有一个计算机平台不具备这种特性。好吧,我在这点上有些摇摆不定,想听听你的看法:历史上提供核心服务和API的问题在于可能会被绕过,对吧?

There's never not been a computer platform that have that So okay, so I kinda go back and forth on this one, wanna try one out on you, which is the problem historically with offering a core services and APIs, can get disintermediated, Right?

Speaker 2

所以我可以在它上面构建,但用户并不知道,诸如此类。

And so I can build on top of it, but then, you know, the user doesn't know, like, whatever.

Speaker 2

在云上构建,我就能绕过云服务商,然后可以切换到其他云平台等等。

Build on top of the cloud, I disintermediate from the cloud, and then I can switch to another cloud or whatever.

Speaker 2

我突然意识到,用这些模型很难做到这一点,因为它们实在太难以抽象化了。

And it occurs to me that that's kind of hard to do with these models because the models are so hard to abstract away.

Speaker 2

就像,它们就是难以驾驭。

Like, they're just they're just unruly.

Speaker 2

对吧?

Right?

Speaker 2

如果你试图用传统软件来驱动它们,它们就是不太容易管理好。

If you try to, like, have traditional software drive them, they just don't kind of manage very well.

Speaker 2

所以我部分认为这几乎就像一种反去中介化技术,你必须直接将其暴露给用户。

So part of me thinks that it's almost like this, like, anti disintermediation technology that you kind of have to expose it to the to the user directly.

Speaker 2

是的。

Yep.

Speaker 2

这样讲得通吗?

Does that make sense?

Speaker 2

所以我在想,即使我认为ChatGPT实际上只是试图将模型暴露给用户,API也只是试图将模型暴露给用户。

And so I'm wondering if, like so even if I think ChatGPT is really just trying to expose the model to the user, the API's just trying to expose the model to the user.

Speaker 2

因此我认为几乎存在这样一种论点:如果真正的价值在于模型,那么你如何将其传递给用户其实并不重要,因为要让某人以计算机科学的经典方式将其抽象化——比如用户不知道自己正在使用该模型——将会非常困难。

So I think there's almost this argument that's like, if the real value is in the models, it doesn't really matter how you get it to them, because it's gonna be very tough for someone who's gonna to abstract it away in in in classic sense of computer science of, like, they don't know that they're using the model.

Speaker 2

比如,你总是知道自己正在使用GPT-5。

Like, you always know you're using g p d five.

Speaker 0

是的。

Yeah.

Speaker 0

而且有趣的是,我认为整个行业也慢慢改变了对此的看法。

And and the interesting thing is I think, like, the entire industry kind of has slowly changed their mind around this too.

Speaker 0

想想看,一开始我们大概觉得,哦,这些都可以互相替换。

Think, like, in the beginning, we kinda thought, oh, these are all gonna be interchangeable.

Speaker 2

就像软件一样。

It's just like software.

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

正是如此。

Exactly.

Speaker 0

所以很容易就能替换掉。

So the easy thing to swap out.

Speaker 0

对。

Yeah.

Speaker 0

我想我们在产品端也正在认识到这一点,比如GPT五的发布,以及四零版本,还有那么多人喜欢零三和四零版本等等。

I think we're learning this on the product side with, like, know, the GPT five launch and, like, four o and, like, how how so many people like o three and four o and and all of that.

Speaker 2

我感觉到它变化的时候。

I felt that when it changed.

Speaker 2

我就觉得,你对我没那么好了。

I'm like I'm like, you're not as nice to me.

Speaker 2

是啊。

Yeah.

Speaker 2

比如,我喜欢

Like, I like the

Speaker 0

验证。

validation.

Speaker 0

验证。

Validation.

Speaker 0

对。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

我也这么想。

Think think so.

Speaker 0

所以。

So.

Speaker 0

其实挺有趣的,因为我真的很喜欢GPT五号的个性,但我觉得我使用ChatGPT的方式非常功利性。

It's It's actually fun because I I really loved GPT five's personality, but I think it's like the way I used, you know, ChatGPT was very utilitarian.

Speaker 0

哦,它主要是用于工作或获取信息之类的。

Oh, it's it's like, you know, mostly for work or just like information.

Speaker 2

是啊。

Yeah.

Speaker 2

我确实逐渐接受了,但说实话,当它改变时我感受到了那种不协调。

I've definitely come around just, you know, but, like, I actually felt the dissonance when it changed.

Speaker 2

就像有种情感上的波动,但这几乎像是一种消除不协调的调解技术。

It's like there's this emotional thing that goes on, but it's almost like it's an anti dissonant mediation technology.

Speaker 2

你某种程度上得向用户展示这一点。

You kinda have to show this to the user.

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

然后你会看到很多更成功的产品如Cursor直接这样做,尤其是那些用户想要更多控制权的编程产品。

And then you see a lot of more successful products like Cursor do this directly, especially the coding products where users want more control.

Speaker 0

我们甚至看到一些更通用的消费产品也在这样做。

We've even seen some, like, you know, like, more general consumer products do this.

Speaker 0

所以在消费者这一侧确实如此。

And so it's definitely been true on the on the consumer side.

Speaker 0

有趣的是,我认为在API这一侧也同样如此。

The interesting thing is I think it's also been true on the API side.

Speaker 0

这也是我认为不...

And that's also something that I think No.

Speaker 0

正是。

Exactly.

Speaker 0

乔纳森,这正是我想说的。

Jonathan, that's exactly what I'm saying.

Speaker 0

所以,就像,

So, like,

Speaker 2

有人可能会辩称,我可以通过API绕过你们。

the argument could be that I could use the API to disintermediate you.

Speaker 2

但实际上,这种情况并未发生,因为在模型与人之间插入一层软件实在太难了。

But, like, you don't see that happening because it's so hard to put a layer of software between a model and a and a person.

Speaker 2

你几乎必须直接暴露模型本身。

You almost have to expose the model.

Speaker 2

是的。

Yes.

Speaker 2

是的。

Yes.

Speaker 2

而且

And

Speaker 0

我认为,如果说有什么变化的话,那就是这些模型似乎在功能上越来越分化,各自擅长不同的领域和特定用例。

I think, if anything, I think the models are, like, almost like diverging in terms of, like, their what they're good at and, like, their specific use case.

Speaker 0

而且我觉得这种情况会越来越多。

And I think there's gonna be more and more of this.

Speaker 0

但基本上,令人意外的是,基于我们API开发的用户留存率出奇地高,尤其是在人们原以为可以随意切换的情况下。

But, yeah, basically, it's it's been surprisingly hard for or, like, the the retention of people building on our API is, like, surprisingly high, especially when people thought you could just kind of swap things around.

Speaker 0

要知道,现在甚至有些工具能帮你实现这种切换。

You might have, you know, like, even tools that help you swap things around.

Speaker 0

是的。

Yeah.

Speaker 0

但模型本身的用户粘性确实令人惊讶。

But, yeah, the the stickiness of the of the of the model itself has been has been surprising.

Speaker 2

你觉得这是因为用户与模型之间建立了某种关系,还是更多出于技术原因——比如我的评估体系是针对OpenAI设计的,且能保持准确性?

And do you think that is because of a relationship between the user and the model, or do you think it's more of a technical thing, which is like, my evals work for OpenAI and this, you know, and the correctness maintains?

Speaker 0

没错。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

我认为两者都有。

I think it's both.

Speaker 0

所以我认为这里确实存在终端用户层面的因素,这是我们部分客户反馈的情况。

So I think there's there's definitely an end user piece here, which is what we've heard from from some of our customers.

Speaker 0

比如,他们只是对模型本身越来越熟悉。

Like, they just get familiar with with the model itself.

Speaker 0

但我也认为存在技术层面的因素,作为开发者——尤其是初创公司——你会深入研究这些模型,不断迭代优化,试图在你的特定应用框架内让它表现得更出色。

But I also think there's a technical piece, which is like the also, as a developer, especially with startups, you're like really going deep with these models and like really like iterating on it, trying to get get it really good within your particular harness.

Speaker 0

你同时在迭代优化自己的应用框架。

You're iterating on your harness itself.

Speaker 0

你会时不时为它添加不同的工具。

You're giving it different tools here and there.

Speaker 0

所以你最终确实是在围绕这个模型打造产品。

And so you really do end up, like, building a product around the model.

Speaker 0

因此,这里存在一个技术层面,当你持续围绕某个产品(比如GPT-5)进行开发时,实际上你是在围绕它构建更多功能,从而使你的产品与该模型形成独特的适配优势。

And so there is a technical piece where, you know, as you kind of keep building with a particular product like GPT five, you're actually, like, building more around it so that your product works uniquely well with that with that model.

Speaker 2

所以我使用Cursor,主要用于各种事务,比如写博客——你知道我们是投资者——有时也用于编程,令人惊讶的是我在Cursor里会用到这么多模型。

So so I I use I use Cursor, and a lot of just for, like, a lot of stuff, like, writing blogs and, like, you know, know, we're investors, and I use it for sometimes for coding, and it's remarkable how many models I use in Cursor.

Speaker 2

比如,我最常用的模型确实是GPT-5。

So, like, literally my go to model is g p d five.

Speaker 2

我太爱GPT-5了。

I love g d five.

Speaker 2

我觉得这太棒了,你懂吗?

I think it's a phenomenal, like, you know?

Speaker 2

然后我用GPT-5的max模式来做规划,但你知道,我在Cursor里用的是tab补全模型,他们刚发布的新模型是用于一些基础的...一些功能。

And then, like, I use, like, max mode with g p d five for planning, and then but, you know, like, I mean, I the tab complete model that's in Cursor, and, like, you know, the the new model they just dropped is for, like, some basic you know, some stuff Yeah.

Speaker 2

Composer不错。

Composer good.

Speaker 2

是啊。

Yeah.

Speaker 2

所以,你知道,我觉得

And so, like, you know And I think that

Speaker 0

某种程度上也反映了这一点,因为就像是,每个特定用途都有对应的特定模型。

kinda reflects this too, because it's like, there's a particular model for each particular use Yeah.

Speaker 0

对。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

比如,我和很多用过新Composer模型的人聊过,它确实在快速处理上表现优异,速度超快。

Like, I've talked to a bunch of people who use the new Composer model, and it's just really good for fast It's super fast.

Speaker 0

第一遍草稿确实如此。

First pass Exactly.

Speaker 0

让你保持心流状态,之后如果需要更深入的思考,可以再切换到其他模型

To keep you in flow kinda thing, and then you kinda bubble out to another model if you want deeper thinking

Speaker 2

我确实会直接使用GPT-5来帮我规划事项,它在这方面非常出色。

about I literally sit down on this GPT-five to help me plan something out, and is really good at that.

Speaker 2

然后当我编码时,我会用快速聊天功能,接着切换到Composer,如果遇到什么棘手的问题或奇怪的bug之类的。

And then when I'm coding, I'm doing the quick chat thing, then I'll use Composer, and if there's whatever, there's some crazy bug or something like that.

Speaker 2

还记得早期大家都认为只会有一个模型吗?连投资者都说我们永远不会投资模型公司,因为最终只会有一个AGI模型。但现实却是模型数量爆炸式增长。

So do you remember in the early days of all of this, where there's gonna be one model, even investors, we will never invest in a model company because there will only be one model and it's gonna be AGI, but the reality it feels like there's this massive proliferation of models.

Speaker 2

就像你之前说的,它们承担着多种功能。

Like you said before, they're doing many things.

Speaker 2

所以可能有两个问题,也许太直接或粗鲁,但第一个是:这对AGI意味着什么?

And so maybe two questions, maybe too blunt or too crass, but the first one is what does that mean to a for AGI?

Speaker 2

第二个问题是:这对OpenAI意味着什么?

And the second was what does that mean for OpenAI?

Speaker 2

这是否意味着最终会形成一个模型组合?

Like, does that mean that, like, you end up with a model portfolio?

Speaker 2

你们会选择其中一部分吗?

Do you select a subset?

Speaker 2

你认为未来这一切会被某个终极模型取代吗?

Do you think this all gets superseded by some god model in the future?

Speaker 2

那么,这会如何发展呢?

Like, how does that play out?

Speaker 2

因为这和大多数人的想法相悖。

Because it's it's against what most people thought.

Speaker 2

大多数人曾认为最终会有一个全能的大型模型。

Most people thought this is all going towards one large model that does everything.

Speaker 0

是啊。

Yeah.

Speaker 0

我觉得这一切最疯狂的地方在于,大家的想法是如何随时间改变的。

I think the the crazy thing about all this is just, like, how everyone's thinking has just changed over time.

Speaker 0

完全同意。

Totally.

Speaker 0

我对此记忆犹新,最疯狂的是这并不久远。

Like the I I distinctly remember this, and and and the crazy thing is not that long ago.

Speaker 0

大概就在两三年前吧。

It's just like three like two or three years ago.

Speaker 0

是的。

Yeah.

Speaker 0

我记得,即使是OpenAI,当时的想法也是会有一个统治一切的模型。

I remember, like, even with an open AI, the the thinking was that there would be like one model that rules them all.

Speaker 0

对。

Yep.

Speaker 0

这就好比,你为什么会...我是说,这某种程度上指向了微调API产品。

And it's like, why would you I like, mean, this kinda goes to the fine tuning API product.

Speaker 0

就像是,你为什么要开发一个微调产品呢?

It's like, why would you even have a fine tuning product?

Speaker 0

你为什么会想要在上面进行迭代呢?

Why would you even want to, like, iterate on it?

Speaker 0

没错。

Yep.

Speaker 0

确实。

Yep.

Speaker 0

将会有一个模型能够涵盖一切。

There's gonna be this one model that just subsumes everything.

Speaker 0

是啊。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

这也是对通用人工智能形态最简化的看法。

And that was also kind of the that that is also like the most simplistic, like, view of what the what the AGI will look like.

Speaker 0

而且,是的,自那以后情况已经完全改变了。

And and, yeah, it's like definitely completely changed since then.

Speaker 0

但另一方面要记住的是,可能还会继续变化,甚至从我们现在所处的位置开始。

Think one and and but then the other thing to keep in mind is like, might continue to change, like, even from where we are today.

Speaker 0

对。

Yeah.

Speaker 0

但越来越清楚的是,我认为未来会有许多专业模型的存在空间。

But it's like becoming increasingly clear, I think, that there will be room for a bunch of specialized models.

Speaker 0

很可能会出现其他类型的模型大量涌现的情况。

There will likely be a proliferation of other types of models.

Speaker 0

我是说,你可以看到我们正在用Codex模型做这类事情

I mean, you see us do this with, like, the codex model

Speaker 2

是啊。

Yeah.

Speaker 2

完全同意。

Totally.

Speaker 0

本身。

Itself.

Speaker 0

我们有GPT-4.1、4.0、5.0等等这些版本。

We have, like, you know, have, GPT four one and, like, four o and, like, five and and and all of this.

Speaker 0

所以我不认为有空间容纳所有这些。

And so I I don't think there's there's room for all for all this.

Speaker 0

不过我觉得这未必是坏事。

I I don't think that's bad for what it's worth.

Speaker 0

我认为,就像我们试图迈向通用人工智能一样,事情总是出人意料,市场也在不断演变,产品组合也因此而发展。

Like, anything, I think, you know, as we've tried to move towards AGI, things have just been very unexpected, and I think the market just evolves, and the product portfolio evolves because of that.

Speaker 0

所以我完全不认为这是件坏事。

So I I don't think it's bad thing at all.

Speaker 0

而我真正认为的是

What I do think it

Speaker 2

你可以轻易地辩称这对OpenAI非常有利,对模型公司也非常有利。

means You could easily argue it's very good for OpenAI and very good for, like, the model companies to, like Yeah.

Speaker 2

因为没有那种赢家通吃的集中化动态。

Because not have, like, you know, winner take all consolidated dynamics.

Speaker 2

对吧?

Right?

Speaker 2

我是说,你会拥有一个更健康的生态系统,能提供更多解决方案。

I mean, you just have a healthier ecosystem, lot more solutions you can provide a lot.

Speaker 2

是的。

Yeah.

Speaker 2

你知道吗?

You know?

Speaker 0

是的。

Yeah.

Speaker 0

随着生态系统的扩大,这通常是有益的。

And as as the ecosystem grows, it generally is helpful.

Speaker 0

这是我们经常思考的一个问题:随着整个AI生态系统的壮大,OpenAI将从中受益匪浅。

Like, this is one thing we actually think about a lot too is is as the general, like, AI ecosystem grows, like, OpenAI just stands to benefit a lot from this.

Speaker 0

这也是为什么我们的一些产品甚至开始向其他模型开放。

And and this is also why we've like, some of our products, we even started opening up to other models.

Speaker 0

对吧?

Right?

Speaker 0

比如,我们的产品现在允许你接入其他模型。

Like, our product now allows you to bring in other models.

Speaker 0

是的。

Yeah.

Speaker 0

我们认为这一切就像水涨船高,总体上对我们有利。

So all of this, we think it's like any any rising tide generally helps us here.

Speaker 0

但随着我们进入一个将出现更多模型的世界,这就是我们投资于模型定制产品的原因,包括微调API、强化微调,并开放这些功能。

But, yeah, I think as we move into a world where there'll be a bunch more models, this is why we've kind of invested in our model customization product with fine tuning API, with the reinforcement fine tuning, opening that up as well.

Speaker 0

这也是我们开源GPT OSS的部分原因,因为我们希望能够促进发展

It's also why part of why we open sourced GPT OSS as well, because we wanna be able to, you know, facilitate

Speaker 2

我想超级

I wanna super.

Speaker 2

我稍后想详细谈谈这个,因为开源确实非常有趣。

I wanna talk about that in in just a bit, because the open source is actually very interesting.

Speaker 0

我 我

I I

Speaker 2

实际上,我认为开源模型非常棒。

mean, actually, I thought the open source model was great.

Speaker 2

是的。

Yeah.

Speaker 2

但这显然是企业必须谨慎对待的事情。

But it's clearly something that a company has to be careful with.

Speaker 2

是的。

Yep.

Speaker 2

但在那之前,我想先聊聊微调API。

But before that, I wanna talk a little bit about the fine tuning API.

Speaker 2

是这样的,我注意到你们正在向更复杂的应用方向发展,比如微调技术。某种程度上,这可以被解读为一种让步,承认存在产品特定数据和用例,通用模型无法满足这些需求。

So so so I I've I've noticed that you are moving towards kinda more sophisticated use of things like, you know, like fine tuning, which, you know, in a way, can read that as a bit of a capitulation that, like, you know, there is product specific data and there's product specific use cases that a general model won't do to your point.

Speaker 2

对吧?

Right?

Speaker 2

所以与模型扩散相反,你们选择了这条路。

So, like, as opposed to proliferation model, you do that.

Speaker 2

看起来这些数据实际上非常有价值,对吧?

It seems like a lot of that data is actually very very valuable, right?

Speaker 2

那么,在多大程度上存在一种互惠机制?比如你们开放产品数据接入微调的能力,同时也能从这些。

And so, you know, to what extent is there like interest in almost a tit for tat where you can like expose, you know, the ability to get product data into fine tuning, and then you also benefit from that data because the the vendors provide it to you versus, like, this is a 100%, you know, like, they keep their own data and there's kind of no interest in that.

Speaker 2

因为在我看来这像是下一阶段的规模化发展,我们目前就处于这个阶段。

Because it feels to me like the next level of scaling, this is kind of where we're at.

Speaker 2

所以我有点好奇如何

And so I'm just kinda curious how

Speaker 0

是的。

Yeah.

Speaker 0

我的意思是,或许退一步说,我们最初投资微调API的主要原因,一是人们对于能够更多定制模型有着巨大需求。

So I mean, maybe even like taking a step back, the the main reason why we even invested in a fine tuning API in the very beginning is, one, there's been huge demand from people to be able to customize the models a bit more.

Speaker 0

这涉及到提示工程,而且我认为行业对此的看法也发生了变化。

It kinda goes into, like, prompt engineering and also, like, I think the industry's changed their mind on that as well.

Speaker 0

就像,它已经进化了。

Like, it's evolved.

Speaker 0

但第二点正是你所说的,就是企业拥有大量他们囤积的数据宝藏,他们希望以某种方式在这次AI浪潮中加以利用。

But the second thing is exactly what you said, which is the companies just have giant treasure troves of data that they are sitting on that they would like to utilize in some fashion in this AI wave.

Speaker 0

你知道,最简单的方法就是把它放进某种向量里,比如用它做检索增强生成之类的。

And you can know, the simple thing is to put it in, you know, some like vector like, do rag with it or something.

Speaker 0

是的。

Yeah.

Speaker 0

但你也知道,如果他们有一个更专业的技术团队,他们会想看看如何利用这些数据来定制模型。

But there's also you know, if they have a more technical team, they do wanna see how they can use it to customize the models.

Speaker 0

因此,这实际上是我们投资于此的主要原因。

And and so that is actually the main reason why we've invested in in this.

Speaker 0

有趣的是,早在2223年左右,我们的微调服务功能过于有限,导致人们很难真正利用这些数据。

The the interesting thing was way back, kinda back in like 2223, our fine tuning offering was, I'd I'd say like too limited, so that it was very difficult for people to to tap into and use this data.

Speaker 0

当时它就像是一个监督式微调(SFT),没错。

So it was just like an SF like a supervised fine tuning Yep.

Speaker 0

API。

PI.

Speaker 0

我们当时觉得,'你可以勉强用用',但实际上它只适用于——说实话——基本上就是指令跟随的加强版。

Like, we're like, oh, you can kind of use it, but in practice, it really is only useful for, like like it's it's honestly just like instruction following plus plus.

Speaker 0

比如稍微调整语气,本质上还是在指导它。

Like, kind of change the tone, you're just really like instructing it.

Speaker 0

是的。

Yeah.

Speaker 0

但我认为最近的关键突破在于强化微调模型,因为通过这种设置,我们现在允许您实际运行RL(强化学习),这虽然更复杂且难度更高,需要投入更多资源。

But I think the the big unlock that has happened recently is with the reinforcement fine tuning model, because with that setup, we're now letting you actual run actually run RL, which is more finicky and it's like harder, and and, you know, like, you need to invest more in it.

Speaker 0

但它能让您更充分地利用数据。

But it allows you to leverage your data way more.

Speaker 2

顺便说一句,这对我而言是个天真的问题——根据我自己投资组合的理解,似乎存在两种使用模式。

By the way, this is just a naive question for me, which is it feels from just my understanding from my own portfolio, it feels like there's two modalities of use.

Speaker 2

一种是长期拥有大量数据宝藏,基于这些数据离线创建模型,然后部署使用。

One of them is I've got a treasure trove of data that I've had for a long time, I create my model on that treasure trove of data, and all that happens offline, and then I deploy that.

Speaker 2

对。

Yep.

Speaker 2

另一种则是产品需要实时使用的情况。

There's another one which is like, actually have the product being used in real time.

Speaker 0

我有

I've got

Speaker 2

一群用户。

a bunch of users.

Speaker 2

是的。

Yeah.

Speaker 2

而且实际上,我可以更贴近用户。

And like, I can actually get much closer to the user.

Speaker 2

我可以进行A/B测试并决定使用哪些数据,这更像是一种近乎实时的操作。

I can kind of a b test and decide which data, and, like, it's kind of more of a near real time thing.

Speaker 2

这是更侧重于产品方面还是... 是的。

Is is it like, is this focus on, like, more product stuff or more Yeah.

Speaker 2

宝藏数据?

Treasure trove?

Speaker 2

所以我们的梦想是

So the dream with the

Speaker 0

微调API的目标是能够同时处理这两种情况。

fine tuning API was that we should be able to handle both.

Speaker 0

对吧?

Right?

Speaker 0

就像我们实际上已经有了这个数据流,然后我们还有完整的LoRa微调推理设置,理论上我们应该能够扩展到数百万个这样的微调模型,这通常是在线学习场景下会出现的情况

It's like it's like we actually had this stream, then we have this whole, like, LoRa setup with the fine tuning inference where we should just be able to scale to, like, millions and millions of of these fine tuned models, which would is usually what would happen if you have, like, this online learning

Speaker 2

事情。

thing.

Speaker 2

完全正确。

Exactly.

Speaker 0

是的。

Yeah.

Speaker 0

实际上,主要还是那种形式。

In practice, it's mostly been the the form.

Speaker 0

对吧?

Right?

Speaker 0

实际上,主要还是他们已创建的离线数据,或是他们与专家合作生成的数据,或是通过使用他们产品所产生的数据,这些数据能够在这里被利用。

In practice, it's mostly been, like, the offline data that they've, like, already created or they are creating with experts or or something and, like, using their product that they're that they're able to use here.

Speaker 0

但关于强化微调API,我主要想说的是,它改变了原有范式,不再只是像SFT那样做小的渐进式改进,比如语气调整。

But the main thing I I was trying to say around the reinforcement fine tuning APIs, it kind of changes the paradigm away from just, like, small incremental improve like, tone improvements, which is what SFT did Yeah.

Speaker 0

而是真正提升模型,使其在特定用例上可能达到苏打水准,这是你所了解的。

To actually improving the model to potentially soda level on a particular use case that you you know about.

Speaker 0

对。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

这正是人们真正开始使用强化微调API的地方,也是它获得越来越多采用的原因。

Like, that's where people have really started using the reinforcement fine tuning API, and that's why it's it's it's gotten more more more uptake.

Speaker 0

因为如果讨论的内容更像是'我能让这个模型不以某种特定方式说话',那就没那么吸引人了。

Because if if the discussion is less like, hey, I can make this model, you know, not, like, speak in a certain way better, it's less compelling.

Speaker 0

但如果像是'在医疗保险编码或编码规划、代理规划等方面,能打造出世界上最优秀的...'

But if it's like, hey, for, like, you know, medical insurance coding or for, like, coding planning, agentic planning or something, can create the world's best Yeah.

Speaker 0

利用你的数据集通过RFT训练的模型,那价值就大得多了

Model using your dataset with RFT, then it becomes a lot more

Speaker 2

那么你们会不会,或者说你们有没有可能

And will you will you ever, like or maybe do you?

Speaker 2

你们会不会想办法获取那些数据?

Will you ever, like, find ways to get access to that data?

Speaker 1

比如说你

Like, you

Speaker 0

知道,是的

know Yeah.

Speaker 2

听着

So the Listen.

Speaker 2

如果我手上有数据又想要便宜的GPU,我会跟你交换的。

If I if I had the data and I wanted cheap GPUs, I'd trade you for it.

Speaker 0

这个嘛,我不确定。

Like, I don't know.

Speaker 0

嗯,是的。

Like Yeah.

Speaker 0

我是说,我们我们之前讨论过这个,我们实际上也在这里试行了一些定价方案,因为这些数据确实很有帮助,而且获取起来有点困难。

We I mean, we we we've talked about this, and we've actually been piloting some pricing here too where it's like because this data is, like, really helpful and and it's kinda hard to get.

Speaker 0

如果你真的使用强化微调API进行构建,你实际上可以获得折扣推理,如果你愿意分享数据,甚至可能获得免费训练。

And if you actually build with a reinforcement fine tuning API, you can actually get discounted inference and potentially free training too if you're willing to share the data.

Speaker 0

这总是,你知道的,取决于客户的选择。

It's always kind of, you know, it's up to the customer there.

Speaker 0

但如果他们这样做,对我们有帮助,对客户也会有好处。

But if they do, it is helpful for us, and there'll be benefits for the customer as well.

Speaker 0

那太棒了。

That's awesome.

Speaker 0

好的。

Okay.

Speaker 2

你说过对提示工程的看法已经改变了。

You said that views on prompt engineering have changed.

Speaker 2

是的。

Yeah.

Speaker 2

实际上,我之前并不知道这一点。

I wasn't actually, I wasn't aware of that.

Speaker 2

其他所有事情我都不知道,这个我也不知道。

All the other things I wasn't aware of, this one I wasn't.

Speaker 2

是的。

Yeah.

Speaker 0

我是说,我认为2022年时主流观点是这样的。

I mean, I think the prevailing view this is back in 2022.

Speaker 0

我记得当时和很多人交流过,他们基本上认为——这类似于单一模型AGI的观点——提示工程将不再重要,未来你甚至不需要考虑在上下文窗口中输入什么内容。

I remember I was talking to so many people, and they're basically I mean, this is similar to, like, the single model AGI view as well, which is like like, prompt engineering is just not gonna be a thing, and you're just not gonna have to think about what you're putting in the in the context window in the future.

Speaker 0

就像,模型会足够智能,它自己就能明白。

Like, the model would just be good enough, and it'll just, like, know.

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

它会知道你需要做什么。

It'll know what what what you need to do.

Speaker 0

是啊。

Yeah.

Speaker 0

那不是

That's not

Speaker 2

那根本不存在。

that's not a thing.

Speaker 0

对。

Yeah.

Speaker 0

嗯,就像,那个...我也不确定。

Well, like, that that, like I don't know.

Speaker 0

也许人们忘了,但这确实是当时普遍的看法。

Maybe people forget it, but, like, that was, like, very common release.

Speaker 0

我觉得这可能和扩展法则之类的有关。

I think it's, like, scaling laws or whatever.

Speaker 0

在扩展法则中有某种机制,你会与模型心灵相通。

Something in scaling laws, and, like, you'll just mind meld with the model.

Speaker 0

而且,就像,你只需要提示和遵循指令,效果就会好到你根本不需要刻意去做。

And, like, you just, like like, prompting and, like, instruction following will just will be so good that you won't really need to do it.

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

如果非要说的话,显然它一直是错的。

And if anything, like, yeah, it's, clearly been wrong.

Speaker 0

而且,是的。

And Yeah.

Speaker 0

对。

Yeah.

Speaker 0

嗯。

Yeah.

Speaker 0

但这很有趣,因为我认为我们现在所处的世界略有不同,模型在执行指令方面已经变得非常、非常出色,相比之前的GB三·五之类的模型。

But it is interesting because it's I think it's a slightly different world that we're in now where the models have gotten really, really good at instruction following relative to the, you know, like, GB three five or something.

Speaker 0

是的。

Yeah.

Speaker 0

但我认为现在的关键已不再是两年前我们所想的提示工程。

But I think the name of the game now is is less on, like, prompt engineering as we had thought about it two years ago.

Speaker 0

更多的是上下文工程方面,比如你给它提供什么工具?

It's more of, like, it's like the context engineering side where it's like, what are the tools you give it?

Speaker 0

它获取的是什么样的数据?

What is like the data that it pulls in?

Speaker 0

它何时会获取正确的数据?

When does it pull in the right data?

Speaker 2

嗯,这确实非常有趣。

Well, it's just very interesting.

Speaker 2

我是说,它已经简化到几乎荒谬的简单程度了。

I mean, it's reduced it to like an almost absurdly simplistic level.

Speaker 2

比如,关于RAG的奇怪之处在于,经典用法就像是用余弦相似度来选择要输入给超级智能的内容。

Like, the weird thing about rag, for example, the classic use of rag is like you're using like cosine similarity to choose something that you're gonna feed into a super intelligence.

Speaker 0

是啊。

Yeah.

Speaker 0

你知道吗,这简直像在侮辱人。

Know, you're like, I'm a rare It's like insulting.

Speaker 2

我居然要根据该死的嵌入空间随机抓取内容。

I'm gonna like randomly grab this thing based on fucking embedding space.

Speaker 2

这其实...当你想让超级智能做决策时,却要把智能推给检索环节...没错。

It doesn't really you know, I'm like and then I'm you know, when you want the super intelligence to decide the thing to do, and so it's like pushing intelligence in that retrieval Yep.

Speaker 2

显然这种做法

Clearly is something that makes

Speaker 0

很有道理。

a lot of sense.

Speaker 0

这几乎就像是在推动

It's almost like pushing

Speaker 2

将智能外推

the intelligence out in

Speaker 0

某种程度上

a way.

Speaker 0

没错

Exactly.

Speaker 0

而且公平地说,我认为RAG的引入是在模型还处于前推理阶段的时候

And and and to be fair, I think, like, RAG was kind of introduced when the models were like it was like pre reasoning models.

Speaker 0

所以当时的情况是,你只有一次机会来做这件事,而且它并不那么聪明。

So it was like, you only had to kinda like one shot to like do this, and it wasn't that smart.

Speaker 0

但现在我们有了推理模型,我的意思是,如果你喜欢的话,我最喜欢的模型实际上是o three,因为它是最勤奋的模型之一。

But now that we do have the reasoning models, now that we have I mean, if you like, one of my favorite models is actually o three because it was like one of the most diligent models.

Speaker 0

它有点

It kinda

Speaker 2

比如o three。

like o three.

Speaker 0

它它会执行所有这些工具调用,本质上就是智能体本身在尝试完成工具调用、RAG或其他类似操作,或者编写执行代码。

It It would just like do all these tool calls, and it's like really the the intelligence itself trying to like do the, you know, tool calls or reg or or anything like that or write the code to execute.

Speaker 0

对。

Yep.

Speaker 0

所以范式已经转变了,但正因如此,上下文工程、提示工程——你提供给模型的内容变得格外重要。

And so the the the paradigm has shifted there, but, yeah, because of that, think, like, context engineering, prompt engineering, what you put what you give the model is, like, extra important.

Speaker 0

没错。

Yep.

Speaker 0

是的。

Yep.

Speaker 0

对。

Yeah.

Speaker 2

好的。

Okay.

Speaker 2

所以你们有横向的API,还有像ChatGPT这样的垂直产品。

So you have API so you have the API, which is horizontal, you've got ChatGPT, other products which are vertical.

Speaker 2

我们甚至还没讨论到像素层面。

We haven't even talked about pixels.

Speaker 2

这些都还只是语言层面的东西。

This is all just this just language.

Speaker 2

智能体是一种新的模态吗?

Are agents a new modality?

Speaker 2

还是说它是别的什么?

Is that something else?

Speaker 2

比如,你知道的,像是代码本或者你这里说的模态具体指什么?

Like, you know, like a codex or What do you mean by modality here?

Speaker 2

在我看来它们似乎兼具垂直和水平特性。

Like and they feel both vertical and horizontal to me in a way.

Speaker 2

对我来说,ChatGPT就是一个产品。

Like, to me, ChatGPT is a product.

Speaker 2

对吧?

Right?

Speaker 2

它就像一个产品,连我妈妈都在用。

It's like it's a product and, like, my mom uses it.

Speaker 2

对吧?

Right?

Speaker 2

是的。

Yep.

Speaker 2

而API是开发者的工具。

And an API is a dev thing.

Speaker 2

你把它交给开发者,而CLI在我看来介于两者之间。

You kinda give it to a developer, and, like, a CLI is kinda somewhere in between to me.

Speaker 2

这就像是,它是个产品吗?

It's like, is it a product?

Speaker 2

它是横向的吗?

Is it, like, it is horizontal?

Speaker 2

嗯,是的。

Like Yeah.

Speaker 2

内部是如何处理它的?

How how is it handled internally?

Speaker 2

是一个完全独立的团队负责

Is it a totally separate team that

Speaker 0

代理相关的工作吗?

does agents?

Speaker 0

或者不。

Or No.

Speaker 0

是的,这很有趣,因为我觉得你刚才的描述方式几乎把代理看作是一个单一的概念,好像它可能有自己特定的...也许

So it's yeah, it's interesting because like, I I think the way that I I the way that you frame it just now almost seemed like agents was like this, like, singular concept that like, you know, might or like might have its own particular Maybe

Speaker 2

更好的问题是你眼中的代理是什么?

a better question is what is an agent to you?

Speaker 0

是的。

Yeah.

Speaker 0

对。

Yeah.

Speaker 0

对。

Yeah.

Speaker 0

对。

Yeah.

Speaker 2

对。

Yeah.

Speaker 2

对。

Yeah.

Speaker 2

确立共同语言对这次对话很重要。

Getting a language is, like, important for this conversation.

Speaker 2

对。

Yeah.

Speaker 0

所以我...我不知道。

So I I I don't know.

Speaker 0

其实我都不确定分享这个是否有帮助,但我对智能体的基本看法是——它是一种能代表你采取行动、并能长期运作的人工智能。

I actually don't even know if it'd be helpful for me to share, but my my general take on agents is it's it's a it's an it's an AI that will take actions on your behalf that can work over long time horizons.

Speaker 0

好的。

Okay.

Speaker 0

我认为这是更为普遍的预监管观点。

And I think that's the that's the more pretty general Pregulatory.

Speaker 2

是的。

Yeah.

Speaker 2

是的。

Yeah.

Speaker 2

确实如此。

Definitely.

Speaker 0

不过,如果你这样想的话,确实。

But, like, if you think about it that way yeah.

Speaker 0

我的意思是,也许这就是你所说的模态,但这只是使用AI的一种方式。

I mean, maybe this is what you mean by modality, but it is just a, like, way of, like, using AI.

Speaker 0

我想它可以被视为一种模态,但我们并不认为它是独立于AI之外的单独事物。

And it is a I guess it could be viewed as a modality, but we don't view it as, like, a separate thing, separate from AI.

Speaker 0

让我来说

Let let me let let

Speaker 2

让我试着给你解释一下这个问题是从何而来的。

let me just try and kinda, you know, give you a sense of where this question's coming from.

Speaker 2

就像,我知道如何打造一个产品,我们也知道如何为产品开拓市场,我们了解将它们转化为平台意味着什么,我们做这行已经很久了,对吧?

Like, I know how to build a product, like, and we know how to do go to market for products, we know how to do, like, you know, we know the implications of turning them into platforms, like, it's just we've been doing this for a very long time, right?

Speaker 2

我们也知道如何为API做同样的事情,对吧?

We know how to do the same thing for APIs, right?

Speaker 2

我们了解如何计费,明白人们基于它构建时产生的张力等等。

We know how to do billing, we know, like, the tension of, people build on top of it, and all of that stuff.

Speaker 2

而我一直在尝试的——这可能只是个人探索。

And like, what I've been trying to and this is just maybe a personal inquiry.

Speaker 2

对我来说还不明确的是,对于一个代理来说,它究竟属于这两个阵营中的哪一个,是更像产品阵营?

It's just not clear for me for an agent if you if it if it sits in one of those two camps, is it more like the product camp?

Speaker 2

还是更像...因为它有点两者兼有。

Is it more like the because it's kinda both.

Speaker 2

比如,我完全可以给你提供编码。

Like, I could like literally give you coding.

Speaker 0

对。

Yeah.

Speaker 0

对。

Yeah.

Speaker 2

然后作为用户,你只需要和它对话,或者我可以把它嵌入到我的应用里。

And like, as a user, and then you just talk to it, or I could like build in a way kind of embed it in, like, my app.

Speaker 2

所以这对你来说意味着,比如定价策略和生态系统影响的问题。

And so, like but then that means something to you as far as, like, you know, how do you price it and what does it mean for ecosystem?

Speaker 2

就像,是的。

Like Yeah.

Speaker 2

举个例子,如果我成立一家公司完全围绕Codecs来开发,你觉得可行吗?

Like, for example, like, would you be fine if I started a company and just, like, built it around Codecs?

Speaker 2

这能成立吗?

Is that a thing?

Speaker 0

创办一家公司并围绕它发展

Starting a company and building it around

Speaker 2

Codex?

Codex?

Speaker 2

是的。

Yeah.

Speaker 0

实际上我认为那会很棒。

I actually think that would be great.

Speaker 0

就像我们发布了Codex SDK,我们希望人们能够基于它进行开发和创新。

Like, it's a we release the Codex SDK, we want people to be able build it and hack it.

Speaker 0

没错。

Yeah.

Speaker 0

其实,我想你可能指的就是这个——这也是OpenAI的一个独特之处,反映了它的运营方式:归根结底,OpenAI是一家AGI(通用人工智能)公司。

Actually, I think this might be what you're getting at, which is and this is like a kind of a unique thing about OpenAI, and it kinda reflects on how how it's run, which is at the end, like, at the end of the day, OpenAI is like an AGI company.

Speaker 0

它本质上是一家智能公司。

It's like an intelligence company.

Speaker 0

是的。

Yeah.

Speaker 0

当然。

For sure.

Speaker 0

因此,智能体只是这种智能得以体现的一种方式。

And so agents are just like one way in which this intelligence kind of be manifested.

Speaker 0

没错。

Yeah.

Speaker 0

所以我认为我们内部的实际思考方式是,我们所有的不同产品线——Sora、Codex、API、ChatGPT——都只是部署的不同界面和不同方式

And so the way that I'd say we actually think about internally is all of our different product lines, Sora, Codex, API, ChatGPT, are just different interfaces and different ways So of deploying

Speaker 2

你们并没有真正

you don't really

Speaker 0

所以并不存在像这样专门考虑智能体的独立团队。

So there's no, like, single teams like this is, you know, like, thinking about agents.

Speaker 0

我认为它更多是以这种方式体现:每个产品领域都在思考如何将这种智能转化为实际可呈现的形式,使智能体行为更具可行性。

I would say the way that it it it manifests itself more is, like, each product area thinks about, like, what is, you know this intelligence is actually turning into a form where, like, you can actually agentic behavior is more possible.

Speaker 2

我明白了。

I see.

Speaker 0

在ChatGPT这样的第一方产品中,这会是什么样子?

What would that look like in a first party product like ChatGPT?

Speaker 0

是的。

Yeah.

Speaker 0

那会是什么样子?

What would that look like?

Speaker 0

这正是Codex最终成为独立产品的原因。

That's that's actually why Codex ended up becoming its own product.

Speaker 0

在编程类产品中,它会是什么样子?

Like, what would it look like in a coding style product?

Speaker 0

是的。

Yeah.

Speaker 0

我们探索过,ChatGPT在那里某种程度上可行,但实际上CLI界面更有意义。

Like, we explored it, and ChatGPT, like, kind of worked there, but, like, actually, the CLI interface actually makes a lot more sense.

Speaker 0

这是部署它的另一种界面。

That's another interface to deploy it.

Speaker 0

是的。

Yeah.

Speaker 0

我明白了。

I see.

Speaker 0

然后如果你看API本身,这又是部署它的另一种方式。

And then if you look about the API itself, it's like, this is another interface to deploy it.

Speaker 0

它是以一种略微不同的方式思考,因为采用的是开发者优先的思维模式。

It's it's thinking about it in a slightly different way because it's a developer first mindset.

Speaker 0

我们正在帮助其他人构建它。

We're helping other people build it.

Speaker 0

定价略有不同。

The pricing is slightly different.

Speaker 0

但这些都是这种核心智能的不同表现形式,也就是亚洲行为。

But it's all these, like, different manifestations of this core, like, intelligence that is the the the Asian behavior.

Speaker 0

是啊。

Yeah.

Speaker 2

整个经济体系中有如此大比例本质上只是代币洗钱,这太惊人了。

It is so remarkable how much of this entire economy is basically just token laundering.

Speaker 0

字面意义上

Literally

Speaker 2

就像我能做的任何事都是为了把英语或自然语言输入进去,然后让智能输出结果。

like anything I can do to get like like English in or like a natural language in and then like, you know, the intelligence out.

Speaker 2

对。

Yeah.

Speaker 2

我是说,这这这是因为这些系统对分层处理有很强的抵抗力。

And I mean, and and and it's because these are so resistant to layering.

Speaker 2

要把语言分层处理实在太困难了。

It's so hard to layer language out.

Speaker 2

就像,你知道的,没错。

Like, you know, like Yep.

Speaker 2

是的。

Yep.

Speaker 2

我甚至可以用编解码器轻松实现,相当容易。

I could even do it easily pretty easily with, like, codecs.

Speaker 2

我可以直接把它当作程序的一个组件使用,基本上就是将智能进行'洗白'处理

I could just, like, use it, you know, as a as a component of a of a program and just, you know, basically launder intelligence to

Speaker 0

它。

it.

Speaker 2

当然,你知道,这样做是要收费的。

I mean, of course, you know, I'd be charged to do that.

Speaker 0

所以

So

Speaker 2

对。

Yes.

Speaker 2

实际上,根据我观察过这么多不同产品的发布——包括智能体发布、你们定义的产品、各种API以及基于这些的产品——我的看法是,它们与我们习惯的东西其实大不相同。

It I I actually my my view of this and having seen now so many kind of launches of different products, I've seen agent launches and the definition that you have, I've definitely seen APIs, and I've seen products on these is like, they're actually quite different than, like, what we're used to.

Speaker 2

比如,成本结构不同,防御性也不同。

Like, the COGS is different, the defensibility is different.

Speaker 2

是的。

Yep.

Speaker 2

哦,所以我们某种程度上是在重写它。

Like, oh, so we're kind of rewriting it.

Speaker 2

所以这有点像,你知道的,你来自定价背景。

And so it's kind of like, you know, you came from a kind of pricing background.

Speaker 2

我是说,你正在研究定价的演示模型。

I mean, you're working on demo model for pricing.

Speaker 2

现在你有了API。

Now you have the API.

Speaker 2

所以我真的很想听听你的想法,我是说,你是如何逐步调整思路的,以及你如何为这些智能访问定价,毕竟你无法预知会有多少人使用它。

So I just love your thoughts on like I mean, how how how have you evolved your thinking and how do you price these, you know, access to intelligence where, you know, you don't know how many people gonna use it.

Speaker 2

几乎可以肯定的是,这会是基于使用量的计费方式,而非其他形式。

It's almost certainly usage based billing, not something else.

Speaker 2

你能稍微谈谈关于这些东西的定价理念吗?

Like, can you can you talk just a bit about, like, philosophy around pricing on these things?

Speaker 2

产品和API的定价方式是否不同?

Is it different for product versus API?

Speaker 2

比如

Like

Speaker 0

是的。

Yeah.

Speaker 0

我认为这里的实际情况是,它也随着时间的推移而演变。

I think the the the honest truth here is, like, it's evolved over time as well.

Speaker 0

而且,说实话,我们在API上采用基于使用量的定价,最根本的原因是这最接近实际使用情况。

And and, like, I actually think the simplest like, the reason why we've done usage based pricing on the API honestly is because it's been the likes it it's closest to how it's actually being used.

Speaker 0

所以我们最初就是这样开始的。

And so that's kinda how we how we started.

Speaker 0

实际上我认为API的基于使用量的定价出人意料地一直很稳固。

I actually think usage based pricing on the API has has has like surprisingly held strong.

Speaker 0

实际上,我认为这可能是我们会长期坚持的做法,主要是因为

And like, I actually think this might be something that we'll keep doing for for quite a long time, mostly because

Speaker 2

在我不明白你们为何不采用按使用量计费的背景下

In the context of how I don't know how you don't do usage based.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 2

就是不知道该怎么... 是的。

Just don't know how that Yeah.

Speaker 0

然后...然后...然后还有我们如何定价的策略。

And then and then and then there's also the strategy of, like, how we price it.

Speaker 0

在内部,我们做的一件事就是始终确保我们的按使用量定价实际上是从成本加成的角度来制定的。

And and internally, one thing we do is is we always make sure that we actually price our our our usage based pricing from a, like, cost plus perspective.

Speaker 0

就像,我们实际上只是试图确保我们在...完全合理。

Like, we're we're actually just, like, trying to make sure that we're being responsible from a Totally makes sense.

Speaker 0

从利润率的角度来看。

From a margin perspective.

Speaker 0

顺便说一下,这是

By the way, this is

Speaker 2

这是整个行业的一个巨大转变,因为,我记得从本地部署到订阅制的转变。

a huge shift in the industry in general just because, like, I remember the shift from on prem to to recurring.

Speaker 2

是的。

Yeah.

Speaker 2

那是个非常非常大的变化。

That was a big, big deal.

Speaker 2

就像,这催生了Zoro。

Like, that created Zoro.

Speaker 2

它催生了整个公司。

Like, it, created whole companies.

Speaker 2

这就像里面整本书的内容。

It's like whole books on in there.

Speaker 2

比如一大堆关于如何进行这种变革的顾问,对吧。

Like, a bunch of consultants on how you do this to change, like Yeah.

Speaker 2

你知道吗?

You know?

Speaker 2

而且我认为向按使用量付费的转变更加重大。

And, like, I think the shift to to usage is as bigger, bigger.

Speaker 2

这还是个非常棘手的技术难题。

And it's also even a really hard technical problem.

Speaker 2

是啊。

Yeah.

Speaker 2

我甚至无法想象8亿,哇,你们是怎么搭建的?

I can't even imagine 800,000,000 wow, like, do you build?

Speaker 0

是啊。

Yeah.

Speaker 0

8亿用户的情况会简单些,因为是订阅制而非按用户计费。

800,000,000 wow is a little easier because it's not user based pricing, it's subscription.

Speaker 0

所以听起来容易多了。

So it's like, that sounds like way way easier.

Speaker 0

确实容易很多。不过API上还是有大量用户,我们得处理所有账单相关的事务。

That's way way But I mean, there's still like a like a lot of users on the API that we need to, like, you know, manage all the billing side.

Speaker 2

有些超额使用的情况或者

There's some, like, overages or stuff you've

Speaker 0

需要处理的问题?

gotta deal with on that?

Speaker 0

你说的超额是指什么?

Or What do you mean by overages?

Speaker 0

我也不太清楚。

Like, I don't know.

关于 Bayt 播客

Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。

继续浏览更多播客