Re: [新聞] OpenAI危險了！DeepSeek正式發佈V3.2 性

LoveSports 發表於 2025/12/5 下午1:00:32

看板Stock標題Re: [新聞] OpenAI危險了！DeepSeek正式發佈V3.2 性作者

(我要當一個渣攻)時間Dec 5 13:00:32 2025推噓 4 推:4 噓:0 →:2

※ 引述《xross (xross)》之銘言：
: 才沒幾天
: Deepmind 就又突然出個 Deep Think 版也是強調 IMO ICPC 數學 AI
: "gold medal winning IMO and ICPC technologies"
: https://x.com/demishassabis/status/1996683917991334300
: 時間點上不是巧合吧
: 怎麼看都像是逼對方出招啊
: 說好的垃圾時間呢???

關於這個贏得IMO金牌的Gemini pro Deep Think功能，

7/21 Google的DeepMind官網，就已經公開說明，

之後會製作一個版本，交給專家小組(包括數學家)測試後，於Google AI Ultra平台推出。

We will be making a version of this Deep Think model available to a set of
trusted testers, including mathematicians, before rolling it out to Google AIUltra subscribers.

https://i.imgur.com/4uwgTa3.png

也就是說，這本來就是計畫好要推出的東西，

只是七月到現在需要先給專家測試過用戶版本。

官網公告
https://deepmind.google/blog/advanced-version-of-gemini-with-deep-think-
officially-achieves-gold-medal-standard-at-the-international-mathematical-
olympiad/

縮網址
https://reurl.cc/KOe5Wm

順帶一提，GPT那邊也是一樣，
以下是科學人訪問OPEN AI的IMO競賽用模型的研發工程師，文章日期是今年8/21，
他們說期待在未來的模型中整合競賽用模型的推理能力。

Those contributed alot to the success here, and now we and others at OpenAI
are applying thembeyond math. It’s not in GPT-5, but in future models, we’
re excited tointegrate these capabilities.

https://i.imgur.com/wXHkN0t.png

有提到八月初推出的GPT5，並沒有包含IMO競賽模型的推論能力在內。

所以之後應該是還有精彩對決可以看。

科學人訪談網址
https://www.scientificamerican.com/article/openai-model-earns-gold-medal-score-at-international-math-olympiad-and/

縮網址
https://reurl.cc/bNVo2E

從兩篇文章看來，IMO競賽模型最主要擅長的似乎是花時間思考，處理複雜的任務。

此外最特別的是，兩家公司的模型都分別在競賽過程中，六題中只回答了五題，

有一題是在模型判斷自己不會之後，選擇不回答。

這代表這類深度思考模型可能具備不知道就承認不知道的能力。

這種能力是靠「後訓練」鍛鍊出來的，大家常說的scaling是「預訓練」。

「後訓練」強化推理能力主要有以下這些方法：

1. RLHF（以人類偏好訓練）