Hopp til hovedinnhold
 AI-nyheter, ferdig filtrert for ledere
SISTE:

AWS viser kryptert AI-inferens uten å lese dataene • AWS flytter kodeagenter inn i styrt sky-runtime • AWS gjør Bedrock til inngang for GPT- og Claude-APIer • OpenAI sender S-1 til SEC • Pentagon setter Alibaba og Baidu på militærliste

GPT-5.4 Thinking Scores 83% on GDPVal — Above Human Expert Level on Value-Creating Tasks

JH
Joachim Høgby
16. mars 202616. mars 20263 min lesingKilde: Fortune

AI Surpasses Expert Level on Work That Actually Creates Value

OpenAI has launched GPT-5.4 "Thinking," and it's not an incremental update. The model has scored 83.0% on the GDPVal benchmark — a test designed to evaluate AI performance on tasks that actually create economic value, not just text generation or quiz questions.

My take:

83% on GDPVal is a number that should be written into the next board presentation. Not to create fear, but to calibrate the pace of AI adoption. We are no longer in the "near future" phase — we are in the "now" phase.

📬 Likte du denne?

AI-nyheter for ledere. Kuratert av en CIO som bygger det selv. Daglig i innboksen.