Русские видео

Сейчас в тренде

Иностранные видео




Если кнопки скачивания не загрузились НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием, пожалуйста напишите в поддержку по адресу внизу страницы.
Спасибо за использование сервиса savevideohd.ru



o3 - wow

o3 isn’t one of the biggest developments in AI for 2+ years because it beats a particular benchmark. It is so because it demonstrates a reusable technique through which almost any benchmark could fall, and at short notice. I’ll cover all the highlights, benchmarks broken, and what comes next. Plus, the costs OpenAI didn’t want us to know, Genesis, ARC-AGI 2, Gemini-Thinking, and much more. AI Insiders ($9!):   / aiexplained   FrontierMath: https://epoch.ai/frontiermath https://arxiv.org/pdf/2411.04872 Chollet Statement:https://arcprize.org/blog/oai-o3-pub-... MLC Paper: https://www.scientificamerican.com/ar... AlphaCode 2: https://storage.googleapis.com/deepmi... Human Performance on ARC-AGI: https://arxiv.org/pdf/2409.01374v1 Wei Tweet ‘3 months’:https://x.com/_jasonwei/status/187018... Deliberative Alignment Paper: https://openai.com/index/deliberative... Brown Safety Tweet: https://x.com/polynoamial/status/1870... Swe-Bench Verified: https://openai.com/index/introducing-... Amodei Prediction: https://x.com/OfirPress/status/185856... David Dohan: 16 hours https://x.com/dmdohan/status/18701714... OpenAI Personal Writing: https://openai.com/index/learning-to-... https://simple-bench.com/ John Hallman Tweet: https://x.com/johnohallman/status/187... 00:00 - Introduction 01:19 - What is o3? 03:18 - FrontierMath 05:15 - o4, o5 06:03 - GPQA 06:24 - Coding, Codeforces + SWE-verified, AlphaCode 2 08:13 - 1st Caveat 09:03 - Compositionality? 10:16 - SimpleBench? 13:11 - ARC-AGI, Chollet 20:25 - Safety Implicaitons AI Insiders:   / aiexplained   Non-hype Newsletter: https://signaltonoise.beehiiv.com/ Podcast: https://aiexplainedopodcast.buzzsprou...

Comments