IMO adalah 3 masalah per 4,5 jam <-> 1,5 jam per masalah Jadi kami ... tepat pada jadwal?
METR
METR19 Mar 2025
When will AI systems be able to carry out long projects independently? In new research, we find a kind of “Moore’s Law for AI agents”: the length of tasks that AIs can do is doubling about every 7 months.
5,24K