ARC-AGI-2 Tests AI Models
The ARC-Prize-Foundation has updated its AGI benchmark, ARC-AGI-2, which poses significant challenges to current AI models, with even leading models like OpenAI’s O1 and DeepSeek’s R1 scoring only 1% on tasks humans solve 60% of the time. A $1 million prize has been offered for solving the benchmark.