GPT 4O Benchmark