continues to excel at coding, leading on difficult coding benchmarks,” Google wrote in a blog post. “It also shows top-tier performance [on] highly challenging benchmarks that evaluate a model’s math, science, knowledge, and reasoning capabilities.”
So what else is new? Google says it addressed feedback from its previous 2.5 Pro release, improving the model’s style and structure. Now 2.5 Pro can be “more creative with better-formatted responses,” Google claims.