Would love to see the reasoning capabilities of the openai image 2 model benchmarked with this.
Would love to see the reasoning capabilities of the openai image 2 model benchmarked with this.