Analysis – CIAOPS

This recent article:

https://blog.ciaops.com/2026/03/07/comparing-ai-services-image-generation/

highlights some important considerations for me when comes to AI services. It is also important to consider that given the outputs are images how they are perceived is (aka like/dislike) is subjective. However, there are some general principles we can apply (aka correctness).

I find it interesting in a world where the current hype is around Claude that that result was the poorest I think most would agree.

Created with Claude Sonnet 4.6 in 357 seconds

espresso_technical_infographic

The clear winner, at least when it comes to image creation is Nano Banana in this test.

Created Nano Banana in 54 seconds

It is interesting how fast Grok created an image but given the output you can clearly see why that is.

Created with Grok in 7 seconds

But to me, if you look at the three images Copilot created, especially the default one here:

Created with Copilot in 75 seconds

you would have to say that produced a quality result well ahead of others.

I hear a lot of complaints about the poor quality of Copilot but in my tests, like this image generation experiment, simply don’t bear that out, at least for me. Remember, I used the same prompt with all the models and the outputs are here.

The most interesting thing for me from this test (apart from the Grok speed) was the monumental fail of Claude to event get close to the others with this test. I fully appreciate that images may noty be Claude’s strength but, as a business, do you want to be constantly switching between models for images, then number, then code? That’s were the real productivity suck is in this new AI world, copying and pasting outputs to the right place.

To me, Copilot still wins as the best option for the overall best option for business information workers and these tests reinforce that.