Testing the differences between AI services

If you are a regular reader of my blog, and I hope you are, you may have noticed a number of articles around a similar topic recently. A very common question these days is ‘What is the best AI service to use?’.

It turns out that the answer to that question is not straightforward. The reason is that AI models produce results ‘probabilistically’. This means, the answers are generated using probability based on the prompt that was made. Thus, even if you use exactly the same prompt, in exactly the same service, it is unlikely that you’ll get exactly the same answer, thanks to probability.

Thus, to provide some answers hopefully, I used the same prompt in a number of different AI tools and results can be found here:

Chatgpt (Deep Research) – https://blog.ciaops.com/2025/08/28/microsoft-365-business-premium-vs-hardware-firewalls-for-smbs/

Copilot Researcher – https://blog.ciaops.com/2025/08/29/security-without-the-high%e2%80%91priced-firewall-m365-business-premium-vs-traditional-firewalls-for-smbs/

Copilot Studio (with GPT5 reasoning) – https://blog.ciaops.com/2025/08/30/why-business-premium-can-replace-most-perimeter-security-for-typical-smbs/

Deepseek (Deep Research) – https://blog.ciaops.com/2025/08/31/how-m365-redefines-the-need-for-expensive-hardware/

Gemini (Deep Research) – https://blog.ciaops.com/2025/09/01/cybersecurity-for-the-modern-smb-a-strategic-analysis-of-m365-business-premium-vs-high-end-hardware-firewalls/

ChatGPT – https://blog.ciaops.com/2025/09/02/m365-business-premium-includes-so-many-advanced-security-controls-that-previously-required-on-premises-network-appliances/

M365 Copilot (GPT) – https://blog.ciaops.com/2025/09/03/why-the-perimeter-is-no-longer-the-control-that-matters-most/

Also, where possible, I used the same AI tool to create the image for the post, although not all tools provide this capability. I also used the ‘deep research’ option of the tool if it was available.

So, you can go and look at each results and judge the results for yourself and I’d love you to share what you think or the differences you have seen between different tools out there.

My plan going forward with these ‘baseline’ results is to use AI once again to compare and contrast them against each other to find the similarities and differences and report back.