Get a copy of my Comparing AI Services Report

bp1

I’ve bundled up all my research into different AI services and had Copilot Research generate a report which you can now request via email by filling in this form:

https://forms.office.com/r/tGK2GZPLc1

The report covers my findings from a recent series of blog posts I wrote that culminated with:

https://blog.ciaops.com/2025/09/12/comparing-ai-services-an-objective-analysis/

but the downloadable report brings all The articles nicely together with some additional insights.

CIA Brief 20250921

SharePoint Showcase highlights: Get the most out of SharePoint agents –

https://techcommunity.microsoft.com/blog/microsoft365insiderblog/sharepoint-showcase-highlights-get…

New collaborative agents in Microsoft 365 Copilot –

https://www.youtube.com/watch?v=biWymgItJ_I

Power up project management in Teams with the Project Manager Agent –

https://techcommunity.microsoft.com/blog/plannerblog/power-up-project-management-in-teams-with-the-…

Introducing Knowledge Agent in SharePoint –

https://techcommunity.microsoft.com/blog/spblog/introducing-knowledge-agent-in-sharepoint/4454154

AI and Microsoft Teams: A New Era of Collaboration –

https://techcommunity.microsoft.com/blog/microsoftteamsblog/ai-and-microsoft-teams-a-new-era-of-col…

Microsoft 365 Insider Round-Up: September 2025 –

https://www.linkedin.com/pulse/microsoft-365-insider-round-up-september-2025-microsoft-365-insider-…

Microsoft 365 Copilot: Enabling human-agent teams –

https://www.microsoft.com/en-us/microsoft-365/blog/2025/09/18/microsoft-365-copilot-enabling-human-…

Addressing multi-tenant management challenges for MSPs with Microsoft Intune and partner innovations –

https://techcommunity.microsoft.com/blog/microsoftintuneblog/addressing-multi-tenant-management-cha…

Copilot-powered file previews coming to Microsoft 365 Copilot app for iOS –

https://techcommunity.microsoft.com/blog/Microsoft365InsiderBlog/copilot-powered-file-previews-comi…

Introducing the new Network Security Hub in Azure –

https://techcommunity.microsoft.com/blog/azurenetworksecurityblog/introducing-the-new-network-secur…

Defending against evolving identity attack techniques –

https://www.microsoft.com/en-us/security/blog/2025/05/29/defending-against-evolving-identity-attack…

Microsoft seizes 338 websites to disrupt rapidly growing ‘RaccoonO365’ phishing service –

https://blogs.microsoft.com/on-the-issues/2025/09/16/microsoft-seizes-338-websites-to-disrupt-rapid…

Datacenters: The secret to connectivity –

https://news.microsoft.com/source/features/innovation/datacenters-the-secret-to-connectivity/

Face the future today by moving your application to cloud native –

https://techcommunity.microsoft.com/blog/microsoftintuneblog/face-the-future-today-by-moving-your-a…

Copilot Chat comes to the Microsoft 365 apps –

https://techcommunity.microsoft.com/blog/microsoft365copilotblog/copilot-chat-comes-to-the-microsof…

Get ready now: One month until Office 2016/2019 end of support –

https://techcommunity.microsoft.com/blog/officeeos/get-ready-now-one-month-until-office-20162019-en…

After hours

How Google Makes Billions Off You—And How To Opt Out –

https://www.youtube.com/watch?v=rtoRk6QS3i4

Editorial

If you found this valuable, the I’d appreciate a ‘like’ or perhaps a donation at https://ko-fi.com/ciaops. This helps me know that people enjoy what I have created and provides resources to allow me to create more content. If you have any feedback or suggestions around this, I’m all ears. You can also find me via email director@ciaops.com and on X (Twitter) at https://www.twitter.com/directorcia.

If you want to be part of a dedicated Microsoft Cloud community with information and interactions daily, then consider becoming a CIAOPS Patron – www.ciaopspatron.com.

Watch out for the next CIA Brief next week

Robert.Agent has been upgraded

I have now upgraded Robert.Agent to use GPT5 deep reasoning as you can see above.

All you need to do is send an email to robert.agent@ciaops365.com with your M365 question in the BODY of the email and you should get a reply to that question in a few minutes. Remember, deep reasoning models take longer to products results.

It is also important to note with Robert.Agent:

1. Each email is treated as a separate conversation. Robert.Agent has no ‘memory’ of any previous email you may have sent. Thus, treat each email you send as a single unique response or session.

2. Robert.Agent has normal Exchange Online security protection. This means, if the email you send looks like spam (i.e. no subject, signature, bad grammar and spelling, etc) then you won’t get a response.

Try the upgraded Robert.Agent out and let me know what you think.

Improved Windows Defender script

Recently, Microsoft updated Visual Studio code and Github Copilot to include the ability to auto detect which AI is best model to use when coding:

I therefore thought I’d take it for a spin and elected to use it to improve the script:

https://github.com/directorcia/Office365/blob/master/win10-def-get.ps1

which you now see the results for yourself. The script require escalation to local administrator to gather the information it needs.

Tests include:

– Attack Surface Reduction Rules

– Defender Settings

– Scanning Settings

– Latest Signature / Engine Versions

– Platform Security

and more. You will find the documentation at:

https://github.com/directorcia/Office365/wiki/Windows-Security-Audit-Script

which was also generated thanks to Github Copilot.

Let me know what you think and if you feel anything should be added.

CIA Brief 20250914

Intune AI Agent: Instant Threat Defense, Invisible Protection –

https://techcommunity.microsoft.com/blog/microsoft-security-blog/intune-ai-agent-instant-threat-def…

Microsoft Sentinel’s AI-driven UEBA ushers in the next era of behavioral analytics –

https://techcommunity.microsoft.com/blog/microsoftsentinelblog/microsoft-sentinel%E2%80%99s-ai-driv…

Zero Trust for Software Development Companies: what “good” looks like in practice –

https://techcommunity.microsoft.com/blog/marketplace-blog/zero-trust-for-software-development-compa…

Azure mandatory multifactor authentication: Phase 2 starting in October 2025 –

https://azure.microsoft.com/en-us/blog/azure-mandatory-multifactor-authentication-phase-2-starting-…

Protect Copilot Studio AI Agents in Real Time with Microsoft Defender –

https://techcommunity.microsoft.com/blog/microsoftthreatprotectionblog/protect-copilot-studio-ai-ag…

After hours

“Focusing is about saying no”- https://www.youtube.com/watch?v=H8eP99neOVs

Editorial

If you want to be part of a dedicated Microsoft Cloud community with information and interactions daily, then consider becoming a CIAOPS Patron – www.ciaopspatron.com.

Watch out for the next CIA Brief next week

Comparing AI services–an objective analysis?

If you have been following my articles about comparing AI services, you’d know that, through some ‘rule of thumb’ reasoning I was able to determine the following ranking of Ai services:

1. Deepseek

2. M365 Copilot

3. Copilot Researcher

4. Gemini

5. Copilot Studio

6. ChatGPT deep research

7. ChatGPT

The problem is that I used the same AI services to potentially evaluate the results that they in fact generated. Could that result in bias? Unsure, but I’d suggest probably, if you look at the results.

What I therefore decided to do was have the original articles evaluated by two AI services that were not on my original list, Claude and Grok. Here’s the result of jus these two:

AI Service	Claude	Grok	Total
M365 Copilot	7	4	11
Gemini	3	7	10
Copilot Studio	5	5	10
Deepseek	6	2	8
Copilot Researcher	2	6	8
ChatGPT Deep Research	4	3	7
ChatGPT	1	1	2

If I now incorporate these results in the overall results I get the following:

AI Service	Researcher	Gemini	ChatGPT	Claude	Grok	Total
M365 Copilot	7	3	4	7	4	25
Deepseek	5	4	7	6	2	24
Gemini	4	7	2	3	7	23
Copilot Studio	2	5	5	5	5	22
Copilot Researcher	6	6	1	2	6	21
ChatGPT Deep Research	3	2	3	4	3	15
ChatGPT	1	1	6	1	1	10

That changes the ranking slightly to:

1. M365 Copilot

2. Deepseek

3. Gemini

4. Copilot Studio

5. Copilot Researcher

6. ChatGPT deep research

7. ChatGPT

with the average score being 20, which most services exceed. ChatGPT still lags, even after this! Interesting, huh?

I think my original conclusion remains valid – most AI services, except for ChatGPT, seem to produce very similar quality on average when prompted in the same way.

Comparing AI Services–the final analysis

I started out to provide an indication of the differences between different AI services here:

Testing the differences between AI services

I did a quick comparison here:

An analysis of how AI services vary

I then did a deep analysis of all the generated articles using:

Copilot Researcher

Gemini Deep Thinking

ChatGPT Deep Thinking

If you now take those three results and assign a score of 7 = highest and 1 = lowest recommendations of each and total them up, you end up with this ranking table:

AI Service	Researcher	Gemini	ChatGPT	Total Score
Deepseek	5	4	7	16
M365 Copilot	7	3	4	14
Copilot Researcher	6	6	1	13
Gemini	4	7	2	13
Copilot Studio	2	5	5	12
ChatGPT Deep Research	3	2	3	8
ChatGPT	1	1	6	8

The winner then appears to be, on average, Deepseek. However, you will note that most AI services tested, except ChatGPT have similar scores, with the ‘average’ score being 12, which most services, except again ChatGPT, scored at or above.

This analysis is far from perfect or ideal or for that matter without bias. There are so many variables that possibly come into play that it very difficult, if not impossible, to get a true ‘apples vs apples’ comparison of AI services. However, I think this result still does provide value if you are looking to answer the question of the ‘best’ AI service. That answer seems to largely be that most AI services, apart from ChatGPT, are pretty much the on par when it comes to prompting, so choosing from amongst these simply based on their response to prompts, doesn’t seem to matter all that much.

Of course, there are plenty of other factors, aside from prompt results, that should be considered. The quality of the generated results also is greatly affected by the actual prompts used and I am sure that also varies across the AI services as well.

What I’ll now be interested to see is what the ‘click’ rate is on each article after a period of time. Will the Google AI service generate more article ‘hits’ than the other articles? Time will tell and I’ll report back once enough time has elapsed. These results also make a good benchmark to potentially test again down the track to see if things have changed at all and the progress these AI agents have made.

Interesting time ahead.

Comparing AI services – a third analysis

Recently, I have been analysing the results produced from the same prompt in carious AI services. After having the various AI services generate answers I compared their value using Copilot Researcher and Gemini. To provide a final alternate analysis of the articles I used ChatGPT Deep Research and received the following analysis, summarised here:

————————–

Rankings (Value to SMB Owner): Based on clarity, practicality, and depth of recommendations, we rank the articles as follows (1 = most valuable):

Deepseek (Aug 31) – Provides a step-by-step guide on replacing firewall functions with M365 features, with concrete examples (Safe Links, web filtering, Conditional Access) and even a cost comparison. Its clear bullet format and action-oriented advice make it highly accessibleblog.ciaops.com blog.ciaops.com.
ChatGPT (Sep 2) – Gives an ultra-concise Q&A answer summarizing the essentials. The “Answer in short” explicitly states that for most SMBs a basic firewall suffices and expensive NGFWs add little compared to built-in M365 protectionsblog.ciaops.com. Its brevity makes it very easy to digest (2‑minute read), though it is simple.
Copilot Studio (Aug 30) – A bullet-point summary that quickly lists the key protections in M365 (endpoint EDR, email filtering, conditional access, DLP, etc.)blog.ciaops.com blog.ciaops.com. Its Q&A style (“Short answer”) is user-friendly and covers the main points succinctly.
M365 Copilot (Sep 3) – Offers a short answer plus a configuration blueprint (steps to enforce MFA, Intune baselines, Defender settings, etc.) focused on identity/device security. This balances brevity with practical steps. It clearly states that for remote-first SMBs, a basic router is enough and effort should go into Intune/M365 policiesblog.ciaops.com blog.ciaops.com.
ChatGPT (Aug 28) – A traditional blog-style article with multiple sections on built-in security, Zero Trust, host firewalls, etc. It thoroughly covers many features of Business Premiumblog.ciaops.com blog.ciaops.com. While detailed, its long paragraphs (7‑min read) and technical tone may be harder for a busy non-expert to quickly follow.
Gemini (Sep 1) – A formal, research-style analysis (13‑min) with executive summary, threat modeling, TCO, and compliance discussionblog.ciaops.com blog.ciaops.com. It is extremely comprehensive (notably covering things like PCI DSS) but in heavy academic tone. This depth is impressive but likely more than a typical SMB owner needs.
Copilot Researcher (Aug 29) – An exhaustively detailed report (20‑min) with an executive summary, configuration guide, feature comparisons, and cost analysisblog.ciaops.com blog.ciaops.com. While it contains valuable info, its length and technicality make it the hardest to consume. It also concludes (like the others) that M365 BP can often replace expensive firewallsblog.ciaops.com, but the sheer volume can overwhelm small teams.

In summary, the shorter Q&A and bullet-list articles (Deepseek, ChatGPT Sep2, Copilot Studio, M365 Copilot) score highest for SMB owners because they deliver clear guidance quickly. The longer, more technical write-ups (Gemini, Copilot Researcher) score lower despite depth.

Overall Best: Deepseek’s article stands out as the most practical: it clearly explains why and how to use M365 features instead of buying pricey firewalls, with concrete steps and examplesblog.ciaops.com blog.ciaops.com. This level of clarity makes it particularly valuable for a small-business owner deciding on cost-effective security.