We have been using paid versions of the following AI engines for the last few months in our daily work. Here’s what we found:
Grok (xAI)
Strengths: Taps live data from X (formerly Twitter), giving unusual visibility into trending topics and public sentiment. Delivers fast, punchy answers with an opinionated tone some users find engaging. Strong on “what’s happening now” queries and cultural chatter.
Weaknesses: Strength outside X is improving but still maturing for enterprise breadth; tone can be too edgy for conservative brands. Enterprise controls and integrations trail Microsoft/Google/OpenAI.
Best applications: Real-time trend monitoring, social-media analysis, quick takes on breaking news, and exploratory ideation where timeliness matters more than depth. Agent automation can send you email alerts.
Claude (Anthropic)
Strengths: Excellent at long-form reasoning, summarisation, and careful, reliable writing. Strong safety alignment and a helpful, polite style. Handles very long documents and structured analysis with low friction and low hallucination risk.
Weaknesses: Risk-averse on sensitive topics; web and third-party tool integrations vary by plan/region. Creative risk-taking and code generation are good, though agentic/tooling breadth can lag OpenAI in some workflows.
Best applications: Policy drafts, research synthesis, RFP responses, legal-adjacent editing, large-document review—any use case prioritising caution, clarity, and reliability. We love the format of its responses.
ChatGPT (OpenAI)
Strengths: Broadly capable all-rounder in coding, reasoning, and creative writing. Large ecosystem (apps, APIs, enterprise), multimodality, and robust function/tool calling for automation. Wide community support and learning resources.
Weaknesses: Quality depends on model/tier and grounding; hallucinations can occur without retrieval/tools. Some advanced features sit behind paid tiers; enterprise governance requires correct configuration.
Best applications: General productivity, prototyping and coding assistance, content creation, data transformation, and APIs. We created two customised agents to do automations and workflows including retrievals and tools that worked very well.
Perplexity
Strengths: Web-native answer engine that cites sources by default. Great for rapid evidence gathering, link curation, and keeping you oriented to what’s verifiable. Strong follow-up flow for iterative research.
Weaknesses: Depth depends on source quality; can over-summarise when the literature is thin. Less suited to offline/long creative tasks without the web.
Best applications: Quick research briefs, literature scans, competitive-intel snapshots—any task where linked citations and current web context are essential. Tip: use Deep/Focused modes for tougher queries. Exports to PDF, markdown and Word.
Gemini (Google)
Strengths: Tight integration with Google ecosystem (Search, Workspace, YouTube, Maps) and strong image-text multimodality. Useful grounding for factual queries and collaborative tasks inside Docs/Sheets/Slides; Search grounding is a practical differentiator.
Weaknesses: Enterprise adoption hinges on admin/data controls. Can be cautious on sensitive topics.
Best applications: Workspace productivity (drafting in Docs, data wrangling in Sheets), search-adjacent tasks, educational use with rich web grounding, and multimodal prompts mixing text, images, and video references. Currently trialling NotebookLLM – will update you.
Microsoft Copilot (formerly 365 Copilot)
Strengths: Deeply embedded in Office/Teams/Outlook.
Weaknesses: Value depends on tenant data quality/permissions; licensing costs and rollout effort can be significant. Less useful outside the Microsoft stack.
Best applications: Knowledge-worker productivity inside Microsoft 365: The agent we created was one of the better ones in the way it synthesised data. Outputs can be opened in Word.
Quick Guide
- Need reliable long-form analysis? Claude.
- Want an all-purpose builder’s toolkit with code and tools? ChatGPT.
- Live, citation-first web research? Perplexity (try Deep Search for harder topics).
- Google-centric workflow or multimodal grounding? Gemini (plus Workspace side-panel/Docs).
- Microsoft-first enterprise productivity? Copilot.
- Social trend pulse and witty real-time takes? Grok.
Practical tip: Pair one generalist (ChatGPT or Claude) with one web-native(Perplexity or Gemini) and, if you’re heavily on Microsoft or X, add Copilot or Grok respectively. This covers depth, freshness, and embedded productivity without over-reliance on a single tool.
Contrarian Perspectives
• Grok: Some argue its X-centric data limits broader applicability, but others suggest its real-time social pulse offers unique predictive power for market trends Forbes on AI Trends.
• Claude: Critics note its cautious approach may hinder creative exploration, though supporters claim this ensures ethical reliability – Anthropic Safety Blog (https://www.anthropic.com/news/safety-first).
• ChatGPT: Variability in output quality is debated, but enterprise-grade configurations can mitigate this OpenAI Enterprise Case Studies.
• Perplexity: Source quality concerns exist, yet its iterative search is praised for reducing bias TechCrunch on Perplexity.
• Gemini: Regional access issues are a drawback, but Google’s data scale is seen as unmatched – Wired on Google AI (https://www.wired.com/story/google-gemini-ai-update).
• Copilot: High costs are criticized, but ROI is significant for Microsoft-heavy firms Gartner on Enterprise AI.


