How long does an AI implementation take?

Most single workflow implementations take 2-6 weeks from kickoff to production. Full AI transformation programmes run 6-12 weeks.

Do you work with specific AI models?

We are model-agnostic and work with all major providers including Anthropic Claude, OpenAI GPT, Google Gemini, Meta Llama, Mistral, and more.

Can you deploy AI on our own servers?

Yes. Our Local & Private AI service deploys models on your own infrastructure or private cloud.

AI Model Directory

Compare 20+ AI models across pricing, capabilities, context windows, and licensing. Find the right model for your use case — from frontier reasoning to edge deployment.

TextImageAudioCode

GPT-4o

$2.50/$10.00 per 1M tokens (input/output)

OpenAI's flagship multimodal model with text, vision, and audio capabilities.

multimodalfrontier128K contextfunction calling

Category: Text

Claude 4 Sonnet

$3.00/$15.00 per 1M tokens

Anthropic's balanced model with strong reasoning and a 200K context window.

reasoningfrontier200K contextsafety-focused

Category: Text

Claude 4 Opus

$15.00/$75.00 per 1M tokens

Anthropic's most intelligent model for complex reasoning and research tasks.

reasoningfrontier200K contextpremium

Category: Text

Gemini 2 Pro

$1.25/$5.00 per 1M tokens (under 128K)

Google's frontier model with a 2M token context window and multimodal capabilities.

multimodalfrontier2M contextvideo

Category: Text

Llama 3 405B

Free (self-hosted) / varies via cloud providers

Meta's most capable open-weight model approaching frontier performance.

open-weightself-hostedfine-tunable128K context

Category: Text

Llama 3 70B

Free (self-hosted)

Meta's balanced open-weight model with strong all-round performance.

open-weightself-hostedfine-tunable

Category: Text

Llama 3 8B

Free (self-hosted)

Meta's efficient small model for edge deployment and on-device AI.

open-weightedgelightweighton-device

Category: Text

Mistral Large

$2.00/$6.00 per 1M tokens

Mistral AI's flagship model with strong multilingual and reasoning capabilities.

multilingualEuropeanfunction calling128K context

Category: Text

Mixtral 8x22B

Free (self-hosted)

Mistral's open-weight Mixture of Experts model with efficient inference.

open-weightMoEefficientmultilingual

Category: Text

Qwen 2 72B

Free (self-hosted)

Alibaba's open-weight model with strong Chinese-English bilingual capabilities.

open-weightmultilingualChinese128K context

Category: Text

Command R+

$2.50/$10.00 per 1M tokens

Cohere's enterprise model optimised for RAG and retrieval workflows.

RAGenterprisecitationsmultilingual

Category: Text

DeepSeek V3

Free (self-hosted) / competitive API pricing

671B parameter MoE model achieving frontier performance at remarkably low training cost.

open-weightMoEcodingefficient

Category: Text

Phi-3 Mini

Free (MIT licence)

Microsoft's compact 3.8B model with surprising capability for edge deployment.

smalledgeon-deviceMIT licence

Category: Text

Grok 2

X Premium+ subscription / API available

xAI's model with real-time X (Twitter) access and competitive reasoning.

real-timeconversational128K context

Category: Text

Stable Diffusion XL

Free (self-hosted)

Stability AI's open-source image generation model for text-to-image creation.

image-generationopen-sourcecustomisableControlNet

Category: Image

Stable Diffusion 3

Free (community) / Commercial licence available

Stability AI's latest image model with improved prompt understanding.

image-generationtext-renderinglatest

Category: Image

DALL-E 3

$0.040–$0.120 per image

OpenAI's image generation model with superior prompt adherence and text rendering.

image-generationtext-in-imageChatGPT integration

Category: Image

Midjourney V6.1

$10–$120/month subscription

Leading AI art platform known for exceptional aesthetic quality.

image-generationartisticphotorealistic

Category: Image

Whisper Large V3

Free (self-hosted) / $0.006/min via API

OpenAI's open-source speech recognition model supporting 99 languages.

speech-to-textmultilingualopen-sourcetranscription

Category: Audio

Claude Code

Included with Claude API usage

Anthropic's agentic coding tool built on Claude for software development tasks.

codingagenticdeveloper-tool

Category: Code

GitHub Copilot

$10–$39/month per user

AI pair programmer integrated into VS Code, JetBrains, and other IDEs.

codingIDE-integrationautocompleteenterprise

Category: Code

Guide

How to choose

Choosing the right AI model depends on your specific requirements across several dimensions. Start by defining your primary use case — text generation, image creation, speech processing, or code assistance. For text models, consider the tradeoff between capability and cost: frontier models like GPT-4o, Claude 4 Opus, and Gemini 2 Pro offer the highest quality but at premium pricing, while models like Mistral Large and Command R+ deliver strong performance at lower cost. Data sovereignty and deployment flexibility are increasingly important factors. If you need to self-host for privacy, compliance, or cost reasons, open-weight models like Llama 3, DeepSeek V3, and Mixtral are your best options. These allow full control over your data and infrastructure but require ML engineering expertise to deploy and maintain. For enterprises on cloud platforms, consider models available natively on your provider: Azure (OpenAI, Mistral), AWS Bedrock (Claude, Llama), or Google Cloud (Gemini). Finally, evaluate context window size, latency requirements, and ecosystem maturity. If you process long documents, Gemini's 2M context or Claude's 200K context are significant advantages. For real-time applications, smaller models like Phi-3 or Llama 3 8B offer lower latency. For production reliability, OpenAI and Anthropic have the most mature enterprise platforms with SLAs, compliance certifications, and support infrastructure.

Grove AI

AI Consultancy

Grove AI helps businesses adopt artificial intelligence fast. From strategy to production in weeks, not months.

FAQ

Frequently asked questions

There is no single 'best' model. GPT-4o is the most versatile, Claude 4 excels at reasoning and safety, Gemini 2 Pro offers the largest context window, and Llama 3 leads in open-weight flexibility. The right choice depends on your specific use case, budget, and deployment requirements.

Commercial models (GPT-4o, Claude) offer simplicity, reliability, and enterprise support. Open-weight models (Llama 3, DeepSeek V3) offer data control, customisation, and no per-token costs. Many organisations use a mix — commercial for prototyping and critical tasks, open for high-volume and specialised workloads.

Costs vary dramatically. Open models are free to self-host (plus infrastructure costs). Commercial API pricing ranges from $0.15/1M tokens (small models) to $75/1M tokens (Claude 4 Opus). For a typical enterprise, expect $500-$10,000/month depending on volume and model choice.

Yes, if you design for it. Use abstraction layers like LiteLLM, LangChain, or the OpenAI-compatible API format (supported by most providers). This allows switching providers without rewriting application code. Test model-specific behaviours before switching in production.

For simple Q&A and chat: 8-32K tokens is sufficient. For document analysis: 128K+ tokens. For processing entire codebases or books: Gemini's 2M or Claude's 200K context. Larger context windows cost more per request, so match your context size to your actual needs.

Need help picking the right tool?

Our team can help you evaluate options and implement the best solution. Book a free strategy call.

Book a Strategy Call View Pricing

AI Model Directory

GPT-4o

Claude 4 Sonnet

Claude 4 Opus

Gemini 2 Pro

Llama 3 405B

Llama 3 70B

Llama 3 8B

Mistral Large

Mixtral 8x22B

Qwen 2 72B

Command R+

DeepSeek V3

Phi-3 Mini

Grok 2

Stable Diffusion XL

Stable Diffusion 3

DALL-E 3

Midjourney V6.1

Whisper Large V3

Claude Code

GitHub Copilot

How to choose

Frequently asked questions

GPT-4o vs Claude 4 Sonnet

Best AI Models for Enterprise

What is a Large Language Model?

Need help picking the right tool?