Home / Tech / Don’t sleep on Cohere: Command A Reasoning, its first reasoning model, is built for enterprise customer service and more

Tech

Don’t sleep on Cohere: Command A Reasoning, its first reasoning model, is built for enterprise customer service and more

By Linda Park - Technology Editor

No Comments

August 22, 2025 7:04 am

Don’t sleep on Cohere: Command A Reasoning, its first reasoning model, is built for enterprise customer service and more

3. Excels at enterprise targeted benchmarks

4. Safety: built for high-stakes work

5. Early buy-in from large enterprises

8. Related

Carl Franzen 2025-08-22 03:35:00

Want smarter insights in your inbox? Sign⁣ up for our weekly newsletters to⁤ get only what matters to ‍enterprise AI, data, and⁢ security leaders. Subscribe⁤ Now

I was in more meetings than usual today so I just caught up to the fact that Cohere, the Canadian startup geared⁣ co-founded by former Transformer paper author ⁢ Aidan Gomez toward making generative AI products work easily, powerfully, and ‌securely for enterprises, has⁣ released its frist reasoning large language model (LLM), Command A⁣ Reasoning.

It looks to be a strong release. Benchmarks, technical‍ specs, and early tests ‍suggest the model‍ delivers on adaptability, efficiency, and ⁣raw reasoning power.⁢

Customer service, ⁤market research, scheduling, data analysis are some of the tasks Cohere says it’s built to handle automatically at scale inside secure enterprise environments.

It is indeed ‌a text-only model,⁣ though, ‍but it should be ‌easy enough to hook up to multimodal models and tools. In fact, tool use is one of its primary selling points.

AI Scaling Hits Its Limits

Power caps, rising token costs, and‍ inference delays are reshaping enterprise ‍AI. Join our exclusive salon to discover how top teams are:

Turning energy into ‌a strategic advantage

Architecting efficient inference for real throughput gains

Unlocking competitive ROI with sustainable AI systems

Secure your spot to stay ahead: https://bit.ly/4mwGngO

While ⁤it’s open‍ for researchers to use for non-commercial purposes, enterprises will need to pay Cohere ‍to get access and the company doesn’t publicly list its pricing because it says it makes bespoke customization and private deployment.

Cohere was valued at $6.8 billion when it announced its latest funding round of $500 million a week and a day ago.

Also Read: Anker Prime Wireless Charging Station Review: Is It Worth the Price?
Tuned for enterprises

Command ‌A Reasoning is tuned for enterprises with sprawling document libraries, long email chains, and workflows that can’t afford hallucinations.

It supports up to 256,000 tokens on multi-GPU setups, a decent size and comparable⁣ to OpenAI’s GPT-5.

The research release ⁣weighs in at 111-billion parameters, trained with tool-use and multilingual performance in mind.

It ‍supports 23 languages out of‍ the box, including English, French, Spanish, Japanese, Arabic, and Hindi. That multilingual depth ⁢is key for global enterprises that need consistent agent quality ‍across markets.

The model slots directly into North,Cohere’s new platform for deploying AI agents and automations on-premises.⁤

That⁤ means enterprises can spin up⁤ custom agents that⁣ live entirely‌ within their⁤ infrastructure, giving them control over data⁣ flows ⁢while still tapping into advanced reasoning.

Cohere looks like it’s thought‌ cleverly⁢ to identify some of the recurring functions across enterprises — onboarding, market research and analysis, advancement — and trained ⁢its model to support its ⁤agentic workflows for handling these⁢ automatically.

Controlled thinking

As with manny other recent reasoning releases including Nvidia’s new Nemotron-Nano-9B-v2, Command A Reasoning introduces a⁢ token budget feature to let users ⁤or developers specify how much reasoning to allocate to specific inputs ‍and tasks. Less⁣ budget‌ means faster, cheaper replies. More budget means deeper, more accurate ⁤reasoning.

The Hugging face release even exposes this tradeoff directly: reasoning can be ‌toggled on or ⁣off ⁢through a simple ⁢parameter.

Developers can run‍ the⁣ model in ⁣“reasoning mode” for maximum performance or switch it off for lower latency tasks—without changing models.

Excels at enterprise targeted benchmarks

So how⁣ does ⁤it ⁣perform in practice? Cohere’s benchmarks⁢ paint a clear picture.

Also Read: Prime Day 2024: 5 Free Amazon Offers You Can Get Now
On enterprise reasoning tasks, Command A Reasoning consistently ⁢outpaces peers like DeepSeek-R1 0528, gpt-oss-120b, and Mistral Magistral Medium.

It handles multilingual benchmarks with equal strength, important for⁤ global businesses.

The token budget system isn’t just a gimmick. In head-to-head comparisons against Cohere’s previous Command A model, satisfaction scores‌ climbed steadily as the budget⁢ increased.⁢ Even with “instant” ⁢minimal reasoning, Command A Reasoning beat⁤ its predecessor. At⁣ higher budgets, it pulled further ahead.

The story is the same in deep research. On the ⁣ DeepResearch Bench—which measures instruction following, readability, insight, and comprehensiveness—Cohere’s system came out on top against offerings from⁣ Gemini, OpenAI, ⁣Anthropic, Perplexity, and xAI’s Grok. The model excelled in turning sprawling questions into reports that are not only detailed but readable, a key challenge in enterprise ‍knowledge work.

beyond benchmarks,‍ the model is ⁢wired for action. Cohere ‍trained it specifically for conversational tool use — letting it call APIs, ‍connect to databases, or query external ‌systems during a task.

Developers can define tools via JSON schema and feed them into chat templates in Transformers, making it ⁤easier to integrate the model into existing enterprise systems.

That design supports Cohere’s larger ‌bet on agentic workflows: AI systems made up of multiple coordinated agents, each handling a piece of a bigger⁣ job. Command A Reasoning is the‌ reasoning engine that keeps those workflows coherent and on task.

Safety: built for high-stakes work

Cohere is also pitching safety as ‍a central feature. The model is trained to‌ avoid the common enterprise headache of over-refusal — when an AI rejects legitimate requests out of caution — while ⁤ still filtering harmful or malicious content. ‌

Also Read: X (Twitter) Grok AI: Malicious Links & Security Risks
Evaluations ⁢focused on five high-risk‌ categories: child safety, self-harm, violence and hate, explicit material, and conspiracy⁣ theories.

For companies looking to⁢ deploy AI in regulated industries or sensitive domains, this balance is meant to make⁤ the model more practical in day-to-day operations.

Early buy-in from large enterprises

SAP SE is one of the first major partners to‍ integrate ⁤the model. Dr. ‌Walter Sun, SVP⁢ and⁢ Global ⁣Head of AI,⁢ said the collaboration will enhance SAP’s generative⁢ AI capabilities within the SAP Business Technology Platform.For customers, that means agentic applications that can be customized to fit enterprise-specific needs.

Availability and licensing

Command A Reasoning is available⁤ now on the Cohere platform, and for research use on Hugging Face.

The Hugging Face repository provides open weights for research under a CC-BY-NC license,requiring users to ⁣share contact data and⁢ adhere to ⁢Cohere’s Acceptable Use Policy.

Enterprises interested in commercial or private deployments can contact ⁤Cohere’s sales team for ⁣bespoke pricing.

For enterprises, the pitch is straightforward: one model, multiple modes of deployment, fine-grained control over performance, multilingual capability, tool integration, and benchmark results that suggest it outperforms its peers.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB daily has you covered. We give you the inside scoop on⁤ what companies are doing with generative AI, ⁤from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters ⁢hear.

An error⁤ occured.