Mistral launches Forge: enterprises can now train AI on their own institutional knowledge

Mistral AI launched Forge, a platform that lets enterprises train frontier AI models directly on their own documentation, codebases, and operational processes. The launch marks a significant shift: AI no longer has to be borrowed from a generic cloud model. Enterprises can now own the intelligence itself.

Mistral AI announced Forge on Tuesday, a platform designed for enterprises that want to train AI models on their own institutional knowledge rather than relying on generic, publicly-trained models. The company is positioning Forge as the infrastructure layer for building what it calls "frontier-grade AI grounded in proprietary knowledge" - models that understand a company's internal terminology, compliance policies, operational procedures, and years of accumulated decisions.

Forge supports the full model training lifecycle: pre-training on large internal datasets, post-training methods to refine behavior for specific tasks, and reinforcement learning to align models with internal policies and evaluation criteria. Critically, models built with Forge can be deployed within a company's own infrastructure - meaning proprietary data, training runs, and the resulting model weights stay under the organization's control. Mistral has already partnered with ASML, Ericsson, the European Space Agency, and several others to train domain-specific models through the platform.

The platform also builds in an "agent-first" design philosophy. Mistral's own autonomous agent, Mistral Vibe, can use Forge to fine-tune models, find optimal hyperparameters, schedule jobs, and generate synthetic training data - handling the operational overhead of model development autonomously. Forge supports both dense and mixture-of-experts architectures, giving enterprises control over the tradeoff between performance and inference cost.

The significance of Forge extends well beyond Mistral's product roadmap. For years, the default AI deployment pattern has been to take a general-purpose model - GPT-4, Claude, Llama - and steer its behavior through prompts, retrieval, and fine-tuning. That works for many use cases. But there is a category of enterprise knowledge that is genuinely hard to surface through these approaches: the accumulated reasoning embedded in years of internal documentation, the edge cases baked into compliance policy, the vocabulary specific to a particular industry or company. Retrieval-augmented generation helps, but a model that was never trained on your domain still has to translate across a significant gap.

Custom-trained models close that gap. When a model has been pre-trained or post-trained on a company's actual internal corpus - its contracts, engineering specifications, regulatory filings, internal wikis - it understands context that a generic model can only approximate. Mistral's framing for this is "strategic autonomy": the idea that companies operating in regulated environments should not have to depend on an external vendor's model behavior, evaluation standards, or governance framework. The model itself becomes institutional property.

There is also a direct cost implication. A model that natively understands your domain requires less retrieval infrastructure, fewer prompt tokens spent on context-setting, and produces more accurate outputs on the first pass. Lower error rates mean fewer human review cycles. For high-volume document processing pipelines - thousands of invoices, contracts, or regulatory submissions per day - even a modest improvement in straight-through processing rates compounds into significant operational savings.

At Laava, we have always argued that the reasoning layer of an AI system should be treated as infrastructure, not as a black box rented from a cloud provider. Our 3 Layer Architecture separates context (metadata, document governance), reasoning (the model), and action (integration with ERP, CRM, and operational systems). The model in Layer 2 is deliberately modular. We select it based on the task, the data sensitivity, and the client's infrastructure constraints - not because one vendor told us their model is best.

Forge fits directly into this architecture. For clients with large proprietary document corpora - a logistics company with ten years of charter party agreements, a construction firm with thousands of project specifications, a financial institution with its entire compliance rulebook - a domain-trained Mistral model deployed within the client's own infrastructure could meaningfully outperform a generic model accessed via API. The key phrase is "within the client's own infrastructure": Forge models stay in your environment. Your training data does not leave your perimeter. The resulting weights belong to you.

This is a meaningful differentiator for European enterprises operating under the EU AI Act's transparency and governance requirements, or under GDPR obligations around automated processing of personal data. Sending sensitive documents to a US cloud API for processing introduces legal and compliance complexity that on-premise or private cloud deployment avoids entirely.

The practical path for most enterprises is not to immediately train a full foundation model from scratch - that requires substantial compute and data infrastructure. The more accessible entry point is post-training: taking an existing open-source base model (Mistral, Llama) and fine-tuning it on internal data for a specific task. That is something organizations with moderate data volumes and standard GPU infrastructure can do today. Forge structures this process, adding evaluation pipelines, reinforcement learning from internal feedback, and the governance controls that regulated industries require.

If you are evaluating AI agents for document-heavy workflows and wondering whether a generic model will produce the accuracy your operations require, the honest answer is: it depends on the domain gap. For organizations whose processes are built around highly specific internal terminology, compliance frameworks, or technical standards, that gap can be significant enough to undermine production viability. A domain-trained model removes that uncertainty.

At Laava, we run a model selection process as part of every engagement. That process now explicitly includes the option of a domain-adapted open-source model, deployed within the client's infrastructure, as an alternative to cloud API access. For clients where data sovereignty or accuracy requirements make a generic model insufficient, this path is increasingly viable. If you want to understand whether your use case would benefit from a custom-trained model versus a well-structured RAG pipeline over a generic model, that is exactly the kind of question we address in our free Roadmap Session.

Mistral launches Forge: enterprises can now train AI on their own institutional knowledge

Want to know how this affects your organization?

Ready to get started?