Skip to main content
Cloud / Azure / Products / Azure OpenAI in Foundry Models

Azure OpenAI in Foundry Models

Azure OpenAI in Foundry Models: access GPT-5, GPT-4.1 and o-series reasoning models with enterprise security and EU data residency.

ai-machine-learning
Pricing Model Pay-per-token (Standard) or reserved capacity as Provisioned Throughput Units (PTU)
Availability Standard, Data Zone, Provisioned, Batch and Developer deployment types
Data Sovereignty EU Data Zone within the EU Data Boundary (Sweden Central and other EU regions)
Reliability 99.9% availability SLA for standard deployments SLA

Azure OpenAI in Foundry Models provides API access to current OpenAI models, including the GPT-5 series, GPT-4.1, the o-series reasoning models, embeddings, and image, audio, and realtime models. You use these models with the security, compliance, and regional availability of Microsoft Azure.

What is Azure OpenAI in Foundry Models?

Azure OpenAI in Foundry Models is part of Microsoft Foundry (Azure AI Foundry) and delivers powerful OpenAI models as a fully managed cloud service. You access reasoning, chat, embedding, image, and audio models through a unified API, without operating your own infrastructure.

The service combines OpenAI’s models with the Azure ecosystem: unified billing, governance, identity integration, and network security. This lets you build production AI applications that meet enterprise requirements for security and data protection.

For companies with GDPR requirements, operation within the EU Data Boundary is possible. With Data Zone deployments, processing of prompts and completions stays exclusively in EU regions.

Core Features

  • Current model line-up: GPT-5 series, GPT-4.1 series, o-series reasoning models (o3, o4-mini), embeddings, image generation (gpt-image-1), and audio and realtime models.
  • Large context windows: up to around 1M tokens with GPT-4.1 and 400K tokens with the GPT-5 series for large documents and long conversations.
  • Flexible deployment types: Standard and Data Zone (pay-per-token), Provisioned (reserved PTU capacity), and Batch for asynchronous bulk processing.
  • Enterprise security: VNET integration, Private Link, managed identity, Azure Policy controls, and built-in content safety filters.
  • Data residency and compliance: EU Data Zone within the EU Data Boundary, with no use of customer data for model training.
  • Extensibility: function calling, structured outputs, the Responses API, fine-tuning (SFT, DPO, RFT), and integration with Azure AI Search for retrieval augmented generation.

Typical Use Cases

  • Conversational AI, chatbots, and internal copilots for knowledge work and support.
  • Content generation, translation, and summarization across large document sets.
  • Code generation and developer assistance within existing workflows.
  • Semantic search and RAG scenarios using embeddings and Azure AI Search.
  • Document analysis, classification, and structured data extraction from unstructured sources.

Benefits

  • GDPR-compliant operation through the EU Data Zone and processing within the EU Data Boundary.
  • Predictable performance via Provisioned Throughput Units (PTU) with guaranteed throughput.
  • 99.9% availability SLA for standard deployments and a financially backed endpoint.
  • Cost optimization by choosing the right deployment type, including Batch with around 50% savings.
  • Seamless integration into Azure: identity, monitoring, networking, and centralized governance.

Integration with innFactory

As a Microsoft Azure Partner, innFactory supports you with the architecture, rollout, and operation of Azure OpenAI in Foundry Models. We help you select the right models and deployment types, build GDPR-compliant architectures in the EU Data Zone, design RAG solutions with Azure AI Search, and optimize cost through PTU and Batch.

Contact us for a non-binding consultation on Azure OpenAI in Foundry Models and Microsoft Azure.

Available Tiers & Options

GPT-4.1 series

Strengths
  • Context window up to ~1M tokens
  • Cost-effective (mini, nano)
  • Function calling and structured outputs
Considerations
  • No reasoning mode

o-series (reasoning)

Strengths
  • o3 and o4-mini for deep analysis
  • Step-by-step reasoning
  • Strong at code and math
Considerations
  • Higher latency and cost

Typical Use Cases

Conversational AI, chatbots and copilots
Content generation and summarization
Code generation and developer assistance
Semantic search and embeddings (RAG)
Document analysis and data extraction

Technical Specifications

Content filtering Built-in Azure AI Content Safety filters
Context window Up to ~1M tokens (GPT-4.1), 400K tokens (GPT-5)
Data privacy No training on customer data
Deployment types Standard, Data Zone, Provisioned (PTU), Batch (50% discount), Developer
Fine tuning SFT, DPO and RFT for selected models (e.g. GPT-4.1, o4-mini, GPT-5)
Models GPT-5 series, GPT-4.1 series, o-series (o3, o4-mini), embeddings, gpt-image-1, Sora (preview), audio/realtime
Rate limits Tokens per minute (TPM) and requests per minute (RPM)

Frequently Asked Questions

Which models are available in Azure OpenAI in Foundry Models?

The current line-up includes the GPT-5 series (incl. mini, nano, pro), the GPT-4.1 series, the o-series reasoning models (o3, o4-mini), embeddings (text-embedding-3), image generation (gpt-image-1), and audio and realtime models. The model catalog is expanded continuously.

How is Azure OpenAI different from OpenAI's API?

Azure OpenAI offers the same models with added enterprise features: availability SLA, Azure security, VNET integration, managed identity, Azure Policy controls, and data residency in EU regions via the EU Data Zone.

Is my data used to train models?

No. Your prompts and completions are not used to train, retrain, or improve OpenAI or Microsoft models. Your data stays your data.

Is Azure OpenAI in Foundry Models GDPR compliant and EU-resident?

Yes. With Data Zone deployments in the EU, prompts and completions are processed only within the EU Data Boundary. This covers regions in countries such as Germany, France, the Netherlands, Sweden, and Switzerland. Microsoft also provides data processing agreements and comprehensive compliance certifications.

How does the pricing model work?

There are two billing models: pay-per-token usage (Standard, Data Zone) or reserved capacity via Provisioned Throughput Units (PTU) for predictable performance. For asynchronous bulk processing, Batch deployments offer around 50% cost savings.

What SLA applies to Azure OpenAI?

Standard deployments carry a financially backed availability of 99.9% per month for the inference endpoint. Provisioned deployments additionally provide guaranteed throughput and lower latency variance.

Microsoft Solutions Partner

innFactory is a Microsoft Solutions Partner. We provide expert consulting, implementation, and managed services for Azure.

Microsoft Solutions Partner Microsoft Data & AI

Similar Products from Other Clouds

Other cloud providers offer comparable services in this category. As a multi-cloud partner, we help you choose the right solution.

74 comparable products found across other clouds.

Ready to start with Azure OpenAI in Foundry Models?

Our certified Azure experts help you with architecture, integration, and optimization.

Schedule Consultation