Azure OpenAI in Foundry Models provides API access to current OpenAI models, including the GPT-5 series, GPT-4.1, the o-series reasoning models, embeddings, and image, audio, and realtime models. You use these models with the security, compliance, and regional availability of Microsoft Azure.
What is Azure OpenAI in Foundry Models?
Azure OpenAI in Foundry Models is part of Microsoft Foundry (Azure AI Foundry) and delivers powerful OpenAI models as a fully managed cloud service. You access reasoning, chat, embedding, image, and audio models through a unified API, without operating your own infrastructure.
The service combines OpenAI’s models with the Azure ecosystem: unified billing, governance, identity integration, and network security. This lets you build production AI applications that meet enterprise requirements for security and data protection.
For companies with GDPR requirements, operation within the EU Data Boundary is possible. With Data Zone deployments, processing of prompts and completions stays exclusively in EU regions.
Core Features
- Current model line-up: GPT-5 series, GPT-4.1 series, o-series reasoning models (o3, o4-mini), embeddings, image generation (gpt-image-1), and audio and realtime models.
- Large context windows: up to around 1M tokens with GPT-4.1 and 400K tokens with the GPT-5 series for large documents and long conversations.
- Flexible deployment types: Standard and Data Zone (pay-per-token), Provisioned (reserved PTU capacity), and Batch for asynchronous bulk processing.
- Enterprise security: VNET integration, Private Link, managed identity, Azure Policy controls, and built-in content safety filters.
- Data residency and compliance: EU Data Zone within the EU Data Boundary, with no use of customer data for model training.
- Extensibility: function calling, structured outputs, the Responses API, fine-tuning (SFT, DPO, RFT), and integration with Azure AI Search for retrieval augmented generation.
Typical Use Cases
- Conversational AI, chatbots, and internal copilots for knowledge work and support.
- Content generation, translation, and summarization across large document sets.
- Code generation and developer assistance within existing workflows.
- Semantic search and RAG scenarios using embeddings and Azure AI Search.
- Document analysis, classification, and structured data extraction from unstructured sources.
Benefits
- GDPR-compliant operation through the EU Data Zone and processing within the EU Data Boundary.
- Predictable performance via Provisioned Throughput Units (PTU) with guaranteed throughput.
- 99.9% availability SLA for standard deployments and a financially backed endpoint.
- Cost optimization by choosing the right deployment type, including Batch with around 50% savings.
- Seamless integration into Azure: identity, monitoring, networking, and centralized governance.
Integration with innFactory
As a Microsoft Azure Partner, innFactory supports you with the architecture, rollout, and operation of Azure OpenAI in Foundry Models. We help you select the right models and deployment types, build GDPR-compliant architectures in the EU Data Zone, design RAG solutions with Azure AI Search, and optimize cost through PTU and Batch.
Contact us for a non-binding consultation on Azure OpenAI in Foundry Models and Microsoft Azure.
Available Tiers & Options
GPT-5 series
- Reasoning models for complex tasks
- Text and image processing
- Up to 400K token context
- Variants: gpt-5, mini, nano, pro
- Higher cost with heavy reasoning
GPT-4.1 series
- Context window up to ~1M tokens
- Cost-effective (mini, nano)
- Function calling and structured outputs
- No reasoning mode
o-series (reasoning)
- o3 and o4-mini for deep analysis
- Step-by-step reasoning
- Strong at code and math
- Higher latency and cost
Typical Use Cases
Technical Specifications
Frequently Asked Questions
Which models are available in Azure OpenAI in Foundry Models?
The current line-up includes the GPT-5 series (incl. mini, nano, pro), the GPT-4.1 series, the o-series reasoning models (o3, o4-mini), embeddings (text-embedding-3), image generation (gpt-image-1), and audio and realtime models. The model catalog is expanded continuously.
How is Azure OpenAI different from OpenAI's API?
Azure OpenAI offers the same models with added enterprise features: availability SLA, Azure security, VNET integration, managed identity, Azure Policy controls, and data residency in EU regions via the EU Data Zone.
Is my data used to train models?
No. Your prompts and completions are not used to train, retrain, or improve OpenAI or Microsoft models. Your data stays your data.
Is Azure OpenAI in Foundry Models GDPR compliant and EU-resident?
Yes. With Data Zone deployments in the EU, prompts and completions are processed only within the EU Data Boundary. This covers regions in countries such as Germany, France, the Netherlands, Sweden, and Switzerland. Microsoft also provides data processing agreements and comprehensive compliance certifications.
How does the pricing model work?
There are two billing models: pay-per-token usage (Standard, Data Zone) or reserved capacity via Provisioned Throughput Units (PTU) for predictable performance. For asynchronous bulk processing, Batch deployments offer around 50% cost savings.
What SLA applies to Azure OpenAI?
Standard deployments carry a financially backed availability of 99.9% per month for the inference endpoint. Provisioned deployments additionally provide guaranteed throughput and lower latency variance.
