What is Amazon DataZone?
Amazon DataZone is a data management service that helps organizations catalog, discover, and securely share their data assets. The service provides a central business data portal where employees can find relevant data and use it for their analyses.
DataZone addresses one of the biggest challenges in modern data management: the gap between available data and the teams that need it. Through automated metadata collection, unified governance policies, and self-service access, DataZone democratizes data access across the organization.
The service integrates seamlessly with existing AWS data services such as Redshift, Athena, Glue, and S3. Data producers can publish assets to a catalog, while consumers search for relevant data through a portal and request access.
Core Features
- Business Data Portal: Central portal where users can discover, understand, and request data for analytics
- Automatic Metadata Collection: Automatically crawls and catalogs metadata from connected data sources
- Governance Workflows: Structured approval workflows for data access requests with audit trail
- Domains and Projects: Organizational model for logical grouping of data and teams
- Data Quality Integration: Integration of data quality metrics directly into the catalog
Typical Use Cases
Enterprise Data Catalog: Large organizations use DataZone to build a central, searchable catalog of all data assets. Employees find relevant data without IT support and understand context through business metadata.
Data Governance: Compliance teams use DataZone to enforce unified governance policies across all data sources. Every access is documented, and sensitive data is automatically classified.
Self-Service Analytics: Analysts and data scientists use the DataZone portal to independently find relevant datasets, request access, and load them into their analytics environments.
Benefits
- Central data catalog reduces silos and duplicate work
- Self-service access accelerates data-driven decisions
- Automated governance ensures compliance
- Seamless integration with the existing AWS data ecosystem
Integration with innFactory
As an AWS Reseller, innFactory supports you with Amazon DataZone: from planning data governance strategy and setting up domains and projects to integrating existing data sources and training business departments.
Typical Use Cases
Frequently Asked Questions
What is Amazon DataZone?
Amazon DataZone is a data management service that enables you to catalog, discover, share, and govern data across your organization. It provides a business data portal for self-service access to curated data assets.
What data sources does DataZone support?
DataZone natively integrates with AWS data sources such as Amazon Redshift, Amazon Athena, AWS Glue, and Amazon S3. External data sources can also be connected through custom connectors.
How does access control work in DataZone?
DataZone uses a concept of domains and projects to organize data access. Data producers publish assets to a catalog, and data consumers can request access through subscriptions that are approved by data owners.