STACKIT Dremio is a fully managed Data Lakehouse platform based on the Apache Dremio open-source project. The service enables SQL queries on heterogeneous data sources such as Object Storage (S3-compatible), file systems, and relational databases — without data movement. As part of the STACKIT platform, Dremio meets all GDPR requirements and runs exclusively in German data centers.
Features
- Apache Arrow Flight: High-performance data queries with columnar in-memory processing
- Data Virtualization: Query data without prior ETL movement
- Automatic Reflections: Materializing caches for accelerated repeat queries
- Sovereign Data Storage: Complete data sovereignty in German data centers
- Integration: Connectivity to STACKIT Object Storage, PostgreSQL Flex, and other STACKIT services
Typical Use Cases
Data Lakehouse: Organizations store raw data in STACKIT Object Storage and run SQL analytics directly on that data — without expensive ETL pipelines or data duplication.
Self-Service Analytics: Business teams can access distributed data sources using familiar SQL tools without relying on IT support.
Benefits
- GDPR-compliant: All data remains in German data centers
- No lock-in: Based on Apache Dremio Open Source
- Cost-efficient: No data movement reduces storage and transfer costs
- Scalable: Horizontal scaling for large data volumes
Integration with innFactory
As an official STACKIT partner, innFactory supports you with Dremio: data source connectivity, performance optimization with Reflections, and integration into existing data platform architectures.
