Market Research Logo

IDC PeerScape: Practices for Building a Data Lake on AWS

IDC PeerScape: Practices for Building a Data Lake on AWS

This IDC PeerScape explores how data architects are building and deploying a high-performing, cost-efficient, and secure data lake on AWS, where different teams within their organization can publish and consume data in a self-service manner."Data lakes are proving to be highly useful data management architectures for deriving value in the DX era, when deployed appropriately," said Ritu Jyoti, program vice president, IDC's Infrastructure Platforms and Technology. "With data being increasingly distributed across on-premises and the public cloud, organizations could tap into public cloud solutions and associated industry best practices to build and deploy a high-performing, secure, cost-effective integrated data lake; get timely access to data; generate insights; and unlock a new world of business opportunities."


IDC PeerScape Figure
Executive Summary
Peer Insights
Practice 1: Always Use Large Files and Columnar Data Formats
Challenge
Example
Guidance
Practice 2: Use EC2 Spot Instances and Adopt S3 Lifecycle Management
Challenge
Example
Guidance
Practice 3: Use Encryption and KMS for Security; Use Redacted or Anonymization for PII Data to Ensure Privacy
Challenge
Example
Guidance
LEARN MORE
Related Research

Download our eBook: How to Succeed Using Market Research

Learn how to effectively navigate the market research process to help guide your organization on the journey to success.

Download eBook

Share this report