Reputation: 43
I am new to azure world. I am going through blob storage and adls2. I observed that apart from space constraint and herieracial namespace, everything what adls2 can do , same can be done through blob.
But still people recommend adls2 for analytical workload. Please advise what are things that can be done through adls2 but not possible through blob storage(apart from space and herieracial namespace).
Upvotes: 1
Views: 2232
Reputation: 3240
Data Lake Storage Gen2 makes Azure Storage the foundation for building enterprise data lakes on Azure. Designed from the start to service multiple petabytes of information while sustaining hundreds of gigabits of throughput, Data Lake Storage Gen2 allows you to easily manage massive amounts of data.
Data Lake Storage Gen2 builds on Blob storage and enhances performance, management, and security in the following ways:
Performance is optimized because you do not need to copy or transform data as a prerequisite for analysis. Compared to the flat namespace on Blob storage, the hierarchical namespace greatly improves the performance of directory management operations, which improves overall job performance.
Management is easier because you can organize and manipulate files through directories and subdirectories.
Security is enforceable because you can define POSIX permissions on directories or individual files.
Also, Data Lake Storage Gen2 is very cost effective because it is built on top of the low-cost Azure Blob Storage. The additional features further lower the total cost of ownership for running big data analytics on Azure.
No limits on account sizes, file sizes, or number of files in ADLS
For more information refer this article by Ashish Patel
Upvotes: 2