What are Data Lakes

  1. Collect
  2. Organize
  3. Analyze
  4. Infuse

What is a data lake ?

What are the benefits of data lake ?

Scalability : Data lakes offer massive scalability up to the exabyte scale. This is important because when creating a data lake you generally don’t know in advance the volume of data it will need to hold. Traditional data storage systems can’t scale in this way. Data lakes are based on the Hadoop framework, which is a framework that helps in the balanced processing of huge data sets across clusters of systems using simple models. It scales up from a single server to thousands, offering local computation and storage at each node. Hadoop supports huge clusters maintaining a constant price per execution bereft of scaling. To accommodate more one just has to plug in a new cluster.



Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Abhishek Satish Gore

Abhishek Satish Gore


Freelancer content writer with a profound understanding of data engineering, big data, and cloud. drop a mail: abhishekgore0605@outlook.com for further details