Data Lake and Data Warehouse merging

Data Lake and Data Warehouse merging

Data Lake and Data Warehouse merging

Another one trend is data lake and data warehouse combine that promotes data stack simplification. Until recent times data lake and data warehouse subsist separately. Both objects are intended to data holding. But they are not synonymous and there is a principal difference between them.

The first object is a repository for a big volume of raw data in its original form from different sources. Data can be of different types: structured, semi-structured and unstructured. Data lake is characterized by high data flexibility and availability and a big choice oh machine learning usage.

The second object is also a repository for a big volume of data. But in this case data runs processing and gets into the storage already structured strictly regulated ways. Data warehouse is characterized by less flexibility, fixed configuration and transactional analytics and BI support.

Wishing to get the best of both sides, organizations try to combine 2 variants. As a result, they have both data lake and data warehouse (sometimes several with many parallel pipelines). Today’s data storage solution providers offer more such possibilities. For example, Snowflake – its platform allows to connect data warehouse and data lake; Microsoft Synapse – its cloud warehouse has integrated capabilities of data lake.

Previous post #maindatainsfrastucturetrends 

Comments

1
  1. […] Попередній пост #maindatainfrastructuretrends  […]

Reply to Роль аналітиків даних зростає – Datalabs Cancel

Email will not be published. Required: *

0 / 1500


Previous Post Next Post

Related posts

Why Your Qlik Deployments Keep Breaking

Every Qlik team has a deployment horror story. Maybe it was the app launch load script bug that decided to release an app to production with a broken ...

Read more

Qlik Deployment Best Practices: From Manual Chaos to Reliable Releases

Are you the type of person who deploys Qlik apps by simply exporting a QVF, renaming it, and then importing it to your target environment? If so you&#...

Read more

The Rumsfeld Matrix as an effective tool in the decision-making process

During a briefing on the Iraq War, Donald Rumsfeld divided information into 4 categories: known known, known unknown, unknown known, unknown unknown. ...

Read more
GoUp Chat