HomeData LakeData lake architecture

Comments

Data lake architecture — 6 Comments

  1. Great article to help people with all the data lake synonyms in use.. Raw layer = staging = bronze = landing zone.. etc. nice overview of what to think about for each data lake.

  2. Hi James,
    I’m not confortable with Delta Format to support DWH workload in terms of concurrency and performance.
    By design delta log is based on parquet files. Until vaccuum, read and write are a mixed of processing this log parquet files and other organized parquet files.
    Imagine read or write process need to parse huge among of text/json data.
    Also when vaccum hapen you loose time travel over your delta table.
    My sentiment is that Delta Format never meet performance and concurrency needed.

  3. Pingback:Serving layers with a data lake – SQLServerCentral

Leave a Reply

Your email address will not be published. Required fields are marked *

HTML tags allowed in your comment: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>