↓
 

James Serra's Blog

Big Data and Data Warehousing

  • Home
  • My Book
  • FAQ
  • Presentations
  • Expert Advise
  • About Me
  • Contact Me
Home→Categories Azure Data Lake

Category Archives: Azure Data Lake

Using a data lakehouse

James Serra's Blog Posted on March 9, 2023 by James SerraMarch 8, 2023

As I mentioned in my Data Mesh, Data Fabric, Data Lakehouse presentation, the data lakehouse architecture, where you use a data lake with delta lake as a software layer and skip using a relational data warehouse, is becoming more and … Continue reading →

Posted in Azure Data Lake, Data Lake, SQLServerPedia Syndication | 1 Reply

When to have multiple data lakes

James Serra's Blog Posted on January 23, 2023 by James SerraMarch 6, 2023

A question I get asked frequently from customers when discussing Data lake architecture is “Should I use one data lake for all my data, or multiple lakes?”. Ideally, you would use just one data lake, but I have seen many … Continue reading →

Posted in Azure Data Lake, Data Lake, SQLServerPedia Syndication | 3 Replies

Data Fabric defined

James Serra's Blog Posted on June 9, 2021 by James SerraJuly 21, 2021

Another buzzword that you may have been hearing a lot about lately is Data Fabric. In short, a data fabric is a single environment consisting of a unified architecture with services and technologies running on it that architecture that helps … Continue reading →

Posted in Azure Data Lake, Data Lake, Data Mesh, Data warehouse, SQLServerPedia Syndication | 58 Replies

Data Mesh defined

James Serra's Blog Posted on February 16, 2021 by James SerraSeptember 2, 2021

The two latest trends in emerging data platform architectures are the Data Lakehouse (the subject of my last blog Data Lakehouse defined), and the Data Mesh, the subject of this blog. Data Mesh was first introduced by ThoughtWorks via the … Continue reading →

Posted in Azure Data Lake, Azure Synapse Analytics/SQL DW, Data Lake, Data Mesh, Data warehouse, SQLServerPedia Syndication | 29 Replies

Data Lakehouse defined

James Serra's Blog Posted on January 25, 2021 by James SerraJune 22, 2021

As a follow-up to my blog Data Lakehouse & Synapse, I wanted to talk about the various definitions I am seeing about what a data lakehouse is, including a recent paper by Databricks. Databricks uses the term “Lakehouse” in their … Continue reading →

Posted in Azure Data Lake, Azure Databricks, Azure Synapse Analytics/SQL DW, Data Lake, SQLServerPedia Syndication | 10 Replies

External tables vs T-SQL views on files in a data lake

James Serra's Blog Posted on November 3, 2020 by James SerraJanuary 12, 2022

A question that I have been hearing recently from customers using Azure Synapse Analytics (the public preview version) is what is the difference between using an external table versus a T-SQL view on a file in a data lake? Note … Continue reading →

Posted in Azure Data Lake, Azure Synapse Analytics/SQL DW, SQLServerPedia Syndication | 6 Replies

Synapse and federated queries

James Serra's Blog Posted on October 15, 2020 by James SerraNovember 10, 2020

As I was finishing up a demo script for my presentation at the SQL PASS Virtual Summit on 11/13 (details on my session here), I wanted to blog about part of the demo that shows a feature in the public … Continue reading →

Posted in Azure Cosmos DB, Azure Data Lake, Azure Synapse Analytics/SQL DW, SQLServerPedia Syndication | 6 Replies

Data Lakehouse & Synapse

James Serra's Blog Posted on September 10, 2020 by James SerraJune 22, 2021

I am starting to see this relatively new phrase, “Data Lakehouse”, being used in the data platform world. It’s the combination of “Data Lake” and “Data Warehouse”. In this post I’ll give my thoughts on it, and how the next … Continue reading →

Posted in Azure Data Lake, Azure Synapse Analytics/SQL DW, Delta Lake, SQLServerPedia Syndication | 18 Replies

More ADLS Gen2 enhancements

James Serra's Blog Posted on July 6, 2020 by James SerraJanuary 30, 2021

On the heels of More Azure Blob Storage enhancements come more enhancements for Azure Data Lake Store Gen2 (ADLS Gen2): Archive tier is now GA: The archive tier provides an ultra-low cost tier for long term retention of data while … Continue reading →

Posted in Azure Data Lake, SQLServerPedia Syndication | Leave a reply

Upload files to ADLS Gen2

James Serra's Blog Posted on June 22, 2020 by James SerraJune 21, 2020

I discover a small new feature the other day, but a very useful one. Previously, there was no way to upload files to an ADLS Gen2 storage account via the Azure portal. You had to use Azure Storage Explorer – … Continue reading →

Posted in Azure Data Lake, SQLServerPedia Syndication | 2 Replies

Query Acceleration for ADLS

James Serra's Blog Posted on May 4, 2020 by James SerraMay 18, 2020

Just announced is Query Acceleration for Azure Data Lake Storage Gen2 (ADLS) as well as Blob Storage. This is a new capability for ADLS that enables applications and analytics frameworks to dramatically optimize data processing by retrieving only the data … Continue reading →

Posted in Azure Data Lake, Blob Storage, SQLServerPedia Syndication | 5 Replies

Azure Storage tips

James Serra's Blog Posted on January 16, 2020 by James SerraJune 20, 2020

As a followup to my blog post Azure Data Lake Store Gen2 is GA, I wanted to give some pointers when using ADLS Gen2 as well as blob storage, as it can get a bit confusing with all the options … Continue reading →

Posted in Azure Data Lake, Blob Storage, SQLServerPedia Syndication | Leave a reply

Ways to access data in ADLS Gen2

James Serra's Blog Posted on September 16, 2019 by James SerraMarch 9, 2020

With data lakes becoming popular, and Azure Data Lake Store (ADLS) Gen2 being used for many of them, a common question I am asked about is “How can I access data in ADLS Gen2 instead of a copy of the … Continue reading →

Posted in Azure Data Factory, Azure Data Lake, Azure Databricks, HDInsight, PolyBase, Power BI, SQLServerPedia Syndication | 5 Replies

Common Data Model

James Serra's Blog Posted on June 6, 2019 by James SerraFebruary 1, 2022

The Common Data Model (CDM) is a shared data model that is a place to keep all common data to be shared between applications and data sources.  Another way to think of it is is a way to organize data … Continue reading →

Posted in Azure Data Lake, SQLServerPedia Syndication | 1 Reply

Where should I clean my data?

James Serra's Blog Posted on April 25, 2019 by James SerraJanuary 30, 2021

As a follow-up to my blogs What product to use to transform my data? and Should I load structured data into my data lake?, I wanted to talk about where you should you clean your data when building a modern data warehouse … Continue reading →

Posted in Azure Data Factory, Azure Data Lake, Azure Databricks, Azure Synapse Analytics/SQL DW, PolyBase, Power BI, SQLServerPedia Syndication, SSIS | 10 Replies

Azure Data Lake Store Gen2 is GA

James Serra's Blog Posted on February 19, 2019 by James SerraMarch 10, 2019

Azure Data Lake Store (ADLS) Gen2 was made generally available on February 7th.  In short, ADLS Gen2 is the best of the previous version of ADLS (now called ADLS Gen1) and Azure Blob Storage.  ADLS Gen2 is built on Blob storage … Continue reading →

Posted in Azure Data Lake, SQLServerPedia Syndication | 4 Replies

Azure Data Lake Store Gen2

James Serra's Blog Posted on June 28, 2018 by James SerraJune 28, 2018

Big news!  The next generation of Azure Data Lake Store (ADLS) has arrived.  See the official announcement. In short, ADLS Gen2 is the combination of the current ADLS (now called Gen1) and Blob storage.  Gen2 is built on Blob storage.  By … Continue reading →

Posted in Azure Data Lake, SQLServerPedia Syndication | 3 Replies

Use cases of various products for a big data cloud solution

James Serra's Blog Posted on October 11, 2017 by James SerraJune 12, 2020

There are a tremendous amount of Microsoft products that are cloud-based for building big data solutions.  It’s great that there are so many products to choose from, but it does lead to confusion on what are the best products to … Continue reading →

Posted in Azure Analysis Services, Azure Cosmos DB, Azure Data Lake, Azure Synapse Analytics/SQL DW, HDInsight, SQLServerPedia Syndication | 3 Replies

Data lake details

James Serra's Blog Posted on June 29, 2017 by James SerraJuly 20, 2022

I have blogged before about data lakes (see What is a data lake? and Why use a data lake?), and wanted to provide more details on this popular technology, some of which I cover in my presentation “Big data architectures and the data lake“. … Continue reading →

Posted in Azure Data Lake, Data Lake, SQLServerPedia Syndication | 3 Replies

U-SQL Defined

James Serra's Blog Posted on March 23, 2017 by James SerraMarch 23, 2017

Azure Data Lake Analytics (ADLA) is a distributed analytics service built on Apache YARN that allows developers to be productive immediately on big data.  This is accomplished by submitting a job to the service where the service will automatically run it … Continue reading →

Posted in Azure Data Lake, PolyBase, SQLServerPedia Syndication | 2 Replies

About James Serra

I work at Microsoft as a big data and data warehousing solution architect where I have been for most of the last ten years.  Prior to that I was an independent consultant working as a Data Warehouse/Business Intelligence architect and developer. I am a prior SQL Server MVP with over 35 years of IT experience. The views and opinions on this blog are mine and not that of Microsoft. Check out my book Deciphering Data Architectures: Choosing Between a Modern Data Warehouse, Data Fabric, Data Lakehouse, and Data Mesh. 

Follow me!

  • LinkedIn
  • Twitter
  • Email
  • SlideShare

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 901 other subscribers

Recent Posts

  • Microsoft Build announcements
  • Deciphering Data Architectures: When to Use a Warehouse, Fabric, Lakehouse, or Mesh
  • Announcements from the Microsoft Fabric Community Conference
  • Real-Time Intelligence in Microsoft Fabric
  • Azure SQL offerings
  • Cool AI sites
  • Introduction to OpenAI and LLMs – Part 3
  • Ways to land data into Fabric OneLake
  • Microsoft Ignite Announcements Nov 2024
  • Benefits of Migrating from Azure Synapse Analytics to Microsoft Fabric
  • European Microsoft Fabric Community Conference announcements
  • Get mentored and coached by me and other industry experts!
  • Microsoft Purview GA menu’s
  • Microsoft Purview FAQ
  • Microsoft Fabric AI Skill
  • Microsoft Fabric reference architecture
  • Classifications and sensitivity labels in Microsoft Purview
  • Copilot in Microsoft Fabric
  • Transform yourself into an Invaluable Data Leader in just 6 weeks
  • Introduction to OpenAI and LLMs – Part 2

Recent Comments

  • Nicole Cyphers on Microsoft Purview FAQ
  • Classifying and Labeling Synapse Spark Data with Microsoft Purview – Learn IT in Sri Lanka on Classifications and sensitivity labels in Microsoft Purview
  • Transitioning from Classic to the New Unified Microsoft Purview Portal for Synapse Governance – Learn IT in Sri Lanka on Microsoft Purview FAQ
  • DevGuy on SSIS package taking forever to load when you open it?
  • Stephane on Microsoft Fabric reference architecture

Top Posts & Pages

  • Classifications and sensitivity labels in Microsoft Purview
  • Microsoft Fabric shortcuts
  • Copilot in Microsoft Fabric
  • Consultants: Corp-to-Corp vs 1099
  • Presentations
  • Transform yourself into an Invaluable Data Leader in just 6 weeks
  • Common Data Model
  • SQL Server does not script indexes by default
  • Data Lakehouse defined
  • Microsoft Purview new data governance features

Archives

  • May 2025 (2)
  • April 2025 (1)
  • March 2025 (1)
  • February 2025 (1)
  • January 2025 (2)
  • December 2024 (1)
  • November 2024 (2)
  • October 2024 (1)
  • September 2024 (3)
  • August 2024 (2)
  • July 2024 (2)
  • June 2024 (2)
  • May 2024 (2)
  • April 2024 (1)
  • March 2024 (2)
  • February 2024 (2)
  • January 2024 (2)
  • December 2023 (1)
  • November 2023 (3)
  • October 2023 (3)
  • September 2023 (1)
  • August 2023 (1)
  • July 2023 (1)
  • June 2023 (2)
  • May 2023 (2)
  • April 2023 (1)
  • March 2023 (1)
  • February 2023 (1)
  • January 2023 (2)
  • December 2022 (1)
  • November 2022 (2)
  • October 2022 (1)
  • September 2022 (2)
  • August 2022 (3)
  • July 2022 (1)
  • June 2022 (2)
  • May 2022 (2)
  • April 2022 (1)
  • March 2022 (2)
  • February 2022 (1)
  • January 2022 (1)
  • December 2021 (2)
  • November 2021 (2)
  • October 2021 (1)
  • September 2021 (1)
  • August 2021 (2)
  • July 2021 (1)
  • June 2021 (2)
  • May 2021 (1)
  • April 2021 (2)
  • March 2021 (3)
  • February 2021 (1)
  • January 2021 (2)
  • December 2020 (4)
  • November 2020 (2)
  • October 2020 (2)
  • September 2020 (4)
  • August 2020 (2)
  • July 2020 (2)
  • June 2020 (2)
  • May 2020 (4)
  • April 2020 (2)
  • March 2020 (3)
  • February 2020 (1)
  • January 2020 (2)
  • December 2019 (3)
  • November 2019 (2)
  • October 2019 (1)
  • September 2019 (1)
  • August 2019 (2)
  • July 2019 (3)
  • June 2019 (2)
  • May 2019 (1)
  • April 2019 (2)
  • March 2019 (2)
  • February 2019 (1)
  • January 2019 (2)
  • December 2018 (1)
  • November 2018 (2)
  • October 2018 (2)
  • September 2018 (2)
  • August 2018 (3)
  • July 2018 (3)
  • June 2018 (3)
  • May 2018 (3)
  • April 2018 (3)
  • March 2018 (4)
  • February 2018 (4)
  • January 2018 (1)
  • December 2017 (1)
  • November 2017 (3)
  • October 2017 (4)
  • September 2017 (2)
  • August 2017 (4)
  • July 2017 (2)
  • June 2017 (4)
  • May 2017 (4)
  • April 2017 (3)
  • March 2017 (6)
  • February 2017 (1)
  • January 2017 (2)
  • December 2016 (4)
  • November 2016 (8)
  • October 2016 (3)
  • September 2016 (3)
  • August 2016 (4)
  • July 2016 (3)
  • June 2016 (2)
  • May 2016 (5)
  • April 2016 (4)
  • March 2016 (5)
  • February 2016 (5)
  • January 2016 (2)
  • December 2015 (2)
  • November 2015 (1)
  • October 2015 (1)
  • September 2015 (2)
  • August 2015 (3)
  • July 2015 (3)
  • June 2015 (1)
  • May 2015 (5)
  • April 2015 (2)
  • March 2015 (1)
  • February 2015 (5)
  • January 2015 (2)
  • December 2014 (2)
  • November 2014 (4)
  • October 2014 (2)
  • September 2014 (1)
  • August 2014 (3)
  • July 2014 (2)
  • June 2014 (1)
  • May 2014 (6)
  • April 2014 (3)
  • March 2014 (6)
  • February 2014 (8)
  • January 2014 (7)
  • December 2013 (8)
  • November 2013 (8)
  • October 2013 (10)
  • September 2013 (10)
  • August 2013 (12)
  • July 2013 (9)
  • June 2013 (10)
  • May 2013 (10)
  • April 2013 (10)
  • March 2013 (10)
  • February 2013 (10)
  • January 2013 (8)
  • December 2012 (10)
  • November 2012 (11)
  • October 2012 (10)
  • September 2012 (9)
  • August 2012 (4)
  • July 2012 (9)
  • June 2012 (7)
  • May 2012 (6)
  • April 2012 (10)
  • March 2012 (15)
  • February 2012 (14)
  • January 2012 (13)
  • December 2011 (12)
  • November 2011 (12)
  • October 2011 (14)
  • September 2011 (14)
  • August 2011 (15)
  • July 2011 (15)
  • June 2011 (20)
  • May 2011 (13)
  • April 2011 (3)

Categories

  • Active Directory
  • Antivirus
  • Appliance
  • Azure
  • Azure Analysis Services
  • Azure Arc
  • Azure Cosmos DB
  • Azure Data Factory
  • Azure Data Lake
  • Azure Data Lake Analytics
  • Azure Databricks
  • Azure Purview
  • Azure SQL Database
  • Azure Stack
  • Azure Synapse Analytics/SQL DW
  • Big Data
  • BISM
  • Blob Storage
  • Blog
  • Business Intelligence
  • Career
  • Certification
  • Cloud
  • Columnstore indexes
  • Consulting
  • Cortana Intelligence Suite
  • Data Lake
  • Data Mesh
  • Data warehouse
  • Database Migration Service
  • DataOps
  • DAX
  • Delta Lake
  • Dimensions
  • DNS
  • DQS
  • ETL
  • Excel
  • Fast Track
  • Fuzzy Lookup
  • Hadoop
  • HDInsight
  • Introduction
  • IoT
  • KPI
  • MDS/MDM
  • MDX
  • Microsoft
  • Microsoft Fabric
  • OpenAI
  • PASS
  • PDW/APS
  • PerformancePoint
  • Podcast
  • PolyBase
  • Power BI
  • Power Map
  • Power Pivot
  • Power Query
  • Power View/Project Crescent
  • Presentation
  • ProClarity
  • Quick Tip
  • Report Builder
  • Security
  • Session
  • SharePoint
  • SharePoint 2013
  • SQL Server
  • SQL Server 2012
  • SQL Server 2014
  • SQL Server 2016
  • SQL Server 2017
  • SQL Server 2019
  • SQLServerPedia Syndication
  • SSAS
  • SSDT/Juneau
  • SSIS
  • SSMS
  • SSRS
  • Tabular
  • Team Foundation Server
  • Tools
  • Training
  • Videos
  • Virtualization
  • Visio
  • Visual Studio 2008
  • Visual Studio 2010
  • Visual Studio 2012
  • Visual Studio 2013
  • Visual Studio vNext
  • White Paper
  • Windows 8

Favorite Blogs/YouTube

Piethein Strengholt
Chris Webb
Paul Randal
Brent Ozar
sqlbi
Paul Turley SQL Server BI Blog
Melissa Coates
Paul Andrew
Matthew Roche
Azure Data Blog
Curbal
Fun with Azure
Advancing Analytics
Guy in a Cube
Bryan Cafferky
RADACAD
Kasper on BI

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org

I’m a Top 10 SQL Server blogger of 2018

I’m a former SQL Server MVP

I’m a Rockstar Blogger

©2025 - James Serra's Blog - Weaver Xtreme Theme
↑