Category Archives: PolyBase

U-SQL Defined

Azure Data Lake Analytics (ADLA) is a distributed analytics service built on Apache YARN that allows developers to be productive immediately on big data.  This is accomplished by submitting a job to the service where the service will automatically run it … Continue reading

Posted in Azure Data Lake, PolyBase, SQLServerPedia Syndication | 2 Comments

PolyBase use cases clarified

I previously talked about PolyBase and its enhancements (see PASS Summit Announcements: PolyBase enhancements).  There is some confusion on PolyBase use cases as they are different depending on whether you are using PolyBase with Azure SQL Data Warehouse (SQL DW) or … Continue reading

Posted in PolyBase, SQLServerPedia Syndication | 3 Comments

Power BI and Excel options for Hadoop

Below I have attempted to list the various options for reporting off of Hadoop (HDInsight, HDP, Cloudera) using Power BI Desktop and Excel.  Some of the data sources prompt you to choose the Data Connectivity mode of either Import or … Continue reading

Posted in Excel, Hadoop, PolyBase, Power BI, SQLServerPedia Syndication | Comments Off on Power BI and Excel options for Hadoop

PASS Summit Announcements: PolyBase enhancements

Microsoft usually has some interesting announcements at the PASS Summit, and this year was no exception.  I’m writing a set of blogs covering the major announcements.  Next up is the PolyBase enhancements. PolyBase is a technology that accesses and combines both … Continue reading

Posted in PolyBase, SQLServerPedia Syndication | 2 Comments

Copying data from Azure Blob Storage

In a previous blog I talked about copying on-prem data to Azure Blob Storage (Getting data into Azure Blob Storage).  Let’s say you have copied the data and it is sitting in Azure Blob Storage (or an Azure Data Lake) … Continue reading

Posted in Azure, Azure SQL Database, Azure SQL DW, Data Lake, PolyBase, SQLServerPedia Syndication | Comments Off on Copying data from Azure Blob Storage

Getting data into Azure Blob Storage

If you have on-prem data and want to copy it to Azure Blob Storage in the cloud, what are all the possible ways to do it?  There are many, and here is a quick review of them: AzCopy: A popular command-line … Continue reading

Posted in Azure, Azure SQL DW, Data Lake, PolyBase, SQLServerPedia Syndication | 2 Comments

Azure SQL Data Warehouse

Analytics Platform System (APS) is Microsoft’s massively parallel processing (MPP) data warehouse technology.  This has only been available as an on-prem solution (see video Overview of Microsoft Analytics Platform System).  Until now.  At the recent Microsoft Build Developer Conference, Executive Vice … Continue reading

Posted in Azure SQL DW, Data warehouse, PDW/APS, PolyBase, SQLServerPedia Syndication | 14 Comments

Hadoop and Data Warehouses

I see a lot of confusion when it comes to Hadoop and its role in a data warehouse solution.  Hadoop should not be a replacement for a data warehouse, but rather should augment/complement a data warehouse.  Hadoop and a data warehouse … Continue reading

Posted in Data Lake, Data warehouse, Hadoop, PDW/APS, PolyBase, SQLServerPedia Syndication | 5 Comments

PolyBase explained

PolyBase is a new technology that integrates Microsoft’s MPP product, SQL Server Parallel Data Warehouse (PDW), with Hadoop.  It is designed to enable queries across relational data stored in PDW and in non-relational Hadoop data that is stored in the Hadoop Distributed File … Continue reading

Posted in Hadoop, PDW/APS, PolyBase, SQLServerPedia Syndication | 26 Comments