Azure Data Explorer Heads Microsoft Cloud Analytics Updates -- Visual Studio Magazine

Azure Data Explorer Heads Microsoft Cloud Analytics Updates

By David Ramel
03/04/2019

Microsoft has beefed up several data analytics offerings in its Azure cloud platform, headed by the general availability of Azure Data Explorer.

Azure Data Explorer fosters real-time analytics of streaming data. The company says it's capable of quickly executing queries on huge volumes of data emanating from applications, Web sites, devices connected to the Internet of Things (IoT) ecosystem and so on.

Corporate exec Julia White said it was "useful to query streaming data to identify trends, detect anomalies and diagnose problems."

Microsoft last month announced the streaming analytics tool had reached general availability, along with Azure Data Lake Storage. Also announced was a preview of new Mapping Data Flow capabilities in Azure Data Factory.

**[Click on image for larger view.]** Azure Data Explorer *(source: Microsoft).*

Azure Data Lake Storage, built on Azure Blob Storage, was described by White as "the first cloud storage that combines the best of hierarchical files system and blob storage." Blob storage is used to house unstructured object data, including text or binary data.

"Azure Data Lake Storage (ADLS) combines the scalability, cost effectiveness, security model, and rich capabilities of Azure Blob Storage with a high-performance file system that is built for analytics and is compatible with the Hadoop Distributed File System," said Jurgen Willis, director of Product Management, Azure Engineering, in his own post. "Customers no longer have to tradeoff between cost effectiveness and performance when choosing a cloud data lake."

Microsoft also announced Mapping Data Flow, a visual, no-code way to work with data transformations in its Azure Data Factory, a hybrid data integration service used to orchestrate and automate data movement, along with the data transformation functionality.

"With Mapping Data Flow in ADF, customers can visually design, build and manage data transformation processes without learning Spark or having a deep understanding of their distributed infrastructure," Willis said. "Mapping Data Flow combines a rich expression language with an interactive debugger to easily execute, trigger, and monitor ETL jobs and data integration processes."

White said the new capabilities complement the Azure Data Factory’s code-first experience, helping data engineers of all types to collaborate and build powerful hybrid data transformation pipelines. These pipelines are used to work with "activities" in Azure Data Factory that transform and process raw data so it can be used for predictions and business insights. These Big Data activities are centered around several Apache open source analytics projects, such as Hive, Pig, MapReduce, HDInsight Spark and so on.

About the Author

David Ramel is an editor and writer at Converge 360.

Printable Format

comments powered by Disqus

Featured

Full-Stack with a Side of Copilot: Building and Deploying an App the AI-Accelerated Way

In this Q&A, developer and VSLive! speaker Esteban Garcia explains how GitHub Copilot can accelerate the full software development lifecycle -- from architecture and code to tests, CI/CD, and Azure deployment -- and how to use it as a repeatable engineering workflow rather than just a faster autocomplete tool.
VS Code 1.127 Further Integrates Advanced Browser-AI Tech

Microsoft's July 1 Visual Studio Code update continues a recent push to make the editor's integrated browser a more capable development surface -- and a more useful tool for AI agents.
Support Vector Regression with SGD Training Using C#

Support vector regression can predict numeric values effectively, and this article shows how to implement and train a kernel SVR model in C# using stochastic sub-gradient descent.
New GitHub Switch Limits Repo Issue Creation to Collaborators Only

After publicly touting pull request limits as a way to cut maintainer noise, GitHub is taking the same idea further with a new setting that lets repository admins restrict issue creation to collaborators only.