News

Azure Data Explorer Heads Microsoft Cloud Analytics Updates

Microsoft has beefed up several data analytics offerings in its Azure cloud platform, headed by the general availability of Azure Data Explorer.

Azure Data Explorer fosters real-time analytics of streaming data. The company says it's capable of quickly executing queries on huge volumes of data emanating from applications, Web sites, devices connected to the Internet of Things (IoT) ecosystem and so on.

Corporate exec Julia White said it was "useful to query streaming data to identify trends, detect anomalies and diagnose problems."

Microsoft last month announced the streaming analytics tool had reached general availability, along with Azure Data Lake Storage. Also announced was a preview of new Mapping Data Flow capabilities in Azure Data Factory.

Azure Data Explorer
[Click on image for larger view.] Azure Data Explorer (source: Microsoft).

Azure Data Lake Storage, built on Azure Blob Storage, was described by White as "the first cloud storage that combines the best of hierarchical files system and blob storage." Blob storage is used to house unstructured object data, including text or binary data.

"Azure Data Lake Storage (ADLS) combines the scalability, cost effectiveness, security model, and rich capabilities of Azure Blob Storage with a high-performance file system that is built for analytics and is compatible with the Hadoop Distributed File System," said Jurgen Willis, director of Product Management, Azure Engineering, in his own post. "Customers no longer have to tradeoff between cost effectiveness and performance when choosing a cloud data lake."

Microsoft also announced Mapping Data Flow, a visual, no-code way to work with data transformations in its Azure Data Factory, a hybrid data integration service used to orchestrate and automate data movement, along with the data transformation functionality.

"With Mapping Data Flow in ADF, customers can visually design, build and manage data transformation processes without learning Spark or having a deep understanding of their distributed infrastructure," Willis said. "Mapping Data Flow combines a rich expression language with an interactive debugger to easily execute, trigger, and monitor ETL jobs and data integration processes."

White said the new capabilities complement the Azure Data Factory’s code-first experience, helping data engineers of all types to collaborate and build powerful hybrid data transformation pipelines. These pipelines are used to work with "activities" in Azure Data Factory that transform and process raw data so it can be used for predictions and business insights. These Big Data activities are centered around several Apache open source analytics projects, such as Hive, Pig, MapReduce, HDInsight Spark and so on.

About the Author

David Ramel is an editor and writer for Converge360.

comments powered by Disqus

Featured

  • .NET Core Ranks High Among Frameworks in New Dev Survey

    .NET Core placed high in a web-dominated ranking of development frameworks published by CodinGame, which provides a tech hiring platform.

  • Here's a One-Stop Shop for .NET 5 Improvements

    Culled from reams of Microsoft documentation, here's a high-level summary of what's new for performance, networking, diagnostics and more, along with links to the nitty-gritty details for those wanting to dig in more.

  • Azure SQL Database Ranked Among Top 3 Databases of 2020

    Microsoft touted the inclusion of Azure SQL Database among the top three databases of 2020 in a popularity ranking by DB-Engines, which collects and manages information about database management systems, updating its lists monthly.

  • Time Tracker Says VS Code Is No. 1 Editor for Devs, Some Working 15+ Hours Per Day

    WakaTime, which does time tracking for programmers, released data for 2020 showing that Visual Studio Code is by far the top editor/IDE used by its coders, some of whom are hacking away for more than 15 hours per day.

Upcoming Events