Onward and Upward

Blog archive

Microsoft, Hortonworks Combine to Bring Hadoop to Windows Server

Hortonworks has partnered with Microsoft to provide the first Hadoop implementation geared specifically toward Windows Server.

Called the Hortonworks Data Platform (HDP), it "enables organizations to capture, process and share data in any format and at scale," according to a Hortonworks press release.

HDP allows customers to use Hadoop on-premises or in the cloud, via Windows Azure. It's a 100 percent open-source project -- all code is made available to the Apache Software Foundation.

For Microsoft-focused developers, it should make creating Big Data apps easier. "Applications built on HDP for Windows should just work on Microsoft's HDInsight server and the Azure HDInsight service," wrote Herain Oberoi on the SQL Server Blog.  

HDP is also completely interoperable between Windows and Linux, making it the industry's first Hadoop distribution available on both platforms, according to Hortonworks.

Hadoop is becoming more important to Microsoft. HDInsight was featured at last year's Build conference, Microsoft's main developer show. And although Big Data has seen its growth mostly in the realm of Linux and Java, Redmond has been steadily increasing its presence in the market. And when one considers that Windows Server has 73 percent of the market (according to Hortonworks, quoting IDC numbers), it makes sense to open up Windows to a larger segment of the development community.

Visual Studio Magazine columnist Andrew Brust nicely summed up the benefits of HDInsight in his December 2012 article:

"With HDInsight, developers can write MapReduce code in C# instead of Java, or use a LINQ provider to manipulate MapReduce indirectly through Hive. A NuGet package provides the C# MapReduce support, and a single-node developer version of HDInsight allows local debugging of such code in Visual Studio. A command-line utility provides deployment of the assembly to the local Hadoop instance. Deployment directly from Visual Studio to remote clusters, including the Windows Azure HDInsight implementation, seems a safe bet for future releases."

The announcement continues to solidify two recent trends for Microsoft: Big Data integration and open-source collaboration. For instance, the recent revelation that Visual Studio and Team Foundation Service (TFS) will support Git source control demonstrated that Microsoft isn't just teasing when it says its committed to the open-source community.

Hortonworks Data Platform 1.2 is available here.

Posted by Keith Ward on 02/28/2013


comments powered by Disqus

Featured

  • Creating Reactive Applications in .NET

    In modern applications, data is being retrieved in asynchronous, real-time streams, as traditional pull requests where the clients asks for data from the server are becoming a thing of the past.

  • AI for GitHub Collaboration? Maybe Not So Much

    No doubt GitHub Copilot has been a boon for developers, but AI might not be the best tool for collaboration, according to developers weighing in on a recent social media post from the GitHub team.

  • Visual Studio 2022 Getting VS Code 'Command Palette' Equivalent

    As any Visual Studio Code user knows, the editor's command palette is a powerful tool for getting things done quickly, without having to navigate through menus and dialogs. Now, we learn how an equivalent is coming for Microsoft's flagship Visual Studio IDE, invoked by the same familiar Ctrl+Shift+P keyboard shortcut.

  • .NET 9 Preview 3: 'I've Been Waiting 9 Years for This API!'

    Microsoft's third preview of .NET 9 sees a lot of minor tweaks and fixes with no earth-shaking new functionality, but little things can be important to individual developers.

  • Data Anomaly Detection Using a Neural Autoencoder with C#

    Dr. James McCaffrey of Microsoft Research tackles the process of examining a set of source data to find data items that are different in some way from the majority of the source items.

Subscribe on YouTube