Onward and Upward

Blog archive

Microsoft, Hortonworks Combine to Bring Hadoop to Windows Server

Hortonworks has partnered with Microsoft to provide the first Hadoop implementation geared specifically toward Windows Server.

Called the Hortonworks Data Platform (HDP), it "enables organizations to capture, process and share data in any format and at scale," according to a Hortonworks press release.

HDP allows customers to use Hadoop on-premises or in the cloud, via Windows Azure. It's a 100 percent open-source project -- all code is made available to the Apache Software Foundation.

For Microsoft-focused developers, it should make creating Big Data apps easier. "Applications built on HDP for Windows should just work on Microsoft's HDInsight server and the Azure HDInsight service," wrote Herain Oberoi on the SQL Server Blog.  

HDP is also completely interoperable between Windows and Linux, making it the industry's first Hadoop distribution available on both platforms, according to Hortonworks.

Hadoop is becoming more important to Microsoft. HDInsight was featured at last year's Build conference, Microsoft's main developer show. And although Big Data has seen its growth mostly in the realm of Linux and Java, Redmond has been steadily increasing its presence in the market. And when one considers that Windows Server has 73 percent of the market (according to Hortonworks, quoting IDC numbers), it makes sense to open up Windows to a larger segment of the development community.

Visual Studio Magazine columnist Andrew Brust nicely summed up the benefits of HDInsight in his December 2012 article:

"With HDInsight, developers can write MapReduce code in C# instead of Java, or use a LINQ provider to manipulate MapReduce indirectly through Hive. A NuGet package provides the C# MapReduce support, and a single-node developer version of HDInsight allows local debugging of such code in Visual Studio. A command-line utility provides deployment of the assembly to the local Hadoop instance. Deployment directly from Visual Studio to remote clusters, including the Windows Azure HDInsight implementation, seems a safe bet for future releases."

The announcement continues to solidify two recent trends for Microsoft: Big Data integration and open-source collaboration. For instance, the recent revelation that Visual Studio and Team Foundation Service (TFS) will support Git source control demonstrated that Microsoft isn't just teasing when it says its committed to the open-source community.

Hortonworks Data Platform 1.2 is available here.

Posted by Keith Ward on 02/28/2013 at 1:15 PM

comments powered by Disqus


  • How to Create a Machine Learning Decision Tree Classifier Using C#

    After earlier explaining how to compute disorder and split data in his exploration of machine learning decision tree classifiers, resident data scientist Dr. James McCaffrey of Microsoft Research now shows how to use the splitting and disorder code to create a working decision tree classifier.

  • Microsoft: Move from Traditional ASP.NET to 'Core' Requires 'Heavy Lifting'

    There are plenty of reasons to move traditional ASP.NET web apps -- part of the old .NET Framework -- to the new cross-platform direction, ASP.NET Core, but beware it will require some "heavy lifting," Microsoft says.

  • Purple Blue Nebula Graphic

    How to Compute Disorder for Machine Learning Decision Trees Using C#

    Using a decision tree classifier from a machine learning library is often awkward because it usually must be customized and library decision trees have many complex supporting functions, says resident data scientist Dr. James McCaffrey, so when he needs a decision tree classifier, he always creates one from scratch. Here's how.

  • Blazor's Future: gRPC Is Key

    Blazor guru Steve Sanderson detailed what Microsoft is thinking about the future of the revolutionary project that enables .NET-based web development using C# instead of JavaScript, explaining how gRPC is key, along with a new way of testing and a scheme for installable desktop apps.

  • Don't Do It All Yourself: Exploiting gRPC Well Known Types in .NET Core

    If you're creating business services that send dates and decimal data then you may be concerned that gRPC services don't support the relevant data types. Don't Panic! There are solutions. Here's how to use them.

.NET Insight

Sign up for our newsletter.

Terms and Privacy Policy consent

I agree to this site's Privacy Policy.

Upcoming Events