News

Bigger Data Comes to Windows

The Hortonworks Data Platform for Windows works with Microsoft's Windows Azure HDInsight Service.

The next step in integrating Windows and big data was taken today, continuing Microsoft's push into expanded business intelligence capabilities.

Hortonworks Data Platform (HDP) for Windows was commercially released today, offering another way to run Apache Hadoop "big data" workloads.

The product previously was at the beta stage back in February. While many Apache Hadoop workloads run on Linux servers, HDP for Windows offers native support for both Linux servers and Windows Server, with "a common user experience," according to Hortonworks' announcement. Moreover, Hortonworks claims that its platform is 100 percent open source, which isn't the case with some Hadoop implementations.

Microsoft collaborated with Hortonworks on the HDP for Windows product. Hadoop is an Apache open source project, largely fostered by Yahoo, with some of the Yahoo Hadoop team members later joining Hortonworks. So Microsoft's collaboration with Hortonworks will add support for organizations running Hadoop in mixed computing environments.

The collaboration also paves the way for Microsoft's big data business intelligence tools. Microsoft PowerPivot for Excel and Power View for SharePoint Services can both be used to display Hadoop query results. Hadoop is an open source framework for MapReduce, which supports scale-out data processing across clusters using piles of unstructured and structured data, allowing ad hoc queries to be run. So, in theory, Microsoft will make it easier to graph such data and gain insights. Microsoft worked with the Apache Software Foundation on the open database connectivity driver for Hive, Hadoop's data warehouse system, to build support for its business intelligence tools.

There's also System Center integration effort with Apache Ambari, which enables System Center to manage Hadoop clusters alongside other computing assets. The Web-based Ambari tool is used to install, monitor and manage Apache Hadoop clusters.

Microsoft is also touting the ability of HDP for Windows to work with its own Windows Azure HDInsight Service. Supposedly, users of HDP for Windows can "migrate seamlessly" to Microsoft's cloud-based Windows Azure Hadoop implementation. Microsoft also has its own Hadoop implementation for Windows, which is called "Microsoft HDInsight Server for Windows."

Microsoft's Windows Azure HDInsight Service is currently at beta. Possibly, it could be released this summer, according to a recent talk by expert Andrew Brust. He noted that Microsoft still needs to do some work with HDInsight to get the tooling up to speed for enterprise use.

Hortonworks' HDP for Windows 1.1 product can be downloaded at this page. It contains Hadoop components such as Pig, Hive and Sqoop, among others. HDP for Windows 1.1 runs on Windows Server 2008 or Windows Server 2012.

About the Author

Kurt Mackie is senior news producer for 1105 Media's Converge360 group.

comments powered by Disqus

Featured

  • Creating Reactive Applications in .NET

    In modern applications, data is being retrieved in asynchronous, real-time streams, as traditional pull requests where the clients asks for data from the server are becoming a thing of the past.

  • AI for GitHub Collaboration? Maybe Not So Much

    No doubt GitHub Copilot has been a boon for developers, but AI might not be the best tool for collaboration, according to developers weighing in on a recent social media post from the GitHub team.

  • Visual Studio 2022 Getting VS Code 'Command Palette' Equivalent

    As any Visual Studio Code user knows, the editor's command palette is a powerful tool for getting things done quickly, without having to navigate through menus and dialogs. Now, we learn how an equivalent is coming for Microsoft's flagship Visual Studio IDE, invoked by the same familiar Ctrl+Shift+P keyboard shortcut.

  • .NET 9 Preview 3: 'I've Been Waiting 9 Years for This API!'

    Microsoft's third preview of .NET 9 sees a lot of minor tweaks and fixes with no earth-shaking new functionality, but little things can be important to individual developers.

  • Data Anomaly Detection Using a Neural Autoencoder with C#

    Dr. James McCaffrey of Microsoft Research tackles the process of examining a set of source data to find data items that are different in some way from the majority of the source items.

Subscribe on YouTube