News

Syncfusion Big Data Platform Now Available

The Windows-based Big Data tools and frameworks suite adds support for Apache Spark, Apache HBase and Scientific Python.

Syncfusion Big Data Platform, described as "a collection of Big Data tools and frameworks" that features simplified installers and visual development tools designed to shorten the Big Data learning curve, has finally crossed over from preview to production. It adds support for Apache Spark, Apache HBase and Scientific Python. Syncfusion claims it's the singular Apache Hadoop-based distribution designed for Windows.

That learning curve for Windows-centric developers was complicated by the extra overhead of messing with Linux virtual machines (VMs) and command-line tools, said Syncfusion, which offers its Windows-native approach running on commodity hardware to eliminate that additional complexity. In fact, the company promises to get developers up and running with a Hadoop cluster within 15 minutes.

With the company's Windows-first focus, developers can run Hadoop jobs and access Apache Hive data with Microsoft's C# programming language, though more "traditional" Big Data languages such as Java, Pig, Hive, Python and Scala are also supported.

As part of the platform, the Syncfusion Big Data Studio gives developers an easy-to-use environment for working with Big Data software such as Pig and Hive and accessing the Hadoop Distributed File System (HDFS).

"The Big Data Studio ships with a local install of the Syncfusion Big Data SDK, which provides a complete working Hadoop distribution right on your laptop," the company said. "No virtual machines are needed, so there is no need to juggle between Linux and Windows. You don't even have to be connected to a cluster to work on Hadoop jobs. You can work with Hadoop on your Windows machine, even when offline, and then deploy to a cluster for production when you are ready."

The Research Triangle Park, N.C., company listed the following enhancements to the platform, in addition to the new support for Spark, HBase and Scientific Python:

  • Direct support for managing Oozie jobs.
  • Improved integration with Syncfusion's machine learning runtime.
  • Support to create and manage pseudo-node Hadoop clusters.
  • Enhanced HDFS file browser usability.

Along with on-premises installations, Syncfusion said users can run their own Hadoop clusters on VMs supplied by cloud service providers such as Microsoft Azure and Amazon Web Services (AWS), with customization functionality not found in other cloud-based Hadoop services. Also, the Microsoft partner said, its platform is 100 percent compatible with Azure HDInsight, Microsoft's cloud implementation of Hadoop.

"We are very excited to declare the Syncfusion Big Data Platform a comprehensive, stand-alone Big Data solution for live production environments," said exec Daniel Jebaraj in a statement yesterday. "Its robust feature set and wide-ranging support for tools like Apache Spark, HBase, Pig and Hive make it a vital component for Big Data computing on Windows. Furthermore, the numerous enhancements that have led to this release make it a great option for on-premise and cloud-based cluster deployment."

The platform, which has been in a preview period that started with a September 2014 beta, is no longer totally free, but a free community license is available for those who qualify. The company invited developers to contact it for full pricing details.

About the Author

David Ramel is an editor and writer at Converge 360.

comments powered by Disqus

Featured

  • Hands On: New VS Code Insiders Build Creates Web Page from Image in Seconds

    New Vision support with GitHub Copilot in the latest Visual Studio Code Insiders build takes a user-supplied mockup image and creates a web page from it in seconds, handling all the HTML and CSS.

  • Naive Bayes Regression Using C#

    Dr. James McCaffrey from Microsoft Research presents a complete end-to-end demonstration of the naive Bayes regression technique, where the goal is to predict a single numeric value. Compared to other machine learning regression techniques, naive Bayes regression is usually less accurate, but is simple, easy to implement and customize, works on both large and small datasets, is highly interpretable, and doesn't require tuning any hyperparameters.

  • VS Code Copilot Previews New GPT-4o AI Code Completion Model

    The 4o upgrade includes additional training on more than 275,000 high-quality public repositories in over 30 popular programming languages, said Microsoft-owned GitHub, which created the original "AI pair programmer" years ago.

  • Microsoft's Rust Embrace Continues with Azure SDK Beta

    "Rust's strong type system and ownership model help prevent common programming errors such as null pointer dereferencing and buffer overflows, leading to more secure and stable code."

  • Xcode IDE from Microsoft Archrival Apple Gets Copilot AI

    Just after expanding the reach of its Copilot AI coding assistant to the open-source Eclipse IDE, Microsoft showcased how it's going even further, providing details about a preview version for the Xcode IDE from archrival Apple.

Subscribe on YouTube

Upcoming Training Events