News

Two Big Data Companies Form Partnership

Cloudera and MongoDB also announce the MongoDB Connector for Hadoop.

Developers are increasingly focusing on Big Data development, and Hadoop is one of the most popular technologies for the .NET crowd.

That tent just got a little bigger, as Cloudera Inc. and MongoDB Inc. announced a strategic partnership to integrate their Big Data technologies.

Cloudera is one of the main players in the Big Data space, with a popular Hadoop distribution and accompanying bundled packages for enterprises. MongoDB is billed as the most widely used NoSQL database and is often used in Big Data analytics.

Details of the partnership were scant, though the companies said they had the lofty goal to "transform how organizations approach Big Data."

One immediate aspect of the partnership was the certification of the MongoDB Connector for Hadoop to be used with Cloudera Enterprise 5, the company's latest package that bundles the Cloudera Distribution Including Apache Hadoop (CDH) with a subscription to Cloudera Manager, a Hadoop administration tool, and technical support.

"Certifying MongoDB's Connector for Hadoop on Cloudera Enterprise 5 was our first step to help our joint customers," Kelly Stirman, director of product at MongoDB, said in an interview. "Resources are being invested even further to match our companies' joint vision, including building a tighter integration of the next version of the MongoDB Connector for Hadoop."

The companies said no further news on the partnership will be forthcoming until the June 24 kick-off of the MongoDB World conference in New York.

The MongoDB Connector for Hadoop plug-in lets developers more easily use MongoDB as an input source or output destination for processing Big Data with the open source Hadoop framework. The connector obviates the need to use custom code or cumbersome import/export scripts to move data around. It takes advantage of multi-core parallelism, has full integration with the Hadoop and Java Virtual Machine (JVM) ecosystems, is compatible with Amazon Elastic MapReduce, and can use local filesystems, Hadoop Distributed File System (HDFS), or S3 to read and write backup files, MongoDB said in a Webinar last year.

"The connector presents MongoDB as a Hadoop-compatible file system allowing a MapReduce job to read from MongoDB directly without first copying it to HDFS, thereby eliminating the need to move terabytes of data across the network," MongoDB said last year when the connector was updated. "MapReduce jobs can pass queries as filters, so avoiding the need to scan entire collections, and can also take advantage of MongoDB's rich indexing capabilities including geospatial, text-search, array, compound and sparse indexes."

Beyond the connector, the companies hinted at more to come regarding the collaboration. "More than a simple technology integration, the partnership brings the two companies' leadership to bear in enabling enterprises to fundamentally rethink how data can be shared and put to work across the enterprise," they said in a statement. "The combination of Cloudera Enterprise and MongoDB will enable customers to easily develop, operate and manage Big Data infrastructure that powers modern applications."

The announcement was the latest in a string regarding MongoDB, which earlier this month released a major upgrade of its database. Last week, Microsoft announced new high-memory MongoDB instances were available in its Microsoft Azure cloud platform.

About the Author

David Ramel is an editor and writer for Converge360.

comments powered by Disqus

Featured

  • AI for GitHub Collaboration? Maybe Not So Much

    No doubt GitHub Copilot has been a boon for developers, but AI might not be the best tool for collaboration, according to developers weighing in on a recent social media post from the GitHub team.

  • Visual Studio 2022 Getting VS Code 'Command Palette' Equivalent

    As any Visual Studio Code user knows, the editor's command palette is a powerful tool for getting things done quickly, without having to navigate through menus and dialogs. Now, we learn how an equivalent is coming for Microsoft's flagship Visual Studio IDE, invoked by the same familiar Ctrl+Shift+P keyboard shortcut.

  • .NET 9 Preview 3: 'I've Been Waiting 9 Years for This API!'

    Microsoft's third preview of .NET 9 sees a lot of minor tweaks and fixes with no earth-shaking new functionality, but little things can be important to individual developers.

  • Data Anomaly Detection Using a Neural Autoencoder with C#

    Dr. James McCaffrey of Microsoft Research tackles the process of examining a set of source data to find data items that are different in some way from the majority of the source items.

  • What's New for Python, Java in Visual Studio Code

    Microsoft announced March 2024 updates to its Python and Java extensions for Visual Studio Code, the open source-based, cross-platform code editor that has repeatedly been named the No. 1 tool in major development surveys.

Subscribe on YouTube