Don't miss Inspire 2024, taking place May 13 - 16, 2024 at the Venetian, Las Vegas. Register Now.

Alteryx and Databricks to Lead Development of Apache SparkR for Scalable Hadoop Analytics

Alteryx and Databricks Announce Technology and Go-to-Market Partnership to Drive Adoption of SparkR and SparkSQL;

Alteryx to Focus on Apache Spark Framework as the Primary Technology for Customers to Achieve Scalable, Analytic Freedom in Hadoop.

San Francisco, Calif. – Spark Summit – June 30, 2014 – Alteryx and Databricks today announced they are collaborating to drive the value of Apache Hadoop and Spark into the hands of everyday analysts. These companies will become the primary committers to SparkR, a subset of the overall Spark framework. In addition, Alteryx and Databricks are announcing a technology and go-to-market partnership to accelerate the adoption of SparkR and SparkSQL, in order to help analysts get greater value from Spark as the leading open-source in-memory engine.

“We are focused on becoming the most complete option for data analysts across the Hadoop landscape. Our goal is to empower analysts to utilize data everywhere to make the best analytic decisions possible,” said George Mathew, President and COO of Alteryx. “We believe the Apache Spark framework to be the primary method for our customers to achieve scalable, analytic freedom with their Hadoop investment. We’re delighted to be driving the new analytic stack with Databricks.”

Apache Spark, an open source data analytics framework, has quickly been gaining traction for its fast and scalable in-memory analytic processing capabilities inside and independent of Hadoop. SparkR is an R package that enables the R programming language to run inside of the Spark framework in order to manipulate the data for analytics. The collaboration between Alteryx and Databricks will foster faster delivery of a market leading in-memory engine for R-based analytics within Hadoop that is available for the Spark community. Together the companies will work to bring the SparkR package to a 1.0 production version, utilizing a growing array of machine learning algorithms.

“The strong traction that Apache Spark has gained in the industry is a clear indication of the value to the broad user community and the need to further invest in the development of projects such as SparkR,” said Amr Awadallah, chief technology officer at Cloudera. “Eliminating the complexities of analytics in Hadoop for users will enable everyday analysts to deliver highly scalable analytics in their Hadoop-based enterprise data hubs.”

“The Databricks team is putting forward the best technology for the betterment and adoption of Apache Spark,” said Ion Stoica, CEO of Databricks. “Our collaboration with Alteryx on SparkR will only accelerate this value to a wider audience.”

Alteryx and Databricks will also collaborate on joint technology and go-to-market activities to speed the ease of use and adoption of SparkR and SparkSQL technologies for data blending and advanced analytics on the Spark platform.

Alteryx will be adopting the Apache Spark framework into a future release of the Alteryx Analytics platform to allow its customers to achieve faster, scalable analytics across all of their data. As an important foundation, Alteryx will support the ability to read and write directly to Hadoop HDFS in an upcoming release to the Alteryx Analytics platform.

For more information


About Alteryx, Inc.

Alteryx is the leader in data blending and advanced analytics software. Alteryx Analytics provides analysts with an intuitive workflow for data blending and advanced analytics that leads to deeper insights in hours, not the weeks typical of traditional approaches. Analysts love the Alteryx analytics platform because they can deliver deeper insights by seamlessly blending internal, third party, and cloud data, and then analyze it using spatial and predictive drag-and-drop tools. This is all done in a single workflow, with no programming required. More than 500 customers, including Experian, Kaiser, Ford, and McDonald’s, and 200,000+ users worldwide rely on Alteryx daily. Visit or call 1-888-836-4274. Alteryx is a registered trademark of Alteryx, Inc.

About Databricks

Databricks ( was founded by the creators of Apache Spark, and are using cutting-edge technology based on years of research to build next-generation software for analyzing and extracting value from Big Data. They believe Big Data is a tremendous opportunity that is still largely untapped, and are working to revolutionize what enterprises can do with it. They are venture-backed by Andreessen Horowitz.

Media Contacts
Brandy S. Baxter
Alteryx, Inc.
Office: 650-375-2907
Twitter: @brandysbaxter
Alex Koritz
Office: 801-461-9795