Databricks-Conducted Survey Reveals Spark Is Increasingly Being Used in Production
SAN FRANCISCO, CA--(Marketwired - Sep 27, 2016) - Databricks, the company founded by the creators of the Apache Spark project, today released the findings of their second annual Apache Spark survey to determine how enterprises and users are utilizing the data analytics and processing engine. The 2016 Databricks Apache Spark Survey collected more than 1,600 responses from 900 organizations. The results show a rise in deployments of Spark in the public cloud, an increased usage of Spark in industry verticals, and an uptick in Spark streaming and Machine Learning. The survey also reveals that most developers employ two or more Spark components simultaneously to build increasingly sophisticated solutions.
Spark remains the most active open-source project in the big data space today, with over 1,000 contributors from more than 250 organizations. Spark's adoption continued to accelerate throughout the past year, and its growth continues across various industries, building sophisticated data solutions by people in various functional roles. In fact, Spark has moved well beyond the early-adopter phase and is now considered mainstream in large data-driven enterprises, such as banking, medical, bio-tech, and pharmacy.
Download the full report here: http://go.databricks.com/2016-spark-survey
"Since inception, Spark's core mission has been to make big data simple and accessible for everyone -- for organizations of all sizes and across all industries. And we have not deviated from that mission," said Matei Zaharia, creator of Apache Spark and Databricks' Chief Technologist. "I'm excited to see more Apache Spark deployments in the cloud and interest from users to build real-time applications using Spark Streaming, machine learning libraries, and other components, tackling complex problems across a broad range of industries."
Key findings from the survey include:
"As Spark becomes easier, faster, and smarter outside the Web Industry, a newer audience is adopting it, as results from the survey suggest," said Reynold Xin, chief architect and co-founder at Databricks. "Performance, ease-of-use, streaming, and reliability top the list as the most important features. These attributes make Spark an attractive engine for performing advanced analytics across industry verticals in solving complex data problems, by users from different functional roles."
About the survey:
A total of 1,615 respondents from 900 distinct organizations responded to this survey. Of the roles represented in the survey, 41 percent identified themselves as data engineers, while 23 percent as data scientists and 21 percent as architects; the rest came from technical management and academia. Survey respondents were predominantly Apache Spark users.
Additional resources:
About Databricks:
Databricks' vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache® Spark™, a powerful open source data processing engine built for sophisticated analytics, ease of use, and speed. Databricks is the largest contributor to the open source Apache Spark project providing 10x more code than any other company. The company has also trained over 20,000 users on Apache Spark, and has the largest number of customers deploying Spark to date. Databricks provides a just-in-time data platform, to simplify data integration, real-time experimentation, and robust deployment of production applications. Databricks is venture-backed by Andreessen Horowitz and NEA. For more information, contact info@databricks.com.
© Databricks 2016. All rights reserved. Apache, Apache Spark and Spark are trademarks of the Apache Software Foundation.
Contact Information:
CONTACT:
Suzanne Block for Databricks
P: 617-824-0981
E: databricks@merrittgrp.com