Processing and Serving Data with Apache Spark

Written by: Edward Yeh, Principal Big Data Consultant So what is Apache Spark and why do we care? Spark is a fast and general-purpose cluster computing system that is used for large-scale data processing of both structured and unstructured data. The project was initially developed by the AMPlab at UC Berkeley and has now evolved …