Apache Spark™ is a fast and general engine for large-scale data processing. It has quickly become a powerful and necessary tool in the world of Big Data. This webinar will provide a technical overview of Apache Spark Data Frame. Agenda: – Definition of a DataFrame including various data sources, primary feature and architecture – Use cases of sample operations that can be performed on DataFrames – Live Demo of a user program written in Scala that will process a complex input file having a variety of record types

Share this: