We help people get the most out of their investment in Oracle Data, Analytics and AI.

emr

ETL Offload with Spark and Amazon EMR - Part 3 - Running pySpark on EMR

In the previous articles (here [https://www.rittmanmead.com/blog/2016/12/etl-offload-with-spark-and-amazon-emr-part-1], and here [https://www.rittmanmead.com/blog/2016/12/etl-offload-with-spark-and-amazon-emr-part-2-code-development-with-notebooks-and-docker/] ) I gave the background to a project we did for a client, exploring the benefits of Spark-based ETL processing running on Amazon's Elastic Map Reduce

DVD

Combining Google Analytics and JSON data through Apache Drill in Oracle Data Visualization Desktop

I've been talking a lot about Oracle's Data Visualization Desktop (DVD) recently, explaining DVD 12.2.2.0 new features [https://www.rittmanmead.com/blog/2016/10/data-visualisation-desktop-12-2-2-0-new-features-2/] and the details of Data Flow [https://www.rittmanmead.com/blog/2016/10/data-visualisation-desktop-data-loading-2/] component via a fantasy