Spark Open Source Testing
Recently I tried helping out more with the open source Apache Spark community. As of the writing of this blog the 2.3.0 has had some release candidates submitted for review by the community. To that end I decided to help out & chronicle my experience. So, to start with, I installed Oracle Virtual Box (technically already had) and grabbed a fresh image of Ubuntu 16.04.3 & loaded that up. (Note, we'll be using a lot of space, make sure you have over 20 gb at least of space otherwise you'll have to add memory as you go using virtualbox commands & gparted in linux, which, I can now say, is very much, not fun😒) After I had that I did some basic stuff, like installing openjdk version 8 (technically I forgot openjdk at first... which maven did not like 😃): sudo apt-get install openjdk-8 Also, you have to pull down a copy of R, if you want to have the R related spark stuff to work, I'm going to skip here how to install R on Linux/Ubuntu, but there are lots ...