We’ve developed a beta version of Spark notebook support in our fork of OpenStack Sahara. With this version it is possible to have a development environment for Spark in a few clicks, ready to be used by a data scientist to develop his applications.
Spark Notebooks offer the same features as iPython notebooks, but with the distributed computing capabilities of Spark behind the scenes. They are programmed in Scala: code and text can be mixed, with plots and other data visualizations generated on the fly.
With our changes in Sahara, Noteboks processes can be deployed together with a Spark cluster and will be configured and ready to go.