
make jar

cd spark-all-pairs-shortest-path/ sbt package

command to run Spark on the cluster

scp -i keyname.pem target/scala-2.10/all-pairs-shortest-path_2.10-1.0.jar [email protected]:~/

ssh -i keyname.pem [email protected]

now put the jar in the hdfs

./persistent-hdfs/bin/hadoop fs -rm /vol/all-pairs-shortest-path_2.10-1.0.jar ./persistent-hdfs/bin/hadoop fs -put all-pairs-shortest-path_2.10-1.0.jar hdfs://

cd spark

./bin/spark-submit --class AllPairsShortestPath --master spark:// \ --deploy-mode cluster hdfs://


  1. ask Rezar (distance first / procedures)
  2. design API
  3. implement API + commenting + cleaning the code
  4. write unit tests
  5. write readme file