Understanding how spark applications use dependencies

Question

Let's say that we have spark application that write/read to/from HDFS and we have some additional dependency, let's call it dep.

Now, let's do spark-submit on our jar built with sbt. I know that spark-submit send some jars (known as spark-libs). However, my questions are:
(1) How does version of spark influence on sent dependencies? I mean a difference between spark-with-hadoop/bin/spark-submit and spark-without-hadopo/bin/spark-submit?
(2) How does version of hadoop installed on cluster (hadoop cluster) influence on dependencies?
(3) Who is responsible for providing my dependency dep? Should I build fat-jar (assembly) ?

Please note that both first questions are about from what HDFS calls come from (I mean calls done by my spark application like write/read).

Thanks in advance

Understanding how spark applications use dependencies

Answers (1)

Related Questions