How to work efficiently with SBT, Spark and “provided” dependencies?

前端未结

关注

 8  725

野性不改

I\'m building an Apache Spark application in Scala and I\'m using SBT to build it. Here is the thing:

when I\'m developing under IntelliJ IDEA, I want Spark depend

相关标签:

8条回答

无人共我

2021-01-31 03:27
Why not bypass sbt and manually add spark-core and spark-streaming as libraries to your module dependencies?
- Open the Project Structure dialog (e.g. ⌘;).
- In the left-hand pane of the dialog, select Modules.
- In the pane to the right, select the module of interest.
- In the right-hand part of the dialog, on the Module page, select the Dependencies tab.
- On the Dependencies tab, click add and select Library.
- In the Choose Libraries dialog, select new library, from maven
- Find spark-core. Ex org.apache.spark:spark-core_2.10:1.6.1
- Profit
https://www.jetbrains.com/help/idea/2016.1/configuring-module-dependencies-and-libraries.html?origin=old_help#add_existing_lib
0 讨论(0)
发布评论:

提交评论
- 加载中...
猫巷女王i

2021-01-31 03:29
[Obsolete] See new answer "Use the new 'Include dependencies with "Provided" scope' in an IntelliJ configuration." answer.

The easiest way to add provided dependencies to debug a task with IntelliJ is to:
- Right-click src/main/scala
- Select Mark Directory as... > Test Sources Root
This tells IntelliJ to treat src/main/scala as a test folder for which it adds all the dependencies tagged as provided to any run config (debug/run).

Every time you do a SBT refresh, redo these step as IntelliJ will reset the folder to a regular source folder.
0 讨论(0)
发布评论:

提交评论
- 加载中...
暗喜

2021-01-31 03:38
A solution based on creating another subproject for running the project locally is described here.

Basically, you would need to modifiy the build.sbt file with the following:
```
lazy val sparkDependencies = Seq(
  "org.apache.spark" %% "spark-streaming" % sparkVersion
)

libraryDependencies ++= sparkDependencies.map(_ % "provided")

lazy val localRunner = project.in(file("mainRunner")).dependsOn(RootProject(file("."))).settings(
   libraryDependencies ++= sparkDependencies.map(_ % "compile")
)
```
And then run the new subproject locally with Use classpath of module: localRunner under the Run Configuration.
0 讨论(0)
发布评论:

提交评论
- 加载中...
梦如初夏

2021-01-31 03:44

Use the new 'Include dependencies with "Provided" scope' in an IntelliJ configuration.

0 讨论(0)
发布评论:

提交评论
- 加载中...
慢半拍i

2021-01-31 03:44
You need to make the IntellJ work.

The main trick here is to create another subproject that will depend on the main subproject and will have all its provided libraries in compile scope. To do this I add the following lines to build.sbt:
```
lazy val mainRunner = project.in(file("mainRunner")).dependsOn(RootProject(file("."))).settings(
  libraryDependencies ++= spark.map(_ % "compile")
)
```
Now I refresh project in IDEA and slightly change previous run configuration so it will use new mainRunner module's classpath:

Works flawlessly for me.

Source: https://github.com/JetBrains/intellij-scala/wiki/%5BSBT%5D-How-to-use-provided-libraries-in-run-configurations
0 讨论(0)
发布评论:

提交评论
- 加载中...
爱一瞬间的悲伤

2021-01-31 03:45

You should be not looking at SBT for an IDEA specific setting. First of all, if the program is supposed to be run with spark-submit, how are you running it on IDEA ? I am guessing you'd be running as standalone in IDEA, while running it through spark-submit normally. If that's the case, add manually the spark libraries in IDEA, using File|Project Structure|Libraries. You'll see all dependencies listed from SBT, but you can add arbitrary jar/maven artifacts using the + (plus) sign. That should do the trick.

0 讨论(0)
发布评论:

提交评论
- 加载中...

1 2 下一页

How to work efficiently with SBT, Spark and “provided” dependencies?

You need to make the IntellJ work.