Passing dependency dlls to Workers/Executors on windows cluster. #906

msgurikar · 2021-04-09T19:10:47Z

msgurikar
Apr 9, 2021

We have setup development windows cluster to run our .NET spark jobs across different windows machine.

Windows Machine 1: Started Master and worker.
spark-class2.cmd org.apache.spark.deploy.master.Master --host xx.xx.100.2

spark-class2.cmd org.apache.spark.deploy.worker.Worker spark://xx.xx.100.2:7077

Now from Windows Machine 2: Submitting job
set SPARK_HOME=%~dp0\spark-3.0.0-bin-hadoop2.7
set DOTNET_WORKER_DEBUG=1
set DOTNET_WORKER_DIR=K:\Microsoft.Spark.Worker-1.0.0
set DOTNET_ASSEMBLY_SEARCH_PATHS=K:/app_binaries/Debug/
set PATH=%SPARK_HOME%\bin;%DOTNET_WORKER_DIR%;%PATH%

Submit-Job.cmd --class org.apache.spark.deploy.dotnet.DotnetRunner --master spark://xx.xx.100.2:7077 --conf spark.driver.host=xx.xx.100.4 --files ./Debug.zip .\microsoft-spark-3-0_2.12-1.0.0.jar, .\Debug.zip .\app.exe

K:\ network drive is accessible to both machines.

I am able to see driver code executing and also Udf getting called, our Udf depends on other Dlls, the place where Udfs calls this dll function, is throwing error and showing worker stderr Unable to load this .dll or its dependencies.

My question is, how do we pass dependency dlls of Udf to workers that are remotely executing Udf.
I see microsoft-spark-3-0_2.12-1.0.0.jar getting copied to spark-3.0.0-bin-hadoop2.7\work\app-20210409135645-0016\0
folder during the run, but where will .net spark finds Udf dependency dlls.

Thank you.

dbeavon · 2021-04-09T19:19:49Z

dbeavon
Apr 9, 2021

@msgurikar
I would avoid mapped drives if possible. They are very fragile/flaky. Generally those are only used as a convenience for a normal person's windows session (as a short-hand for a full UNC). It is challenging to make them behave well for service accounts and system accounts.

I'd highly recommend using full UNC's (\\server\shareddata...)

Also I'd recommend downloading process monitor (sysinternals) and watching the file IO. It will show both successful file operations and failures. You can use filter to see only the IO for certain processes, and exclude others.

1 reply

msgurikar Apr 10, 2021
Author

Thank you David for the reply. Yes, i did try setting set environment variable DOTNET_ASSEMBLY_SEARCH_PATHS=\shareddrive\sharedfolder\app_binaries\Debug\ on our driver Submit job
It didnt work either.

i want to know, how/where does worker/executor finds code binaries and its dependencies.

In Pyspark, dependency files are sent as --py-files .zip file.

Do we have any equivalent for .NET spark.

Thank you,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Passing dependency dlls to Workers/Executors on windows cluster. #906

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Passing dependency dlls to Workers/Executors on windows cluster. #906

msgurikar Apr 9, 2021

Replies: 1 comment · 1 reply

dbeavon Apr 9, 2021

msgurikar Apr 10, 2021 Author

msgurikar
Apr 9, 2021

Replies: 1 comment 1 reply

dbeavon
Apr 9, 2021

msgurikar Apr 10, 2021
Author