fix(executor): switch process working dir via native chdir by my-vegetable-has-exploded · Pull Request #475 · ray-project/raydp

my-vegetable-has-exploded · 2026-05-17T06:24:25Z

Motivation

The RayDP executor process CWD remains the container default directory (e.g., / or /opt) rather than the executor's workingDir. This causes Spark distributed files (--files) and archives (--archives) to be extracted to the wrong location, making it impossible for executor code to find them at the expected paths.

Approach

JNA native chdir call: Add JNA dependency to pom.xml, define a LibC interface to load the chdir() syscall from libc. After setUserDir() (which only changes the user.dir system property), add switchProcessWorkingDirBestEffort() to actually switch the process-level CWD
Align SparkEnv.driverTmpDir: Change driverTmpDir from workingDir/_tmp to workingDir itself, so the root directory for Spark distributed files matches the executor's workingDir, and files/archives are extracted to the correct path
Fault tolerance: chdir failure only logs a warning and does not interrupt executor startup; reads cwd before and after the switch for diagnostic logging

Add JNA dependency to invoke native chdir syscall for switching the executor process working directory. Align SparkEnv.driverTmpDir with workingDir to ensure distributed files and archives are extracted to the correct root directory. Signed-off-by: wangyi <epsilonwang@didiglobal.com>

Copilot

Pull request overview

This PR updates RayDP executor startup so the executor process attempts to switch its native working directory to the executor workingDir, aligning Spark distributed file/archive placement with executor-local expectations.

Changes:

Adds JNA and a native libc chdir call during executor startup.
Adds diagnostic logging around process CWD switching.
Changes SparkEnv.driverTmpDir to point at workingDir instead of workingDir/_tmp.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
`core/raydp-main/src/main/scala/org/apache/spark/executor/RayDPExecutor.scala`	Adds best-effort native CWD switching and aligns Spark distributed file root with executor working directory.
`core/raydp-main/pom.xml`	Adds the JNA dependency required for native libc access.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+        logWarning(s"Failed to switch executor process cwd from ${beforeCwd} to ${targetDir}, " +
+          s"chdir returned rc=${rc}, errno=${Native.getLastError}")


pang-wu · 2026-05-23T07:08:48Z

      assert(workerTmpDir.exists() && workerTmpDir.isDirectory)
-      SparkEnv.get.driverTmpDir = Some(workerTmpDir.getAbsolutePath)
+      // Keep Spark's distributed file/archive root aligned with executor workingDir.
+      SparkEnv.get.driverTmpDir = Some(workingDir.getAbsolutePath)


will only do this solve the problem?

pang-wu · 2026-05-23T07:54:05Z

+
+  private def getProcessWorkingDir: String = {
+    try {
+      val procCwd = Paths.get("/proc/self/cwd")


This is linux specific.

pang-wu · 2026-05-23T08:00:00Z

Can we add some tests to verify the fix work?

wangyi added 2 commits May 17, 2026 03:27

fix: correct import order in RayDPExecutor.scala for scalastyle

fc35aff

slfan1989 requested a review from Copilot May 17, 2026 12:19

Copilot started reviewing on behalf of slfan1989 May 17, 2026 12:20 View session

Copilot AI reviewed May 17, 2026

View reviewed changes

Comment thread core/raydp-main/src/main/scala/org/apache/spark/executor/RayDPExecutor.scala

Comment on lines +201 to +202

logWarning(s"Failed to switch executor process cwd from ${beforeCwd} to ${targetDir}, " +

s"chdir returned rc=${rc}, errno=${Native.getLastError}")

pang-wu reviewed May 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(executor): switch process working dir via native chdir#475

fix(executor): switch process working dir via native chdir#475
my-vegetable-has-exploded wants to merge 2 commits into
ray-project:masterfrom
my-vegetable-has-exploded:ch-dir

my-vegetable-has-exploded commented May 17, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

pang-wu May 23, 2026

Uh oh!

pang-wu May 23, 2026

Uh oh!

pang-wu commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		logWarning(s"Failed to switch executor process cwd from ${beforeCwd} to ${targetDir}, " +
		s"chdir returned rc=${rc}, errno=${Native.getLastError}")

Conversation

my-vegetable-has-exploded commented May 17, 2026

Motivation

Approach

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

pang-wu May 23, 2026

Choose a reason for hiding this comment

Uh oh!

pang-wu May 23, 2026

Choose a reason for hiding this comment

Uh oh!

pang-wu commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants