Allocation in execv, execvp, and execve

The `exec` family of functions replace the current process with the new one, thus programs that wish to spawn a child normally call `exec*()` soon after a `fork()`. Unfortunately, fork [interacts badly with threads](http://www.linuxprogrammingblog.com/threads-and-fork-think-twice-before-using-them), because of which [POSIX restricts](http://pubs.opengroup.org/onlinepubs/009695399/functions/fork.html) the functions that are allowed to be called after `fork()` to a small [set of async-signal-safe functions.](http://man7.org/linux/man-pages/man7/signal-safety.7.html) The part that affects Rust is that, in particular, calls to `malloc()` are not allowed, which means any heap allocation after `fork()` is forbidden. See [`std::process` documentation](https://doc.rust-lang.org/std/os/unix/process/trait.CommandExt.html#notes) and [source](https://github.com/rust-lang/rust/blob/1.15.1/src/libstd/sys/unix/process/process_unix.rs#L114) for additional details.

The `nix` interface to `execv`, `execve`, and `execvp` calls `to_exec_array` to allocate the kind of array that the corresponding C functions expect. It makes sense that this is done inside `exec`; the `argv` and `envp` pointers are laid out in a way completely alien to Rust, and in violation to its safety rules. However, since `exec*` functions are typically invoked after a fork, it would be very nice to make the allocation optional, while still providing a fully safe wrapper.

One way to avoid allocation after fork and retain safety is allocate the needed buffers *before* forking. For example, a type that encapsulates an owning vector of C strings could be instantiated before `fork()`, and passed to actual `exec`. Since it would always store both the vector of `CString` objects and a vector of pointers to their content, its `as_c_vec()` method could produce the layout needed by `exec*` without further allocation:

```rust
struct CVec {
    strings: Vec<CString>,

    // nullptr-terminated vector of pointers to data inside self.strings.
    ptrs: Vec<*const libc::c_char>,
}

impl CVec {
    fn new<S>(slice: &[S]) -> Result<CVec>
        where S: AsRef<OsStr>
    {
        // ...
    }

    pub fn as_c_vec(&self) -> *const *const libc::c_char {
        self.ptrs.as_ptr()
    }
}
```

A safe variant of `unistd::execvp()` would only accept `CVec` objects:

```rust
pub fn safe_execv(path: &CString, argv: &CVec) -> Result<Void> {
    unsafe {
        libc::execv(path.as_ptr(), argv.as_c_vec())
    };
    Err(Error::Sys(Errno::last()))
}
```

Another option, which is more of a hassle to implement, but perhaps more convenient to use, is to completely encapsulate the allocation done inside the exec wrapper. It would effectively support staged execution, where first stage would prepare the needed data and return a closure that carries out the actual call to `libc::exec`, to be called after forking:

```rust
let do_exec = unistd::stage_execvp("cmd", &["arg0", "arg1", "arg2"])?;
if unistd::fork()?.is_child() {
    do_exec();  // handle errors
    _exit(127);
}
```

String arguments are supported by accepting `AsRef<OsStr>`, again converted to C strings before `fork()`. Here is an [example](https://github.com/hniksic/rust-subprocess/blob/release/0.1.9/src/posix.rs#L234) of a library that implements this approach to wrap `exec*` and implement the equivalent of `execvpe`. (While developing this library, the code exhibited real hangs before its own exec wrapper was switched to the approach described above.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allocation in execv, execvp, and execve #555

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Allocation in execv, execvp, and execve #555

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions