- Set up a box so that wrapping works correctly. - Print a `+` at the start of each trace line to show it is trace output. - Format location tag with the rest of the output (fixes wrapping). - Add tests for traceln.
eio -- effects based parallel IO for OCaml
This library implements an effects-based direct-style IO stack for multicore OCaml.
The library is very much a work-in-progress, so this is an unreleased repository.
Contents
- Motivation
- Structure of the code
- Getting started
- Testing with mocks
- Fibres
- Tracing
- Switches, errors and cancellation
- Design note: results vs exceptions
- Performance
- Networking
- Design note: object capabilities
- Filesystem access
- Time
- Multicore support
- Design note: thread-safety
- Design note: determinism
- Further reading
Motivation
The Unix
library provided with OCaml uses blocking IO operations, and is not well suited to concurrent programs such as network services or interactive applications.
For many years, the solution to this has been libraries such as Lwt and Async, which provide a monadic interface.
These libraries allow writing code as if there were multiple threads of execution, each with their own stack, but the stacks are simulated using the heap.
The multicore version of OCaml adds support for "effects", removing the need for monadic code here. Using effects brings several advantages:
- It is faster, because no heap allocations are needed to simulate a stack.
- Concurrent code can be written in the same style as plain non-concurrent code.
- Because a real stack is used, backtraces from exceptions work as expected.
- Other features of the language (such as
try ... with ...
) can be used in concurrent code.
In addition, modern operating systems provide high-performance alternatives to the old Unix select
call.
For example, Linux's io-uring system has applications write the operations they want to perform to a ring buffer,
which Linux handles asynchronously.
Due to this, we anticipate many OCaml users will want to rewrite their IO code at some point, once effects have been merged into the official version of OCaml. It would be very beneficial if we could use this opportunity to standardise on a single concurrency API for OCaml.
This project is therefore exploring what this new API should look like by building an effects-based IO library and then using it to create or port larger applications.
The API is expected to change a great deal over the next year or so. If you are looking for a stable library for your application, you should continue using Lwt or Async for now. However, if you'd like to help with these experiments, please get in touch!
At present, Linux with io-uring is the only backend available. It is able to run a web-server with good performance, but many features are still missing.
Structure of the code
eio
provides concurrency primitives (promises, etc), and a high-level, cross-platform OS API.eio_linux
provides a Linux io-uring backend for these APIs, plus a low-level API that can be used directly (in non-portable code).eunix
provides some common code shared by multiple backends.eio_main
selects an appropriate backend (e.g.eio_linux
), depending on your platform.ctf
provides tracing support.
Getting started
You will need a version of the OCaml compiler with effects. You can get one like this:
opam switch create 4.12.0+domains+effects --repositories=multicore=git+https://github.com/ocaml-multicore/multicore-opam.git,default
opam pin add -yn ppxlib 0.22.0+effect-syntax
opam pin add -yn ocaml-migrate-parsetree 2.1.0+effect-syntax
Then you'll need to install this library (and utop
if you want to try it interactively):
git clone --recursive https://github.com/ocaml-multicore/eio.git
cd eio
opam pin -yn ./ocaml-uring
sed -i '/dune" "subst/d' *.opam
opam pin -yn -k path .
opam depext -i eio_main utop
To try out the examples interactively, run utop
and require
the eio_main
library.
It is also convenient to open the Eio.Std
module:
# #require "eio_main";;
# open Eio.Std;;
This function writes a greeting to stdout:
let main ~stdout =
Eio.Flow.copy_string "Hello, world!\n" stdout
To run it, we use Eio_main.run
to run the event loop and call it from there:
# Eio_main.run @@ fun env ->
main ~stdout:(Eio.Stdenv.stdout env);;
Hello, world!
- : unit = ()
Note that:
-
The
env
argument represents the standard environment of a Unix process, allowing it to interact with the outside world. A program will typically start by extracting fromenv
whatever things the program will need and then callingmain
with them. -
The type of the
main
function here tells us that this program only interacts viastdout
. -
Eio_main.run
automatically calls the appropriate run function for your platform. For example, on Linux this will callEio_linux.run
. For non-portable code you can use the platform-specific library directly.
Testing with mocks
Because external resources are provided to main
as arguments, we can easily replace them with mocks for testing.
e.g.
# Eio_main.run @@ fun _env ->
let buffer = Buffer.create 20 in
main ~stdout:(Eio.Flow.buffer_sink buffer);
traceln "Main would print %S" (Buffer.contents buffer);;
+Main would print "Hello, world!\n"
- : unit = ()
traceln
provides convenient printf-style debugging, without requiring you to plumb stderr
through your code.
It's actually using the Format
module, so you can use the extended formatting directives here too.
Fibres
Here's an example running two threads of execution (fibres) concurrently:
let main _env =
Switch.top @@ fun sw ->
Fibre.both ~sw
(fun () -> for x = 1 to 3 do traceln "x = %d" x; Fibre.yield ~sw () done)
(fun () -> for y = 1 to 3 do traceln "y = %d" y; Fibre.yield ~sw () done)
# Eio_main.run main;;
+x = 1
+y = 1
+x = 2
+y = 2
+x = 3
+y = 3
- : unit = ()
Notes:
-
The two fibres run on a single core, so only one can be running at a time. Calling an operation that performs an effect (such as
yield
) can switch to a different thread. -
The
sw
argument is used to handle exceptions (described later).
Tracing
The library can write traces in CTF format, showing when threads (fibres) are created, when they run, and how they interact.
We can run the previous code with tracing enabled (writing to a new trace.ctf
file) like this:
# let () =
let buffer = Ctf.Unix.mmap_buffer ~size:0x100000 "trace.ctf" in
let trace_config = Ctf.Control.make buffer in
Ctf.Control.start trace_config;
Eio_main.run main;
Ctf.Control.stop trace_config;;
+x = 1
+y = 1
+x = 2
+y = 2
+x = 3
+y = 3
The trace can be viewed using mirage-trace-viewer. This should work even while the program is still running. The file is a ring buffer, so when it gets full old events will start to be overwritten with new ones.
This shows the two counting threads, as well as the lifetime of the sw
switch.
Note that the output from traceln
appears in the trace as well as on the console.
Switches, errors and cancellation
A switch is used to group fibres together so that they can be cancelled or waited on together. This is a form of structured concurrency.
Here's what happens if one of the two threads above fails:
# Eio_main.run @@ fun _env ->
Switch.top @@ fun sw ->
Fibre.both ~sw
(fun () -> for x = 1 to 3 do traceln "x = %d" x; Fibre.yield ~sw () done)
(fun () -> failwith "Simulated error");;
+x = 1
Exception: Failure "Simulated error".
What happened here was:
- The first fibre ran, printed
x = 1
and yielded. - The second fibre raised an exception.
Fibre.both
caught the exception and turned off the switch.- The first thread's
yield
saw the switch was off and raised the exception there too. - Once both threads had finished,
Fibre.both
re-raised the exception.
Note that turning off a switch only asks the other thread(s) to cancel. A thread is free to ignore the switch and continue (perhaps to clean up some resources).
Any operation that can be cancelled should take a ~sw
argument.
Switches can also be used to wait for threads even when there isn't an error. e.g.
# Eio_main.run @@ fun _env ->
Switch.top (fun sw ->
Fibre.fork_ignore ~sw
(fun () -> for i = 1 to 3 do traceln "i = %d" i; Fibre.yield ~sw () done);
traceln "First thread forked";
Fibre.fork_ignore ~sw
(fun () -> for j = 1 to 3 do traceln "j = %d" j; Fibre.yield ~sw () done);
traceln "Second thread forked; top-level code is finished"
);
traceln "Switch is finished";;
+i = 1
+First thread forked
+j = 1
+i = 2
+Second thread forked; top-level code is finished
+j = 2
+i = 3
+j = 3
+Switch is finished
- : unit = ()
Switch.top
is used for top-level switches. You can also use Fibre.fork_sub_ignore
to create a child sub-switch.
Turning off the parent switch will also turn off the child switch, but turning off the child doesn't disable the parent.
For example, a web-server might use one switch for the whole server and then create one sub-switch for each incoming connection. This allows you to end all fibres handling a single connection by turning off that connection's switch, or to exit the whole application using the top-level switch.
Design note: results vs exceptions
The OCaml standard library uses exceptions to report errors in most cases.
Many libraries instead use the result
type, which has the advantage of tracking the possible errors in the type system.
However, using result
is slower, as it requires more allocations, and explicit code to propagate errors.
As part of the effects work, OCaml is expected to gain a typed effects extension to the type system,
allowing it to track both effects and exceptions statically.
In anticipation of this, the Eio library prefers to use exceptions in most cases,
reserving the use of result
for cases where the caller is likely to want to handle the problem immediately
rather than propagate it.
Performance
As mentioned above, Eio allows you to supply your own implementations of its abstract interfaces.
This is in contrast to OCaml's standard library, for example, which only operates on OS file descriptors.
You might wonder what the performance impact of this is.
Here's a simple implementation of cat
using the standard OCaml functions:
# let () =
let buf = Bytes.create 4096 in
let rec copy () =
match input stdin buf 0 4096 with
| 0 -> ()
| got ->
output stdout buf 0 got;
copy ()
in
copy ()
And here is the equivalent using Eio:
# let () =
Eio_main.run @@ fun env ->
Eio.Flow.copy
(Eio.Stdenv.stdin env)
(Eio.Stdenv.stdout env)
Testing on a fresh 10G file with pv on my machine gives:
$ truncate -s 10G dummy
$ cat_ocaml_unix.exe <dummy | pv >/dev/null
10.0GiB 0:00:04 [2.33GiB/s]
$ cat <dummy | pv >/dev/null
10.0GiB 0:00:04 [2.42GiB/s]
$ cat_ocaml_eio.exe <dummy | pv >/dev/null
10.0GiB 0:00:03 [3.01GiB/s]
Eio.Flow.copy src dst
asks dst
to copy from src
.
As dst
here is a Unix file descriptor,
it first calls the probe
method on the src
object to check whether it is too.
Discovering that src
is also a file descriptor, it switches to a faster code path optimised for that case.
On my machine, this code path uses the Linux-specific splice
system call for maximum performance.
Note that not all cases are well optimised yet, but the idea is for each backend to choose the most efficient way to implement the operation.
Networking
Eio provides a simple high-level API for networking.
Here is a client that connects to address addr
using network
and sends a message:
let run_client ~sw ~net ~addr =
traceln "Connecting to server...";
let flow = Eio.Net.connect ~sw net addr in
Eio.Flow.copy_string "Hello from client" flow;
Eio.Flow.shutdown flow `Send
Note: The flow
is attached to sw
and will be closed automatically when it finishes.
Here is a server that listens on socket
and handles a single connection by reading a message:
let run_server ~sw socket =
Eio.Net.accept_sub socket ~sw (fun ~sw flow _addr ->
traceln "Server accepted connection from client";
let b = Buffer.create 100 in
Eio.Flow.copy flow (Eio.Flow.buffer_sink b);
traceln "Server received: %S" (Buffer.contents b)
) ~on_error:(fun ex -> traceln "Error handling connection: %s" (Printexc.to_string ex));
traceln "(normally we'd loop and accept more connections here)"
Notes:
accept_sub
handles the connection in a new fibre, with its own sub-switch.- Normally, a server would call
accept_sub
in a loop to handle multiple connections. - When the child switch created by
accept_sub
finishes,flow
is closed automatically.
We can test them in a single process using Fibre.both
:
let main ~net ~addr =
Switch.top @@ fun sw ->
let server = Eio.Net.listen net ~sw ~reuse_addr:true ~backlog:5 addr in
traceln "Server ready...";
Fibre.both ~sw
(fun () -> run_server ~sw server)
(fun () -> run_client ~sw ~net ~addr)
# Eio_main.run @@ fun env ->
main
~net:(Eio.Stdenv.net env)
~addr:(`Tcp (Unix.inet_addr_loopback, 8080))
+Server ready...
+Connecting to server...
+Server accepted connection from client
+(normally we'd loop and accept more connections here)
+Server received: "Hello from client"
- : unit = ()
Design note: object capabilities
The Eio
high-level API follows the principles of the Object-capability model (ocaps).
In this model, having a reference to an "object" (which could be a function or closure) grants permission to use it.
The only ways to get a reference are to create a new object, or to be passed an existing reference by another object.
For A to pass a reference B to another object C, A requires access (i.e. references) to both B and C.
In particular, for B to get a reference to C there must be a path in the reference graph between them
on which all objects allow it.
This is all just standard programming practice, really, except that it disallows patterns that break this model:
- Global variables are not permitted. Otherwise, B could store itself in a global variable and C could collect it.
- Modules that use C code or the OS to provide the effect of globals are also not permitted.
For example, OCaml's Unix
module provides access to the network and filesystem to any code that wants it.
By contrast, an Eio module that wants such access must receive it explicitly.
Consider the network example in the previous section. Imagine this is a large program and we want to know:
- Does this program modify the filesystem?
- Does this program send telemetry data over the network?
In an ocap language, we don't have to read the entire code-base to find the answers:
- All authority starts at the (privileged)
run
function with theenv
parameter, so we must check this code. - We see that only
env
's network access is used, so we know this program doesn't access the filesystem, answering question 1 immediately. - To check whether telemetry is sent, we need to follow the
network
authority as it is passed tomain
. main
usesnetwork
to open a listening socket on the loopback interface, which it passes torun_server
.run_server
does not get the fullnetwork
access, so we probably don't need to read that code (we might want to check whether we granted other parties access to this port on our loopback network).run_client
does getnetwork
, so we do need to read that. We could make that code easier to audit by passing it(fun () -> Eio.Net.connect network addr)
instead ofnetwork
. Then we could see thatrun_client
could only connect to our loopback address.
Since OCaml is not an ocap language, code can ignore Eio and use the non-ocap APIs directly. Therefore, this cannot be used as a security mechanism. However, it still makes non-malicious code easier to understand and test, and may allow for an ocap extension to the language in the future. See Emily for a previous attempt at this.
Filesystem access
Access to the filesystem is also controlled by capabilities, and env
provides two:
fs
provides full access (just like OCaml's stdlib).cwd
restricts access to files beneath the current working directory.
For example:
let try_write_file dir path data =
match
Eio.Dir.with_open_out ~create:(`Exclusive 0o600) dir path @@ fun flow ->
Eio.Flow.copy_string data flow
with
| () -> traceln "write %S -> ok" path
| exception ex -> traceln "write %S -> %a" path Fmt.exn ex
let try_mkdir dir path =
match Eio.Dir.mkdir dir path ~perm:0o700 with
| () -> traceln "mkdir %S -> ok" path
| exception ex -> traceln "mkdir %S -> %a" path Fmt.exn ex
# Eio_main.run @@ fun env ->
let cwd = Eio.Stdenv.cwd env in
try_mkdir cwd "dir1";
try_mkdir cwd "../dir2";
try_mkdir cwd "/tmp/dir3";
+mkdir "dir1" -> ok
+mkdir "../dir2" -> Eio.Dir.Permission_denied("..", _)
+mkdir "/tmp/dir3" -> Eio.Dir.Permission_denied("/tmp", _)
- : unit = ()
The checks also apply to following symlinks:
# Unix.symlink "dir1" "link-to-dir1"; Unix.symlink "/tmp" "link-to-tmp";;
- : unit = ()
# Eio_main.run @@ fun env ->
let cwd = Eio.Stdenv.cwd env in
try_write_file cwd "dir1/file1" "A";
try_write_file cwd "link-to-dir1/file2" "B";
try_write_file cwd "link-to-tmp/file3" "C"
+write "dir1/file1" -> ok
+write "link-to-dir1/file2" -> ok
+write "link-to-tmp/file3" -> Eio.Dir.Permission_denied("link-to-tmp/file3", _)
- : unit = ()
You can use open_dir
(or with_open_dir
) to create a restricted capability to a sub-directory:
# Eio_main.run @@ fun env ->
let cwd = Eio.Stdenv.cwd env in
Eio.Dir.with_open_dir cwd "dir1" @@ fun dir1 ->
try_write_file dir1 "file4" "D";
try_write_file dir1 "../file5" "E"
+write "file4" -> ok
+write "../file5" -> Eio.Dir.Permission_denied("../file5", _)
- : unit = ()
Note that you only need to use open_dir
if you want to create a new sandboxed environment.
You can use a single directory object to access all paths beneath it,
and this allows following symlinks within that sub-tree.
A program that operates on the current directory will probably want to use cwd
,
whereas a program that accepts a path from the user will probably want to use fs
,
perhaps with open_dir
to constrain all access to be within that directory.
Time
The standard environment provides a clock with the usual POSIX time:
# Eio_main.run @@ fun env ->
let clock = Eio.Stdenv.clock env in
traceln "The time is now %f" (Eio.Time.now clock);
Eio.Time.sleep clock 1.0;
traceln "The time is now %f" (Eio.Time.now clock)
+The time is now 1623940778.270336
+The time is now 1623940779.270336
- : unit = ()
You might like to replace this clock with a mock for tests. In fact, this README does just that - see doc/prelude.ml for the fake clock used in the example above!
Multicore support
Fibres are scheduled cooperatively within a single domain, but you can also create new domains that run in parallel. This is useful to perform CPU-intensive operations quickly. For example, let's say we have a CPU intensive task:
let sum_to n =
traceln "Starting CPU-intensive task...";
let total = ref 0 in
for i = 1 to n do
total := !total + i
done;
traceln "Finished";
!total
We can use Eio.Domain_manager
to run this in a separate domain:
let main ~domain_mgr =
Switch.top @@ fun sw ->
let test n =
traceln "sum 1..%d = %d" n
(Eio.Domain_manager.run_compute_unsafe domain_mgr
(fun () -> sum_to n))
in
Fibre.both ~sw
(fun () -> test 100000)
(fun () -> test 50000)
# Eio_main.run @@ fun env ->
main ~domain_mgr:(Eio.Stdenv.domain_mgr env)
+Starting CPU-intensive task...
+Starting CPU-intensive task...
+Finished
+sum 1..50000 = 1250025000
+Finished
+sum 1..100000 = 5000050000
- : unit = ()
Notes:
traceln
can be used safely from multiple domains. It takes a mutex, so that trace lines are output atomically.- The exact traceln output of this example is non-deterministic, because the OS is free to schedule domains as it likes.
run_compute_unsafe
is "unsafe" because you must ensure that the function doesn't have access to any non-threadsafe values. The type system does not check this.run_compute_unsafe
waits for the domain to finish, but allows other fibres to run while waiting. This is why we useFibre.both
to create multiple fibres.run_compute_unsafe
does not start an event loop in the new domain, so it cannot perform IO or create fibres. There will be a separate API for that in the future.
Design note: thread-safety
OCaml spent the first 25 years of its existence without multicore support, and so most libraries are not thread-safe. Even in languages which had parallelism from the beginning thread safety is a very common cause of bugs. Eio therefore defaults to a conservative model, in which each mutable value is owned and used by a single domain.
There are several ways to share values between domains:
- Immutable values (such as strings) can always be shared safely.
- Values specifically designed to be thread-safe (e.g. a
Mutex.t
) can be shared. - A non-threadsafe value's owning domain can update it in response to messages from other domains.
- A non-threadsafe value can be wrapped with a mutex.
- A non-threadsafe value can passed to another domain if the sending domain will never use it again.
Note that (3) and (4) are not quite the same. Consider this code:
let example q =
assert (Queue.length q = Queue.length q)
If q
is only updated by its owning domain (as in 3) then this assertion will always pass.
Queue.length
will not perform an effect which could switch fibres, and so nothing else can update q
.
If another domain wants to change it, it sends a message to q
's domain, which is added to the domain's
run-queue and will take effect later.
However, if q
is wrapped by a mutex (as in 4) then the assertion could fail.
The first Queue.length
will lock and then release the queue, then the second will lock and release it again.
Another domain could change the value between these two calls.
Design note: determinism
Within a domain, fibres are scheduled deterministically. Programs using only the eio APIs can only behave non-deterministically if given a capability to do so from somewhere else.
For example, Fibre.both f g
always starts running f
first,
and only switches to g
when f
finishes or performs an effect that can switch fibres.
Performing IO with external objects (e.g. stdout
, files or network sockets) will introduce non-determinism,
as will using multiple domains.
Note that traceln
is unusual. Although it writes (by default) to stderr, it will not switch fibres.
Instead, if the OS is not ready to receive trace output, the whole domain is paused until it is ready.
This means that adding traceln
to deterministic code will not affect its scheduling.
In particular, if you test your code by providing (deterministic) mocks then the tests will be deterministic.
An easy way to write tests is by having the mocks call traceln
and then comparing the trace output with the expected output.
See Eio's own tests for examples, e.g. tests/test_switch.md.
Further reading
Some background about the effects system can be found in:
- "Retrofitting Concurrency onto OCaml" (to appear, PLDI 2021)
- https://kcsrk.info/ocaml/multicore/2015/05/20/effects-multicore/
- Effects examples: https://github.com/ocaml-multicore/effects-examples/tree/master/aio
- Concurrent System Programming with Effect Handlers
- Asynchronous effect based IO using effect handlers