mirror of https://github.com/ocaml-multicore/eio.git synced 2025-11-27 00:04:06 -05:00

Go to file

Thomas Leonard 1b853cca44 Add Time.now and Time.sleep_until

Providers now provide `sleep_until`. `sleep` is implemented using that.
This makes things a little more precise.

2021-06-17 15:58:54 +01:00

doc

Add Time.now and Time.sleep_until

2021-06-17 15:58:54 +01:00

lib_ctf

Add switches for structured concurrency

2021-05-14 10:33:42 +01:00

lib_eio

Add Time.now and Time.sleep_until

2021-06-17 15:58:54 +01:00

lib_eio_linux

Add Time.now and Time.sleep_until

2021-06-17 15:58:54 +01:00

lib_eunix

Moved Linux implemention to new eio_linux library

2021-06-15 13:35:52 +01:00

lib_main

Moved Linux implemention to new eio_linux library

2021-06-15 13:35:52 +01:00

ocaml-uring @ f73cc8c73c

Initial FS abstraction

2021-06-17 13:05:53 +01:00

tests

Clean up the Network API a bit

2021-06-17 13:29:50 +01:00

.gitignore

initial import

2021-03-02 15:19:17 +01:00

.gitmodules

Fix error in tracing patch

2021-04-28 17:34:36 +01:00

ctf.opam

Rename repository to eio

2021-05-14 10:53:10 +01:00

dune

README improvements

2021-06-12 14:12:19 +01:00

dune-project

Moved Linux implemention to new eio_linux library

2021-06-15 13:35:52 +01:00

eio_linux.opam

Moved Linux implemention to new eio_linux library

2021-06-15 13:35:52 +01:00

eio_main.opam

Add eio_main library to select backend

2021-05-25 11:27:36 +01:00

eio.opam

Merge fibreslib into eio

2021-06-09 09:50:44 +01:00

eunix.opam

Moved Linux implemention to new eio_linux library

2021-06-15 13:35:52 +01:00

Makefile

Split promises into their own library

2021-04-14 09:21:06 +01:00

README.md

Add Time.now and Time.sleep_until

2021-06-17 15:58:54 +01:00

README.md

eio -- effects based parallel IO for OCaml

This library implements an effects-based direct-style IO stack for multicore OCaml.

The library is very much a work-in-progress, so this is an unreleased repository.

Motivation
Structure of the code
Getting started
Testing with mocks
Fibres
Tracing
Switches, errors and cancellation
Design note: results vs exceptions
Performance
Networking
Design note: object capabilities
Filesystem access
Time
Multicore support
Design note: thread-safety
Design note: determinism
Further reading

Motivation

The Unix library provided with OCaml uses blocking IO operations, and is not well suited to concurrent programs such as network services or interactive applications. For many years, the solution to this has been libraries such as Lwt and Async, which provide a monadic interface. These libraries allow writing code as if there were multiple threads of execution, each with their own stack, but the stacks are simulated using the heap.

The multicore version of OCaml adds support for "effects", removing the need for monadic code here. Using effects brings several advantages:

It is faster, because no heap allocations are needed to simulate a stack.
Concurrent code can be written in the same style as plain non-concurrent code.
Because a real stack is used, backtraces from exceptions work as expected.
Other features of the language (such as try ... with ...) can be used in concurrent code.

In addition, modern operating systems provide high-performance alternatives to the old Unix select call. For example, Linux's io-uring system has applications write the operations they want to perform to a ring buffer, which Linux handles asynchronously.

Due to this, we anticipate many OCaml users will want to rewrite their IO code at some point, once effects have been merged into the official version of OCaml. It would be very beneficial if we could use this opportunity to standardise on a single concurrency API for OCaml.

This project is therefore exploring what this new API should look like by building an effects-based IO library and then using it to create or port larger applications.

The API is expected to change a great deal over the next year or so. If you are looking for a stable library for your application, you should continue using Lwt or Async for now. However, if you'd like to help with these experiments, please get in touch!

At present, Linux with io-uring is the only backend available. It is able to run a web-server with good performance, but many features are still missing.

Structure of the code

eio provides concurrency primitives (promises, etc), and a high-level, cross-platform OS API.
eio_linux provides a Linux io-uring backend for these APIs, plus a low-level API that can be used directly (in non-portable code).
eunix provides some common code shared by multiple backends.
eio_main selects an appropriate backend (e.g. eio_linux), depending on your platform.
ctf provides tracing support.

Getting started

You will need a version of the OCaml compiler with effects. You can get one like this:

opam switch create 4.12.0+domains+effects --packages=ocaml-variants.4.12.0+domains+effects --repositories=multicore=git+https://github.com/ocaml-multicore/multicore-opam.git,default

Then you'll need to install this library (and utop if you want to try it interactively):

git clone --recursive https://github.com/ocaml-multicore/eio.git
cd eio
opam pin -yn ./ocaml-uring
opam pin -yn .
opam depext -i eio_main utop

To try out the examples interactively, run utop and require the eio_main library. It is also convenient to open the Eio.Std module:

# #require "eio_main";;
# open Eio.Std;;

This function writes a greeting to stdout:

let main ~stdout =
  Eio.Flow.copy_string "Hello, world!\n" stdout

To run it, we use Eio_main.run to run the event loop and call it from there:

# Eio_main.run @@ fun env ->
  main ~stdout:(Eio.Stdenv.stdout env);;
Hello, world!
- : unit = ()

Note that:

The env argument represents the standard environment of a Unix process, allowing it to interact with the outside world. A program will typically start by extracting from env whatever things the program will need and then calling main with them.
The type of the main function here tells us that this program only interacts via stdout.
Eio_main.run automatically calls the appropriate run function for your platform. For example, on Linux this will call Eio_linux.run. For non-portable code you can use the platform-specific library directly.

Testing with mocks

Because external resources are provided to main as arguments, we can easily replace them with mocks for testing. e.g.

# Eio_main.run @@ fun _env ->
  let buffer = Buffer.create 20 in
  main ~stdout:(Eio.Flow.buffer_sink buffer);
  traceln "Main would print %S" (Buffer.contents buffer);;
Main would print "Hello, world!\n"
- : unit = ()

traceln provides convenient printf-style debugging, without requiring you to plumb stderr through your code. It's actually using the Format module, so you can use the extended formatting directives here too.

Fibres

Here's an example running two threads of execution (fibres) concurrently:

let main _env =
  Switch.top @@ fun sw ->
  Fibre.both ~sw
    (fun () -> for x = 1 to 3 do traceln "x = %d" x; Fibre.yield ~sw () done)
    (fun () -> for y = 1 to 3 do traceln "y = %d" y; Fibre.yield ~sw () done)

# Eio_main.run main;;
x = 1
y = 1
x = 2
y = 2
x = 3
y = 3
- : unit = ()

Notes:

The two fibres run on a single core, so only one can be running at a time. Calling an operation that performs an effect (such as yield) can switch to a different thread.
The sw argument is used to handle exceptions (described later).

Tracing

The library can write traces in CTF format, showing when threads (fibres) are created, when they run, and how they interact. We can run the previous code with tracing enabled (writing to a new trace.ctf file) like this:

# let () =
    let buffer = Ctf.Unix.mmap_buffer ~size:0x100000 "trace.ctf" in
    let trace_config = Ctf.Control.make buffer in
    Ctf.Control.start trace_config;
    Eio_main.run main;
    Ctf.Control.stop trace_config;;
x = 1
y = 1
x = 2
y = 2
x = 3
y = 3

The trace can be viewed using mirage-trace-viewer. This should work even while the program is still running. The file is a ring buffer, so when it gets full old events will start to be overwritten with new ones.

This shows the two counting threads, as well as the lifetime of the sw switch. Note that the output from traceln appears in the trace as well as on the console.

Switches, errors and cancellation

A switch is used to group fibres together so that they can be cancelled or waited on together. This is a form of structured concurrency.

Here's what happens if one of the two threads above fails:

# Eio_main.run @@ fun _env ->
  Switch.top @@ fun sw ->
  Fibre.both ~sw
    (fun () -> for x = 1 to 3 do traceln "x = %d" x; Fibre.yield ~sw () done)
    (fun () -> failwith "Simulated error");;
x = 1
Error: Simulated error
- : unit = ()

What happened here was:

The first fibre ran, printed x = 1 and yielded.
The second fibre raised an exception.
Fibre.both caught the exception and turned off the switch.
The first thread's yield saw the switch was off and raised the exception there too.
Once both threads had finished, Fibre.both re-raised the exception.

Note that turning off a switch only asks the other thread(s) to cancel. A thread is free to ignore the switch and continue (perhaps to clean up some resources).

Any operation that can be cancelled should take a ~sw argument.

Switches can also be used to wait for threads even when there isn't an error. e.g.

# Eio_main.run @@ fun _env ->
  Switch.top (fun sw ->
    Fibre.fork_ignore ~sw
      (fun () -> for i = 1 to 3 do traceln "i = %d" i; Fibre.yield ~sw () done);
    traceln "First thread forked";
    Fibre.fork_ignore ~sw
      (fun () -> for j = 1 to 3 do traceln "j = %d" j; Fibre.yield ~sw () done);
    traceln "Second thread forked; top-level code is finished"
  );
  traceln "Switch is finished";;
i = 1
First thread forked
j = 1
i = 2
Second thread forked; top-level code is finished
j = 2
i = 3
j = 3
Switch is finished
- : unit = ()

Switch.top is used for top-level switches. You can also use Fibre.fork_sub_ignore to create a child sub-switch. Turning off the parent switch will also turn off the child switch, but turning off the child doesn't disable the parent.

For example, a web-server might use one switch for the whole server and then create one sub-switch for each incoming connection. This allows you to end all fibres handling a single connection by turning off that connection's switch, or to exit the whole application using the top-level switch.

Design note: results vs exceptions

The OCaml standard library uses exceptions to report errors in most cases. Many libraries instead use the result type, which has the advantage of tracking the possible errors in the type system. However, using result is slower, as it requires more allocations, and explicit code to propagate errors.

As part of the effects work, OCaml is expected to gain a typed effects extension to the type system, allowing it to track both effects and exceptions statically. In anticipation of this, the Eio library prefers to use exceptions in most cases, reserving the use of result for cases where the caller is likely to want to handle the problem immediately rather than propagate it.

Performance

As mentioned above, Eio allows you to supply your own implementations of its abstract interfaces. This is in contrast to OCaml's standard library, for example, which only operates on OS file descriptors. You might wonder what the performance impact of this is. Here's a simple implementation of cat using the standard OCaml functions:

# let () =
    let buf = Bytes.create 4096 in
    let rec copy () =
      match input stdin buf 0 4096 with
      | 0 -> ()
      | got ->
        output stdout buf 0 got;
        copy ()
    in
    copy ()

And here is the equivalent using Eio:

# let () =
    Eio_main.run @@ fun env ->
    Eio.Flow.copy
      (Eio.Stdenv.stdin env)
      (Eio.Stdenv.stdout env)

Testing on a fresh 10G file with pv on my machine gives:

$ truncate -s 10G dummy

$ cat_ocaml_unix.exe <dummy | pv >/dev/null
10.0GiB 0:00:04 [2.33GiB/s]

$ cat                <dummy | pv >/dev/null
10.0GiB 0:00:04 [2.42GiB/s]

$ cat_ocaml_eio.exe  <dummy | pv >/dev/null
10.0GiB 0:00:03 [3.01GiB/s]

Eio.Flow.copy src dst asks dst to copy from src. As dst here is a Unix file descriptor, it first calls the probe method on the src object to check whether it is too. Discovering that src is also a file descriptor, it switches to a faster code path optimised for that case. On my machine, this code path uses the Linux-specific splice system call for maximum performance.

Note that not all cases are well optimised yet, but the idea is for each backend to choose the most efficient way to implement the operation.

Networking

Eio provides a simple high-level API for networking. Here is a client that connects to address addr using network and sends a message:

let run_client ~sw ~net ~addr =
  traceln "Connecting to server...";
  let flow = Eio.Net.connect ~sw net addr in
  Eio.Flow.copy_string "Hello from client" flow;
  Eio.Flow.shutdown flow `Send

Note: The flow is attached to sw and will be closed automatically when it finishes.

Here is a server that listens on socket and handles a single connection by reading a message:

let run_server ~sw socket =
  Eio.Net.accept_sub socket ~sw (fun ~sw flow _addr ->
    traceln "Server accepted connection from client";
    let b = Buffer.create 100 in
    Eio.Flow.copy flow (Eio.Flow.buffer_sink b);
    traceln "Server received: %S" (Buffer.contents b)
  ) ~on_error:(fun ex -> traceln "Error handling connection: %s" (Printexc.to_string ex));
  traceln "(normally we'd loop and accept more connections here)"

Notes:

accept_sub handles the connection in a new fibre, with its own sub-switch.
Normally, a server would call accept_sub in a loop to handle multiple connections.
When the child switch created by accept_sub finishes, flow is closed automatically.

We can test them in a single process using Fibre.both:

let main ~net ~addr =
  Switch.top @@ fun sw ->
  let server = Eio.Net.listen net ~sw ~reuse_addr:true ~backlog:5 addr in
  traceln "Server ready...";
  Fibre.both ~sw
    (fun () -> run_server ~sw server)
    (fun () -> run_client ~sw ~net ~addr)

# Eio_main.run @@ fun env ->
  main
    ~net:(Eio.Stdenv.net env)
    ~addr:(`Tcp (Unix.inet_addr_loopback, 8080))
Server ready...
Connecting to server...
Server accepted connection from client
(normally we'd loop and accept more connections here)
Server received: "Hello from client"
- : unit = ()

Design note: object capabilities

The Eio high-level API follows the principles of the Object-capability model (ocaps). In this model, having a reference to an "object" (which could be a function or closure) grants permission to use it. The only ways to get a reference are to create a new object, or to be passed an existing reference by another object. For A to pass a reference B to another object C, A requires access (i.e. references) to both B and C. In particular, for B to get a reference to C there must be a path in the reference graph between them on which all objects allow it.

This is all just standard programming practice, really, except that it disallows patterns that break this model:

Global variables are not permitted. Otherwise, B could store itself in a global variable and C could collect it.
Modules that use C code or the OS to provide the effect of globals are also not permitted.

For example, OCaml's Unix module provides access to the network and filesystem to any code that wants it. By contrast, an Eio module that wants such access must receive it explicitly.

Consider the network example in the previous section. Imagine this is a large program and we want to know:

Does this program modify the filesystem?
Does this program send telemetry data over the network?

In an ocap language, we don't have to read the entire code-base to find the answers:

All authority starts at the (privileged) run function with the env parameter, so we must check this code.
We see that only env's network access is used, so we know this program doesn't access the filesystem, answering question 1 immediately.
To check whether telemetry is sent, we need to follow the network authority as it is passed to main.
main uses network to open a listening socket on the loopback interface, which it passes to run_server. run_server does not get the full network access, so we probably don't need to read that code (we might want to check whether we granted other parties access to this port on our loopback network).
run_client does get network, so we do need to read that. We could make that code easier to audit by passing it (fun () -> Eio.Net.connect network addr) instead of network. Then we could see that run_client could only connect to our loopback address.

Since OCaml is not an ocap language, code can ignore Eio and use the non-ocap APIs directly. Therefore, this cannot be used as a security mechanism. However, it still makes non-malicious code easier to understand and test, and may allow for an ocap extension to the language in the future. See Emily for a previous attempt at this.

Filesystem access

Access to the filesystem is also controlled by capabilities, and env provides two:

fs provides full access (just like OCaml's stdlib).
cwd restricts access to files beneath the current working directory.

For example:

let try_write_file dir path data =
  match
    Switch.top @@ fun sw ->
    let flow = Eio.Dir.open_out dir ~sw path ~create:(`Exclusive 0o600) in
    Eio.Flow.copy_string data flow
  with
  | () -> traceln "write %S -> ok" path
  | exception ex -> traceln "write %S -> %a" path Fmt.exn ex

let try_mkdir dir path =
  match Eio.Dir.mkdir dir path ~perm:0o700 with
  | () -> traceln "mkdir %S -> ok" path
  | exception ex -> traceln "mkdir %S -> %a" path Fmt.exn ex

# Eio_main.run @@ fun env ->
  let cwd = Eio.Stdenv.cwd env in
  try_mkdir cwd "dir1";
  try_mkdir cwd "../dir2";
  try_mkdir cwd "/tmp/dir3";
mkdir "dir1" -> ok
mkdir "../dir2" -> Eio.Dir.Permission_denied("..", _)
mkdir "/tmp/dir3" -> Eio.Dir.Permission_denied("/tmp", _)
- : unit = ()

The checks also apply to following symlinks:

# Unix.symlink "dir1" "link-to-dir1"; Unix.symlink "/tmp" "link-to-tmp";;
- : unit = ()

# Eio_main.run @@ fun env ->
  let cwd = Eio.Stdenv.cwd env in
  try_write_file cwd "dir1/file1" "A";
  try_write_file cwd "link-to-dir1/file2" "B";
  try_write_file cwd "link-to-tmp/file3" "C"
write "dir1/file1" -> ok
write "link-to-dir1/file2" -> ok
write "link-to-tmp/file3" -> Eio.Dir.Permission_denied("link-to-tmp/file3", _)
- : unit = ()

You can use open_dir to create a restricted capability to a sub-directory:

# Eio_main.run @@ fun env ->
  Switch.top @@ fun sw ->
  let cwd = Eio.Stdenv.cwd env in
  let dir1 = Eio.Dir.open_dir ~sw cwd "dir1" in
  try_write_file dir1 "file4" "D";
  try_write_file dir1 "../file5" "E"
write "file4" -> ok
write "../file5" -> Eio.Dir.Permission_denied("../file5", _)
- : unit = ()

Note that you only need to use open_dir if you want to create a new sandboxed environment. You can use a single directory object to access all paths beneath it, and this allows following symlinks within that sub-tree.

A program that operates on the current directory will probably want to use cwd, whereas a program that accepts a path from the user will probably want to use fs, perhaps with open_dir to constrain all access to be within that directory.

Time

The standard environment provides a clock with the usual POSIX time:

# Eio_main.run @@ fun env ->
  let clock = Eio.Stdenv.clock env in
  traceln "The time is now %f" (Eio.Time.now clock);
  Eio.Time.sleep clock 1.0;
  traceln "The time is now %f" (Eio.Time.now clock)
The time is now 1623940778.270336
The time is now 1623940779.270336
- : unit = ()

You might like to replace this clock with a mock for tests. In fact, this README does just that - see doc/prelude.ml for the fake clock used in the example above!

Multicore support

Fibres are scheduled cooperatively within a single domain, but you can also create new domains that run in parallel. This is useful to perform CPU-intensive operations quickly. For example, let's say we have a CPU intensive task:

let sum_to n =
  traceln "Starting CPU-intensive task...";
  let total = ref 0 in
  for i = 1 to n do
    total := !total + i
  done;
  traceln "Finished";
  !total

We can use Eio.Domain_manager to run this in a separate domain:

let main ~domain_mgr =
  Switch.top @@ fun sw ->
  let test n =
    traceln "sum 1..%d = %d" n
      (Eio.Domain_manager.run_compute_unsafe domain_mgr
        (fun () -> sum_to n))
  in
  Fibre.both ~sw
    (fun () -> test 100000)
    (fun () -> test 50000)

# Eio_main.run @@ fun env ->
  main ~domain_mgr:(Eio.Stdenv.domain_mgr env)
Starting CPU-intensive task...
Starting CPU-intensive task...
Finished
sum 1..50000 = 1250025000
Finished
sum 1..100000 = 5000050000
- : unit = ()

Notes:

traceln can be used safely from multiple domains. It takes a mutex, so that trace lines are output atomically.
The exact traceln output of this example is non-deterministic, because the OS is free to schedule domains as it likes.
run_compute_unsafe is "unsafe" because you must ensure that the function doesn't have access to any non-threadsafe values. The type system does not check this.
run_compute_unsafe waits for the domain to finish, but allows other fibres to run while waiting. This is why we use Fibre.both to create multiple fibres.
run_compute_unsafe does not start an event loop in the new domain, so it cannot perform IO or create fibres. There will be a separate API for that in the future.

Design note: thread-safety

OCaml spent the first 25 years of its existence without multicore support, and so most libraries are not thread-safe. Even in languages which had parallelism from the beginning thread safety is a very common cause of bugs. Eio therefore defaults to a conservative model, in which each mutable value is owned and used by a single domain.

There are several ways to share values between domains:

Immutable values (such as strings) can always be shared safely.
Values specifically designed to be thread-safe (e.g. a Mutex.t) can be shared.
A non-threadsafe value's owning domain can update it in response to messages from other domains.
A non-threadsafe value can be wrapped with a mutex.
A non-threadsafe value can passed to another domain if the sending domain will never use it again.

Note that (3) and (4) are not quite the same. Consider this code:

let example q =
  assert (Queue.length q = Queue.length q)

If q is only updated by its owning domain (as in 3) then this assertion will always pass. Queue.length will not perform an effect which could switch fibres, and so nothing else can update q. If another domain wants to change it, it sends a message to q's domain, which is added to the domain's run-queue and will take effect later.

However, if q is wrapped by a mutex (as in 4) then the assertion could fail. The first Queue.length will lock and then release the queue, then the second will lock and release it again. Another domain could change the value between these two calls.

Design note: determinism

Within a domain, fibres are scheduled deterministically. Programs using only the eio APIs can only behave non-deterministically if given a capability to do so from somewhere else.

For example, Fibre.both f g always starts running f first, and only switches to g when f finishes or performs an effect that can switch fibres.

Performing IO with external objects (e.g. stdout, files or network sockets) will introduce non-determinism, as will using multiple domains.

Note that traceln is unusual. Although it writes (by default) to stderr, it will not switch fibres. Instead, if the OS is not ready to receive trace output, the whole domain is paused until it is ready. This means that adding traceln to deterministic code will not affect its scheduling.

README.md

eio -- effects based parallel IO for OCaml

Contents

Motivation

Structure of the code

Getting started

Testing with mocks

Fibres

Tracing

Switches, errors and cancellation

Design note: results vs exceptions

Performance

Networking

Design note: object capabilities

Filesystem access

Time

Multicore support

Design note: thread-safety

Design note: determinism

Further reading