Sel4 unikernel vs multi-kernel vs SMP on genode/sculpt

fdelizy · May 21, 2026, 5:05pm

Hi,

I have been spending some time to investigate why sculpt-os+sel4 was rather slow on ARM IMX8MP, and aside from finding some stability issues that I am still working on, I also found the default config for seL4 is actually mono-core, (non-SMP).

kinda filling silly for not checking that earlier TBH ^^;

Reviewing a few threads and papers, I can understand the SMP version isn’t on a path to verification, so choosing unikernel over SMP makes some sense from a security standpoint, yet, the multi-kernel configuration could be an option. It would require a bit more tooling I am sure, but I was wondering, has genode team had a look at the multi-kernel configuration of seL4 for integration?

Right now, mono-thread payloads are quite a bit faster on sel4 compared to hw, but obviously, hw beats sel4 on SMP (since SMP is not activated ^^; )

any thougths?

thanks!

skalk · May 22, 2026, 9:58am

The short answer is: no.

I would be careful with too general assumptions about the runtime performance in between two different underlying kernels in Genode.
When looking at different workloads of our nightly tests, I can see diverse results regarding time needed to succeed in between both mentioned kernels. There are plenty reasons not necessarily directly related to the kernel itself that are responsible therefore. For instance, the initial costs being spent by a client to establish a session are generic, but the actual costs differ on the kernels (what needs to be paid for kernel-resource management). Therefore, it happens that the initial session establishment can be very costly on one kernel, where the client has to try this several times (each time delivering more ram/caps) in a loop. If you have a scenario centered on a lot of dynamic regarding session opening/closing, you get quite different result than maybe I/O heavy loads using shared memory and signaling, and so on. In our regular tests I can see for instance a slightly faster success of the “nic_router_stress” test (artificial network test with no real I/O) on sel4, but regarding the “bomb” test (creating and destroying tons of components) it performance collapses in constrast to base-hw. But even here I would be cautious to assume too much from this observation. One would need to first analyze what is actually bottleneck in which situation. Sometimes it might be simply a configuration issue.

fdelizy · May 22, 2026, 10:06am

True. That being said, my observations are based on a few benchmarks I created to avoid that bias, I am looking at the best way to share those results in a meaningful fashion (but I digress). You mention you have regular tests, could you point me at what are those? I would be interested to learn from them and maybe add that to my benchmarks as well

Either way, I need to study a bit more the base-sel4 layer (and base-hw layer for that matter) to see if the multi-kernel integration would be possible or even make sense then. That will be a low priority for this project though, so it might come rather late ^^;

Thanks for the feedback

alex-ab · May 22, 2026, 10:52am

What I want to add: for x86 we enabled SMP support, but not for any seL4 ARM platform, simply there was no need to do so.

fdelizy · May 22, 2026, 10:59am

got it, I might look into that as well, but since the SMP version is not verfiable according to seL4 community, I might just activate and fix or go for the MultiKernel directly (still trying to understand the limitations).

skalk · May 22, 2026, 11:04am

The list of tests we run nightly can be found here:

The actual results (logs) are not publicly available, but are part of our internal test-suite.
One of the potentially interesting run-scripts is for instance depot_autopilot.run, which itself contains a bunch of independent ~90 tests that are not dependent on additional peripherals.

fdelizy · June 25, 2026, 4:21pm

a followup question, do you also publish a list of platforms that are subjected to those tests? does this include qemu versions or only actual hardware? or both?

I am interested to see what I should validate my code against to ensure stability too

skalk · June 29, 2026, 9:59am

In general, no we don’t publish that data, because nobody had the time to sanity the test-data from infrastructural sensitive information yet.

The following table shows which kernels are tested on which hardware/emulator currently:

Hardware	hw	nova	sel4	Fiasco.OC
Raspberry Pi 1 (arm_v6)	yes	no	no	no
i.MX53 QSB	yes	no	no	no
i.MX6Q Sabrelite	yes	no	yes	yes
i.MX7D Sabre	yes	no	no	yes
Qemu PBXA9	yes	no	no	yes
Qemu Zynq	yes	no	no	no
Zynq QRSP 31x	yes	no	no	no
iMX8MQ EVK	yes	no	yes	no
Qemu Raspberry 3	yes	no	no	yes
Qemu RiscV	yes	no	no	no
PC (32-bit)	no	yes	no	yes
PC (64-bit)	yes	yes	yes	yes
Qemu PC (32-bit)	no	yes	no	yes
Qemu PC (64-bit)	yes	yes	yes	yes

I did not listed Linux, Pistachio, OKL4, and old Fiasco as kernels, which are tested on Qemu and PC hardware as well.

fdelizy · June 29, 2026, 2:41pm

Thanks a lot for sharing!