Mastodawn

Oh! There's one more feature that was planned for, and is nearly ready: the support for Propeller-like multitasking, in that the processor's internal state can be cleanly separated from the logic, and rotated, at a cost of zero clocks (but a pipeline flush), at instruction boundaries. This way, in order to have multiple functional processor cores, you'd only need logic blocks for the register files (+ few other tidbits), but the control and arithmetics subsystem would be reusable.

The reason for doing it this way is, it effectively gives you the benefits of RTOS without the costs of actually implementing a software kernel (or the instructions needed to support a modern kernel). It hard-limits the number of parallel tasks, which modern software (RT)OSes tend to frown upon, but the limit is parametric and unbounded, so it actually works quite well in gateware contexts.

The catch is, the instruction pipeline is (currently) not included in the internal state, so it will need to be fully flushed at every task switch. This can probably be improved on, but then again, the constraints of this design didn't call for aggressive pipelining, anyway. Plus, a pipeline flush is still cheaper than a software-controlled context switch.

I did some work for supporting assigning tasks timeslices at different frequencies, but this subsystem is kind of messy and incomplete, so what I can release without significant preparation work would likely be a simple task ring, with a wee bit of optimisation for skipping over a halted (= waiting for interrupts) task. As I said, it's basically Propeller-like task sequencing.

EDIT: For clarity, specific interrupts can be tied to specific tasks. The intent was to allow combining several apparent Microblaze-compatible cores within a design, to bring down the logic block count, so the whole thing is supposed to behave approximately as multiple independent CPU cores sitting on common buses.