Thinking about moving more of the 100baseT1 decode pipeline to the GPU. This is definitely going to end up being one of the more heavily end to end accelerated protocol decodes in the library, at least for now
Ok yeah it's definitely going to happen and i have a plan.
After some data shuffling that currently happens on the CPU but will probably move to GPU long term, I'll run one GPU thread per detected packet (packet start search already happens on GPU) and decode the rest of the packet out to timestamps and data bytes.