@tallison Do you know of extractous? It uses GraalVM to compile Tika natively, and then exposes it as a Rust crate.

https://github.com/yobix-ai/extractous

Makes me wonder whether anyone's used wasm to use Tika in the browser...

GitHub - yobix-ai/extractous: Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages. - yobix-ai/extractous

GitHub
@codeslack Oh, my head hurts.Thank you for this!
@tallison This could potentially open up new use cases where the JVM acts as a barrier... though I wonder what the executable sizes are like.