#OpenDataLoader PDF โ #opensource #PDF parser for #AI & #accessibility ๐ #1 in benchmarks (0.90 overall), runs 100% locally, zero cloud, no GPU needed. #RAG #LLM #Python #NodeJS #Java
๐ Extracts #Markdown, #JSON (with bounding boxes) & #HTML from any PDF โ correct reading order, heading hierarchy, list & image detection with XY-Cut++ algorithm
๐งต๐#pdf
