metachris/pdfx

Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.

1,072
PythonApache License 2.0
Stars

1,072

Updated

Nov 23, 2025

Stars Over Time

Top Contributors

Related Repositories

Track developers from metachris/pdfx

Join 1,000+ companies finding quality developer leads