metachris/pdfx
Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.
1,072
PythonApache License 2.0Stars
1,072
Updated
Nov 23, 2025
Stars Over Time
Top Contributors
Related Repositories
Track developers from metachris/pdfx
Join 1,000+ companies finding quality developer leads