PDF To CSV
Tabula
import tabula
df = tabula.read_pdf("file.pdf")
tabula.convert_into(pdf, "output.csv", output_format="csv")
Dealing with multi-line tables
tabula.read_pdf("BC Civil Forefeiture 2014-15 Grant Recipients.pdf",spreadsheet=True, pages='all')
Last updated
Was this helpful?