Data Transformation Template:

Extract Valuable Information by Parsing PDF Files

Understand data in your PDF files better by parsing, extracting, and importing data into individual datasets

Wrangling PDF Flow The flow view of this template

Data Sources:
PDF files

This template allows you to see how you can wrangle and parse PDF files.. Right click the data source in this template to see various parsing options available. You can extract all the tables in the PDF document into a single dataset or have each table be imported in as a separate dataset. Once you understand these options, feel free to replace the example data with your own PDF files.

For more information, please read this detailed documentation guide.

New user?

If your data is mostly on Google Cloud Platform, please use Dataprep. Otherwise, choose Designer Cloud.

Use in Designer Cloud Use in Dataprep