Data Transformation Template:

Filter Required Data Based on Values from Another Dataset

Conditional filtering of data Flow The flow view of this template

array functions (arrayintersect, arraylen), extractlist, list, join, filtering (keep)

This template shows how you can filter your data based on values found in another reference dataset. It makes use of array functions such as arrayintersect after extracting all the values to look for in the reference dataset into an array via the list function.

To customize this template for your own use case, supply your own reference dataset in place of categories.txt and modify the Find values and filter recipe to accommodate the number of columns in your source data for the filtering.

New user?

If your data is mostly on Google Cloud Platform, please use Dataprep. Otherwise, choose Designer Cloud.

Use in Designer Cloud Use in Dataprep