This article will cover how to use regular expressions (RegEx) to match and extract data.
1. Add RegEx Extract in Your Dataflow
2. RegEx Extract Options
Add RegEx Extract to any node in your Dataflow. Select any node and use the Transform (+) button to open the list of options to select RegEx Extract. You could work with the node in the Dataflow tab or use the Properties panel.
RegEx Extract node has four fields to help extract values from structured data. Read along to see how we extract the first five digits from sample data.
The first dropdown lets you choose which data field you want to change.
The Match field is a textbox used to type a regular expression to find data with specific criteria.
Note: A column can only be used once (in a single operation row). Click on the Extract Another button to add a new RegEx condition.
The third field lets you choose from two RegEx flags.
Note: You may select either Global, Case Insensitive, or both.
The third option lets you type in a custom name for your extracted data.
Note: A new colum will be create for each regular expression.
Here's what the sample data looks like after our regular expression.