The Pervasive DataRush ProfilerTM module helps you profile your data based on a set of quality metrics that you control. Forget time-consuming, error-prone manual checking for data assurance.
There’s a lot riding on your data. Make sure it’s accurate.
DataRush Product Architecture
High Performance Data-intensive Application
Capabilities
- Intuitive API capable of specifying a set of pre-defined and user-defined metrics to execute on a data source.
- Splits input data into clean and dirty data streams according to the configured metrics.
- Configurable outputs including an object model, embedded database, XML, or PDF.
- Extensive set of quality metrics such as field comparison, is blank, is null, is value contained in lookup, and a regex matcher.
- Statistical metrics such as min, max, mode, median, standard deviation, and variance.
- Data discovery metrics such as equal range binning with outlier handling, most frequent values, distinct values, data ranges, and quantiles.
- Extend with user-defined metrics written in an easy to use scripting language.