Tools

Currently, the proposed tools for profiling are open source tools, SQL scripts or existing the enterprise data governance / data management tools:

  • Open Source Tools:

Pandas Profiling: a Python package that generates interactive data profiling reports from pandas DataFrame. It provides a quick overview of data by generating descriptive statistics.

  • SQL Scripts:

SQL scripts can be used to execute standard data profiling functions.

  • Data Management Tools:

Some of the existing the enterprise data management / data governance tools offer functionalities that can be used to execute profiling and report results.