About data profiling

To profile data in a data source (for example, a database or a file) means to examine and collect information about that data. The purpose of the examination can be, for example, to determine whether this data is accurate and complete or whether the data can be used for business analysis. The information collected during the data profiling refers to data type, structure, content, relationships, and so on.

When uploading data sources to Data Preparation, the data is automatically profiled. To get the best results from automatic data profiling, you can prepare your data (for example, set up the date formats or exclude footer rows). For details, see Preparing data files.

The automatic profiling system identifies the following:

For files, automatic data profiling may require manual adjustments. For details, see Change the data role and Define joins between data sources.