The CRISP-DM Methodology

Data Understanding

An initial collection of raw data is undertaken resulting in a series of reports detailing information such as: data sources, nature of data (incl. integer, float, structured/unstructured), correlations and data quality issues. Some initial analysis and data visualisation may take place to discover useful insights and relevant data subsets.