Member-only story

The Data Science Method (DSM) -Exploratory Data Analysis

Aiden V Johnson
6 min readFeb 11, 2019

--

This is the third article in a series about how to take your data science projects to the next level by using a methodological approach similar to the scientific method coined the Data Science Method. This article is focused on the number of step three Exploratory Data Analysis. If you missed the previous article(s) in this series, you can go to the beginning here, or click on each step title below to read a specific step in the process.

The Data Science Method

  1. Problem Identification
  2. Data Wrangling
  3. Exploratory Data Analysis
  4. Pre-processing and Training Data Development
  5. Modeling
  6. Documentation
Camp 3 EDA

EXPLORATORY DATA ANALYSIS (EDA)

Step number three in the Data Science Method (DSM) assumes that both steps one and two have already been completed. At this point in your data science project, you have a well-structured and defined hypothesis or problem description. The model development data set is up and ready to be explored, and your early data cleaning steps are already completed. At a minimum, you have one column per variable and have a clear understanding of your response variable.

Based on step two in the DSM you have already reviewed the following items about each variable in your data:

  1. Column Name
  2. Data Type (numeric, categorical, timestamp, etc)
  3. Description of Column
  4. Count or percent per unique values or codes (including NA)
  5. The range of values or codes

There are many sub-steps in a proper exploratory data analysis (EDA) workflow. Depending on your familiarity with your data and the complexity of the data and the problem you are solving the scale of the EDA necessary may change. Generally, the exploratory analysis workflow can be broken down into four critical steps:

  1. Build data profile tables and plots
  2. Explore data relationships
  3. Identification and creation of features

1. DATA PROFILES — PLOTS AND TABLES

--

--

Aiden V Johnson
Aiden V Johnson

Responses (1)

Write a response