Stop wasting your time with EDA

Work smarter with dataprep.ai and get to training faster

Dawn Moyer
4 min readJun 13, 2022
Photo by Danny Lines on Unsplash

Please stop running your Exploratory Data Analysis (EDA)…one…notebook…cell…at…a…time. You are wasting your time when comprehensive tools are available. dataprep.ai is one of those tools.

This past week, I was able to iterate through multiple dataset transformations and review the changes blazing fast using dataprep.eda. The value of the tool jumped out at me so much that I sent a quick tutorial to my team. Most folks still run just run the same EDA code snippets for each project out of habit. Time to shake it up, expand into new calculations, and complete EDA faster.

I will review the code, and the interactive report and show how the report can be used to identify areas of concern for Kaggle’s Spaceship Titanic competition.

  1. The code
  2. The report
  3. The Usage

The code is SIMPLE!

This article focuses on the EDA functions within the dataprep python package. There is a load_dataset class but I prefer to use pandas to read in the csv file. This is as simple as import, read, create_report, save/show. That’s it.

--

--

Dawn Moyer

Data Enthusiast, fallible human. A data scientist with a background in both psychology and IT. https://www.linkedin.com/in/dawn-e-moyer/