Excel for Data Science

Introduction

Microsoft Excel is one of the most important and widely used tool in data science for quick data analysis. Even if we have other tools to handle big datasets, Excel always comes handy in day to day analytics projects for quick EDA or even making exhaustive reports.

So, we will we covering the most used Excel features by Data Scientists or Data Analysts in the analytics domain.

Table of Contents

  • Basic Excel Functions
  • VLook Up
  • Pivot Table
  • Basic Data Analysis (with Example DataSet)
  • Data Science Excel Case Studies

Basic Excel Functions

Article: Check out 15 most useful Functions for Excel Data Analysis Link

Video: Excel Beginners – Complete Crash Course in 30 minutes

VLOOK UP

Similarly, HLOOK UP can be used for fetch data from rows.

Pivot Table

Part 1

P.S. Ignore the where PivotPal is being used.

Part 2:

Part 3:

Data Analysis (with Example DataSet)

The below video will cover the following Excel features:

  1. Using Tables
  2. Formulas
  3. Pivot Tables & Power Pivot measures
  4. Conditional formatting
  5. Charts
  6. Data Validation
  7. Keyboard Shortcuts & tricks

Bonus Video: 20 Excel Tricks for Data Analysis | Excel Tutorials

Data Science Excel Case Study

Suppose you have to complete the first cut version of Data Science Project on Excel. Let’s work on few case studies below and do full end-to-end data analysis work using Excel.

Case Study 1:

The client is a Motorcycle Shop Chain based in India. The client hired KeytoDataScience to run make a dashboard for their business team to understand their sales and ratings over the years. The below dataset contains yearly sales data for accessories, bikes, clothing and bike components.

Task:

Provide an overview of the motorcycle’s sales by the following attributes:

  • Total yearly sales across all categories
  • Year-on-year total sales and average rating
  • Year-on-year change in sales and average rating

Output:

Create an Excel Dashboard on showing the overview of the motorcycles sales and ratings.

Motorcycle Shop Sales Excel Dashboard

Please include a section on highlighting the insights you can generate by observing the dashboard.

Case Study 2:

The client is a leading Fashion retailer in Australia. The client hired KeytoDataScience to run a display advertising campaign for their brand, and shows ads to users leading them to make a purchase on the brand’s website. The given dataset is the Sales data for all users who made a purchase online in the first half of October ’21.

Task:

Provide an overview of the brand’s sales by the following attributes:

  • Overall – Total Sales and Revenue in the given time period
  • Basket – Avg. unique quantity, revenue per order
  • Attributes – Time of Day, Day of Week, Geography, Payment Type, Gender
  • Frequency – How many are single/multiple purchasers? What is the frequency of multi-purchase? Any typical attributes?

Output:

Create a Excel Dashboard/Report on your findings covering the overview of the brand’s sales performance.

Please include a section on how would you broadly go about targeting more users for this brand. Come up with a well-defined media strategy.

Data Science Life Cycle

SQL for Data Science

Leave a Comment

Keytodatascience Logo

Connect

Subscribe

Join our email list to receive the latest updates.

© 2022 KeyToDataScience