Load the Superstore dataset and libraries needed.
A summary table or boxplot can be useful to visualize the distribution of key variables. Generate a boxplot to visualize Sales and another to visualizr Profit
Create a bar plot to show the top 10 orders with highest value of sales.
Use a heatmap to visualize the pattern of missing data.
# Install and load the VIM package if not already installed
library(VIM)
# Visualize missing data pattern
missing_pattern <- aggr(dataset, col=c('navyblue','red'), numbers=TRUE, sortVars=TRUE, labels=names(dataset), cex.axis=.7, gap=3, ylab=c("Missing data","Pattern"))
Generate a histogram that can show the distribution of the Profit column.
Generate a bar plot that shows Total Sales by Category and a barplot that shows Profit by Category.
- Submitted notebook (or file) with your responses to each of the exercises.
- Upon completion, add your deliverables to git.
- Then commit git and push your branch to the remote.
- Make a pull request and paste the PR link in the submission field in the Student Portal.
Good luck!