#shows requested cpu memory for array jobs alongside requested cpu memory for non array jobs for easy comparison.
CPU_arraytask_fig = sns.distplot(CPU_arraytask['ReqMemCPU'], kde=False, label='CPU Array Task', color = "green")
CPU_arraytask_fig.set_yscale('log')
CPU_nonarraytask_fig = sns.distplot(CPU_nonarraytask['ReqMemCPU'], kde=False, label='CPU Non Array Task')
CPU_nonarraytask_fig.set_yscale('log')
plt.legend(prop={'size': 12})
plt.title('User Request of CPU Memory for Array Job vs Not Array Job')
plt.xlabel('Requested Gigs')
plt.ylabel('Amount of Users Requesting')
```
%% Cell type:code id: tags:
```
#shows requested node memory for array jobs alongside requested node memory for non array jobs for easy comparison.
Node_arraytask_fig = sns.distplot(Node_arraytask['ReqMemCPU'], kde=False, label='Node Array Task', color = "green")
Node_arraytask_fig.set_yscale('log')
Node_nonarraytask_fig = sns.distplot(Node_nonarraytask['ReqMemNode'], kde=False, label='Node Non Array Task')
Node_nonarraytask_fig.set_yscale('log')
plt.legend(prop={'size': 12})
plt.title('User Request of Node Memory for Array Job vs Not Array Job')
plt.xlabel('Requested Gigs')
plt.ylabel('Amount of Users Requesting')
```
%% Cell type:markdown id: tags:
# # These are Plotly Express Graphs of the some of the above Seaborn graphs. Run them only if you need more details about the data in the graph. They will make your notebook run slower.
# These are Plotly Express Graphs of the some of the above Seaborn graphs. Run them only if you need more details about the data in the graph. They will make your notebook run slower.
%% Cell type:markdown id: tags:
Graphs: User Request of CPU Memory for all Jobs
User Request of CPU Memory for Array Job
User Request of CPU Memory for Non Array Job
User Request of Node Memory for all Jobs
User Request of Node Memory for Array Job
User Request of Node Memory for Non Array Job
Graphs: > User Request of CPU Memory for all Jobs
> User Request of CPU Memory for Array Job
> User Request of CPU Memory for Non Array Job
> User Request of Node Memory for all Jobs
> User Request of Node Memory for Array Job
> User Request of Node Memory for Non Array Job
These graphs create histograms using the data for the month of March 2020
They use cutoff cpu and node memory declared in CPU_cutoff and Node_cutoff - 5 gigs.
Can also show box or violin graph above to show where min, max, median, and 3rd quartile is.
%% Cell type:code id: tags:
```
CPU_fig = px.histogram(CPU_cutoff, x="ReqMemCPU",
title='User Request of CPU Memory for all Jobs',
labels={'ReqMemCPU':'ReqMemCPU'}, # can specify one label per df column
opacity=0.8,
log_y=True, # represent bars with log scale
marginal="box", # can be `box`, `violin`
hover_data=CPU_cutoff.columns,
nbins=30,
color_discrete_sequence=['goldenrod'] # color of histogram bars