21
Bar Chart Race Python Package Official Release
I’m excited to announce the official release of bar_chart_race, a python package for creating bar chart races. In this post, I'll cover many of the major available options. Navigate to the official documentation for all of the options.
Bar chart races have become very popular over the last year and no python package existed to create them. I built some for my coronavirus dashboard.
This post is available as a tutorial on YouTube
Install with:
pip install bar\_chart\_race
This is the first major release — version 0.1.0
>>> import bar\_chart\_race as bcr
>>> bcr.\_\_version\_\_
'0.1.0'
In order to use bar_chart_race, your data must be in ‘wide’ pandas DataFrame where:
- Every row represents a single period of time
- Each column holds the value for a particular category
- The index contains the time component (optional)
A few sample datasets are available to download via the load_dataset function. The covid19_tutorial dataset contains the total deaths due to COVID-19 of selected countries over a period of 10 days.
df = bcr.load\_dataset('covid19\_tutorial')
df
Once you have data in the correct format, you can pass it directly to bar_chart_race.
bcr.bar\_chart\_race(df)
By default, bars are horizontal, but can be made vertical with the orientation parameter.
bcr.bar\_chart\_race(df, orientation='v')
Set sort to 'asc' to change the order of the bars.
bcr.bar\_chart\_race(df, sort='asc')
Limit the number of bars plotted with n_bars.
bcr.bar\_chart\_race(df, n\_bars=6)
Fix the order of the bars for the duration of the animation by setting fixed_order to a list.
bcr.bar\_chart\_race(df, fixed\_order=['Iran', 'USA', 'Italy',
'Spain', 'Belgium'])
Fix the maximum value for the entire duration of the animation.
bcr.bar\_chart\_race(df, fixed\_max=True)
By default, 10 frames are used per time period with the entire period lasting 500 milliseconds (half of a second). Both of these are changed below.
bcr.bar\_chart\_race(df, steps\_per\_period=20, period\_length=200)
Linearly interpolate the period label.
bcr.bar\_chart\_race(df, interpolate\_period=True)
bar_chart_race uses matplotlib for all of the underlying plotting. Many properties can be set by using parameters common to matplotlib.
bcr.bar\_chart\_race(df,
figsize=(5, 3),
dpi=100,
label\_bars=False,
period\_label={'x': .99, 'y': .1, 'ha': 'right', 'color': 'red'},
title='COVID-19 Deaths by Country')
Bar properties can also be set.
bcr.bar\_chart\_race(df, bar\_kwargs={'alpha': .2, 'ec': 'black', 'lw': 3})
The period label can be formatted with date directives or new-style string formatting.
bcr.bar\_chart\_race(df, period\_fmt='%b %-d, %Y')
Add a custom label to the plot that summarizes the current time period.
def summary(values, ranks):
total\_deaths = int(round(values.sum(), -2))
s = f'Total Deaths - {total\_deaths:,.0f}'
return {'x': .99, 'y': .05, 's': s, 'ha': 'right', 'size': 8}
bcr.bar\_chart\_race(df, period\_summary\_func=summary)
A single perpendicular bar can be added to summarize each period as well.
def func(values, ranks):
return values.quantile(.9)
bcr.bar\_chart\_race(df, perpendicular\_bar\_func=func)
This blog post was published with jupyter_to_medium, a python package I createdto automate the process of publishing Jupyter Notebooks as Medium blog posts.
21