Capital in the 21st Century: Chapter 7

Data provenance

The data were downloaded as Excel files from: http://piketty.pse.ens.fr/en/capital21c2.

Loading relevant libraries and data

This document depends on the xlsx, reshape2, and ggplot2 packages.

Below, we separately load the data required to make each figure.

library(ggplot2)
library(xlsx)
library(reshape2)
 
#First we'll make a quick function for melting and renaming tables
melt_name<-function(x,name,id.vars="Year",...){
  #if all measurements are missing for a given year, we will generate a versionof the figure with interpolations for that year.
  #We'll show both interpolated and non interpolated figures
  NArows<-apply(x,1,function(r) {all(is.na(r[-1]))} )
  out_interp<-melt(x[!NArows,],id.vars=id.vars,...)
  out_raw<-melt(x,id.vars=id.vars,...)
  list(out_interp,name,out_raw)
}
 
 
#####################################
 
# Spread Sheet TS7.7
 
# Inequality of labor income across time and space
 
#####################################
 
#Gini-Lorenz curves
ts7.7 = read.xlsx("../_data/Chapter7TablesFigures.xlsx",sheetName="TS7.7",rowIndex=8:108,colIndex=1:3,header=FALSE)
names(ts7.7) <- c("x","linear","continuous")

Recreate Figures

There is basically only one figure in this chapter, the rest are all tables

baseline <- expression(paste("45",degree," degree line (perfect equality)"))
ggplot(data=ts7.7)+
  geom_point(aes(x=x,y=x,color= 'black')) +
  geom_point(aes(x=x,y=linear,color= 'red')) +
  geom_point(aes(x=x,y=continuous,color= 'green')) +
  ylab('Total income or capital share owned by the poorest x%') +
  xlab('Curve 1 assumes that the poorest 90% and the richest 10% own 50% of total income or capital each, and that both groups are homogenous \n (hence a linear curve); curve 2 assumes a continuous distribution') + 
  scale_color_discrete("Legend", labels = c(baseline, "Curve #1: distribution with 2 groups", "Curve #2: continuous distribution"))

plot of chunk unnamed-chunk-1