Career Tracks at UC Part I: Cleaning and Tidying

In 2017 the University of California launched Career Tracks, a job classification system for staff not represented by a union. The goal was threefold: (1) to give employees better-defined career paths; (2) to better-align university compensation with the market; and (3) to better-reflect primary job responsibilities for each employee. Additionally,... [Read More]
Tags: R

The Data Analyst as Wanderer: Pre-Exploratory Data Analysis with R

This post is about pre-exploratory data analysis. Namely, answering questions about the data at two junctures: before you know anything about the data and when you know only very little about the data. There are roughly three overlapping questions to ask: What is this? What’s in this? What can I... [Read More]
Tags: R

Simple Probability Trees in R

This may surprise you, but there isn’t an easy, “canonical” method to construct simple probability trees in R. Google uncovers some hacky attempts from years past, but it obviously hasn’t been a pressing issue or priority in the community. The reason for this, I think, is threefold: (1) probability trees... [Read More]
Tags: R

R at the Golden State Sprint Triathlon

Earlier today I completed my first (sprint) triathlon. For me, it was 2 hours and 12 minutes of barbarism–a 1/2 mile swim, a 15 mile bike ride, and a three mile run to boot. I knew my time was poor; I struggled to wiggle out of my wet suit, was... [Read More]
Tags: R

I am a tidyverse Enthusiast

I am a tidyverse enthusiast. The proof is in the pudding: of my six packages on GitHub, only one DESCRIPTION contains a non-tidyverse package (rcicero, tidyjson). I once contemplated rewriting these packages sans the tidyverse–for science, learning, growth, bragging rights, and character building–but I broke into a cold sweat once... [Read More]
Tags: R UNIX bash