Year 2 of Locke Data
by steph
Hey folks, I wanted to give y’all an update about Locke Data one year on from when I started it up.
by steph
Hey folks, I wanted to give y’all an update about Locke Data one year on from when I started it up.
Using tabulizer
we’re able to extract information from PDFs so it comes in really handy when people publish data as a PDF! This post takes you through using tabulizer
and tidyverse
packages to scrape and clean up some budget data from PASS, an association for the Microsoft Data Platform community. The goal is to mainly show some of the tricks of the data wrangling trade that you may need to utilise when you scrape data from PDFs.
If you decide you want to use R in your existing Hugo blog, it’s really easy to convert over. There’s a single command you need to know from blogdown and the rest is working out your deployment process.