Conferences

SatRdays Cardiff

by ellen

8 minute read

Hey again lovely readers! This blog is a very special one indeed, you get to hear about our great day out at SatRdays in Cardiff recently not once, not twice, but five times, from each of our team members perspectives! I think it’s fair to say that it was a very different experience for each of us - from seasoned conference attendees like Steph and MaĆ«lle, Amy who had never presented before, sponsorship newbie Oz and then Ellen somewhere inbetween, we all had very different (but great)…

6 minute read

Using tabulizer we’re able to extract information from PDFs so it comes in really handy when people publish data as a PDF! This post takes you through using tabulizer and tidyverse packages to scrape and clean up some budget data from PASS, an association for the Microsoft Data Platform community. The goal is to mainly show some of the tricks of the data wrangling trade that you may need to utilise when you scrape data from PDFs.