Digital Data Collection and Wrangling

14 January 2020, 11:30 - 13:00

IT Training Room, Cambridge University Library, West Rd, Cambridge CB3 9DR

Cambridge Digital Humanities Workshop

This session addresses the technical and ethical aspects of digital data collection and wrangling – two fundamental stages in the lifecycle of a digital research project. Participants will be introduced to online data sources and practices of internet-mediated data collection, including retrieving data from social media platforms. As data collected from online sources is often dirty and messy, we will also provide a short practical introduction to the process of transforming raw data into a clean and structured dataset using free and open-source software.