Webscraping for Beginners

21 November 2017, 14:00 - 16:00

Room B4, Institute of Criminology, Sidgwick Site

Introductory digital methods training from Cambridge Digital Humanities

Led by: Dr Gabe Recchia (Concept Lab, CRASSH)

Digital research projects commonly require the researcher to collect of a large number of documents from the Internet. Frequently, although the researcher can find the documents online, they are in a format that is impossible to use, and/or there are so many documents that obtaining a large number would be incredibly tedious to do by hand. Webscraping -- the practice of using computational tools to extract data from websites -- offers a solution to this challenge. We will cover a variety of webscraping techniques, as well as tips and tricks to make manual collection of digital data more efficient when it is necessary.

This event is open to staff and PhD students at the University of Cambridge. Spaces are limited and must be booked online here