24 Apr 2018 11:00am - 1:00pm S2, Alison Richard Building, Sidgwick Site

Description

Course leader: Dr Paul Nulty (CRASSH, University of Cambridge)

This session will introduce basic methods for reading and processing text files in Python. We will proceed slowly through an example that demonstrates reading in a large text corpus from structured or unstructured files, basic string processing, word frequency counting, and syntactic analysis. Attendees will learn to perform these tasks in base Python and using the natural language processing library spaCy. No prior knowledge of Python is assumed, but attendees may wish to follow an online guide to configure a python installation in order to follow along with the examples.

Pre-registration is essential: please book here or use the online registration link on this page.  PhD students and staff from the University of Cambridge have priority for bookings on this course – if the course appears fully booked and you fall into this category please contact the course organisers directly.

Upcoming Events

Walking with Constable: the Cambridge Edition
Cambridge Festival, Walk
A group of young people look towards a laptop as if they are researching something for a group project.
Cambridge Festival, Workshop
Sculpture made from twisting layers of metal sheeting.
Closed Event, Data School
AI-generated tiles of satellite imagery
Data School

9 Sep 2024 - 13 Sep 2024

CDH | Social Data School: September 2024

CENTRE FOR RESEARCH IN THE ARTS, SOCIAL SCIENCES AND HUMANITIES

Tel: +44 1223 766886
Email enquiries@crassh.cam.ac.uk