This repository contains all of the code and data related to the Spring 2021 module Language Analytics as part of the bachelor's tilvalg in Cultural Data Science at Aarhus University.
This repository is in active development, with new material being pushed on a weekly basis.
For the sake of convenience, I recommend using our own JupyterHub server for development purposes. The first time you use the server, you'll need to create your own version of the repo and install relevant dependencies in a virtual environment:
git clone https://github.com/CDS-AU-DK/cds-language.git
bash ./create_lang_venv.sh
From then on, every time you use the server, make sure you update the repo and install any new dependencies:
cd lang101
git pull origin main
bash ./create_lang_venv.sh
This repository has the following directory structure:
Column | Description |
---|---|
data |
A folder to be used for sample datasets that we use in class. |
notebooks |
This is where you should save all exploratory and experimental notebooks. |
src |
For Python scripts developed in class and as part of assignments. |
utils |
Utility functions that are written by me, and which we'll use in class. |
This class takes place on Wednesday mornings from 8-12. Teaching will take place on Zoom, the link for which will be posted on Slack.
A detailed breakdown of the course structure and the associated readings can be found in the syllabus. Also, be sure to familiarise yourself with the studieordning for the course, especially in relation to examination and academic regulations.
The instructor is me! That is to say, Ross.
All communication to you will be sent both on Slack and via Blackboard. If you need to get in touch with me, Slack should be your first port-of-call!