- 09.00-09.30: Registration
- 09.30-10.30: Sally Chambers. General introduction into DH-basics, Research Data Management (RDM) and introduction on Data Management Plan (DMP)
- 10.30-11.00 Break
- 11.00-13.00: Maxim Romanov. Basics of open linked data; organising data for research; open linked data vs databases; version control and collaboration
- data organization and data formats:
- data formats (csv/tsv, YAML, xml, etc.): explanations
- OpenITI example: URIs, folder structure, metadata, structural tagging (Detailed descriptions)
- practical: participants organizing their own data (model)
- version control and collaboration
- 13.00-14.00 Lunch break
- 14.00-16.00: Maxim Romanov. Text analysis - Regular Expressions
- 16.00-16.30 Break
- 16.30-18.00: Maxim Romanov. Text analysis - Simple Scripting in Python
- Preprocessing Arabic texts;
- Keywords in Context (KWIC);
- combining regular expressions and simple scripting, saving results;
- Off-the-shelf solution: Antconc;
- Frequency Lists;
- generating frequency lists: step-by-step explanations;
- Usage of Frequency Lists;
- document distance: identifying similar texts;
- creating frequency-based readers.