• 09.00-09.30: Registration
  • 09.30-10.30: Sally Chambers. General introduction into DH-basics, Research Data Management (RDM) and introduction on Data Management Plan (DMP)
  • 10.30-11.00 Break
  • 11.00-13.00: Maxim Romanov. Basics of open linked data; organising data for research; open linked data vs databases; version control and collaboration
    • data organization and data formats:
      • data formats (csv/tsv, YAML, xml, etc.): explanations
      • OpenITI example: URIs, folder structure, metadata, structural tagging (Detailed descriptions)
      • practical: participants organizing their own data (model)
    • version control and collaboration
  • 13.00-14.00 Lunch break
  • 14.00-16.00: Maxim Romanov. Text analysis - Regular Expressions
  • 16.00-16.30 Break
  • 16.30-18.00: Maxim Romanov. Text analysis - Simple Scripting in Python
    • Preprocessing Arabic texts;
    • Keywords in Context (KWIC);
      • combining regular expressions and simple scripting, saving results;
      • Off-the-shelf solution: Antconc;
    • Frequency Lists;
      • generating frequency lists: step-by-step explanations;
    • Usage of Frequency Lists;
      • document distance: identifying similar texts;
      • creating frequency-based readers.