Leipzig Workshop Week

14–18 December 2015

Leipzig Workshop Week is a series of three related workshops in the week beginning 14th of December. In addition, the Sunoikisis DC Planning Seminar will run on the 16th and 17th.

The workshops showcase the range and depth of research undertaken in the Department of Digital Humanities and provide an opportunity for scholars working in related fields to come together to share their knowledge and learn new practical skills.

  • Visitors may attend one or several workshop streams according to their interest.
  • There is no fee to attend the workshops. If you would like to attend then please send an email noting your interest to the person organising each workshop stream (listed below).
  • Workshops are held in rooms P402, P502 and P702 of the Paulinum building at Augustplatz 10. Click here for a map.
  • Further information for visitors to Leipzig can be found on our website here.

Details on each of the workshops are available below or you can download the full programme here.

ANALYZING TEXT REUSE AT SCALE / WORKING WITH BIG HUMANITIES DATA
14–16 December
14–15: Room P402. 16: Room P502
A Dariah-DE Methods Workshop

The methodological workshop will cover topics including building a corpus, creating training data, topic modeling, text reuse, georeferencing, mapping, social network analysis, and semantic analysis.

The workshop will be useful for humanists working with corpora of texts too large to read in their entirety. With the availability of automatically transcribed books, newspapers, and other sources from Dariah, TextGrid, the German state libraries, other national collections, Google, the Internet Archive, and Europeana, many humanists currently working with textual sources could now benefit from the methods we will discuss. Ideally, participants will have some prior experience using software for classification and clustering of texts, including popular packages for topic modeling, sentiment analysis, and stylometrics, and will have identified a corpus of texts to use during practical sessions. We will, however, cover much of the relevant background and explore several end-to-end examples. A VM with the tools used in the workshop is available here.

The workshop will discuss the background for and applications of text reuse analysis for making sense of large corpora of texts. We will cover relevant methods in information retrieval, sequence alignment, computational linguistics, and machine learning. We will discuss modeling decisions that participants will have to make when applying text reuse or other text-mining techniques to their own scholarly questions. Finally, we will work through examples of text reuse analysis and classification with the Proteus Project’s passim system and with some complementary tools. The goal is to provide hands-on familiarity with the methods in order to enhance understanding of how different digital approaches can be drawn together into a coherent research approach. Participants are encouraged to submit their own textual resources prior to the workshop.

This workshop builds upon results from several complementary and collaborating projects: the Proteus Project, led by University of Massachusetts Amherst; the Viral Texts Project at Northeastern; the Kitab Project; the Open Philology Project including a Dariah-DE exploratory effort at Leipzig University; and the Perseids Project at Tufts University.

For further information contact thomas.koentges (at) uni-leipzig.de or maxim.romanov (at) uni-leipzig.de

The workshop program can be downloaded here.

Recordings of the first workshop day:
Link 1: the first 8 hours – https://www.youtube.com/watch?v=TQAWXa-HB4I
Link 2: the last 30 minutes – https://www.youtube.com/watch?v=bziwhMAYrzk

Recordings of the second workshop day:
Link 3: the first 3.5 hours (morning) – https://www.youtube.com/watch?v=euiRrJ1oZ3s

TREEBANKING ANCIENT LANGUAGES: CURRENT AND PROSPECTIVE RESEARCH
14–15 December
Room P702

This workshop aims to gather people who produce or employ treebanks for computational analysis of historical languages. The development of treebanks is crucial not only for linguistic but also for more content-based research.

There is a growing need for communication, coordination, and share of tools and documentation among scholars, including those working on ancient languages, some of which are still underrepresented in the digital world. The workshop aims to promote awareness and knowledge of the computational methods enabling automated and semi-automated research on historical corpora by using treebanks.

Topics covered in the workshop include: Character encoding; Tokenizers; Sentence-splitters; Lemmatizers; POS-taggers; Parsers; Layers of linguistic annotation; Annotation schemes; Annotation guidelines; Annotation evaluation; Annotation tools; Machine learning; and Universal dependencies.

For further information contact celano (at) informatik.uni-leipzig.de

The workshop program can be downloaded here.

SUNOIKISIS 2016 PLANNING SEMINAR
16–17 December
Room P402

SunoikisDC is an international consortium of Digital Classics programs developed by the Alexander von Humboldt Chair of Digital Humanities at the University of Leipzig in collaboration with the Harvard’s Center for Hellenic Studies and Perseids.

Click here for further information about the Sunoikisis 2016 Planning Seminar or contact monica.berti (at) uni-leipzig.de

DIGITAL ARABIC AND DIGITAL PERSIAN RESEARCH WORKSHOP
17–18 December
17: Room P702. 18: Room P502

The workshop brings together scholars of Arabic and Persian textual traditions (broadly defined), including both those who are already using digital methods and those who want to start using them. The workshop will allow us to share our findings, get feedback on work in progress, and to chart future research projects. The event will also give us all an opportunity to work out collaborative strategies that will be beneficial for all involved in the DH-ization of Arabic and Persian studies.

Participants will present their work—completed, in progress, planned, contemplated—and will have an opportunity to receive feedback from colleagues with more experience working with historical languages. The 2-day workshop will include up to 20 sessions, with each 90-min session accommodating 2 or 3 presentations (20 or 30 mins each), followed by 30 mins of discussion. All participants are strongly encouraged to discuss collaborative efforts, with the workshop ending with a charted course for action.

For further information contact maxim.romanov (at) uni-leipzig.de

The workshop program can be downloaded here.