cc-by-4.0Sahala, AleksiSvärd, Saana2025-03-242021-09-302021-09-30https://hydatakatalogi-test-24.it.helsinki.fi/handle/123456789/9375Verbs of seeing in Akkadian This repository contains scripts and data for the paper "Language Technology Approach to Seeing in Akkadian". /data aug18-nolex.txt Lemmatized dataset from Oracc results-pmi2-top50.log Script parameters for pmizer (see https://github.com/asahala/Pmizer) results-pmi2-top50.tsv Results in .tsv format. Fields in the file: keyword translation from Oracc collocate translation from Oracc period distribution genre distribution period and genre distribution keyword freq collocate freq co-occurrence freq PMI2 score average distance between keyword and collocate (in words) url to Korp (all links may not return results, as Korp Oracc had a major update in 2019: see https://www.kielipankki.fi/corpora/oracc/ for more info and user guide). Note that the co-occurrence of words (a, b) is symmetric, meaning that (a, b) == (b, a). Thus, if you search results in Korp using the links and do not get any results, you may have to switch the search boxes in reverse order. period/genre-distribution-matrix.tsv Distribution of seeing verbs in different genres and periods as a matrix representationOpenAkkadianDistributional SemanticsPMIAssyriologyLanguage Technology Approach to "Seeing" in Akkadiandataset