Common Lab Research Infrastructure for the Arts and
Common Lab Research Infrastructure for the Arts and Humanities LLOD Use Case: MWE lexicon Jan Odijk CLARIAH-CORE LD 4 LR Workshop Utrecht, 2017 -02 -06/07 1
MWE-LEXICON • Du. ELME: database of multiword expressions (MWE) • MWE: word combination that has idiosyncratic properties – E. g. de plaat poetsen the plate polish = ‘to bolt’ • Du. ELME= – Set of MWE descriptions for MWEs – Set of pattern descriptions
MWE DESCRIPTION • Component list (seq. of lemmas) – [plaat, poetsen] • Morphosyntactic properties – Conjugated with hebben • Some semantic properties – Takes [Human] argument • Example sentence (strongly restricted) – Hij heeft de plaat gepoetst
MWE DESCRIPTION • Pattern id: reference to the description of its global syntactic structure – ec 1 • Parameters (to define its fine syntactic structure – Plaat: sg def
PATTERN DESCRIPTION • Pattern id – ec 1 • Syntactic structure with open slots – [. VP [. obj 1: NP [. det: D (1) ] [. hd: N (2) ]] [. hd: V (3) ]] • Description • Relation between components and example sentence • …
FORMATS • Originally: Set of CSV files • Later: LMF-compatible format – One minor deviation: error – One deviation that [I think (now)] can be remedied
LOD? • Any advantages of converting LMF format into some LOD format? • Can Lemon model handle it? • Or other LD-based lexicon models (if any)? • Linking with other lexicons?
Thanks for your attention
- Slides: 8