{"id":8,"date":"2025-11-14T20:51:10","date_gmt":"2025-11-14T20:51:10","guid":{"rendered":"https:\/\/plutarch.uw.edu.pl\/?page_id=8"},"modified":"2025-11-14T23:01:50","modified_gmt":"2025-11-14T23:01:50","slug":"corpus-data","status":"publish","type":"page","link":"https:\/\/plutarch.uw.edu.pl\/?page_id=8","title":{"rendered":"Corpus &#038; Data"},"content":{"rendered":"<p class=\"page-intro\">The Plutarch Metaphor Observatory corpus combines diplomatic texts, structured IDs, and annotation metadata so every metaphor unit is traceable from source passage to analytic output.<\/p>\n<div class=\"pmo-card-grid\"><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Corpus scope<\/p><h3>42 works \u00b7 18k metaphor loci<\/h3><\/p>\n<ul>\n<li>Moralia focus with select <em>Vitae<\/em> where cognition metaphors cluster.<\/li>\n<li>Fragments normalized to standard abbreviations (e.g., <em>De aud.<\/em> 40D).<\/li>\n<li>Each locus links out to translation refs, manuscript notes, and citation tokens.<\/li>\n<\/ul>\n<p>\n<\/article><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Text preparation<\/p><h3>Diplomatic ingest pipeline<\/h3><\/p>\n<ul>\n<li>Custom QDPX parser preserves paragraphing, speaker cues, and apparatus markers.<\/li>\n<li>Witness alignment + stable IDs keep Explorer, Works, and exports in sync.<\/li>\n<li>Automatic QA checks flag missing lemmas, duplicated spans, or orphan IDs.<\/li>\n<\/ul>\n<p>\n<\/article><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Metadata graph<\/p><h3>Linked coders &amp; lexical data<\/h3><\/p>\n<ul>\n<li>Coding sessions log annotator, timestamp, and protocol version.<\/li>\n<li>Lexical units carry lemma, role (source\/target), and conceptual domain paths.<\/li>\n<li>Paraphrases + freeform notes support qualitative follow-up in workshops.<\/li>\n<\/ul>\n<p>\n<\/article><br \/>\n<\/div>\n<h2 class=\"pmo-section-heading\">Annotation protocol<\/h2>\n<div class=\"pmo-card-grid\"><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Domains<\/p><h3>Target\/source hierarchy<\/h3><p>\nEvery metaphor unit records both domains with a hierarchical path (e.g., <code>BODY \u2192 ORGAN \u2192 HEART<\/code>). This enables cross-work comparisons, radial charts, and domain co-occurrence analysis.<\/p>\n<\/article><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Lexical roles<\/p><h3>Token-level tagging<\/h3><p>\nTokens are tagged as source\/target carriers, linkers, or elaborations. Lemmas sync with CLTK assets so we can export morphological bundles alongside the Shiny views.<\/p>\n<\/article><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Paraphrase layer<\/p><h3>Coder interpretation<\/h3><p>\nAnnotators capture a short paraphrase plus optional note categories (emotion, epistemic stance, rhetoric). These strings feed the Works citation builder and future NLP experiments.<\/p>\n<\/article><br \/>\n<\/div>\n<h2 class=\"pmo-section-heading\">Workflow<\/h2>\n<section class=\"pmo-timeline\"><br \/>\n<div class=\"pmo-timeline-item\"><div class=\"pmo-timeline-dot\"><\/div><div class=\"pmo-timeline-content\"><span class=\"pmo-timeline-year\">Source<\/span><h4>Text preparation<\/h4><p>\nImport QDPX + diplomatic editions, normalize paragraph IDs, and attach bibliographic metadata.<\/p>\n<\/div><\/div><br \/>\n<div class=\"pmo-timeline-item\"><div class=\"pmo-timeline-dot\"><\/div><div class=\"pmo-timeline-content\"><span class=\"pmo-timeline-year\">Annotate<\/span><h4>Metaphor coding<\/h4><p>\nDual annotators mark spans, domains, lemmas, and paraphrases following the Thinking of Thinking handbook.<\/p>\n<\/div><\/div><br \/>\n<div class=\"pmo-timeline-item\"><div class=\"pmo-timeline-dot\"><\/div><div class=\"pmo-timeline-content\"><span class=\"pmo-timeline-year\">Validate<\/span><h4>QA + reconciliation<\/h4><p>\nAutomated checks highlight conflicts, then editors reconcile and freeze the release candidate.<\/p>\n<\/div><\/div><br \/>\n<div class=\"pmo-timeline-item\"><div class=\"pmo-timeline-dot\"><\/div><div class=\"pmo-timeline-content\"><span class=\"pmo-timeline-year\">Publish<\/span><h4>Explorer &amp; exports<\/h4><p>\nData powers the Metaphor Explorer, Works catalogue, and downloadable CSV \/ JSON bundles.<\/p>\n<\/div><\/div><br \/>\n<\/section>\n<h2 class=\"pmo-section-heading\">Access &#038; reuse<\/h2>\n<div class=\"pmo-card-grid\"><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Interactive<\/p><h3>Metaphor Explorer<\/h3><p>\nNavigate domains, coders, and paraphrases inside the Shiny dashboard. <a href=\"https:\/\/plutarch.uw.edu\/observatory\" target=\"_blank\" rel=\"noopener\">Launch the Explorer<\/a>.<\/p>\n<\/article><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Works focus<\/p><h3>Digital corpus of works<\/h3><p>\nBrowse essays with curated summaries, citation-ready references, and cross-links to annotation slices. <a href=\"https:\/\/plutarch.uw.edu\/works\" target=\"_blank\" rel=\"noopener\">Open the Works app<\/a>.<\/p>\n<\/article><br \/>\n<article class=\"pmo-card\"><p class=\"pmo-card-label\">Bulk data<\/p><h3>Research exports<\/h3><p>\nNeed CSV\/JSON dumps or API (beta) access? Email <a href=\"mailto:observatory@uw.edu.pl\">observatory@uw.edu.pl<\/a> describing your use case and we will provision credentials.<\/p>\n<\/article><br \/>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>The Plutarch Metaphor Observatory corpus combines diplomatic texts, structured IDs, and annotation metadata so every metaphor unit is traceable from source passage to analytic output. Annotation protocol Workflow Access &#038; reuse<\/p>\n","protected":false},"author":0,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-8","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/plutarch.uw.edu.pl\/index.php?rest_route=\/wp\/v2\/pages\/8","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/plutarch.uw.edu.pl\/index.php?rest_route=\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/plutarch.uw.edu.pl\/index.php?rest_route=\/wp\/v2\/types\/page"}],"replies":[{"embeddable":true,"href":"https:\/\/plutarch.uw.edu.pl\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=8"}],"version-history":[{"count":3,"href":"https:\/\/plutarch.uw.edu.pl\/index.php?rest_route=\/wp\/v2\/pages\/8\/revisions"}],"predecessor-version":[{"id":41,"href":"https:\/\/plutarch.uw.edu.pl\/index.php?rest_route=\/wp\/v2\/pages\/8\/revisions\/41"}],"wp:attachment":[{"href":"https:\/\/plutarch.uw.edu.pl\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=8"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}