Presentasjon lastes. Vennligst vent

Presentasjon lastes. Vennligst vent

Dias 1 Lene Offersgaard Center for Sprogteknologi, Københavns Universitet DK-CLARIN status WP 5.

Liknende presentasjoner


Presentasjon om: "Dias 1 Lene Offersgaard Center for Sprogteknologi, Københavns Universitet DK-CLARIN status WP 5."— Utskrift av presentasjonen:

1 Dias 1 Lene Offersgaard Center for Sprogteknologi, Københavns Universitet DK-CLARIN status WP 5

2 Dias 2 Possible architecture of the clarin.dk portal.

3 Dias 3 Possible overall architecture of the clarin.dk site, with the back-end aggregator systems shown.

4 Dias 4 Ressourcetyper Vi arbejder med følgende ressourcetyper: Basisressourcer Annoteringsressourcer Værktøjsressourcer Services

5 Dias 5 Rapporterede typer Skrevet sprog5.Tale/Audio6.Multimodale4.Billeder(obj) 1.Monolingvale Tekstsamlinger 11stk 2.Multilingvale Tekstsamlinger 4stk 3.Ord-collections 7stk Audio m. transskrip. >3stk Audio u. transskrip. Multimodal u. transskrip. u. anno. 7. Andet Multimodal m. transskrip. u/m anno. >1stk Billeddatabaser 1(2)stk Dækker de rapporterede typer hele CLARINS verden? Det antager vi indtil videre…

6 Dias 6 Metadata Kortlægning blandt partenere i dk-clarin: clarin/?q=ressourceoversigt_juni2008 clarin/?q=ressourceoversigt_juni2008 EU-clarin: (to view the submissions) (to add one yourself)

7 Dias 7 Center for Sprogteknologi EU-CLARINs metadata forslag[1] indeholder følgende fælles metadata, med kommentarer om der er en klar parallel til DC Element set:[1] ResourceTypeLigner DC element Type NameLigner DC element Title LanguageDC element DescriptionDC element Country InstituteLigner DC element Creator CreatorDC element YearLigner DC element Date, men Date bruges eksplicit under flere af de enkelte grupper af data FormatDC element MetadataLink ReferenceLinkLigner DC element Identifier [1][1] Se forslaget på clarin_ad-hoc-registry-v6_0.pdf

8 Dias 8 Center for Sprogteknologi DK-CLARINs forslag til generelle metadata for alle ressourcer. DC beskrivelserne er også angivet. Type (DC element set): The nature or genre of the resource. Title (DC element set): A name given to the resource Language (DC element set): A language of the resource. Description (DC element set): An account of the resource Creator (DC element set): An entity primarily responsible for making the resource Date (DC element set): A point or period of time associated with an event in the lifecycle of the resource Format (DC element set): The file format, physical medium, or dimensions of the resource. Identifier (DC element set): An unambiguous reference to the resource within a given context. isVersionOf (refined DC element set: relation) A related resource of which the described resource is a version, edition, or adaptation. Flere??

9 Dias 9 Struktur for tekstsamlinger

10 Dias 10 Metadata for tekstenheder From the TEI header description (http://www.tei- c.org/release/doc/tei-p5-doc/html/HD.html) we have that “the element has four principal components:http://www.tei- c.org/release/doc/tei-p5-doc/html/HD.html fileDescfileDesc (file description) contains a full bibliographic description of an electronic file. encodingDescencodingDesc (encoding description) documents the relationship between an electronic text and the source or sources from which it was derived. profileDescprofileDesc (text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting. revisionDescrevisionDesc (revision description) summarizes the revision history for a file. Of these, only the element is required in all TEI headers; the others are optional.”


Laste ned ppt "Dias 1 Lene Offersgaard Center for Sprogteknologi, Københavns Universitet DK-CLARIN status WP 5."

Liknende presentasjoner


Annonser fra Google