This page last changed on Feb 09, 2008 by bill-parod@northwestern.edu.

The NCF collection has been ingested into the Fedora repository running on monk.lis.uiuc.edu on port 9080.

To access the objects in the repository you will need to login using the same authentication information that has been established for the monk datastor on scribe.

The content and access models for these materials in Fedora are preliminary. Some discussion is needed to determine what would be best for Monk in that regard. The object model implemented for this collection has a Fedora object for each unadorned and each MophAdnorned work in the NCF.

Object identifiers are formed from the NCF identifiers. For example, "Oliver Twist", by Charles Dickens has NCF id 22511.
The Fedora persistent identifier (PID) for this item is set to monk:ncf-22511. The Fedora PID for the MorphAdorned object is monk:ncf-22511-ma.

These objects each assert a relationship that relate one to the other in their RELS-EXT datastreams. There is also a datastream in each object that references the xml file for the other. So there a couple of ways to obtain adorned/unadorned files from each other.

The object also reference two disseminators.

There is a "core" disseminator which we've found it useful to associate with all objects regardless of their 'type' or content. This provides a sort of "base class" which consuming applications can rely on without concern for what the object is. This "core" disseminator includes methods such as "getWebView" which provides an html "view" of the object, "getXMLForIndexing" which provides the object as xml for search engine indexing, "getLabel" which provides a simple string that can be used as a caption or title for the item. I won't document all methods here - one can see these by inspecting any of the objects.

These text objects also support a "book" disseminator which provides "book" specific views of the object. For example, "getTOC", "getHeader", "getChunk", "viewChunk", and others. These methods will be use by the proxy and so we provide some examples here.

For "Oliver Twist", you can obtain the TEI Header with: http://monk.lis.uiuc.edu:9080/fedora/get/monk:ncf-22511/monk:behav-def-book/getHeader

The TOC as XML: http://monk.lis.uiuc.edu:9080/fedora/get/monk:ncf-22511/monk:behav-def-book/getTOC

A list of all the divs with their titles and types: http://monk.lis.uiuc.edu:9080/fedora/get/monk:ncf-22511/monk:behav-def-book/getChunklist

You can obtain the xml for any element in the text that contains an "id" attribute. For example you can obtain the xml for Chapter 1 (id=ncf-22511-1-1) with: http://monk.lis.uiuc.edu:9080/fedora/get/monk:ncf-22511/monk:behav-def-book/getChunk?xmlid=ncf-22511-1-1

You can obtain the html for any element in the text that contains an "id" attribute. For example you can obtain the html for Chapter 1 (id=ncf-22511-1-1) with: http://monk.lis.uiuc.edu:9080/fedora/get/monk:ncf-22511/monk:behav-def-book/viewChunk?xmlid=ncf-22511-1-1

The output format of these disseminations controlled by the following xslt programs in /contents/web/root/collections/ncf/datastreams/xsl respectively:bib_xml.xsl, toc_xml.xsl, chunklist_xml.xsl, chunk_xml.xsl, chunk_html.xsl.

The "chunk_html.xsl" program that affects html output is minimal at this point - it just copies the source xml, wrapped in html/body tags. It will need work to achieve desired presentation.

A full list of all dissemination methods available for these items can be obtained with: http://monk.lis.uiuc.edu:9080/fedora/get/monk:ncf-22511/fedora-system:3/viewMethodIndex

Document generated by Confluence on Apr 19, 2009 15:04