|
MONK : Conference call, 2008 May 2
This page last changed on May 02, 2008 by martinmueller@northwestern.edu.
Present: Amit, John N. Loretta, Martin, Phil, Sara, Steve, Tanya Amit reported on a plan to create a platform for curatorial work and hopes he can deliver a first version of it by May 12. This platform will let Sara, Martin, and other curators review data from the teiHeader and add appropriate metadata for use by the data store Sara will be able to start work in mid-May. John N. hopes to have a first version of the new Prior completed by the middle of May. We discussed pseudo-pages and the kinds of information that MorphAdorner needs to make decisions about the div elements that form the basis for pseudo-pagination. Amit asked whether the process of pseudo-paging can be seaprated from MorphAdorner so that the information could be used in a workflow that uses a different NLP tool. Phil responded that MorphAdorner can emit output that would strip some or all tags. The chief purpose of pseudo-pages is to give human readers an easy way of orienting themselves in a text--analogous to the Stephanus pages of Plato with their page quartiles or quintiles. We discussed the question of where "rend" information is kept in the workflow. In the current version of Prior, which harks back to WordHoard, rend information and structural information are kept together. Ideally, the data store should not have to concern at all with any formating issues, which are managed by XSLT stylesheets outside the data store. There was considerable disucssion, with some residual uncertainty or disagreement, about how mch of the decisions can be pushed back to the Abbot stages of the process. This will require further and careful discussion and collaboration. The next conference call will take place next week, Friday, May 9, at 2pm. |
| Document generated by Confluence on Apr 19, 2009 15:04 |