|
This page last changed on Feb 23, 2008 by martinmueller@northwestern.edu.
Monk Data Cell Conference Call
Tuesday, June 26, 2007. 3-4 p.m. central time
Present: Phil Burns (Pib), Bill Parod, Loretta Auvil, Bernie A'cs, Duane Searsmith, Vered Goren, Joe Paris, John Norstad (secretary)
Bill reported that Pib is finishing his last rounds of updates to MorphAdorner. He will adorn the initial collection of texts as described by Martin, including the approximately 250 novels from NCF and Tanya's Stein text.
Pib says that he is pretty close to ready to begin generating reasonable output for our initial large set of texts next week. He will put the adorned texts on our local Ariadne host for access by Monk members, all of whom have been assigned NU netids and passwords.
Bill will take care of sending out mail with information on the netids and passwords.
Bill plans to begin looking at how we might go about extracting sparse matrices of counts from the current WordHoard datastore to feed D2K itineraries.
Loretta reported that she has done work in integrating new texts into the Feature Lens MySQL database.
Bill has been working with Martin to run repeated phrase scripts on MorphAdorned Stein texts. These scripts were originally developed several years ago to investigate repeated phrases in Homer.
John is writing code to ingest MorphAdorned texts as produced by Pib into the current WordHoard datastore, as a first step that will permit us to begin to explore and experiment with large relational database stores and compare this approach to alternatives. After Pib and Martin have finished adorning the 250 NCF texts, John will ingest the full collection and we will have our first very large relational database store.
Loretta says that Kelly Searsmith, an expert in Victorian literature, has been hired to work on the SEASR project and is interested in helping Martin prepare the texts.
Loretta also reminded us that Monk has a teragrid account, ready to be used when we need it.
|