|
MONK : Conference call, 2007 Oct. 16, Data
This page last changed on Feb 23, 2008 by martinmueller@northwestern.edu.
Present: Bill Parod (chair, secretary), Phil Burns (Pib), Loretta Auvil, Amit Kumar, Vered Goren, Joe Paris, Martin Mueller Bill reported availability of label data at the chapter level from Sara Steger for sentimentalism in NCF. He also reported availability of a D2K InputModule for bringing NCF count data into a D2K sparse matrix. He asked about next steps for running analytics for this experiment at NCSA. Loretta suggested he check the InputModule code into svn so she can work with it. Bill will do that. Bill asked about types of date ranges anticipated for works and perhaps authors. Martin indicated work publication date, 'enters circulation' dates, and author birth/death dates. Also author origin and sex. Martin asked where that data should be recorded - should it be in the teisimple header or elsewhere? Bill suggested such data could be added to the teiheader or provided separately in the accompanying Submision Information Package (SIP). Martin will discuss such preferences with the Standards and Metadata subgroup of the Data Cell. Martin asked about methodology regarding Sara Steger's recent sentimentalism classifications. An interesting discussion between Martin and Loretta ensued. Martin will post a summary via email that we'll add here. |
| Document generated by Confluence on Apr 19, 2009 15:04 |