|
This page last changed on Mar 20, 2007 by plaisant@cs.umd.edu.
TEMPLATE (pretty good draft now)
(this is to guide your answers, go in edit mode, grab the text of the template, keep the headers - but you can remove the details about the questions to make your answers easier to read - and copy that text in your own page).
In general try to give EXAMPLES. e.g. example hypothesis you may have or examples of dream insight that you hope to gain from using the tools. Imagine that Monk is here and that all is possible, tell us what you dream of, then give example of 1st steps you like to see toward those dream goals (i.e. what you really hope the monk development team start with).
Short title of case study (replace by your text)
Long title
Name: Name and user type you represent (Scholar? curator?)
Problem/Question: a paragraph and links to other materials if needed, describing the research you want to conduct
Status of the research: Are you working on this now e.g. in a small manual fashion or with other tools? If this is your thesis, when do you hope to graduate? (i.e. when do you hope to be able to use the tools at the latest)
Measure of success:: What would be a good indicator of success in this case study? For you personally (e.g. a paper, a paragraph of my thesis talking abouy the work I did)? Being able to analyse more than 2 books? Prove Joe Shmoe wrong?
Texts needed in the collection:
- If you get only one book/work, which one do you need? We can put that in nora or wordhoard or may be Tapor to understand what the current tools can do today.
text
- If you get 5 books/works? (the 1st steps of Monk)
text
- In your dreams: what collections do you want?
text
- Is there multiple versions of your documents you would need to see in paralel or combined? Is there foreign language or unusual characters?
Generality: what other questions other users might ask that would be similar to your question?
Granularity: can you guess the granularity(ties) you will need to use? word, paragraph, page, books etc. Multiple levels?
Characteristics: what low level characteristics of the text you think will be useful for your research? (e.g. POS, Ngrams, Soundex).
Patterns Can you try to express examples of complex patterns you want to identify, or hope to find?
Morphology: example of use?
Tags:
- Would you make use of existing xml tags in those collection? For the standard ones (date, author etc)? or is there special ones on the collection you want that will be useful.
- Other Dream tags: you would like to have? (e.g. things/entities you wish would be extracted for you, e.g. places, mentions of money, numbers, etc.)
Classification: Is classification interesting (e.g. supervised learning like in nora or not). Give examples of questions
Comparisons: Are comparisons between texts useful? which comparisons?
Topic extraction: interested?
Lexicon, counts of words, most common occurences, concordance Describe need and importance
Annotation: _if you could annotate.. give example of what type of annotation you would love to have in the tool itself.
Collaboration If we had collaborative tools, who would you collaborate with? what annotation, results, tools etc. would you like to share, or not share
Bonus question: What's your favorite example of text analysis results paper? and why?
|