This page last changed on Apr 04, 2007 by plaisant@cs.umd.edu.

Long title

Name: Name and user type you represent (Scholar? curator?)

Problem/Question: a paragraph and links to other materials if needed, describing the research you want to conduct

Status of the research: Are you working on this now e.g. in a small manual fashion or with other tools? If this is your thesis, when do you hope to graduate? (i.e. when do you hope to be able to use the tools at the latest)

Measure of success:: What would be a good indicator of success in this case study? For you personally (e.g. a paper, a paragraph of my thesis talking abouy the work I did)? Being able to analyse more than 2 books? Prove Joe Shmoe wrong?

Texts needed in the collection:

  • If you get only one book/work, which one do you need? We can put that in nora or wordhoard or may be Tapor to understand what the current tools can do today.

text

  • If you get 5 books/works? (the 1st steps of Monk)

text

  • In your dreams: what collections do you want?

text

  • Is there multiple versions of your documents you would need to see in paralel or combined? Is there foreign language or unusual characters?

Generality: what other questions other users might ask that would be similar to your question?

Granularity: can you guess the granularity(ties) you will need to use? word, paragraph, page, books etc. Multiple levels?

Characteristics: what low level characteristics of the text you think will be useful for your research? (e.g. POS, Ngrams, Soundex).

Patterns Can you try to express examples of complex patterns you want to identify, or hope to find?

Morphology: example of use?

Tags:

  • Would you make use of existing xml tags in those collection? For the standard ones (date, author etc)? or is there special ones on the collection you want that will be useful.
  • Other Dream tags: you would like to have? (e.g. things/entities you wish would be extracted for you, e.g. places, mentions of money, numbers, etc.)

Classification: Is classification interesting (e.g. supervised learning like in nora or not). Give examples of questions

Comparisons: Are comparisons between texts useful? which comparisons?

Topic extraction: interested?

Lexicon, counts of words, most common occurences, concordance Describe need and importance

Annotation: _if you could annotate.. give example of what type of annotation you would love to have in the tool itself.

Collaboration If we had collaborative tools, who would you collaborate with? what annotation, results, tools etc. would you like to share, or not share

Bonus question: What's your favorite example of text analysis results paper? and why?

Document generated by Confluence on Apr 19, 2009 15:05