Reputation: 181
I want to implement "Template-based information extraction without the templates" paper and for the first step, I have to do "clustering Events". I have the MUC dataset and have parsed and tokenized it. But there is something that I cannot understand. In the paper, it says that : "We cluster event patterns to create templates. An event pattern is either (1) a verb, (2) a noun in WordNet under the Event synset, or (3) a verb and the head word of its syntactic object. Examples of each include (1) ‘explode’, (2) ‘explosion’, and (3) ‘explode:bomb’." I was wondering what are these three conditions. How can I apply them to cluster my data? I mean should I first go search in wordnet and remove some words? I just do not know how to achieve the goal. Any help will be appreciated.
Upvotes: 0
Views: 242
Reputation: 1261
I am writing this reply to help future readers.
You've posed 4 questions. I will try and address all of them. All your questions are surrounding a singular portion of the paper: the approximation of template topics using what the authors term "event patterns".
The three "conditions" are the definition of event patterns (This is explained in the paper --- its also in the text you've quoted).
The authors define an event pattern as (a) a verb, (b) a noun you can extract from an event's synset on WordNet (recall that there are six types of events that have templates in the corpus i.e. bombing, kidnap, attack, and others), and (c) the concatenation of the verb with the head of its object.
You're not using them to cluster your "data" -- they are the data to be clustered. You're supposed to extract these so-called event patterns from the documents and cluster them using the two algorithms discussed in [2, p978-979]. The documents are from the terrorism corpus [1].
No. You're supposed to determine what nouns belong to an event's synset on Wordnet.
You need to improve the way you read a paper. Read a paper and make notes. After that, re-read the paper and revisit the questions + points you jotted down. See [3] for tips. If you read it in that manner, you should be able to get sufficient understanding to achieve your goal.
Refs
Upvotes: 1