Prepositional Phrase Attachment: data files

The files have one of two extensions:

The prefix indicate which are the test files and which are the training files.



 

  train.lema.occs
  train.lema.feat
 

  test.lema.occs
  test.lema.feat

In addition, the files pp.v.out and pp.n.out consists of complete parse trees of the sentences, along with the the 4 tuples extracted from them. The .v and .n files contain the v and n labeled examples, , respectively.