Friday, 3 April 2009

Reading again today (and keeping the machine working).

Trying to find out how people have done geoparsing in the past, whether they try to use context, and if so how. I am hoping to take the view that the other georefs in a page are all the context needed and to use gazetteers to work out the relationships between them. This has been done before, but I am not sure anyone has investigated why certain assumptions should hold. Most of it has been done with implementation in mind and evaluated by testing the results.

2 comments:

  1. Types of context:
    immediate text window
    distant (like other georefs)
    sliding window of georefs

    uses:
    positive (it is Sheffield, South Yorkshire)
    negative (it is John Sheffield, a name and therefore not a place)
    nearest places
    supporting places

    ReplyDelete
  2. wrappers use REs to look at html near place names (or better find likely placename sections)

    ReplyDelete