Posted by: Ian | September 20, 2011

Trying to get my head around the options….

OK, so here’s the problem:

An organisation can have multiple names, and it can have multiple URLs…. and sometimes one can identify a straight one-to-one relationship between the two.

For example: Riga Technical University is the english name for Rīgas Tehniskā Universitāte. Being clever, I have identified http://www.rtu.lv as the home page (in its native Latvian) and http://www.rtu.lv/en (as the english-language version). I can even associate the URLs as appropriate: the english name links to the english-language pages, and the Latvian name links to the Latvian-language pages.

Life is a tad more complex in other places. For example “Đại học Quốc gia Hồ Chí Minh” can be called either “National University of Ho Chi Minh” or “Ho Chi Minh City Vietnam National University” in english….  yet I have only one URL: http://www.vnuhcm.edu.vn

Contrary-wise: EDINA has just one name, but two URLs (http://edina.ac.uk and http://www.edina.ac.uk )

There are, naturally, some unknown number of instances where the name and the URL have not been linked – where the harvesting code was unable to make a “sensible” correlation.

The problem is working out how to model this sometimes-present relationship of many-to-many – in code, in data-returns, and on the screen.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Categories

%d bloggers like this: