Saturday, 25 February 2012

 History and Linguistics
I went on to the historyspot site through Twitter and watched Magnus Huber talking about Corpus Linguistics and how that related to the recording of 134 million words which is a colossal amount of data on the Old Bailey online. He acknowledged Tim Hitchcock's role in that.
From the the linguist's point of view it is historical corpus of spoken English. From the historian's angle it is a record of spoken language related to historical events more than 200 years ago. XML tags were mentioned and a layout was used rather than metalinguistic data to record speech events, pronouns (which signify spoken language). This material was used as a basis for a linguistic corpus.
Someone point out my mistakes if I got some of this information incorrect!
I really need to watch the whole video properly. I was just taking a Twitter 'break' from planning my assignments for both History modules; and this is how I got side-tracked!

No comments:

Post a Comment