• Kat Gupta’s research blog

    caution: may contain corpus linguistics, feminism, activism, LGB, queer and trans stuff, parrots, London

My tools

Recently I’ve been going to bike maintenance workshops. It’s been an interesting and often satisfying experience. There’s the comfort of knowing how to check your own bike for damage, how to mend punctures and how to be a more self-sufficient cyclist. There’s also a sense of satisfaction about doing something that has physical results, something that gets you covered in bike grease and dirt, something that requires you to work with your hands as well as your mind.

There seem to be a few books out discussing the issue of working with one’s hands and the dangers of office work. It’s hard to write about this without romanticising manual work from a smug, clean fingernailed, academic stance, from the privileged stance of this being a choice for me, of this being as unnatural for me as Marie Antoinette playing milkmaid at her hameau.

However, for me at least, there’s also a sense of responsibility; if I use something every day I should be able to understand how some aspects of it work, be able to fix some problems, know when I’m out of my depth. It gives me a better understanding of my tool’s capabilities and limitations. Does it make me a better corpus linguist? In some ways, yes – I built my desktop myself to a specification I designed for corpus linguistics. It’s skewed towards processing power and data storage, and light on the graphics. In other ways, not yet. While a computer is a bit more than a magic box for me, there’s still so much I don’t know about what it can do and how it works. The more sensitive I become to how computers work, what they’re capable of, what they are incapable of, and the implications these have for investigating language use…well, it can only be a good thing, right?

Bit of a ramble I’m afraid, I’ve caught some horrible student lurgy and am feeling a bit fuzzy-headed.

creating a history

I don’t remember where I first came across the suffragettes. I was an avid reader as a child, the sort whose parents attempted to ban reading at the table because their beloved offspring would become so engrossed in a book they’d forget to eat, and the sort whose parents were thwarted in this ban when their beloved offspring would read sauce bottles and milk cartons and cereal packets pointedly and out loud. I had certainly come across them by secondary school, and I remember being frustrated when my GCSE history textbook contained a few pages on the suffrage campaign but it wasn’t on the syllabus.

The textbook had the sort of information you’d expect – women chaining themselves to the railings, suffragettes, an image of a poster of a large, evil-eyed cat with the limp, helpless body of a long-skirted woman in its month, photographs and a short case study of Emily Wilding Davison at the 1913 Derby. It wasn’t much, but I was fascinated.

Some ten years later and I found myself actually researching the suffrage movement. In some ways, not studying the movement as presented by my GCSE textbook was perhaps beneficial; it meant I had fewer preconceived ideas about the movement. The more I read (and continue to read), the stranger the focus of that textbook appeared. Why the focus on women chaining themselves to railings, when my reading suggests that was a fairly minor part of suffrage activities – things like large demonstrations, deputations and window-smashing seem more prevalent both in the literature and in the texts I’m studying? Why the focus on the Women’s Social and Political Union rather than reflecting the diversity of the movement and the myriad groups involved? And why the consistent use of the term suffragette, rather than the more inclusive term suffragist?

It’s remarkably persistent too – I’ve lost count of the times people have asked me if my research focuses on these popular representations of the suffrage movement, and I’ve had people email me back to ask if I meant to write suffragette instead in my abstracts. I don’t think it’s a lack of interest – far from it. People seem intrigued by the movement, its actions and figures within it. This event, called “Remember the Suffragettes: a Black Friday vigil in honour of direct action” was held in November 2010 and clearly acknowledges suffragette direct action and its consequences.

I don’t have answers about why this simplified portrayal of the suffrage movement exists, why the Pankhursts and the WSPU and women chained to the railings linger in people’s memories and have this resonance. The cynical feminist in me says that we like our women feisty but not dangerous, that chaining oneself to railings is provocative yet non-threatening, that we are more comfortable with women’s martyrdom that we are with their bombs and arson. We like equal rights and are a bit shocked that women couldn’t vote less than 100 years ago, but we’re a bit scared by what it might take to attain those rights. We’d rather hear about action and grand gestures than endless rounds of legal and political debate and scrutiny. My thinking here is almost certainly simplistic, but I rather suspect that our focus on these particular aspects of the suffrage movement (and neglect of others) says more about our concerns than theirs.

swearing with Google ngrams

Back, after an unwelcome hiatus. I’ve learnt my lesson though, and will be backing my wordpress database up. Regularly.

Anyway, being a linguist of the sweary variety, I was intrigued to see someone on twitter use Google lab’s ngram viewer to look at cunt and express surprise and delight that cunt was being used so frequently rather earlier than expected.

I thought the graph looked interesting.  The frequency of cunt was rather erratic: an isolated big peak in around 1625-35; an isolated smaller peak in around 1675; peaks in 1690ish and 1705ish; a rather spiky presence between 1705 and 1800; then fairly consistently low frequency until around 1950 when its frequency increases again.

This seemed puzzling – rather than being fairly low-level but present, there were these huge spikes in the 17th century.  I decided to have a look at the texts themselves.  These turned out to be in Latin, and the following image rather neatly illustrates the two different meanings at work here:

The books themselves seem to be religious texts written in Latin, even if Google’s ever-helpful advertising algorithm seems to interpret things rather differently. As you can see in the first image, I selected texts from the English corpus. It’s possible that the books are assigned a corpus based on their place of publication, but it’s not very intuitive.

I took a closer look at the texts to try and work out what was going on.  Some of the texts were in Latin, as this example taken from De paradiso voluptatis quem scriptura sacra Genesis secundo et tertio capite:

However, this was not the only issue.  I found at least one example of a musical score – this example taken from Liber primus motectorum quatuor vocibus:

Here, the full lexical item is benedicunt. In both of these examples, cunt is not a full lexical item; I can understand why the layout of the score might have led to it being parsed as a separate item, but I’m a bit confused why the same seems to have happened with dicunt.

The high frequency of cunt can also be attributed to Optical Character Recognition (OCR). Basically, the text is scanned and a computer program tries to convert the images into text.  This has varying degrees of accuracy – it can be very good, but things like size and font of print, the paper it was printed on and age of the texts all have an effect. The text obtained through scanning with OCR is then linked to the image.

This example, taken from Incogniti clariss. olim theologi Michaelis Aygnani Carmelitarum Generalis, is probably familiar to those working with OCR scanned texts. The text actually reads cont. but the OCR has read it as cunt. The search program can’t read the image files; all it has to go on are OCR scanned texts. When these aren’t accurate, you get results like these.

I think Google ngram is interesting, but with some caveats. Corpora can be tiny – the researcher can have read every single text in their corpus and know it inside-outside. Corpora can be large and highly structured, like the British National Corpus. Corpora can be large and the researcher doesn’t need to have read every single text contained in them, but through careful compilation the researcher knows where the texts have come from, where they were published and so on – for example, corpora assembled through LexisNexis. This is a bit different – it’s not really clear what’s even in the collection of texts and the researcher has to trust that Google has put the right texts in the right language section. I’ve seen Google ngrams being used to gauge relative frequencies or two or more phrases, but for now I think I’ll stick to more traditional corpora for most in-depth work.

Mark Davis also has a post comparing the Corpus of Historical American English with Google Books/Culturomics. His post is in-depth, interesting and systematic; I just swear a lot. You should probably read his.

Arson and unresolutions

Happy New Year! I’ve started the new year with a bit of arson. Not mine, I hasten to add.

I’ve been kind of struggling to find a good record of suffrage actions; part of the problem of interdisciplinary research is that if you’re basically teaching yourself a subject, there are almost certainly going to be resources well-known to ‘natives’ (for want of a better term) of the field but which you’re blissfully unaware of. It feels a bit rude to keep bothering historians with what must be rather inane questions, but happily I’ve found Andrew Rosen’s Rise Up, Women! The Militant Campaign of the Women’s Social and Political Union 1903-1914 and that’s doing rather nicely to flesh out the spring and early summer for me. He calculates that the WSPU caused an estimated £145,025 of damage between March and August; out of curiosity I looked for a way of converting this into current values and to my joy, found not only a huge list of resources but an online converter. It gives two figures depending on which dates you plug in:

In 1910, £145,025 0s 0d would have the same spending worth of today’s £8,275,126.50
In 1915, £145,025 0s 0d would have the same spending worth of today’s £6,244,776.50

It’s probably not going into the chapter, but it is useful to understand the non-suffragist reaction. I can understand why there was so much hostility directed at the WSPU both from outside and within the suffrage movement. I don’t think the feeling of “stop doing that, you’re making our side look bad!” is an unfamiliar one to activists, then as now.

Today I’ve also shuffled around some stuff on the Nottingham Students Against Fees and Cuts blog, added more photos and posted up an article. This leads in to my set of…well. I don’t do resolutions, but here are some things ‘d like to sort out this year:

  1. Better balance with my various commitments. Especially in the last few months of 2010, I was juggling LGBT trans welfare and committee stuff, protest stuff, thesis work, various jobs, German class and being the responsible one in the house. These things are all important and I don’t want to drop any of them (except for being the responsible one in the house) but I need to find a way of not stressing myself out with them.
  2. Join a dojo and get back into ju jitsu training. It’s something I enjoy, I’ve found a dojo near me that seems to offer classes on different days and the exercise will hopefully make me sleep better. I need to check that the dojo has people of a range of sizes and abilities (not so much because I can’t throw big people around, but because I don’t appreciate being thrown around as if I, too, am a big person).
  3. Be more disciplined about thesis work; stop unproductive fretting and agonising about what to do, email my supervisor before I work myself into a state, be more decisive and confident in my work
  4. Be more organised. Hah.

Does anyone else have academic-related resolutions?

it lives! it liiiiiives!

I’m currently working on something that seem to have mutated out of a chapter. No, wait, that’s a rubbish description.

In Chapter 4 I examined words derived from Mutual Information for the words suffragette, suffragettes, suffragist and suffragists. Between the historical research and my data-driven categories, I identified the following categories: constitutionalist vs militant, class, geography, gender/gender roles, origins, direct action, legal and prison, proper names, organisational, politics and opposition. I then investigated the direct action category in more detail.

The terms I looked at (disturbance*, outrage*, violence, crime*, disorder and incident?) were evaluative on a lexico-grammatical level. However, upon reading the texts, I realised that there were other types of evaluation at work in the texts. These were longer and more analytical, operating at the discourse level and could only be discovered by reading the texts. So I read texts. One of the themes that emerged was the tension between organised actions and individual actions. I started planning out Chapter 5, worked out which period to focus on, worked out which articles in that period I was going to analyse, and read a lot of discourse analysis.

I then started to analyse the articles, only to discover something a bit interesting in the arrangement of texts within the articles. It wasn’t mentioned in the scholarship about historical newspapers I’d read. I’m still searching to see if someone, anyone has researched it. I thought it was interesting though, and talked to my supervisor about it. She encouraged me to explore that idea a bit more; maybe it would be interesting in its own right, maybe it would make another chapter stronger.

It’s now becoming something that I think makes the link between Chapters 4 and 5 stronger, but which I think ought to be a chapter itself. This is very much data-driven research; I thought I’d be doing some fairly straightforward (critical) discourse analysis and I wasn’t expecting to find something like this, but instead I’ve found something that makes me reconsider the structure of my thesis.

In a way, I like the chaos. I like having a sense of freedom to explore things, I like being able to say “wow, this is interesting, I should pursue it”, I like getting excited about new things and part of me is going what, wait, how has no one else discovered this? Am I really the first?. I’m a perfectionist, and I have a hard time committing to something because I’m convinced that if I fussed over it just a little more it would be even better. But at the same time, I’m incredibly conscious of the time restraints and the fact that I need to knuckle down and get this thesis done.

Does anyone else feel like this about their thesis? How do you decide between sticking solidly to your plan or haring off after something interesting? Am I setting up a false dichotomy here and it’s possible to have a compromise?

Thoughts on direct action

In my research, it’s impossible not to come across issues of direct action. Earlier this year I was looking at words derived from Mutual Information for suffragist, suffragists, suffragette and suffragettes. A category for direct action terms emerged from this data, and I started to look at disturbance*, outrage*, violence, crime*, disorder and incident? in more detail.

To quote from a draft of this chapter:

The terms examined in this chapter can be paired – two relatively high frequency, non-specific terms disturbance* and outrage*, two low frequency, specific terms violence and crime* and two terms, disorder and incident?, which are both used to describe suffrage campaigners’ interactions with non-suffrage supporters, but differ in which groups they are used in conjunction with. Reports shy away from using violence and crime* to describe suffrage campaigning and instead use more ambiguous terms such as disturbance and outrage; the actions encompassed by disturbance and outrage include the disruption of meetings and heckling as well as more destructive acts such as ‘fire outrages’ and bombs.

The damage and destruction was largely confined to property. If there was physical aggression, it was nearly always faced by the suffrage campaigners than posed by them. The Times describes “large hostile crowds”, “active hostility to suffragists”, several “scenes of great disorder” and meetings broken up by “young roughs”. The texts I am working with at the moment describe “disorderly scene[s]” at suffragist meetings, but on closer examination the meetings themselves seem to have been highly organised and the disruption was due to members of the public, who formed a “hooting and jostling mob”, hassled the speakers and attempted to hustle them out of the park.

And then the Browne Review was announced. As a young researcher in arts, humanities and social sciences (I might declare which area I’m in if someone decides to fund me), I am worried about what this might mean for these areas. I’m worried about the Education Maintenance Allowance, AimHigher and Lifelong Learning UK being scrapped with no clear information on what will replace them. I’m worried about the privatisation of Higher Education. These cuts to education take place against a background of cuts to public services, and it’s difficult not to interpret them as ideological.

I was not the only one to feel like this, and direct action began to seem rather closer to home. Students marched on the 10th November, 24th November and 9th December. The argument that “the broken pane of glass is the most valuable argument in modern politics” began to be heard again and yet again, damage to property was positioned as worse than damage done to people’s lives.

With other students at my university, I engaged in peaceful direct action. I made my placard, marched, chanted, and was duly kettled for around six hours. I’m pretty sure that this video is from where I was kettled. Protesters were injured – a woman beaten and racially abused by police, a man nearly died after being beaten and Green and Black Cross has an appeal for witnesses. People found being inside a kettle a “shocking experience”. There are photos of the day (link 1, link 2) but I’m not sure how well they capture most protesters’ experience – the vast majority were not hurling paintbombs and setting benches on fire.

Parliament still voted to allow an increase in tuition fees.

There are criticisms of the focus of reporting, criticism of kettling tactics and outrage over the state (and state-sanctioned) response to these protests. I am confident that more criticisms will emerge.

There are things I will take from this on a personal and political level, but at the moment I’m trying to work out what I can take from this as a researcher in direct action.

One of these things is the sheer courage it takes to engage in direct action. I can understand why people may be ambivalent towards militant direct action in the suffrage movement – at times it can come across as misdirected or petty. But it takes bravery to demonstrate when you know this could lead to police violence against you.

A second point is about disenfranchisement. Many of those on the demonstrations were young and without a political voice. Demonstrating – being physically present at the gates of Parliament – was one of the few ways open to them to get their demands heard. While these people are excluded on the basis of youth – a temporary condition, rather than the more fixed one of sex – it still shows how easily those not able to vote can be ignored by government.

A third is about the nature of direct action. There is a fair amount of discussion going on about what direct action should encompass. There are calls for peaceful demonstrations, candle-lit vigils, writing to MPs…but these have been going on for years with no demonstrable effect. There’s a sense of frustration amongst protesters. I was marching against top-up fees in 2003 and I think these protests have partly come about because people have tried to be nice and polite only to be ignored. Suffragists, too, tried a peaceful approach – they had demonstrations and rallies, they lobbied MPs, they sent deputations to members of the Cabinet, they organised petitions. If I can feel frustrated after a mere seven years of campaigning, it makes me more sympathetic to women who’d been campaigning for over 30 years with no real progress.

A fourth is a about the fractures in a movement that can result from direct action, and the willingness for some parts of the movement to disown those who do take part in property damage. You can almost hear the ‘respectable’ protesters scrambling to distance themselves from the balaclava-clad youths clutching spraypaint and lighter fuel. No one seems to be asking why they’re engaging in such actions, yet these “bainlieue-style youth from Croydon, Peckam, the council estates of Islington” are well aware of what university will cost them and are one of the most vulnerable groups: “We’re from the slums of London, how do they expect us to pay £9,000 a year uni fees? EMA is the only thing keeping us in college, what’s stopping us from doing drug deals in the street now? Nothing.”

While direct action within the suffrage movement was organised by a different demographic, there still seem to be the same split between those urging peaceful direct action – such as the NUS’s Right to Recall campaign – and those engaging in different forms of protest.

What I’d take to these protests from my research is an understanding that there’s room in a movement for lots of different forms of protest. It’s not an either/or decision between things like the Right to Recall campaign and more confrontational things like occupations, and it’s important not to fracture along these lines.

I didn’t really expect to be a participant-observer of direct action when I started my PhD, but I hope it makes me a better researcher – a less judgemental one, and one better able to understand the pressures that lead to direct action

oh no, not another research blog

Apparently this is how the cool kids are disseminating their research.  I’m Kat and I’m an apprentice academic (of sorts).  My background is in corpus linguistics but my thesis also involves discourse analysis, history and politics.  My thesis focuses on how the British women’s suffrage movement was represented in The Times newspaper between 1908 and 1914.  More broadly, my research interests include language and gender, language and power, sociolinguistics, discourse analysis and corpus linguistics.

I also like tea, feminism, activism, books, webcomics, music, LGBT and queer issues, rats, and procrastination.