In order to visualise and compare the emphasis of each political party's election manifesto, we have text-mined them into Word Clouds. The seven manifestos shown here have been treated in exactly the same way in order to achieve maximum objectivity.

The process of producing these Word Clouds was as follows:

  • download the manifestos from each party's web site as a PDF file;
  • export all the text each contains into a text file (note: text embedded within images could not be exported from the PDF files);
  • page headers and or footers (where present) were removed;
  • each chapter in a manifesto was grouped into a single block of text;
  • a two-column CSV file was constructed containing all the manifestos with party name in the first column and chapter texts in the second (a total of 121 chapters, approximately 189,000 words;
  • the CSV file was loaded into open-source statistical package R using the text mining package tm and package wordcloud;
  • the text is transformed in a corpus (a data type for analysing texts) and further cleaned to turn all the text to lower case, remove all punctuation and numbers, remove unwanted words such as 'a', 'the' and 'we', as well as political party names;
  • the remaining text for each party is turned into its own Word Cloud (using exactly the same parameters for all parties) such that the most frequent words are larger, a maximum word count of the 200 most frequently used words and a maximum size to the plot (smaller plots occur where the total word count of the manifesto is smaller).

Presented below are the Word Clouds for each political party as well as one derived from all the manifestos together.

Further text mining will be carried out to produce more analytical visualisations.
This type of text mining is just a small part of what our students learn on the MSc and Professional Doctorate in Data Science at the University of East London.

Conservative Election Manifesto 2015 (Text Mining)

Word cloud

Commonality between Conservatives in 2010 and 2015 (Text Mining)

Word cloud

Comparison between Conservatives in 2010 and 2015 (Text Mining)

Word cloud

Liberal Democrat Election Manifesto 2015 (Text Mining)

Word cloud Lib Dem

Commonality between LibDem in 2010 and 2015 (Text Mining)

Word cloud Lib Dem

Comparison between LibDem in 2010 and 2015 (Text Mining)

Word cloud Lib Dem

Labour Election Manifesto 2015 (Text Mining)

Word cloud Labour

Commonality between labour in 2010 and 2015 (Text Mining)

Word cloud Labour

Comparison between labour in 2010 and 2015 (Text Mining)

Word cloud Labour

UKIP Election Manifesto 2015 (Text Mining)

Word cloud UKIP

Green Election Manifesto 2015 (Text Mining)

Word cloud Greens

Plaid Cymru Election Manifesto 2015 (Text Mining)

Word cloud Plaid Cymru

SNP Election Manifesto 2015 (Text Mining)

Word cloud SNP

Commonality between Conservative and Labour Manifestos (Text Mining)

Word cloud Con Lab

Comparison between Conservative and Labour Manifestos (Text Mining)

Word cloud Con Lab

Centre for Geo-Information Studies

The Centre for Geo-Information Studies is an established research centre specialising specialise in all aspects of geo-information science.

Read more