Full data for #Covid19uk during UK lockdown

This blog provides access to data on tweets using the #Covid19UK hashtag during the UK lockdown, which began on 24 March 2020. The data were extracted using TAGS, and then mapped using NodeXL. There were of course other UK-focused hashtags used during this period and some UK-based Covid-19 tweeting that did not use any hashtag. However it was unfeasible to capture all the data. I therefore stuck with one hashtag all the way through. You can see further information, including other search terms and ways of presenting data, in a tweet thread. These data are updated on a weekly basis (except where there is too much data to plot at once, in which case they are charted by day). There’s more about this work on the BMJ Opinion blog.

The URLs in the table point to NodeXL reports that describe the data more fully. Note that some of the early days of the UK lockdown had so much tweeting that it was not possible to obtain a complete data set (either because of the 18,000 tweet & retweet limit or because of difficulties accessing the Twitter API; you can only extract Twitter data 7-10 days into the past). Fortunately NodeXL also allows us to look at the tweet data for posts retweeted during the periods where there are data. That means that we can access some of the more popular tweets from the periods that were “missed”. For example, for 20 March, pre-lockdown, there were only 8 hours 21 minutes of data extracted (15:38 to 23:59). However, retweets during this period allowed the identification of tweets going back over 2 weeks. The following graph shows daily data from 3 to 17 March and hourly data thereafter, plotting tweets but not retweets.

Click into the links in the table below to see the NodeXL reports. These are typically weekly reports, starting each Tuesday (as the lockdown began on a Tuesday). Scroll down to the bottom of the NodeXL reports to download the raw data (unless the table below has a link for “data”, in which case the file was too large to upload to the NodeXL Graph Gallery, and needed to be uploaded to Dropbox instead). In the NodeXL reports select the link – “Download the Graph Data as a NodeXL Workbook”. Sometimes there was just too much data to analyse in one go, so I have processed them in days instead. Most of the NodeXL reports were produced in September or October. That means that some of the tweets had been deleted, retweets retracted, sometimes accounts deleted. For example, for 24 March – re-extracted on 1 October – 14,735 tweets and retweets were identified out of 16,192 originally collected. There may be some interesting findings from looking at the posts that have disappeared in the intervening period – perhaps some were from “troll farm” and bot accounts that have been removed. I have a lot of the original TAGS data, which will allow further study of this question.

NodeXL graphs (e.g. the graph below for 24 March) show Twitter accounts that tweeted, retweeted, and/or were mentioned in tweets. These are called “vertices”. You can read more about how to interpret these maps in an article by NodeXL and Pew Research. You can also read a lot more about how to analyse and interpret the raw data on the ScotPublicHealth blog and in my papers published in peer-reviewed medical journals.

The number of Twitter accounts included in the weekly extracts has varied over time, peaking at the start of lockdown, a quiet period over the summer, and an increase during September as we contemplated a return of stricter lockdown rules.

Hopefully there will be time over coming months to analyse these data in collaboration with academic teams. Please contact me if you are interested in contributing. However, for the moment my focus is on GP exams and clinical work. Watch this space for further data over coming months.

Week of lockdownDatesVertices (number of tweeters, retweeters & mentioned accountsURL to access data
Pre lockdown
(4 days)
20-23 March5073720 March  
21 March  
22 March  
23 March
124 – 30 March8634824 March
25 March
26 March
27 March
28 March
29 March
30 March
231 March – 6 April6108431 March
1 April
2 April
3 April
4 April
5 April
6 April
37 – 13 April42330Report
414 – 20 April40443Report
521 – 27 April36964Report
628 April – 4 May35491Report
75– 11 May55688Report
812- 18 May5537712 May
13 May
14 May
15 May
16 May
17 May
18 May
919 – 25 May44803Full week
1026 May – 1 June44714Full week
112– 8 June25091Full week
129 – 15 June21481Full week
1316-22 June18857Full week
1423-29 June30340Full week
1530 June – 6 July23065Full week
167 – 13 July24174Full week
1714 – 20 July14243Full week
1821- 27 July9191Full week
1928 July – 3 August14549Full week
204 – 10 August8357Full week
2111 – 17 August7618Full week
2218 – 24 August6125Full week
2325– 31 August8143Full week
241 – 7 September9774Full week
258 – 14 September35498Full week
2615 – 21 September24634Full week
2722 – 28 September24250Full week
2829 Sep – 5 October13756Full week
 296 – 12 October  17766 Full week 
 3013 – 19 October  19491Full week 
#covid19uk data by week from the start of UK lockdown

Dr Graham Mackenzie, GPST2, Edinburgh, Scotland

12 October 2020.

graham mackenzie on twitter

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s