User Tools

Site Tools



This is a wiki for Taking science to the crowd: Researchers, programmers and volunteer contributors transforming science online, a research project based at the University of Gothenburg.

Feel free to contact us with any questions.


Blog posts



The purpose of this analysis is to analyse three types of heterogenous network data found in one single policy paper called the Green Paper on Citizen Science, written by the Socientize project, which is funded by the EC. In this report the following types of data will be analysed:

  1. Scientific Literature (DOI lists searchable in the Web of Science)
  2. Web sites (URLs crawlable with Issuecrawler, or similar)
  3. Other Policy documents (work in progress).

This way, the network of each policy paper (in this case the Green Paper on Citizen Science) could perhaps be mapped. Then, the following questions can be asked:

  • What is the scientific evidence for this policy?
  • What are the influencing networks?
  • What previous policy informs the report?

The challenge is, however, to find ways of merging and visualizing very different kinds of network nodes. For example, scientific literature is very rich on meta-data that shows several different qualities of networks. So do URLs that can be crawled throughout the internet. But books, printed reports and journals outside the indexed webs of science are poor in metadata. Thus, some experimentation is required.

Note: this is just a page for work-in-progress. Every line of analysis is highly tentative and in constant flight.

1. Visualizations of Scientific Literature (WoS data (n=21))

Cited Referenes: Journals

This map was created using VosViewer to find out what journals the Green Paper cites (indirectly, as cited references of the 21 articles).

Cited References: Authors

This map was also created in VosViewer to find the central authors.

Cited References: Authors 2

This visualization shows the Cited References network of the 21 articles found in the Green Paper, visualized using Gephi. The size of the nodes is based on Cited References in the local dataset (see list here). Articles that have less than 2 local citations have been excluded for the purpose of clarity.

Source Data (gephi) 21recs, Cited References, Minimum 1 Local Citation, Author Nodes, Rank by local Citations.

Except for Irwin, the other large nodes represent ecology researchers.

Keyword Map

This map was also created using Gephi. It is based on the keywords of the 21 articles.

The trend of ecology is very much present here.

2. Visualisation of URLs (n=25) found in the Green Paper

In the entire Green Paper Document there are 25 URLs. These were crawled with Issuecrawler and Navicrawler and then exported to Gephi.

This visualisation primarily shows that the Green paper document is very EU-centric. This is of course no surprise as an isolated observation. However, when looking at the various links, there seem to be an interesting intertwinement of NGOs/Lobby groups and specific (funding) agencies in the EU.

IssueCrawler Results

The same 25 URLs found in the Green Paper were processed in Issuecrawler. Similar, but not identical results were obtained.

IssueCrawler Visualisation

Issuecrawler has a deeper crawl and it reveals further networks. The primary Eu institutions are clearly clustered together on both maps. But also the national government institutions appear. The network also ties in the various framework programme projects of the EU. US projects, such as and the American universities (Berkeley, Stanford) are in the network periphery.

IssueCrawler Visualization II (gephi)

HYPHE results

The same 25 URLs were processed using the Hyphe Software by Médialab. The pages were crawled and I manually excluded some of the big social media platforms, such as Facebook and Twitter, for the sake of clarity. Hyphe is less blackboxed than IssueCrawler, and it is thus possible to make decisions across the whole network, not just the “core network” that IssueCrawler lets you download. This way, the dataset is much more flexible and allows for a greater zoom.

You can produce your own visuals with the following files:

Hyphe I: Entire network

Note: This is just a zoom on one of the nodes. The entire network must be viewed as a pdf-file.

Here, we see the Socientize web page. It is very easy to see what pages Socientize links to, and what pages it receives links from.

Hyphe II: Blogosphere zoom

This image is produced from the same large network as above. Here we can zoom in on two important blogs that are linked from the Green Paper. and NYecology have two links in common (besides being linked from the same Green Paper): and

3. Visualisation of other policy reports

(not yet started)

Source Data

greenpaper.txt · Last modified: 2015/01/03 11:37 by christopher