Dataset : Interactions of pharmaceutical companies with world countries, cancers and rare diseases from Wikipedia network analysis

RightsAttribution, Non Commercial, Share Alike
vignette
Collection

Quotation

José Lages, Dima Shepelyansky, Guillaume Rollin (2019): Interactions of pharmaceutical companies with world countries, cancers and rare diseases from Wikipedia network analysis. UTINAM. FR-18008901306731-2019-08-12

General metadata

Identifier : local : FR-18008901306731-2019-08-12
Description :
Using the English Wikipedia network of more than 5 million articles we analyze interactions and interlinks between the 34 largest pharmaceutical companies, 195 world countries, 47 rare renal diseases and 37 types of cancer. The recently developed algorithm of reduced Google matrix (REGOMAX) allows us to take into account direct Markov transitions between these articles but also all indirect ones generated by the pathways between these articles via the global Wikipedia network. Thus this approach provides a compact description of interactions between these articles that allows us to determine the friendship networks between articles, the PageRank sensitivity of countries to pharmaceutical companies and rare renal diseases. We also show that the top pharmaceutical companies of Wikipedia PageRank are not those of the top list of market capitalization.
Disciplines :
computer science, artificial intelligence (engineering science), computer science, information systems (engineering science), genetics & heredity (fundamental biology), health care sciences & services (medical research), oncology (medical research), pharmacology & pharmacy (medical research), public, environmental & occupational health (medical research, social sciences), physics, mathematical (physics), multidisciplinary sciences
Keywords :

Dates :
Data acquisition : from 1 May 2017 to 31 May 2017
Data provision : 20 Apr 2019
Metadata record :
Creation : 12 Aug 2019

Language : English (eng)
Audience : General, Research, Stakeholder, Policy maker, Informal Education

Coverages

Spatial coverage :

  • Monde: on Earth, latitude between 85° N and 85° S, longitude between 180° W and 180° E

Time coverage :

Taxonomic coverage :

  • Species
    Homo sapiens MSW (Human)

Administrative metadata

Data creatorsAffiliation
José LagesUTINAM
Dima ShepelyanskyLPT
Guillaume RollinUTINAM
Publisher : Institut UTINAM
Label : Initiative pour le SITE Bourgogne Franche-Comté
Science contact : José Lages website e-mail
Computing contact : José Lages website e-mail
Projects and funders :
Access : available

Technical metadata

Formats : application/pdf, image/png, image/svg+xml, image/x-eps, text/csv, text/html, text/plain
Data acquisition methods :
  • Derived or compiled data :
    Web crawling of Wikipedia editions (May 2017) to retrieve information.
  • Simulation or computational data :
    PageRank, CheiRank and 2DRank algorithms have been used to rank articles of the English Wikipedia language edition (May 2017).
    Reduced Google matrix method has been used to infer interactions between articles.
Datatype : Dataset

Publications

  • Interactions of pharmaceutical companies with world countries, cancers and rare diseases from Wikipedia network analysis (doi:10.1101/614016)
About
Terms of use
Contact
© OSU-THETA, CNRS