Skip to content
This repository has been archived by the owner on Sep 24, 2023. It is now read-only.

Latest commit

 

History

History
25 lines (19 loc) · 1.41 KB

README.md

File metadata and controls

25 lines (19 loc) · 1.41 KB

get-latest-cord-19-publication-data

CORD-19 Dataset Author Statistics

The CORD-19 dataset of COVID-19 publications is published weekly at CORD-19 Historical Releases.

In order to understand the distribution of authors on papers in this dataset, this repository:

  • requests data from the latest published dataset,
  • calculates author statistics on publications in that dataset,
  • appends the latest statistics to the data file
Author Statistics Description
Date Date of CORD-19 release
Article Count Total number of articles in the dataset
Mean Author Count Mean number of authors per paper
Std Author Count Standard deviation of authors per paper
Min Author Count Minimum number of author per paper in dataset
25% Percentile 25% percentile of number of authors per paper in dataset
50% Percentile 50% percentile of number of authors per paper in dataset
75% Percentile 75% percentile of number of authors per paper in dataset
Max Author Count Maximum number of authors per paper in dataset

Inspired by: https://github.com/canovasjm/covid-19-san-juan