Automated Data Collection: COVID-19/SARS-COV-2 Cases in EU by Country, State/Province/Local Authorities, and Date
covid19-eu-data
is a dataset repository for COVID-19/SARS-CoV-2 cases in Europe. We pull data from official government websites regularly using the open-source scripts inside the repository.
On 2022-02-29, IE stopped updating the detailed covid infection data for weekends.
On 2021-11-01, we stopped collecting PL data as there are some data quality issues. But we update the original data here.
Breaking Change:
On 2021-01-03, we dropped the whole commit history and removed the cache files. This is done because the repo is growing into a behemoth.
On 2020-12-31, we stopped caching most of the webpages due to oversize of the repo.
On 2020-05-22, we removed documents/be
and documents/dk
. These two folders are bloating and our repo reached the GitHub storage hard limit (2GB). The files have been moved to covid19-eu-zh/covid19-eu-data-20200522 as a snapshot.
Full changelog: CHANGELOG.md
Commit Status:
Workflow status by countries:
Country | Status | Data Source |
---|---|---|
AT | ||
BE | ||
CH | ||
CZ | ||
DE | ||
DK | ||
ES | ||
FR | ||
GR | ||
HU | ||
IE | ||
IT | ||
NL | ||
NO | ||
PL | ||
PT | ||
SE | ||
FI | ||
SI | ||
UK | ||
EU(ECDC) |
The tabular data files are located in dataset
folder. The folder dataset/daily
holds the daily updates in each country.
The metadata for the tabular data is found in
.dataherb/metadata.yml
.
Some of the countries publish more than simple tabular data. We cache the files in documents
folder.
The scripts that are being used to update the data are located in scripts
folder. Most of the scripts require the utils.py
module to run. Create a new environment and run pip install -r requirements.txt
to install the requirements.
The workflows that update the dataset are defined in .github/workflows
. The python scripts are scheduled to run on GitHub Actions.
Caveats:
There is a repo cleaning up the raw data on ArcGis.
Caveats:
We stopped tracking UK data.
cases_lower
and cases_upper
, to reflect the range of the number of cases.Northern Ireland does not publish detailed data.
-f
flag to true
for scripts/download_it.py
to redownload all dates.Bugs and requests: PRs are welcome.
Telegram Channel (in Chinese): 新冠肺炎欧洲中文臺