Scraper for Covid-19 Data in Berlin

Download Covid-19 data from the official sources of the city of Berlin:

See covid-berlin-data for the data itself (updated daily).

Installation

Mac

$ brew install python
$ pip install poetry
$ make setup

Arch Linux

# pacman -S poetry
$ make setup

Other systems

Install these dependencies manually:

Python >= 3.8.1
poetry

Then run:

$ make setup

Usage

This program works in several steps:

Download press releases from the current RSS feed and save their metadata to a database in the passed cache directory:
```
$ ./covid-berlin-scraper --cache my_cache_dir --verbose download-feed
```
Download the current district table (Verteilung in den Bezirken) and save the data to a database in the passed cache directory:
```
$ ./covid-berlin-scraper --cache my_cache_dir --verbose download-district-table
```
Download the current dashboard and save the data in a database to the passed cache directory:
```
$ ./covid-berlin-scraper --cache my_cache_dir --verbose download-dashboard
```
(Optional) Download press releases from the press release archive and save their metadata to the same database:
```
$ ./covid-berlin-scraper --cache my_cache_dir --verbose download-archives
```

Parse the content of all press releases, district tables and dashboards stored in the database and generate a CSV output:

$ ./covid-berlin-scraper --cache my_cache_dir --verbose parse-press-releases \
    -o my_output.csv \
    --output-hosp my_output_incl_hospitalized.csv

Help

See all command line options:

$ ./covid-berlin-scraper --help

Development

Installation

$ make setup

Testing and linting

$ make test
$ make lint

Help

$ make help

Contributing

Feel free to remix this project under the terms of the Apache License, Version 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
covid_berlin_scraper		covid_berlin_scraper
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
covid-berlin-scraper		covid-berlin-scraper
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scraper for Covid-19 Data in Berlin

Installation

Mac

Arch Linux

Other systems

Usage

Help

Development

Installation

Testing and linting

Help

Contributing

About

Releases

Languages

License

jakubvalenta/covid-berlin-scraper

Folders and files

Latest commit

History

Repository files navigation

Scraper for Covid-19 Data in Berlin

Installation

Mac

Arch Linux

Other systems

Usage

Help

Development

Installation

Testing and linting

Help

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Languages