STAR

An implementation of the threat based STAR biodiversity metric by Muir et al (also known as STAR(t)).

See method.md for a description of the methodology, or scripts/run.sh for how to execute the pipeline.

Checking out the code

The code is available on github, and can be checked out from there:

$ git clone https://github.com/quantifyearth/STAR.git
...
$ cd STAR

Additional inputs

There are some additional inputs required to run the pipeline, which should be placed in the directory you use to store the pipeline results.

SpeciesList_generalisedRangePolygons.csv - A list of species with generalised ranges on the IUCN Redlist.
BL_Species_Elevations_2023.csv (optional) - corrections to the elevation of birdlife species on the IUCN Redlist taken from the BirdLife data.

The script also assumes you have a Postgres database with the IUCN Redlist database in it.

Species data acquisition

There are two scripts for getting the species data from the Redlist. For those in the IUCN with access to the database version of the redlist, use extract_species_data_psql.py.

For those outside the IUCN, there is a script called extract_species_data_redlist.py that gets the data via the V4 Redlist API. You will need an API key, which you can request via the API website by signing up. Once you have that, you still still need to download the ranges for that taxa your interested, as those are not available from the API, so before running the script you must go to the spacial data portal and download the files for the TAXA you are interested in.

Running the pipeline

There are two ways to run the pipeline. The easiest way is to use Docker if you have it available to you, as it will manage all the dependencies for you. But you can check out and run it locally if you want to also, but it requires a little more effort.

Running with Docker

There is included a docker file, which is based on the GDAL container image, which is set up to install everything ready to use. You can build that using:

$ docker buildx build -t star .

You can then invoke the run script using this. You should map an external folder into the container as a place to store the intermediary data and final results, and you should provide details about the Postgres instance with the IUCN redlist:

$ docker run --rm -v /some/local/dir:/data \
	-e DB_HOST=localhost \
	-e DB_NAME=iucnredlist \
	-e DB_PASSWORD=supersecretpassword \
	-e DB_USER=postgres \
	star ./scripts/run.sh

Running without Docker

If you prefer not to use Docker, you will need:

Python3 >= 3.10
GDAL
R (required for validation)
Reclaimer - a Go tool for fetching data from Zenodo
Littlejohn - a Go tool for running scripts in parallel

If you are using macOS please note that the default Python install that Apple ships is now several years out of date (Python 3.9, released Oct 2020) and you'll need to install a more recent version (for example, using homebrew).

With those you should set up a Python virtual environment to install all the required packages. The one trick to this is you need to match the Python GDAL package to your installed GDAL version. For example, on my machine I did the following:

$ python3 -m venv ./venv
$ . ./venv/bin/activate
(venv) $ pip install gdal[numpy]==`gdal-config --version`
...
(venv) $ pip install -r requirements.txt

You will also need to install the R stats packages required for the validation stage:

$ R -e "install.packages(c('lme4', 'lmerTest'), repos='https://cran.rstudio.com/')"

Before running the pipeline you will need to set several environmental variables to tell the script where to store data and where the database with the IUCN Redlist is. You can set these manually, or we recommend using a tool like direnv.

export DATADIR=[PATH WHERE YOU WANT THE RESULTS]
export DB_HOST=localhost
export DB_NAME=iucnredlist
export DB_PASSWORD=supersecretpassword
export DB_USER=postgres

Once you have all that you can then run the pipeline:

(venv) $ ./scripts/run.sh

Credits

The author of this package is greatly indebted to both Francesca Ridley from the University of Newcastle and Simon Tarr of the IUCN for their guidance and review.

Data Attribution

The crosswalk table data/crosswalk_bin_T.csv was created by Francesca Ridley and is derived from:

Lumbierres, M., Dahal, P.R., Di Marco, M., Butchart, S.H.M., Donald, P.F.,
& Rondinini, C. (2022). Translating habitat class to land cover to map area
of habitat of terrestrial vertebrates. Conservation Biology, 36, e13851.
https://doi.org/10.1111/cobi.13851

The paper is licensed under CC BY-NC. It is used in this STAR implementation to crosswalk between the IUCN Habitat classes in the Redlist and the land classes in the Copernicus data layers.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.github/workflows		.github/workflows
data		data
prepare_layers		prepare_layers
prepare_species		prepare_species
scripts		scripts
tests		tests
threats		threats
utils		utils
.gitignore		.gitignore
.mypy.ini		.mypy.ini
.pylintrc		.pylintrc
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
method.md		method.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

STAR

Checking out the code

Additional inputs

Species data acquisition

Running the pipeline

Running with Docker

Running without Docker

Credits

Data Attribution

About

Uh oh!

Releases

Packages

Languages

License

quantifyearth/STAR

Folders and files

Latest commit

History

Repository files navigation

STAR

Checking out the code

Additional inputs

Species data acquisition

Running the pipeline

Running with Docker

Running without Docker

Credits

Data Attribution

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages