About the project

Inland water bodies provide a variety of critical services for both human and aquatic life, including drinking water, recreational and economic opportunities, and marine habitats. A significant challenge water quality managers face is the formation of harmful algal blooms, which can harm human health, threaten other mammals like pets, and damage aquatic ecosystems.

Cyanobacteria are microscopic algae that can multiply very quickly in warm, nutrient-rich environments, often creating visible blue or green blooms. These blooms can block sunlight from reaching the rest of the aquatic ecosystem beneath the surface, and take away oxygen and nutrients from other organisms. Cyanobacteria can produce toxins that are poisonous to humans, pets, and livestock. The effect of climate change on marine environments likely makes harmful algal blooms form more often.

Manual water sampling, or “in situ” sampling, is generally used to monitor cyanobacteria in inland water bodies. In situ sampling is accurate, but time intensive and difficult to perform continuously. Public health managers also rely on the public to notice and report blooms.

The goal of CyFi is to help water quality managers better allocate resources for in situ sampling, and make more informed decisions around public health warnings for critical resources like drinking water reservoirs. Ultimately, more accurate and more timely detection of algal blooms helps keep both the human and marine life that rely on these water bodies safe and healthy.

CyFi was born out of the Tick Tick Bloom machine learning competition, hosted by DrivenData. The goal in that challenge was to detect and classify the severity of cyanobacteria blooms in small, inland water bodies using publicly available satellite, climate, and elevation data. Labels were based on "in situ" samples that were collected manually by many organizations across the U.S. The model in CyFi is based on the winning solutions from that challenge, and has been optimized for generalizability and efficiency.

For more details, see the reference paper.

Cite this work

Dorne, E., Wetstone, K., Cerquera, T. B., & Gupta, S. (2024). Cyanobacteria detection in small, inland water bodies with CyFi. In Proceedings of the 23nd Python in Science Conference (pp. 154–173). https://doi.org/10.25080/pdhk7238

Additional resources

Tick Tick Bloom machine learning competition

About harmful algal blooms (HABs)

Related tools

There are other groups working on cyanobacteria estimates from satellite imagery. Here are a few that use Sentinel-3 (300m resolution) imagery:

EPA guidance on HABs

Related research on using satellite imagery to monitor HABs