Javascript must be enabled to continue!
Physical and Biogeochemical Drivers for Forecasting Red Tides in Southwest Florida: A Regionally Integrated Machine Learning Framework
View through CrossRef
Harmful algal blooms (HABs) caused by Karenia brevis (K. brevis) present a persistent ecological and public health challenge across coastal Florida. Reliable bloom forecasting is critical for protecting public health, supporting coastal economies, and enabling timely management responses. This study develops a regionally integrated machine learning framework to predict weekly K. brevis bloom occurrence using environmental data from both the Peace and Caloosahatchee Rivers, combined with coastal bloom records from Southwest Florida and Tampa Bay to enhance the spatial and temporal continuity of the response record. A Random Forest classifier was trained on a multi-decadal dataset incorporating river discharge, nutrient concentrations (total nitrogen and total phosphorus), wind forcing, sea surface temperature, salinity, and sea surface height anomalies as a proxy for Loop Current variability. The model achieved strong predictive performance on a chronologically withheld test set, with an overall accuracy of ~90%, balanced accuracy of 87.6%, and ROC–AUC of 0.972, indicating strong discrimination between bloom and non-bloom conditions with high precision and recall for bloom events. Bloom timing and persistence were captured with strong agreement during ongoing bloom periods, while non-bloom conditions were identified with low false-positive rates. Feature-response analyses indicated that bloom probability increased most sharply under moderate discharge and nutrient conditions, with diminished sensitivity at higher extremes. Learning curve analysis demonstrated robust training performance and stable generalization, with validation accuracy plateauing near 84%, suggesting a data-limited ceiling on forecast skill. By aggregating nutrient inputs across multiple watersheds and integrating spatially aligned bloom observations, this study demonstrates the utility of multi-source machine learning frameworks for regional-scale HAB prediction. The results support the development of early warning tools and provide a reproducible foundation for evaluating how combined watershed loading and physical forcing are associated with K. brevis bloom occurrence in complex estuary systems with watershed and coastal coupling.
Title: Physical and Biogeochemical Drivers for Forecasting Red Tides in Southwest Florida: A Regionally Integrated Machine Learning Framework
Description:
Harmful algal blooms (HABs) caused by Karenia brevis (K.
brevis) present a persistent ecological and public health challenge across coastal Florida.
Reliable bloom forecasting is critical for protecting public health, supporting coastal economies, and enabling timely management responses.
This study develops a regionally integrated machine learning framework to predict weekly K.
brevis bloom occurrence using environmental data from both the Peace and Caloosahatchee Rivers, combined with coastal bloom records from Southwest Florida and Tampa Bay to enhance the spatial and temporal continuity of the response record.
A Random Forest classifier was trained on a multi-decadal dataset incorporating river discharge, nutrient concentrations (total nitrogen and total phosphorus), wind forcing, sea surface temperature, salinity, and sea surface height anomalies as a proxy for Loop Current variability.
The model achieved strong predictive performance on a chronologically withheld test set, with an overall accuracy of ~90%, balanced accuracy of 87.
6%, and ROC–AUC of 0.
972, indicating strong discrimination between bloom and non-bloom conditions with high precision and recall for bloom events.
Bloom timing and persistence were captured with strong agreement during ongoing bloom periods, while non-bloom conditions were identified with low false-positive rates.
Feature-response analyses indicated that bloom probability increased most sharply under moderate discharge and nutrient conditions, with diminished sensitivity at higher extremes.
Learning curve analysis demonstrated robust training performance and stable generalization, with validation accuracy plateauing near 84%, suggesting a data-limited ceiling on forecast skill.
By aggregating nutrient inputs across multiple watersheds and integrating spatially aligned bloom observations, this study demonstrates the utility of multi-source machine learning frameworks for regional-scale HAB prediction.
The results support the development of early warning tools and provide a reproducible foundation for evaluating how combined watershed loading and physical forcing are associated with K.
brevis bloom occurrence in complex estuary systems with watershed and coastal coupling.
Related Results
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract
The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Coupling the Data-driven Weather Forecasting Model with 4D Variational Assimilation
Coupling the Data-driven Weather Forecasting Model with 4D Variational Assimilation
In recent years, the development of artificial intelligence has led to rapid advances in data-driven weather forecasting models, some of which rival or even surpass traditional met...
Scenario Dependence of Biogeochemical and Biogeophysical Effects of Reforestation 
Scenario Dependence of Biogeochemical and Biogeophysical Effects of Reforestation 
Reforestation is a widely considered nature-based method for climate mitigation. The net effect of reforestation on the climate system has two components: i) biogeochemical and ii)...
Physical and Biogeochemical Drivers for Forecasting Red Tides in Southwest Florida: A Regionally Integrated Machine Learning Framework
Physical and Biogeochemical Drivers for Forecasting Red Tides in Southwest Florida: A Regionally Integrated Machine Learning Framework
Harmful algal blooms (HABs) caused by Karenia brevis (K. brevis) present a persistent ecological and public health challenge across coastal Florida. This study develops a regionall...
Improving tidal modeling for rocky worlds
Improving tidal modeling for rocky worlds
<p>The high number of discovered close-in planets motivates the improvement of tidal modeling.Among the five thousand exoplanets discovered up to now, half of them ha...
Establishment and Application of the Multi-Peak Forecasting Model
Establishment and Application of the Multi-Peak Forecasting Model
Abstract
After the development of the oil field, it is an important task to predict the production and the recoverable reserve opportunely by the production data....
Forecasting
Forecasting
The history of forecasting goes back at least as far as the Oracle at Delphi in Greece. Stripped of its mystique, this was what we now refer to as “unaided judgment,” the only fore...

