CCPortal
DOI10.5194/hess-23-1015-2019
Identifying rainfall-runoff events in discharge time series: A data-driven method based on information theory
Thiesen S.; Darscheid P.; Ehret U.
发表日期2019
ISSN1027-5606
起始页码1015
结束页码1034
卷号23期号:2
英文摘要In this study, we propose a data-driven approach for automatically identifying rainfall-runoff events in discharge time series. The core of the concept is to construct and apply discrete multivariate probability distributions to obtain probabilistic predictions of each time step that is part of an event. The approach permits any data to serve as predictors, and it is non-parametric in the sense that it can handle any kind of relation between the predictor(s) and the target. Each choice of a particular predictor data set is equivalent to formulating a model hypothesis. Among competing models, the best is found by comparing their predictive power in a training data set with user-classified events. For evaluation, we use measures from information theory such as Shannon entropy and conditional entropy to select the best predictors and models and, additionally, measure the risk of overfitting via cross entropy and Kullback-Leibler divergence. As all these measures are expressed in "bit", we can combine them to identify models with the best tradeoff between predictive power and robustness given the available data. We applied the method to data from the Dornbirner Ach catchment in Austria, distinguishing three different model types: Models relying on discharge data, models using both discharge and precipitation data, and recursive models, i.e., models using their own predictions of a previous time step as an additional predictor. In the case study, the additional use of precipitation reduced predictive uncertainty only by a small amount, likely because the information provided by precipitation is already contained in the discharge data. More generally, we found that the robustness of a model quickly dropped with the increase in the number of predictors used (an effect well known as the curse of dimensionality) such that, in the end, the best model was a recursive one applying four predictors (three standard and one recursive): Discharge from two distinct time steps, the relative magnitude of discharge compared with all discharge values in a surrounding 65 h time window and event predictions from the previous time step. Applying the model reduced the uncertainty in event classification by 77.8 %, decreasing conditional entropy from 0.516 to 0.114 bits. To assess the quality of the proposed method, its results were binarized and validated through a holdout method and then compared to a physically based approach. The comparison showed similar behavior of both models (both with accuracy near 90 %), and the crossvalidation reinforced the quality of the proposed model. Given enough data to build data-driven models, their potential lies in the way they learn and exploit relations between data unconstrained by functional or parametric assumptions and choices. And, beyond that, the use of these models to reproduce a hydrologist's way of identifying rainfall-runoff events is just one of many potential applications. © Author(s) 2019.
语种英语
scopus关键词Catchments; Forecasting; Information theory; Probability distributions; Rain; Risk assessment; Time series; Curse of dimensionality; Data-driven approach; Event classification; Kullback Leibler divergence; Multivariate probability distributions; Predictive uncertainty; Probabilistic prediction; Rainfall-runoff events; Runoff; catchment; data set; discharge; identification method; numerical method; precipitation (climatology); prediction; rainfall-runoff modeling; Austria
来源期刊Hydrology and Earth System Sciences
文献类型期刊论文
条目标识符http://gcip.llas.ac.cn/handle/2XKMVOVA/159753
作者单位Thiesen, S., Institute of Water Resources and River Basin Management, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany; Darscheid, P., Institute of Water Resources and River Basin Management, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany; Ehret, U., Institute of Water Resources and River Basin Management, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
推荐引用方式
GB/T 7714
Thiesen S.,Darscheid P.,Ehret U.. Identifying rainfall-runoff events in discharge time series: A data-driven method based on information theory[J],2019,23(2).
APA Thiesen S.,Darscheid P.,&Ehret U..(2019).Identifying rainfall-runoff events in discharge time series: A data-driven method based on information theory.Hydrology and Earth System Sciences,23(2).
MLA Thiesen S.,et al."Identifying rainfall-runoff events in discharge time series: A data-driven method based on information theory".Hydrology and Earth System Sciences 23.2(2019).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Thiesen S.]的文章
[Darscheid P.]的文章
[Ehret U.]的文章
百度学术
百度学术中相似的文章
[Thiesen S.]的文章
[Darscheid P.]的文章
[Ehret U.]的文章
必应学术
必应学术中相似的文章
[Thiesen S.]的文章
[Darscheid P.]的文章
[Ehret U.]的文章
相关权益政策
暂无数据
收藏/分享

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。