Last year, Bart de Beus was nominated for the 2025 CWI Best BSc Thesis in Applied Math Award.
Below are two photos of Itteren, a town in the south of the Netherlands. The first one shows the town as it looks normally. The second picture shows, well, quite an extreme: a flood. In 1995, this flood led to the evacuation of 250 000 people and a million animals. This was one of a handful of heavy floods in the 90's, so the Dutch regional water authorities (water boards, or "waterschappen" in Dutch) were tasked with calculating how often an area overflows. Based on these calculations, the Dutch government took better protective measures against floods. How do these calculations work?


Figure 1 and 2: The town of Itteren, Limburg, in 2016 (left; Van de Biezen, 2016) and 1995 (right; Rijkswaterstaat, n.d.)
These calculations are called flood frequency analyses. The objective: calculate if an area does not overflow "too often". Although floods are rarely desirable, they may be acceptable on occasion. For example, preventing a flood might be more expensive than dealing with the consequences if water reaches your house only once every hundred years. In this case, you may choose to let the flood happen, whenever that may be, and repair the damage when it does. Flood frequency analyses are done using one of two statistical methods; the "timeseries method" or the "stochastic method". Different water boards use different methods, the division being roughly 50/50. We don’t actually know, however, if these methods lead to different results. Spoiler alert: they do!
I had the opportunity to research both methods for Waternet, the company that works on behalf of water board Amstel, Gooi en Vecht. They use the timeseries method for the flood frequency analyses, but were also interested in implementing the stochastic method as there may be advantages to using one method over the other.
The objective
First, we must ask ourselves: how often is 'too often'? The Dutch provinces have decided on an acceptable return period
, measured in years, for each type of area. According to the provinces, the type of area is allowed to flood once every
years. For instance, the provincial bylaws note that a house is allowed to flood once every hundred years (
). By law, the water boards have to take measures making sure that water does not reach houses more often than that. Another example is farmland, which is allowed to flood only once every twenty-five years.
The water authorities use statistics and hydrological software to calculate the water level
that corresponds to return period
. What does this mean exactly? Say, we want to do a flood frequency analysis for farmland. That means that, given the provincial bylaws, we look for the water level corresponding to
. If the water level corresponding to
is higher than the level of the farmland, Waternet has to take measures to ensure this area overflows less often than once every tenty-five years. The number
can take every value above zero, but common values (extracted from the bylaws) are ten, twenty-five, fifty and a hundred years.
A digital tool for mapping floods
The hydrological software D-Hydro can be used for both the stochastic and timeseries method. Using D-Hydro it is possible to map different areas into systems of canals, rivers, pumps, bridges and more. Figure 3 shows how this looks.

Figure 3: The Noordse Buurt and Westveense polder as simulated in D-Hydro.
D-Hydro can be given various input parameters and calculates water levels based on those inputs. Both the stochastic and timeseries method use the water levels calculated by D-Hydro. The difference between the methods is the input and the way its output is used to calculate how often an area floods.
The timeseries method
Let's start by looking at the timeseries method. In De Bilt the Royal Dutch Meteorological Institute (KNMI) records the amount of precipitation. These records are available as a dataset with hourly measurements, going back over a hundred years. These "rainfall events" are the input for D-Hydro, which then calculates the water levels over time.
After entering the rainfall events into D-Hydro and calculating the water levels, we select the highest water level per year. So, over the
total years, we obtain with
different maximum water levels. We denote these by
. We call the maximum water level in a year
, which is a random variable. The water levels
are realizations of
.
Ideally, we want to know the distribution of
as it holds all kinds of relevant information. Extreme Value Statistics (EVS) can help with this. EVS has important results on the the maximum values of random variables, namely that the distribution of the maximum over many independent draws always converges to a specific probability distribution. If we assume that necessary conditions for EVS are met, we can use it to gain insight into the yearly water peak, in this case
. EVS is a commonly used method for estimating flood events and it has many other applications.
The stochastic method
With the stochastic method, we say that a whole plethora of explanatory variables can cause extremely high water levels. For example, if the water level in a river is extremely high, just a bit of rain can cause a flood. The same goes for if a pump fails: if water cannot be pumped out of a polder, canals might overflow, even without rain.
These (continuous) explanatory variables are discretized in different classes, after which every class gets a representative value. The goal being that every possible situation that might happen, gets simulated in D-Hydro. Every class gets allocated a probability, based on historical data from a case study area.
The advantage of this method is that we don’t just look at rainfall events to predict floods, but also look at other contributing factors. With enough historical data, we can add almost any contributing variable, like how high the groundwater level is, the water levels in neighboring rivers and lakes, and lastly also if certain pumping stations fail.
What does the result look like?
Both methods ideally lead to the same result, which is a graph with on the logarithmical x-axis our return periods (the
we are after), and on the y-axis the water level (the
we seek). In Figure 4, the result is shown for one area (compartment OP-1-A) in the Noordse Buurt and Westveense polder, our case study region. In every of the thirty compartments in our case study, the timeseries method lead to higher predicted water level compared to the stochastic method. The average difference over all compartments was 4 to 9 centimeters, with
ranging from 25 to a 100 years. The maximum difference was over 28 centimeters.

Figure 4: Results of the stochastic method (red) and the time series method (blue) for compartment OP-1-A in the Noordse Buurt and Westveense polder.
Now knowing that these two methods lead to varying results, it is interesting to know why we get these varying results. In turn, this could help the water boards make more informed decisions in what measures to take and when. During my research at Waternet, I investigated multiple explanations for these differences, about which I write in my thesis Schatten van wateroverlast, een vergelijking tussen statistiek vóóraf en statistiek áchteraf in de Noordse Buurt en Westveense polder (De Beus, 2025). As you can tell, it is written in Dutch.
My experience at Waternet
As an applied mathematician, I never imagined myself suddenly doing my thesis at a company which regulates and takes care of our water supply and systems. It is a fun example of how mathematics is almost everywhere! I found myself in a team of hydrologists, which is a whole other area of expertise, I thought. However, all models they use (including D-Hydro) were based on, as you might guess, mathematical models and formulas. Furthermore, these two methods I researched are largely mathematical problems. The hydrological part is mapping the water system in D-Hydro, and knowing how to take correct measures.
The models I implemented were immediately used for other case study areas. Through my research I hoped to give better insight on how and when to implement which method. Most importantly, I wanted to shed light on some of their drawbacks and when these drawbacks influence the results.
Cover photo taken from Orange County Register, June 19th, 2008






































































