Presentation is loading. Please wait.

Presentation is loading. Please wait.

A major Hungarian project for flood risk assessment A.Zempléni (Eötvös Loránd University, Budapest, visiting the TU Munich as a DAAD grantee) Technical.

Similar presentations


Presentation on theme: "A major Hungarian project for flood risk assessment A.Zempléni (Eötvös Loránd University, Budapest, visiting the TU Munich as a DAAD grantee) Technical."— Presentation transcript:

1 A major Hungarian project for flood risk assessment A.Zempléni (Eötvös Loránd University, Budapest, visiting the TU Munich as a DAAD grantee) Technical University of Munich, 24.11.2003

2 The project Title: “Establishing the engineering and scientific bases on flood risk assessment, development of new methods of flood frequency and risk estimation” (one of the approximately 100 national research and development programs accepted in 2001). Participants: Water Resources Research Centre, three universities, regional water directorates.

3 Main reason for it being proposed There were several major flood waves on river Tisza in the last decade.

4 Hungary

5 Current situation Last major analysis of flood data was done 30 years ago. During the dry period 1960-1998, the rivers were narrowed at several places. There is a natural tendency for the riverbed to deepen. The current flood estimates are based on water level data rather than the streamflow time series.

6 Water-level data example: Vásárosnamény At least two observations per day for each station (there are approx. 50 of them) for 100 years. Reduction: one observation per day.

7 Main questions Return level estimators (extreme value analysis) Time series methods for daily observations (in order to run simulations), to be presented in the next talk.

8 Analysis of extreme values Probably the most important part Classical methods: based on annual maxima Peaks-over-threshold methods: utilize all floods higher than a given (high) threshold. Multivariate modelling –Bayesian approach (dependence among parameters) –Joint behaviour of extremes

9 Extreme-value distributions Letbe independent, identically distributed random variables. If we can find norming constants a n, b n such that has a nondegenerate limit, then this limit is necessarily a max-stable or so-called extreme value distribution. X 1, X 2,…,X n [max(X 1, X 2,…, X n )-a n ]/ b n

10 Characterisation of extreme-value distributions Limit distributions of normalised maxima: Frechet:(x>0) is a positive parameter. Weibull:(x<0) Gumbel: (Location and scale parameters can be incorporated.)

11 Another parametrisation The distribution function of the generalised extreme-value (GEV) distribution:  : location,  : scale,  : shape parameters;  >0 corresponds to Frechet,  =0 to Gumbel  <0 to Weibull distribution if

12 Example: annual maxima of water levels Vásárosnamény Záhony

13 The fitted models to water level data Vásárosnamény Záhony, right-endpoint: 940 cm778 cm

14 Check the conditions Are the observations (annual maxima) – independent? It can be accepted for most of the stations. –identically distributed? Check by comparing different parts of the sample fitting models, where time is a covariate –follow the GEV distribution?

15 A typical example (water level data, Vásárosnamény ) 0.168 0.47

16 Tests for GEV distributions Motivation: limit distribution of the maximum of normalised iid random variables is GEV, but –the conditions are not always fulfilled –in our finite world the asymptotics is not always realistic Usual goodness-of-fit tests: –Kolmogorov-Smirnov –χ 2 Not sensitive for the tails

17 Alternatives Anderson-Darling test: Computation: where z i =F(X i ). Sensitive in both tails. Modification: (for maximum; upper tails). Its computation:

18 Further alternatives Another test can be based on the stability property of the GEV distributions: for any m  N there exist a m, b m such that F(x)=F m (a m x+b m )(x  R) The test statistics: Alternatives for estimation: To find a,b which minimize h(a,b) (computer- intensive algorithm needed). To estimate the GEV parameters by maximum likelihood and plug these in to the stability property.

19 Limit distributions Distribution-free for the case of known parameters. For example: where B denotes the Brownian Bridge over [0,1]. As the limits are functionals of the normal distribution, the effect of parameter estimation by maximum likelihood can be taken into account by transforming the covariance structure. In practice: simulated critical values can also be used (advantage: small-sample cases).

20 Power studies For typical alternatives, there is no major difference between the test A-D and B. The power of h very much depends on the shape of the underlying distribution. The probability of correct decision: n Test 100200400100200 400 Distr.NBexpNormal B 0.200.460.670.500.900.110.25 A-D 0.270.510.900.580.960.090.24 h 0.070.170.680.200.420.150.28

21 Applications For specific cases, where the upper tails play the important role (e.g. modified maximal values of real flood data), B is the most sensitive. When applying the above tests for the flood data (annual maxima; windows of size 50), there were a couple of cases when the GEV hypothesis had to be rejected at the level of 95%. Possible reasons: changes in river bed properties (shape, vegetation etc).

22 An example for rejection: Szolnok water level, 1931-80

23 Further investigations Confidence bounds should be calculated, possible methods –based on asymptotic properties of maximum likelihood estimator –profile likelihood –resampling methods (bootstrap, jackknife) –Bayesian approach Estimates for return levels

24 Confidence intervals For maximum likelihood: –By asymptotic normality of the estimator: where is the (i,i)th element of the inverse of the information matrix –By profile likelihood For other nonparametric methods by bootstrap.

25 Profile likelihood One coordinate of the parameter vector is fixed, the maximization is with respect the other components: l(  ) is the log-likelihood function;  =(  i,  -i ) Let X 1,…,X n be iid observations. Under the regularity conditions for the maximum likelihood estimator, asymptotically (a chi-squared distribution with k degrees of freedom, if  i is a k-dimensional vector).

26 Use of the profile likelihood Confidence interval construction for a parameter of interest: where c  is the 1-  quantile of the  1 2 distribution. Testing nested models: M 1 (  ) vs. M 0 (the first k components of  =0). l 1 ( M 1 ), l 0 (M 0 ) are the maximized log-likelihood functions and D:=2{l 1 ( M 1 )- l 0 (M 0 )}. M 0 is rejected in favor of M 1 if D>c  (c  is the 1-  quantile of the  k 2 distribution).

27 Return levels z p : return level, associated with the return period 1/p (the expected time for a level higher than z p to appear is 1/p): The quantiles of the GEV: where Remark: the probability that it actually appears before time 1/p is more than 0.5 (approx. 0.63 if p is small) if   0 if  = 0

28 Backtests: return level estimators based on parts of the dataset

29 Investigation of the backtest Too many floods above the estimated level (mean=3.95 for the first 21 years) - simulation studies confirm this being a significant deviation from the iid case. Linear trend in the mean: m=616+1.2(t-50) the other parameters are supposed to be constant. Estimated endpoint for 2000: 960 cm (actual observed value, not in the sample : 970 cm).

30 Peaks over threshold methods If the conditions of the Fisher-Tippett theorem hold, the conditional probability of X-u, under the condition that X>u, can be given as if y>0 and, where H(y) is the so called generalized Pareto distribution (GPD).  is the same as the shape parameter of the corresponding GEV distribution.

31 Peaks over threshold methods Advantages: –More data can be used –Estimators are not affected by the small “floods” Disadvantages: –Dependence on threshold choice –Original daily observations are dependent; declustering not always obvious (see Ferro- Segers, 2003 for a recent method).

32 Inference Similar to the annual maxima method: –Maximum likelihood is to be preferred –Confidence bounds can be based on profile likelihood –Model fit can be analyzed by P-P plots and Q-Q plots or formal tests (similar to those presented earlier) –Return levels/upper bounds can be estimated Our results for the flood data: very much the same as for the GEV.

33 GPD fit: Vásárosnamény, water level shape=-0.51, estimated upper endpoint=940 cm the upper endpoint of its 95% conf. int.: 1085 cm

34 Return level estimators by parts of the dataset: Vásárosnamény

35 Some references 1.Ferro, T. A.- Segers, J. (2003): Inference for clusters of extreme values. Journal of Royal Statistical Soc. Ser. B. 65, p. 545-556. 2.Kotz, S. – Nadarajah, S. (2000): Extreme Value Distributions. Imperial College Press. 3.Zempléni, A. (1996): Inference for Generalized Extreme Value Distributions Journal of Applied Statistical Science 4, p. 107-122. 4.Zempléni, A. Goodness-of-fit tests in extreme value theory. (In preparation.)


Download ppt "A major Hungarian project for flood risk assessment A.Zempléni (Eötvös Loránd University, Budapest, visiting the TU Munich as a DAAD grantee) Technical."

Similar presentations


Ads by Google