Home
DETECTION OF TREND AND NORM VIOLATION
Contents
1. must be consecutive The user will have the choice in P4 to a use the seasonal means in place of the selected intermediate low frequency years b truncate the selected years c do nothing When b is selected a level study stepwise before and after the truncation is automatically chosen by the program using the most appropriate test 2 3 4 5 6 7 17 Eliminate certain months at the extremes The user is allowed to choose a smaller analysis period than that defined by the input file The values which are not part of the newly defined period will be eliminated from the file created by P1 Therefore the rest of the program will treat this series as if the values outside the new period did not exist Return to frequency table before elimination This option permits the user to return to the original period if he decides not to keep the newly created one See the complete sampling frequency table up to now This option allows the user to see the original table This is useful when defining a new period Non representative distribution exit without test This option allows the user to exit the rest of the program if the sampling was too little or irregular The file SYNTHESE P6 contains the preliminary information about the series as well as the monthly frequencies table 3 1 Several criteria in the program s oatput allow the user to detect a sampling that was too low or too irregul
2. Although the program can be used on a system with two floppy drives it was designed to be used with a hard disk and is therefore more efficient in this configuration The program in its original form accepts data of maximum length 1300 1 3 General flow chart DATA ACQUISITION ACQUISITION 1 DIVISION OF DIV THE FILES SERIES P1 PREPARATION GRAPHIC ANALYSES ANALYSIS OF THE DATA MEASUREMENT CREATION OF A WORK SERIES PREPARATION FOR TESTS P5 SUMMARY VIOLATION STUDIES P7 P8 CLEAN UP OF TEMPORARY FILES TM STOP Im ii 2 Utilities allow more efficient use of the program several utilities included following is a brief description of these companion programs 2 1 System configuration CONF EXE As the program is designed to work on different systems CONF EXE allows the user to adapt the program to his particular system characteristics This program asks questions about the screen resolution color modes graphic symbols to use and the definition of the interfaces The programs can be used with CGA 640 x 200 EGA 640 x 350 and ATT 640 x 400 They can also function in 320 x 200 mode but certain titles will be superimposed as the corresponding 40 x 25 text mode does not leave much room for adding comments to the graphs CONF EXE creates the file SETUP PC which must be present when the programs are used N B CONF EXE must be run first so that the oth
3. threshold Once the sub populations are defined the user must choose a norm or reference Three norms are available a A general or individual norm which should never be exceeded It applies to each value of the populations used b monthly norm which is applied to the monthly means of the values used c An individual norm which can be exceeded once per month It is applied to each value of the populations used the highest value if exceeding the norm for each month being eliminated 33 11 3 Analysis Once the form of the analysis has been chosen the program calculates prints the results for each sub population individual or monthly number of values present _ mean standard deviation median number of violations percentage of violations A table then displays the statistics concerning the duration of the violations in each of the sub populations duration of violation and associated frequency mean duration of the violations associated standard deviation A series of graphs is then displayed which compare the distributions and cumulative distributions of the two chosen populations This is followed by graphs which compare the violations of each sub population with values which exceed the norm plotted relative to the chosen norm Finally a graph showning the duration of successive violations is displayed with program P6 program P7 writes the results of the analyse
4. K gt lt qi 1 DETECTION OF TREND AND NORM VIOLATION USER S MANUAL DETECT and EXCEED Version 2 by Daniel CLUIS Mars 1989 INRS Eau 0 Box 7500 Sainte Foy Quebec Canada G1V 4C7 TABLE CONTENTS Page INTRODUCTION Rr dump ew pna gay teme 1 WARNING Dem 2 1 General remarks T 3 1 1 System requirements 3 1 2 General information sene mensenn Slee ra 3 1 3 General flow chart 4 2 Utilities wed AN GA WA Sele eS 5 2 1 System configuration 5 2 2 Data acquisition awe dta RE E Rum oe Seba a 5 2 3 File manipulation wss terde a uu Race sua kupa 6 So Bateh fil s SIRE qu dad os ss dua 8 4 Series preparation 1 9 Graphic analyses 2 11 asl Graphs available ns inenten daneen e RN OR edele 14 Series evaluation P3 EXE 15 6 1 Graphs and tables available 19 7 Work series creation and structure P4 EXE 22 7 1 Tables available 24 8 Test preparation JP Onde ds irri iade D Tere 25 8 1 Graphs
5. REGULAR SAMPLING As shown in table 3 1 certain questions can be asked if there is irregular sampling 1 If at least two consecutive months were not sampled user is allowed N B to truncate certain consecutive months It is not possible to eliminate months which are not adjacent After months have been eliminated the only choice of interval which will keep this truncation is one value per month for the non truncated months Any other selection will not take into account the previous truncation In using this software it became evident that the criteria should be a little less restrictive It is now possible to truncate months if there are two consecutive months with two values or less 1 16 least year has less than 4 values It is possible to limit later analysis The choices are presented in the following menu As the number of values for certain year s is small you may limit the analysis 1 Eliminate certain intermediate years 2 Eliminate certain months at the extremes 3 Return to frequency table before elimination 4 See the complete table up to now 5 Non representative distribution exit without test 6 Normal continuation of program 7 Help A brief description of the different options follows Eliminate certain intermediate years The user is allowed to select certain intermediate years with low frequencies for later analysis The years which are kept for this analysis
6. ST SEASONALITY DISPLAY FINAL LEVELS DISPLAY OF MEAN SLOPE FOR s 1 5 THE GLOBAL BEFORE AND SERIES AND EACH AFTER THE STEP BEFORE AND AFTER THE STEP FLOW CHART OF PROGRAM P6 DISPLAY OF GENERAL INFORMATION ABOUT THE SERIES _ DISPLAY OF XT OT 5 05 s 1 S T oui SEASONALITY no no HAS A TREND BEEN DETECTED BY THE TEST yes i DISPLAY Or STEPWISE TREND MONOTONIC TREND SEASONALITY DISPLAY OF SLOPE INITIAL DISPLAY SEASON s 1 S INITIAL AND FINAL LEVELS MEAN OF THE GLOBAL SERIES STANDARD DEVIATION SEASONAL MEAN SEASONAL STANDARD DEVIATION NUMBER OF SEASONS FLOW CHART OF PROGRAM P7 P8 CHOICE OF SUB POPULATIONS dates months discharges CHOICE OF A NORM individual monthly once per month STATISTICAL ANALYSIS OF THE SUB POPULATIONS STUDY OF THE FREQUENCY INTENSITY AND DURATION OF THE VIOLATIONS GRAPHS COMPARED FOR THE SUB POPULATIONS ANOTHER TYPE OF NORM TO ANALYSE
7. al days 1 3 7 15 month 1 2 3 amp 6 12 replace missing values using one of three methods a temporal interpolation b seasonal mean c concentration discharge relationship At this stage a complete series of equidistant values has been generated and saved in files with extensions TMC or TML depending on whether they are concentrations mass loadings The last part of this program determines if there is persistence and if there is whether it is Markovian GENERAL REMARKS GENERATION OF MISSING VALUES The choice of a work interval which is too small will result in the creation of fictitious data generation of missing values It is important not to fill in too many intervals with the available methods seasonal means interpolation concentration discharge relationship order to make the user aware of this situation a warning is issued when more than 20 of the intervals will be synthetised using other data The user can then choose another interval so as to reduce the number of intervals without data or he can continue knoving that the chosen work series contains a large percentage of fictitious data in certain cases the sampling may have been very irregular making a better choice of interval difficult 23 METHODS GENERATING DATA FOR EMPTY INTERVALS The three methods offered for the creation of data are Temporal interpolation Use of the mean value of the parameter taken t
8. ameter are plotted oh graphs 3 3 and 3 4 The symbols used 1 9 represent the last digit of the year while the stars connected by straight lines represent the means of all the values for the same month This graph allows the user to see how the data might possibly be regrouped into homogenous seasons Graph 3 5 plots the logs of the concentrations against the logs of the discharges allowing the user to detect any relationship which might be present between the variables as well as completing the regression analysis presented in table 3 4 20 Tables Table 3 1 presents the monthly frequencies of the observation for each year This allows the user to select an appropriate interval for creating an equidistant series P4 EXE When there is sufficiently regular sampling the mean number of observations per year make the choice of a frequency more easy This selection should not give rise to too many empty intervals Table 3 2 presents the results of the analysis of variance on the equality of the monthly means Each observation of the original series represents one replicate for the month during which it was taken The results are presented in the usual form of ANOVA tables d f number of degrees of freedom 55 sum of squares ms mean square F value of the statistic for the test H Du u Wyo H the monthly means are not equal where u is the mean of the 1 ith month For more information on ana
9. ar Normal continuation of program This option allows the user to return to the normal execution of the program as soon as the data manipulations are finished The user may also avoid any treatment of years with low frequencies by choosing this option as soon as the menu appears Help Gives a bit of help concerning the use of option 1 18 The analysis of variance used here is only a preliminary step the detection of a trend as it serves only to detect seasonality The analyses of variance used in this program use only one value per interval season for each year If the original sample contained more than one observation in the interval the mean of these values is used As little experimental planning was done so as to have optimum results the analysis of variance must be done with the available data and conclusions validated when they don t seem appropriate To validate the use of the analysis of variance a test of the equality of the variances BARTLETT is performed before the results are printed and a warning issued if the equality of the variances is rejected SEASONALITY The analysis of variance is used to construct seasons having significantly different means Firstly the monthly means are tested to see if they are significantly different If they are then there is seasonality The user is allowed to regroup the data so that larger seasons are analyzed this option 15 not possible if months
10. available Q 28 9 LOSES APP E 29 10 Analysis summary 6 30 10 1 Output of results 31 11 Norm violation 7 8 EXE 11 1 Definition of the two sub populations 11 2 Choice of norm or threshold 11 3 Analysis oos pe uu uu ial ope APPENDIX Program flow charts for 1 to 7 32 32 32 33 INTRODUCTION This software package uses non parametric methods to detect trends in water quality data Input data may be any compatibly structured temporal series The program is easy to use due to its interactive and graphic interface The principal parts of the program as follows Part 1 Part 2 Part 3 Part 4 Part 5 Test Part 6 Part 7 Reading of the measured concentrations mass loadings and discharges Graphic representation of the original data Analysis of sampling frequency elimination of periods without data months and or years detection of possible seasonality and a concentration discharge relationship Choice of a work interval and method to replace missing data detection of any persistence Display of inertia graphics Double Mass and CUSUM function from which the type of trend monotonic or step may be determined as well a
11. bably because the interval chosen was too narrow frequency too high On the other hand few missing values may mean that the interval was well chosen in the case of regular sampling or that the chosen interval was too large low frequency It should be noted that this table will not be presented if there were missing values no empty intervals Table 4 2 displays the results of the analysis of persistence done on the equidistant series The coefficients of correlation for the first to sixth order and their associated standard deviations are presented From this the user can see the coefficients that were significantly different from 0 as well as the most probable structure of the persistence as determined by the program Three different structures can be identified In the first there is no persistence the first order autocorrelation coefficient is not significantly different from 0 The second structure is Markovian persistence p1 is significantly different from 0 and the second order partial auto correlation coefficient is not significantly different from 0 Finally the third is non Markovian persistence pl and the second order partial auto correlation coefficient are significantly different from 0 A coefficient is considered significantly different from 0 if its value is at least 2 times greater than its standard deviation 25 8 Test preparation 5 This program gathers together all the inf
12. duced by an effect of changes in discharges Therefore the user should look for a different pattern than that of the Double Mass of parameter vs time CUSUM FUNCTION The CUSUM function should be used in the same way as the Double Mass curves as far as the change of slope of successive points is concerned The CUSUM function is calculated as follows t CUSUM X E x j x E Ser d the CUSUM is graphed as a function of time t What to look for Th CUSUM function cumulative sums in effect rotates the mean lines of graphs 2 2 and 2 3 so that they are brought to the horizontal The deviations from the general mean line are therefore much more visible as they determine the scale of the graph in opposition to the mean line as in the double mass curves This graph reveals if the curve intersects the y 0 axis very often in that case there is probably no significant trend if there are departures on only one side of the curve indicating a probable trend if the curve is parabolic suggesting a monotonic linear trend if there are discontinuous lines typical of stepwise trends 14 5 1 Graphs available The program can produce up to eight different graphs although only one is always displayed It is graph 2 1 which shows the temporal evolution of the parameter and permits the user to see any possible outliers This graph is displayed after each outlier elimination Graphs 2 2 to 2 5 show different Double Ma
13. e the same type Graph 5 2 is displayed when the mass loading is analyzed instead of the concentration Graph 5 3 displays the CUSUM curve of the parameter concentration in function of time The equidistant series is used for this graph as well This graph allows the user to detect break points which can serve as separation points in the case of a Mann Whitney test for the detection stepwise trend Graph 5 4 is displayed when mass loading is analyzed instead of concentration 29 9 Tests Twelve tests available to the user The majority of these tests are classic non parametric tests modified so that they take into account seasonality and or persistence Six programs allow the different tests to be run on regrouped data so that each program runs the test for the case where persistence is present and a corresponding test where persistence is absent Table 9 1 displays the number and name of each test as well as the program where it is located Table 9 1 Tests available Program name Test number Test name 1 MANN WHITNEY MW EXE 2 MANN WHITNEY LETTENMAIER MW EXE 3 MANN WHITNEY SEASONALITY MWS EXE amp MANN WHITNEY SEAS LETTENMAIER MWS EXE 5 KENDALL KEN EXE 6 SPEARMANN LETTENMAIER SP EXE 7 KENDALL SEASONALITY 5 8 HIRSCH AND SLACK 5 _ 9 FOSTER AND STUART 1 FS EXE 10 FOSTER AND STUART 2 FS EXE 11 KENDALL S TAU 12 SPEARMAN SP EXE All the information on the constructi
14. either method will work while the criteria for model adjustment at the end of P6 can be used to choose the most appropriate test The CUSUM functions can be very useful from an exploratory point of view when making such a choice In fact a stepwise trend results in a CUSUM of the form two models can be identified from these graphs making the choice easier 27 DOUBLE MASS AND CUSUM These graphs should be considered in the same way as in P2 The data however are now equidistant permitting a more efficient study of the graphs It is important to use the graphs in a complementary fashion the double mass graph gives a idea of the amplitude of the possible trend while the CUSUM graph gives a better idea of the changes in the slope In fact the height of the Double Mass is defined by the extreme right of the cumulated values and large deviations from that line are a hint for the significance of the trend The height of the CUSUM graph is defined by the amplitude Therefore it will always appear that there is a significant trend so the CUSUM must be used to appreciate changes in slopes ANOTHER TEST In general the test suggested above is the most appropriate but in certain cases the user may want to compare the results with those of other tests When a Markovian persistence is detected the suggested tests use the LETTENMAIER correction for Markovian persistence The user may want to co
15. enus and explications permitting him to locate the date in the file Note that if there is no date the programs can not be used even if DIVISE EXE worked This option is included for the external use of the program when the data are equidistant identify the column containing discharges if one exists transform the concentrations to mg l the charges to Kg day and the discharges to m sec so that results from different parts of the program will be compatible permit the inclusion of dates if desired the output files dates are mandatory however for analysis with the programs Finally the user is asked for output file names It is not necessary to create files for all the parameters in the input file For example only one parameter and one output file may be chosen 3 Batch files So that the different parts of the programs can run rapidly and automatically a series of nested batch files are used They include EXCEED BAT DETECT BAT PP1 BAT PP5 BAT The file EXCEED BAT permits an analysis of violations calling successively 1 P2DEP 7 and P8 The file DETECT BAT permits a complete trend analysis Pl P6 Each of the other programs allows the user to restart at a different spot in the programs for example PP2 allow the user to restart at P2 However the preceeding analysis must have been run up to one level farther along than the new starting point For example the program can be
16. er programs can operate correctly P This program permits a simple acquisition of the data and creates an output file which is easily readable by the file division program This program also permits the acquisition of a single series of data which can then be read by Pl The user is guided by menus and questions which permit him to 6 define the presence absence of dates the presence of dates 15 mandatory for the analysis performed by the programs eliminate badly recorded data append data to an already existing file DIVISE EXE NAQUADAT and industrial files contain data on concentrations and mass loadings pertaining to many parameters This program divides these files into files containing only one parameter To use the program the user types DIVISE with the disk containing the program DIVISE EXE in the drive The user is first asked to give the name of the file to be divided DATA DAT The user then 15 asked to choose one of the three types of files which can be used 1 Norma1 NAQUADAT file type 2 NAQUADAT file type with a lt CR gt after 80 characters 3 Industrial file type As the two first file types have very spacial uses the choice industrial will generally be chosen by the user This last option allows the treatment of nearly all file types which do not come from NAQUADAT The user is asked to number the columns in the input file including the date He is then guided by m
17. everal distinct outputs available according to the presence of seasonality and the type of trend studied All output in text mode is written to the screen and a file SYNTHESE P6 at the same time each time it is run results are appended to the end of this file The only difference between what is seen on the screen and what is written to the file SYNTHESE P6 is the screen display of the graphs showing the quality of the adjustment for the type of trend chosen none monotonic or stepwise In the case where there is no trend only the general mean is plotted as a straight line 11 NORM VIOLATION 7 AND P8 EXE This program studies the frequency intensity and the duration of those values of a series concentrations or mass loadings which exceed an environmental norm or regulation It also allows the comparison of two sub populations from the studied series The definition of the two sub populations can be determined after completion of the preceding program DETECT before and after the first operation of a treatment plant for example or completely arbitrarily recent data vs old data summer months vs winter months high flow data vs low flow data In any case it is only necessary to run the preparation program 11 1 Definition of the sub populations The sub populations are divided by boolean intersections according to dates start end specific months or classes of discharge 11 2 Choice of a norm or
18. have been truncated The user regroup the data as many times as he wishes however only the last one will be used If the user does not regroup 12 seasons of one month each will be used The use of seasons permits the use of tests which take into account the presence of cycles in the data However the use of seasonality tests with no seasonality will result in a loss of power as compared to non seasonality tests C Q RELATIONSHIP Another way to estimate the values for empty intervals is to look for a is a strong relationship between concentration and discharge 19 proposed relationship is of following form rating curve Firstly the following regression is carried out ln C a b 1n Q which results in the initial units of a and b b This model is not the best for the original base but it is simple and gives a good idea of the strength of the relationship between the concentration and discharge 6 1 Graphs and tables available The program contains 5 graphs and 6 tables However they will not all be used in any one session as in many cases there is a different graph or table for either mass loading or concentration A Graphs Graphs 3 1 and 3 2 present the temporal evolution of the parameter in question although only one will be displayed according to whether mass loading or concentration is being analyzed The monthly means for each year for the chosen par
19. he same period the other years Use of concentration discharge relationship Interpolation is used when there is no seasonality but there is persistence It should be noted however that the use of this method will increase the persistence The use of the mean value for the interval during the other years is done when seasonality is present and when representative means of each of the intervals is used The concentration discharge method is the preferred method but it is not very often that the relationship is strong enough to allow a valid estimate for the empty interval N B When the number of observation is low it may be better to choose the second method means so as not to create persistence before it is studied In fact there is a difficulty in the analysis equidistant data should be used for the analysis of persistence lent a preliminary persistence analysis could validate the interpolation as the method for data generation 24 7 1 Tables available The program has two tables Table 4 1 displays the number of missing values per interval the number of intervals is the number of complete intervals in a year Intervals are numbered from the beginning of the calendar year if there has been truncation The number of missing values for an interval is the number of years for which there was no observation available for that interval If the number of missing values is high for all the intervals it is pro
20. lyses of variance and their associated hypotheses refer to NETER and WASSERMAN 1974 Applied Linear Statistical Models Irwin Homewood 842 pp Seasonality is present if the test shows that the monthly means are significantly different When the equality of the means is rejected the user has the option of regrouping the months so as to construct seasons which may be more appropriate Table 3 2b presents the results of the analysis of variance of the data regrouped by the user The table is the same as table 3 2 except that the corresponding test is Ho u where k is the number of seasons defined when the data where regrouped Tables 3 3 and 3 3b are used if the parameter analyzed was mass loading instead of concentration Table 3 4 presents the results of the regression in the form of the relationship C first results variance and mean the natural logs of the concentration and discharge The parameter estimators a and b are 21 ale obtained from the result of the regression a b In Q transforming a gt and b b The percentage of the variance explained gives an estimate of the strength of the relationship FISHER s test is also used to determine if the relationship is significant and to allow a valid estimate of the concentrations from discharges 22 7 Work series creation and structure P4 EXE This part of the program carries out the following _ choice of an interv
21. mpare the results obtained with this adaptation with those obtained without it to see if there is a difference in trend detection For various reasons the analysis of seasonality may not be convincing due to the presence of outliers which make the distribution of the data non normal the equality of the variances may be rejected In this case the user may want to compare the results of seasonal tests with those of non seasonal tests The agreement of several tests permits the validation of the conclusions despite factors such persistence and seasonality When there is no agreement the user must use his best judgement to determine the best test to use in the face of the effects of seasonality or persistence It may also be advisable to choose another test if the suggested test is seasonal and there are few observations This is especially important in the case of the MANN WHITNEY seasonality test as its power is not known and the data used must have been taken before and after a separation 28 8 1 Graphs available The program contains four graphs although only two will be displayed for any analysis according to whether the user has chosen to analyze concentrations or mass loadings Graph 5 1 displays the double mass curve of the parameter concentration as a function of time This graph is made from the equidistant series and it is this difference that distinguishes it from graph 2 2 The trend structures however ar
22. nd 2 3 present such Double Mass curves The line that starts at 0 0 and goes to the upper right hand corner of the graph is called the general mean line Such graph should be regarded as an set of changing slopes Thus f a group of points seems to form a straight line with a slope greater than that of the mean line one can conclude that the mean of these points is greater than that of the mean in general At the same time a slope less than that of the mean line means that the mean of these points is less than that of the general mean These types of curves are therefore very useful for detecting trends in the means Other lines above and below the mean line can be seen in graphs 2 2 and 2 3 These rails represent two standard deviations from the mean line and each is calculated using only the points associated with its side If these rails are far from the mean line it suggests large variations in the data and allows the detection of large differences between the Double Mass curve and the mean line The rail is not distinguishable from the mean line if all the points are situated on only one side of it If there is no trend present the points of the Double Mass curve are situated on both sides of the mean line in a random fashion B Double mass of parameter vs discharge The principle of these Double Mass curves is the same as for parameter vs time However it allows to see if a detected trend vs time could not have been intro
23. of files which will be read correctly example 1 79 04 19 79 05 10 79 06 07 79 06 29 example 2 00QUO2MC920279 04 191440EST 805 000002 920279 05 191550 5 805 000002 920279 06 071200 5 805 000002 920279 06 293400 5 805 000002 920279 08 281300 5 805 example 3 79 04 19 79 05 10 79 06 07 79 06 29 From the example it is clear that the spaces left blank not the fields are not read This format was chosen because it conforms with that used by NAQUADAT UNITS So that quantities agree with those used by the programs concentration should 25 25 25 25 25 900000 300000 900000 000000 900000 400000 200000 200000 100000 900000 300000 900000 000000 212341 198743 207432 198672 be in mg l mass loadings in kg day and discharges in m sec DISCHARGE FILE If discharges are used they must be present in the original discharge file and in the file described above 5 Graphic analysis This file reads the previously created work file and carries out a graphic analysis Initially it presents the temporal evolution of the parameter under study and suggests the elimination of maxima and minima which if not rejected as outliers could bias the graphic interpretation It should be noted that for the trend detection itself the choice of non parametric methods limits the impact of these values The pr
24. ogram then asks the user if he wants the following plotted Double Mass curves B CUSUM function GENERAL REMARKS OUTLIERS Outliers are eliminated so as not to bias the results of the graphic analysis as well as the different parametric analyses carried out in other parts of the program For the trend detection the non parametric tests used yield stable results even if outliers are present Once such a value has been eliminated it is no longer accessible by the rest of the program and the only way to get it back is to start again with Pl using the original file DOUBLE MASS CURVES Double mass curves show using accumulated sums parameter evolution For a double mass curve of the parameter vs time we have on the ordinate time t t 2 x where Xt is the value of the parameter at time t the t 0 abscissa we have the ranks of time t expressed as number of days since the first measured observation In the case of a double mass curve of parameter vs discharge on the abscissa we would have at time t 2 Q The ordinate is t 0 calculated as in the previous example 12 the observations non equidistant the Double Mass should be used here as exploratory method for detecting trends Their use in 5 with equidistant data will be more representative of the nature of the phenomenon studied What to look for A Double mass of parameter vs time Graphs 2 2 a
25. on and application of these tests is in the detailed methodological report of the programs Technical specifications are presented for each 10 Analysis summary 6 This program summarises all the results obtained for the series and presents for the parametric tests the dates of the changes the levels of the parameter studied as well as the slopes of the trends GENERAL REMARKS CRITERIA ADJUSTMENT The criteria adjustment and the graphic study at the end of P6 allow the user to judge how well the trend model used stepwise or monotonic was suited to the data The smaller the root mean square error RMSE the better the adjustment The elimination of extreme values in P2 will generally reduce the RMSE but should not be used for this purpose VS the trend detection tests used this software package all parametric and the analyses in P6 are all parametric there can occasionally be disagreement between results In such cases the presence of extreme values is usually the cause of the contradictory results The user can restart the program at P2 and eliminate the extreme values if agreement of the results is desired Such a phenomenon is possible because of the weak influence of outliers on non parametric tests compared to parametric tests It is always preferable to give more weight to the results obtained with the non parametric results 31 S
26. ormation about the work series that has been determined by the preceding programs interval seasonality persistence length etc It then determines which part of the series to analyze if there is a monotonic or step trend and suggests the appropriate test according to the following decision tree DECISION TREE TREND PERSISTENCE SEASONALITY APPROPRIATE TEST Monotonic gt Markovian no seasons Lettenmaier Spearman trend persistence with seasons Hirsch and Slack No no seasons Spearman Kendall persistence with seasons Kendall seasonality Stepwise Markovian no seasons Lettenmaier Mann Whitney trend persistence with seasons Hirsch and Slack No no seasons Mann Whitney persistence with seasons Kendall seasonality The test chosen by the user is carried out and the results displayed GENERAL REMARKS MONOTONIC OR STEPWISE The choice of the detection of a monotonic or stepwise trend can be made according to the knowledge the user has about the series under study The objective of the analysis also helps in making the choice For example if one wants to study the impact of the opening a treatment plant it would be best to choose detection of stepwise trend with the separation on either s de of the opening date of the plant For those cases where the changes more gradual such as the acid rain effect or changes of land uses in a watershed the monotonic trend detection should be used When the trend is very strong
27. restarted at PP3 after a complete run of the program but it could not be started at that point if the execution had stopped at Pl This amp Series Preparation P1 EXE program carries out the following operations completely reads the input file identification of the discharges if they exist elimination of dates where the studied parameter in question has not been analyzed replacement of the sampling date with the number of days since the beginning of the sampling period creation of a second quality parameter concentration or mass loading from the discharge series if it exists creation of a work file TMP for later use General remarks INPUT FILE The input file can contain mass loadings or concentrations The data files supplied with the program contain only concentrations The input file must be the following FORTRAN format 12X 12 1X 12 1X 12 16X 12 6 F12 6 which means 12 spaces a 2 digit integer containing the year a space a 2 digit integer containing the month a space a 2 digit integer containing the day 16 spaces a 12 digit real number with 6 decimal places containing the charge or concentration a 12 digit real number with 6 decimal places containing the discharge As this last number is optional blank spaces should be used if there are no discharges in the file programs be modified to accept another type of data structure The following are examples
28. s the date of any change Recommendation of the most appropriate non parametric test considering the structure of the present series Execution of test and diagnostic Summary of the data characteristics and the options chosen as well as parametric interpretation of the results slope of the trend date of the change initial and final levels Analysis of norm violation A flow chart for each of these parts is included in the Appendix with all statistical programs of this type the user is by his choices responsible for the validity of the results obtained Warnings are issued by the program at the various critical steps Because of changing sampling schemes the choice of an appropriate work interval requires both that the data collected accurately reflect the phenomenon studied and that the chosen interval does not create too much fictitious data 3 l General remarks The program requires an IBM compatible PC XT or AT model personal computer running DOS 2 0 or higher It should have 512 K RAM at least CGA EGA video card monochrome or color monitor capable of displaying graphics 2 floppy drives or a hard disk and one disk drive 360 Kb 5 1 2 in double sided double density a dot matrix printer capable of producing graphics an 8087 or 80287 co processor is not required but greatly increases calculation speed 1 2 General information
29. s to a file In this case it writes to SYNTHESE P7 and as before each time the program is run results are appended to the end of the file In addition during the execution of P7 graphs are displayed on the screen and will be printed if the user has selected this option in Pl FLOW CHART PROGRAM 1 READING OR L DO DISCHARGES EXIST READING OF DISCHARGES CREATION OF THE SECOND DATA FILE CREATION OF FILE TMP ANALYSIS OF 0 CONCENTRATION 1 MASS LOADING L T FLOW CHART OF PROGRAM P2 GRAPH OF SERIES VS TIME REJECT OF MAX AND OR MIN AS OUTLIERS V 00 ON Ut Q N CONCENTRATIONS MASS LOADINGS TIME N _ il DOUBLE MASS C vs T DOUBLE MASS L vs T DOUBLE MASS C vs Q DOUBLE MASS L vs Q C vs T STOP SERIES 5 TABLE OF MONTHLY FREQUENCIES FLOW CHART OF PROGRAM P3 ASK STARTING TABLE OF MISSING VALUES PERSISTENCE FLOW CHART PROGRAM 5 DOUBLE MASS CURVE CUSUM CURVE TREND TYPE STEPWISE OR MONOTONIC SERIES DIVISION no ASK FOR RANKS OR DATES FOR DIVISION SUGGESTED TEST ACCORDING TO TYPE OF TREND PERSISTENCE AND SEASONALITY USE OF THE TEST TO USE APPROPRIATE TEST yes WRITING OF INFORMATION TO IDENT TMP FOR USE BY TE
30. ss curves called by the menu options 1 2 3 and 4 These graphs allow the user to detect large deviations on one side of the line representing the general mean from the lower left to the upper right these deviations being important in the detection of trends Graphs 2 6 to 2 8 the CUSUM curves allow the user to see in the form of a trend the concentration and mass loadings of the parameter as well as its associated discharge These graphs are menu options 5 6 and 7 It should be noted that when there are no discharge available many graphs are not available If in a series without discharges the parameter measured was concentrations graphs 2 3 2 4 2 5 2 6 and 2 8 will not be available whereas if mass loadings was the measured parameter graphs 2 2 2 3 2 4 2 5 2 6 and 2 7 will not be available 15 6 Series evaluation This program accomplishes the following tasks display of the temporal evolution establishment of monthly sampling frequency table This permits a preliminary evaluation of the work interval A subsequent question allows the elimination of entire months where sampling did not take place eg winter seasonality analysis on a monthly basis by ANOVA The resulting graphs of monthly means allow the user to determine a suitable regrouping of the months the detection of a significant concentration discharge relationship which can be used later if the user wishes GENERAL REMARKS IR
Download Pdf Manuals
Related Search
Related Contents
Manual PDF Klipsch R-12SW subwoofer HD09 MANUAL(英)-FOXTECH View S-4611 User Manual Husqvarna EZF 3417/ 965879301 Lawn Mower User Manual ROLINE DisplayPort Cable, DP M - DVI M 2 m KX-VC600 / KX-VC300 CN English Service Manual IM_5CCY-0411_ABQ-C_ES you can you can Copyright © All rights reserved.
Failed to retrieve file