Water Quality Parameter Prediction In Stream Flow Environmental Sciences Essay

In This paper, the truth public presentation of preparation, proof and anticipation of monthly H2O quality parametric quantities is discussed when using Adaptive Neuro-Fuzzy Inference System ( ANFIS ) . This theoretical account analyze historical informations were generated through uninterrupted monitoring Stationss of H2O quality parametric quantities ( dependent variables ) of Johor River in order to copy their secondary property ( independent variables ) . However, the informations originating from supervising Stationss and experiment may be polluted by noise signals owing to systematic mistakes and random mistakes. This noisy informations frequently makes the predict undertaking comparatively hard. In order to counterbalance for this augmented noise the primary aim of this manuscript is to develop technique that can heighten the truth of H2O quality anticipation ( WQP ) . Therefore, this survey suggests an augmented Wavelet – Neuro-Fuzzy ( WDT-ANFIS ) based informations merger faculty for WQP. The efficiency of the faculties was examined to foretell critical parametric quantities that affected due to the urbanisation around the rivers. The parametric quantities have been studied ; electrical CONDuctivity ( COND ) , entire dissolved solids ( T.D.S ) and TURBidity ( TURB ) . Results showed that the optimal degree of truth was achieved by doing the length of the cross-validation equal to one fifth of the information record. Furthermore, WDT-ANFIS faculty outperformed the ANFIS faculty with important betterment in anticipation truth. This consequence indicates that the proposed attack is fundamentally an attractive option, offering a comparatively fast algorithm with good theoretical belongingss to de-noise and predict the H2O quality parametric quantities. This new technique will be valuable to help determination shapers in describing the position of H2O quality, probe of spacial and temporal alterations.


Water quality mold is the footing of H2O pollution control undertaking. It predicts the inclination of H2O quality assortment harmonizing to the current H2O environment quality status, transportation and transmutation regulation of the pollutants in the river basin.

In H2O quality mold, [ 1 ] reported that the turbidness was related closely to both its respiration rate and the H2O temperature. Model users will see sunlight strength fluctuation within the H2O column when imitating the eutrophication phenomenon [ 2 ] .These illustrations reflect that human intelligence uses bing cognition to cut down the figure of picks in order to raise the effectivity of theoretical account use. Each clip, they tend to change simply one or two parametric quantities. This is because if they modify many parametric quantities at the same clip, they may easy acquire lost in the use way. However, due to the size and heterogeneousness of the set possible input variables, and because of the nonlinear nature of the job, conventional methods are non assuring. To this terminal, unreal intelligence techniques ( AI ) techniques are capable to mime this behaviour every bit good as to complement the lack.

Recently, AI has been accepted as an efficient alternate tool for mold of complex non-linear systems. The theoretical accounts normally do n’t see the internal mechanism but build theoretical accounts to foretell H2O quality via the relationship between inputs and end products.

At present, unreal intelligence techniques ( AI ) have been used intensively for anticipation in a figure of water-related countries, including H2O resource survey [ 3 ] , oceanology [ 4 ] , and air pollution [ 5 ] .

The above survey attempts were usually based on an premise that the informations to be used should be dependable and accurate. However, the informations originating from probe and experiment may be polluted by noise signals due to the subjective and/or nonsubjective mistakes [ 6 ] . For illustration, the experiment mistakes may be resulted from measuring, reading, recording, and external conditions. Since these noisy signals are likely to falsify the consequences of theoretical accounts, it is a must to take them ( that is, signal denoising ) before utilizing any original informations.

Signals can be denoised through the application of a set of additive filters [ 7 ] . However, one job of these filters is that they are more appropriate in additive systems than nonlinear systems. In add-on, Fourier analysis technique ( FAT ) is a classical tool for cut downing noises, but it is merely suited for denoising data/signals incorporating steady noises. Due to the noises that are unsteady in real-world instances, its application is still limited. To get the better of the jobs of traditional denoising techniques, more sophisticated techniques such as wavelet de-noising technique ( WDT ) has been proposed.

WDT is utile for denoising multi-dimensional spatial/ temporal signals incorporating steady/unsteady noises. It has been widely applied to technology systems for patterns acknowledgment and cognition find [ 8 ] and [ 9 ] .

However, few of these surveies were applied to H2O quality direction systems, where the H2O quality monitoring informations demands to be used for parametric quantity anticipation [ 10 ] . In this survey a WDT is proposed for cut downing noises induced by complex uncertainness.

As mentioned before, there are many different types of AIs techniques among them unreal nervous webs ( ANN ) and adaptative neuro-fuzzy illation systems ( ANFIS ) have late gained attending in literature. Although ANN is rather powerful for patterning assorted existent universe jobs, it besides has its defects. If the input informations are less accurate or equivocal, ANN would be fighting to manage them and a fuzzed system such as ANFIS might be a better option. ANFIS theoretical account shows significantly higher truth and dependable in term of anticipation than ANN [ 11 ] .

In this article, ANFIS faculty is proposed to foretell WQP at Johor River Basin. Furthermore, an augmented WDT-ANFIS based informations merger faculty for WQPP will present. In add-on, comprehensive comparing analysis is carried out between ANFIS and DWT-ANFIS to measure the public presentation that achieved after take the noising from the informations.



Johor is the 2nd largest province in Peninsula Malaysia with an country of 18,941 km2. Johor River considers the chief river in Johor. The river flows in a approximately north-south way and empties into the Johor. The H2O quality of Johor River has been deteriorated with increasing degrees of assorted pollutants. Besides, it persists to be silted and chocked by rubbish and wastes as a effect of storage of enforcement by local-authorities. These contaminations finally flow into Johor River Estuaries, which are rich wonts that provide engendering and feeding countries for fish and domestic fowl.

In this research, a survey of ANN patterning to foretell entire dissolved solids, electrical conduction and turbidness in Johor River basin is presented. These H2O quality parametric quantities were measured 60 samples within continuance 1998-2007 at chief watercourse as shown in Fig. 1.


In this survey, the H2O quality parametric quantities of involvements are entire dissolved solids ( T.D.Solids ) , electrical conduction and turbidness due to their importance in the Johor River and its feeder. The information appears that conduction is extremely affected due to cumulative consequence of urban land usage from upstream in the survey country. While, high concentration of dissolved solids were found in the survey country and caused H2O balance jobs for aquatic beings. Furthermore, the turbidness values exceed 300A NTU ( Nephelometric Turbidity Units ) . This poses serious jobs for the H2O intervention station located near the river.

There are many parametric quantities more of import than what we selected in our survey such as COD, BOD and DO. But the scope of BOD harmonizing to the DOE monitoring station in the survey country was found between ( 1-2 ) which indicate there is non much organic waste nowadays in the H2O. Same scenario was observed in COD information which is ranged ( 10-15 ) . Therefore, this manuscript focuses on critical parametric quantities that affected due to the urbanisation around the rivers.


Adaptive neuro-fuzzy illation system ( ANFIS ) , foremost was proposed by Jang in 1993 [ 12 ] , can accomplish a extremely nonlinear function and it is superior to common additive methods in bring forthing nonlinear clip series [ 13 ] . Throughout this research, it was considered the ANFIS architecture for the first order Sugeno fuzzy theoretical account [ 14 ] . The ANFIS is a multilayer provender forward web which uses nervous web larning algorithms and fuzzed logical thinking to map an input infinite to an end product infinite [ 15 ] . Assuming the fuzzy illation system under consideration has two inputs, ten and Y, and one end product, degree Fahrenheit for a first-order Sugeno fuzzy theoretical account, a common regulation set with two fuzzy if.then regulations can be expressed as:

Rule 1: If x is A1and Y is B1 so f1=p1 x+q1 y+r1 ( 1 )

Rule 2: If x is A2 and Y is B2 so f2=p2 x+q2 y+r2 ( 2 )

Where A1, A2 and B1, B2 are the rank maps ( medium frequency ) for inputs x and y, severally ; pi, chi and Rhode Island ( one = 1 or 2 ) are additive parametric quantities in the attendant portion of the first-order Sugeno fuzzy theoretical account. The corresponding tantamount ANFIS architecture is showed in Fig. 2, where nodes of the same bed have similar maps. ANFIS consists of five beds as follow:

Optimization Scheme of ANFIS Module Parameters

For each of WQP we used same architectures that presented in Fig. 2.Where, three inputs ( suspended solids, pH and temperature ) were used to foretell the TURB. One input ( T.D.S ) was used to foretell the COND, while same faculty used to foretell the T.D.S after utilized the COND as input.

It is to be noted that there is no analytical method to find the optimum figure of MFs. The optimum figure of MFs is normally determined heuristically and verified by experimentation. Hence, the figure of MFs is selected in test and mistake footing. In the interim, it is noted that we have tried four types of rank map: ( a ) triangular, ( B ) trapezoidal, ( degree Celsius ) gaussian, and ( vitamin D ) bell-shaped to build the fuzzed Numberss. After a big figure of tests, as a consequence bell-shaped distributed rank map compared with the others have obtained the minimal comparative mistake. Table 1 illustrates the figure and the types of MFs that adopted in this survey to make faculties.The ANFIS faculty is trained until making certain minimal mistake or after finishing certain figure of developing era. In this survey, the less no. of loop was introduced in order to devour the clip Fig 3 depicts the alteration in Root Mean Square Error ( RMSE ) for the ANFIS faculty during preparation and cross- proof. It is obvious from the figure that the faculty reaches the end 300 era.


Wavelet analysis represents the following logical measure after short-time Fourier transforms ( STFT ) . It is based on a windowing technique with variable-sized parts. Wavelet transform ( WT ) allows the usage of long clip intervals where we want more precise low frequence information, and shorter parts where we want high frequence information [ 16 ] . In general, the major advantage offered by ripples is the ability to execute local analysis ; that is to analyse a localised country of a larger signal. The discrete-time WT of a clip domain signal is given as [ 10 ] :

( 3 )

Where, is the female parent ripple while and are, severally, the grading and switching indices. The grading gives the DWT logarithmic frequence coverage in contrast to the unvarying frequence coverage of the STFT. This analysis method so consists of break uping a signal into constituents at several frequence degrees, which are related by powers of two ( a dyadic graduated table ) [ 16 ] . The filtrating attack to multi-resolution WT is to organize a series of half-band filters that divide a spectrum into a high frequence set and a low frequence set. It is formulated on a grading map or low-pass filter ( LP ) and a ripple map or high-pass filter ( UP ) [ 17 ] . Wavelet Multi-resolution analysis ( WMRA ) builds a pyramidic construction that requires an iterative application of grading and ripple maps to low-pass and high base on balls filters, severally. These filters ab initio act on the full signal set at the high frequence ( small-scale values ) foremost and bit by bit cut down the signal set at each phase. As in Fig.4, the high-frequency set end products are represented by the item coefficients ( Dl, D2, D3 ) , and the low-frequency set end products are represented by the estimate coefficients ( Al, A2, A3 ) .

Optimum parametric quantity choice for wavelet de-noising of WQP informations

When utilizing ripples to de-noise WQP information, there are many factors that must be considered. Examples of these picks are which ripple, degree of decomposition, and thresholding methods to utilize. MATLAB provides several households of ripples including the Morlet, Mexican chapeau, Meyer, Haar, Daubechies, Symlets, Coiflets and Spline biorthogonal ripples and provides farther certification about these ripple households [ 18 ] . In order to obtain perfect Reconstruction consequences, merely extraneous ripples will be considered. The extraneous ripple transform has certain benefits. It is comparatively concise, allows for perfect Reconstruction of the original signal and is non comparatively hard to cipher. The two common methods of thresholding a signal are soft thresholding and difficult thresholding which are used in the MATLAB ripple tool chest [ 9. M. Misiti, Y. Misiti, G. Oppenheim and J.-M. Poggi. Wavelet Toolbox: For Use With, The Math Works Inc ( 1996 ) .18 ] . Although difficult thresholding is the simplest method, soft thresholding can bring forth better consequences than difficult thresholding. Therefore, the soft thresholding was adopted in this survey. There are four threshold choice regulations that are available to utilize with the ripple tool chest [ 9. M. Misiti, Y. Misiti, G. Oppenheim and J.-M. Poggi. Wavelet Toolbox: For Use With, The Math Works Inc ( 1996 ) .18 ] are shown in table 2. These threshold choice regulations use statistical arrested development of the noisy coefficients over clip to obtain a non-parametric appraisal of the reconstructed signal without noise. Merely the Sqtwolog was investigated in this survey which this method uses a fixed signifier threshold, which consequences in minimax public presentation multiplied by a factor proportional to logarithm of the length of the signal. Sing to the degree of decomposition, in this manuscript we reached the decision that a decomposition degree of 4 produced sensible consequences after utilised test and mistake method for all faculties.


Cross proof is a exemplary rating method that provides an indicant of how good the scholar will make when it is asked to do new anticipations for information it has non already seen. One manner to get the better of this job is to non utilize the full informations set when developing a scholar. Some of the information is removed before preparation Begins. Then when preparation is done, the informations that was removed can be used to prove the public presentation of the erudite theoretical account on “ new ” information. This is the basic thought for a whole category of theoretical account rating methods called cross proof.

The training/validation informations split can hold a important impact on the consequences of the theoretical accounts [ 19 ] . Several methods for implementing the cross-validation theory were proposed in the literature ; nevertheless, the kernel of all these methods is similar. Among these different techniques the hold-out method is adopted in this survey due to its simpleness. Most of other methods ( like k-fold and Leave-one-out ) are computationally expensive.

The hold-out method is the simplest sort of cross-validation. The information set is partitioned into two sets, called the preparation set and the testing set, without any peculiar pick of the divider. As shown in Fig. 5, the map approximator is trained to suit a map utilizing the preparation set merely. Then the map approximator is used to foretell the end product values for the information in the testing set.

Elect optimal length of informations for Cross Validation

The job with using the cross-validation attack in our survey is choosing the length of the information set utilised. It is of import for this choice to be representatives for characteristics of both developing informations set and the expected information set in the anticipation procedure. Different length of the cross-validation informations set of one ten percent, one fifth and one tierce of the information records has been examined Fig. 6.


Since the H2O parametric quantities were truthfully monitored over these ten old ages, the public presentation of the proposed faculties can be examined and evaluated. The public presentations of the faculties are evaluated harmonizing to Mean Absolute Error MAE.

( 4 )

Where m is the figure of proving sample, pi ( trial ) is the theoretical account end product for each parametric quantity, Ti is the existent value for each parametric quantity, I is the parametric quantity index. In this survey, for a given set of ANFIS parametric quantities, three web architectures were developed for each parametric quantity in order to obtain the optimum length of cross-validation informations set that provides the high degree of anticipation truth. The table 3 verifies that, taking one tierce of the information records was non sufficient.The chief ground for this is there was non adequate information record for the preparation procedure, which made it hard to make the mistake end. On the other manus, when one ten percent of the information was used for the cross proof procedure, the high degrees of mistake were observed. Choosing one ten percent of informations records lead to failing in observing the characteristics of the expected information set in anticipation procedure.

It appears from the tabular array that the MAE was lessening to the about half for the ANFIS faculty that adopted to foretell the T.D.S. after using the one fifth of the information record comparing with cross-validation length equal to tierce.

Furthermore, the MAE for the ANFIS that used to foretell the TURB equal to 36.54 when cross-validation equal to one ten percent, while truth ANFIS reduces this mistake to 20.52 after cross proofs equal to one fifth of information record was introduced. Same scenario was obtained in the faculty that used to foretell the COND. As complete, the optimal degree of truth was achieved by choosing the length of the cross-validation equal to one fifth of the information record.

After select the optimum length of cross-validation, a comparing between the ANFIS faculties and WDT-ANFIS faculties will present in order to analyze the effectivity of the proposed faculties to foretell WQP. Fig.7 demonstrates the public presentation of the WDT-ANFIS during preparation and cross-validation procedure. It ‘s obvious in Fig. 7 that the public presentation end of was achieved in less than 100 eras, while the same end could non be achieved in Fig. 3. This consequence depicts that the WDT-ANFIS capable to devour the clip.

Apparently from Fig. 8 that the WDT-ANFIS based faculty outperformed the ANFIS and was able to supply betterment in anticipation truth of TURB with MAE equal to 0.1. While, inefficient public presentation was observed harmonizing to ANFIS faculty when the MAE was exceed 20. It is apparent with the addition of noise strength ; the WQP will hold more accurate anticipation value through the informations de-noised by WDT than those without de-noising. This indicates the high quality of WDT in cleaning the information.

Apparently, the ANFIS faculty that used to foretell the T.D.S was able to accomplish acceptable consequence after using natural information. This consequence figures out that the ANFIS faculty can efficaciously capture nonlinearity in input/output function. More sweetenings in anticipation of T.D.S were achieved after using WDT-ANFIS. Where the truth was betterment eight times comparing with the ANFIS faculty.

Albeit the mistake while proving is consider comparatively high comparing with preparation and cross- proof stage, achieve a MAE do n’t transcend 3.4 which is consider best consequence in predict the T.D.S. This consequence showed that the WDT-ANFIS faculty could be considered as the appropriate mold technique for anticipation such WQP.

It can besides be observed that the WDT-ANFIS faculty outperformed the ANFIS faculty and was able to supply betterment in anticipation truth for COND. It can be noticed that the WDT-ANFIS was capable of accomplishing high degree of truth in anticipation phase equal to 2.2, while inefficient public presentation was observed harmonizing to the ANFIS faculty when the per centum of MAE exceed 25.

Over all, in this research, the WDT-ANFIS can hence be declared as the best web architecture because it outperforms ANFIS. These consequences show that the WDT-ANFIS theoretical account was non merely capable of bettering the truth of WQPP but the theoretical account besides was capable of capturing the temporal forms of the H2O quality which allowed it to supply important sweetening in anticipation. As a consequence, ANFIS module become more capable to capture the moral force and the complex processes that hidden in the informations itself for WQP after augmented it with WDT.

The above observation is farther illustrated in Fig.9 demoing the public presentation of WDT-ANFIS faculties while preparation, cross-validation and anticipation procedure for the WQP. It is obvious from the Fig. 9 ( a ) that the proposed faculty that adopted to foretell the COND was capable to placing the complex non-linear relationship between input and end product informations where merely one fifth of records were used during anticipation. It is clear from the figure that the maximal per centum mistake for all the trained and cross-validation records is merely 1.85 % and 1.2 % severally. While, the maximal per centum mistake for all the predicted records is merely 3.9 % . This consequence reveals that the proposed faculty able to imitating the existent behaviour of the COND in the H2O organic structure.

In order to salvage clip and avoid making independent faculty to each parametric quantity, old faculty that utilized to foretell the COND will follow and examined to foretell the T.D.S. it can be noticed from Fig. 9 ( B ) that using one parametric quantity as input does non let the WDT-ANFIS faculty to hold the exposure to enough input kineticss and the chance to be suitably evaluated during cross-validation procedure. Consequently, the mistake degrees during the anticipation procedure may get down to be larger than those obtained during transverse proof. As a consequence, the system may non be able to keep the same degrees of truth for both the cross proof and anticipation procedure. Even though the degree of mistakes in the anticipation phase is comparatively high, moderately good consequence was obtained for the anticipation of T.D.S where maximal per centum mistakes during anticipation procedure did non transcend 10 %

Ali et at. [ 20 ] adopted RBF-NN to foretell the TURB at Johor River Basin. High degree of mistake was obtained due to one parametric quantity was used as input to the faculty. In this portion of this paper, three parametric quantities were introduced as input to the faculty after investigated the correlativity between the inputs and end product. Fig.9 ( degree Celsius ) shows the faculty public presentation while preparation, cross proof and anticipation for TURB. It can detect that the maximal mistake in preparation procedure is falling within 0.05 % which indicates that the theoretical account able gaining control the relationships between input/output. Although the mistake of anticipation phase equal to four times the mistake in cross-validation, the faculty provides high degree of truth comparing with the other faculties that used to foretell the COND and T.D.S where the maximal per centum mistake for all the predicted records is merely 0.21 % .


Adaptive Neuro-Fuzzy Inference System ( ANFIS ) can cover with extremely variable, lingual, obscure and unsure informations or cognition. Therefore, this manuscript employs ANFIS theoretical account as a feasible agencies to foretell three H2O quality parametric quantities ; Electrical CONDuctivity ( COND ) , Total Dissolved Solids ( T.D.S ) and TURBidity ( TURB ) at Johor River basin, Malaysia. However, legion incompatibilities in information records are observed in the informations that mensurating by Department of Environment ( DOE ) monitoring Stationss owing to systematic mistakes, random mistakes and mistakes associated with informations entry. This blunt mismatch generate noise corrupted the records of the existent value of the parametric quantities. Therefore, the truth of the ANFIS is significantly affected by such mistakes nature and forms of the monitored informations. In order to predominate over these insufficiencies, ANFIS theoretical account is augmented with Wavelet De-noising Technique ( WDT ) to better the truth. The consequence shown that the WDT can be applied successfully and lend to heighten the truth of H2O quality anticipation by synthesis it with ANFIS algorithm.


The writers wish to thank Department Of Environment for supplying the needed informations for developing this research and to Dr. Sundarambal Palani for her penetration and counsel throughout this research. This research was supported by the research grant for the 2nd and 3rd writers from University Kebangsaan Malaysia UKM-GUP-PLW-08-13-308.