Ensemble Deep Random Vector Functional Link Neural Network Based on Fuzzy Inference System

In recent years, advancements in artificial neural networks (ANNs) have played a pivotal role in solving complex problems across various domains. However, traditional ANNs often face challenges in capturing intricate features, especially when dealing with non-linear data patterns. To address these limitations, the ensemble deep random vector functional link (edRVFL) neural network has emerged as a promising solution. This blog delves into the recent paper titled "Ensemble Deep Random Vector Functional Link Neural Network Based on Fuzzy Inference System," which introduces a novel approach that combines the strengths of edRVFL and fuzzy inference systems (FIS) to enhance feature learning capabilities.

Introduction to Neural Networks and RVFL

Neural networks have been at the forefront of machine learning due to their ability to model complex relationships between inputs and outputs. These networks aim to approximate a target function by learning a hypothesis function within a hypothesis space . The goal is to minimize the generalization error, i.e., the difference between the predicted output and the true output. Traditional neural networks rely on backpropagation (BP) to optimize parameters, which often leads to challenges such as slow convergence, susceptibility to local optima, and high computational demands.

To overcome these challenges, randomized neural networks (RNNs) were introduced. Unlike BP-based networks, RNNs fix certain parameters during training and compute output layer parameters using iterative processes or closed-form solutions. Among RNNs, the random vector functional link (RVFL) network stands out. RVFL is a shallow feed-forward network characterized by randomly initialized hidden layer parameters that remain fixed during training. This architecture allows for efficient learning with fewer adjustable settings, making it a practical choice for various applications.

However, the shallow RVFL model has limitations, such as its inability to effectively capture intricate hidden relationships due to its reliance on a single hidden layer. To address these limitations, the deep RVFL (dRVFL) and ensemble deep RVFL (edRVFL) networks were proposed. The dRVFL introduces multiple hidden layers, enabling the capture of more complex relationships, while the edRVFL leverages ensemble learning to enhance stability and effectiveness.

The Role of Fuzzy Inference Systems (FIS)

Fuzzy logic provides a mathematical framework for handling uncertainty and imprecise information. Within this framework, the fuzzy inference system (FIS) employs fuzzy logic principles to emulate human-like reasoning through the integration of fuzzy sets, linguistic variables, and a set of rules. The fundamental components of FIS include fuzzification, rule evaluation, and defuzzification. Fuzzification transforms precise input values into fuzzy sets, rule evaluation applies conditional statements to generate fuzzy output values, and defuzzification converts these fuzzy outputs into precise outcomes.

FIS has found applications in various domains, including control systems, pattern recognition, finance, and classification. Given the dynamic nature of feature enhancement capabilities in FIS, the paper proposes integrating FIS with the edRVFL architecture to create a hybrid model—edRVFL-FIS.

The Proposed edRVFL-FIS Model

The paper introduces the ensemble deep random vector functional link neural network based on a fuzzy inference system (edRVFL-FIS). The core idea behind this model is to leverage the capabilities of both deep learning and fuzzy logic to produce rich feature representations for training the ensemble model. The edRVFL-FIS model incorporates diverse clustering methods, including R-means, K-means, and Fuzzy C-means, to establish fuzzy layer rules, resulting in three model variations: edRVFL-FIS-R, edRVFL-FIS-K, and edRVFL-FIS-C.

Architecture of the edRVFL-FIS Model

The architecture of the edRVFL-FIS model is modular and structured, allowing for a systematic processing of data. Input samples undergo a fuzzification process, forming a fuzzy layer. This layer generates enhanced fuzzified features by combining original input features with fuzzy features. Each hidden layer in the model operates as an independent base model within the ensemble framework. These base models utilize both hidden and defuzzified features to make predictions.

Each base model in the edRVFL-FIS framework can be viewed as an RVFL model that incorporates distinct input features. The ensemble structure leverages diverse supervised defuzzified features for each base model, enhancing the richness of feature extraction during output prediction.

Mathematical Framework of edRVFL-FIS

To understand the edRVFL-FIS model, let's dive into its mathematical framework. The model's architecture is defined by several key equations that describe the formation of hidden layers, fuzzification, defuzzification, and output prediction.

Formation of Hidden Layers:
The first hidden layer is calculated as:

where is the input matrix, is the fuzzified matrix, is the randomly initialized weights matrix, is the bias vector, and is the activation function.

For subsequent hidden layers (where ):
Output Calculation:
The output of the first hidden layer is computed as:

where is the unknown weight matrix, and is the defuzzified matrix.

For the -th hidden layer:
Fuzzification:
The fuzzification process generates a fuzzy layer matrix by fuzzifying each training sample using the TSK FIS. The weighted fire strength of the -th fuzzy rule is given by:

where is the membership function corresponding to the fuzzy set .
Defuzzification:
The defuzzification process converts the fuzzified vectors into defuzzified outputs that contribute to the final output layers. The defuzzified output of a sample is defined as:

where are the parameters used in the THEN part of the fuzzy inference system.
Final Output:
For an unknown sample , the output of the -th base model is:

The final output is obtained by combining the outputs of all base models through majority voting:

Experimental Results and Analysis of the edRVFL-FIS Model

The paper titled "Ensemble Deep Random Vector Functional Link Neural Network Based on Fuzzy Inference System" conducts a series of experiments to validate the performance of the proposed ensemble deep random vector functional link neural network based on a fuzzy inference system (edRVFL-FIS). This section provides a detailed overview of these experiments, highlighting the datasets used, the compared models, the experimental setup, and the results.

1. Datasets

The experiments were conducted using two main categories of datasets:

Category-1 UCI Datasets: These datasets include relatively smaller datasets from the UCI Machine Learning Repository, with sample sizes ranging from 100 to 1000.
Category-2 UCI Datasets: These datasets also come from the UCI repository but feature larger sample sizes, ranging from 1000 to 10,000.
NDC Datasets: The NDC datasets, generated using David Musicant’s NDC Data Generator, are large-scale datasets with sample sizes ranging from 10,000 to 1 million.

Each dataset varies in terms of domain, size, and complexity, providing a comprehensive evaluation of the model's performance.

2. Compared Models

The performance of the proposed edRVFL-FIS models was compared against several baseline models, including both shallow and deep variants of randomized neural networks (RNNs). The models compared in the experiments include:

RVFL: Standard shallow RVFL model.
ELM: Extreme Learning Machine, a fast learning algorithm for single-layer feedforward neural networks.
BLS: Broad Learning System, which expands the feature space with various feature mapping units.
H-ELM: Hierarchical Extreme Learning Machine, which enhances ELM by stacking multiple layers.
dRVFL: Deep RVFL model with multiple hidden layers.
edRVFL: Ensemble deep RVFL model, which leverages ensemble learning techniques.
Fuzzy BLS: A neuro-fuzzy-based variant of the Broad Learning System.
edEGERVFL: Ensemble deep extended graph embedded RVFL, which incorporates graph structures in the feature space.

The proposed models evaluated were:

edRVFL-FIS-R: edRVFL-FIS model with R-means clustering.
edRVFL-FIS-K: edRVFL-FIS model with K-means clustering.
edRVFL-FIS-C: edRVFL-FIS model with Fuzzy C-means clustering.

3. Experimental Setup

The experiments were conducted with a focus on evaluating the classification accuracy, standard deviation, ranking, and statistical significance of the models' performance. The models were trained and tested on each dataset, and their performance was compared using several statistical tests, including the Friedman test and the Wilcoxon-signed-rank test. These tests were employed to determine the statistical significance of the differences in performance between the proposed models and the baseline models.

The key metrics used for comparison were:

Classification Accuracy: The percentage of correct predictions made by the model.
Standard Deviation: The variability in model accuracy across different runs.
Average Rank: The average ranking of the model across all datasets, with lower ranks indicating better performance.

4. Results on Category-1 UCI Datasets

Table 1 presents the classification accuracies of the proposed edRVFL-FIS models and the baseline models on Category-1 UCI datasets. The proposed models consistently outperformed the baseline models, with edRVFL-FIS-R achieving the highest average accuracy of 82.08%, followed by edRVFL-FIS-C and edRVFL-FIS-K.

Table 1: Classification Accuracies on Category-1 UCI Datasets

Dataset	RVFL	ELM	BLS	H-ELM	dRVFL	edRVFL	Fuzzy BLS	edEGERVFL	edRVFL-FIS-R	edRVFL-FIS-K	edRVFL-FIS-C
Balance Scale	98.24	98.24	98.08	91.68	95.68	95.36	98.40	97.76	98.56	98.56	98.88
Breast Cancer Wisconsin	89.42	89.71	87.70	87.56	87.99	87.85	91.14	89.13	89.85	89.42	89.85
Dermatology	97.00	97.26	97.81	97.26	96.99	97.27	97.54	96.99	97.54	97.81	97.82
Echocardiogram	83.90	83.90	86.21	84.67	84.67	84.67	81.62	80.11	86.98	86.21	86.21
Ecoli	60.61	60.91	60.92	59.43	60.32	61.21	60.02	56.73	61.51	60.91	61.22
Energy Y1	88.40	88.53	88.28	88.27	89.58	90.36	89.32	88.67	91.92	92.32	92.32
Energy Y2	91.15	91.66	89.84	88.01	90.62	90.49	91.40	87.63	92.32	92.19	92.58
Flags	52.60	51.59	53.64	53.13	53.12	52.06	52.65	56.17	54.14	54.68	54.66
Heart Switzerland	44.87	46.43	48.13	44.07	44.80	43.20	44.80	50.47	49.70	47.37	47.33
Iris	65.33	65.33	76.00	62.67	74.00	72.00	71.33	74.67	74.67	76.00	76.00
LED Display	72.70	72.60	72.10	73.40	72.30	72.30	73.10	71.90	72.90	73.20	73.10
Lymphography	84.39	84.41	85.72	86.46	85.72	83.72	85.70	85.79	87.13	87.08	86.41
Mammographic	79.71	79.09	78.67	78.36	79.92	79.40	79.92	79.40	79.92	80.23	80.23
Pittsburgh Bridges REL	59.05	63.19	64.14	51.81	64.24	61.43	63.38	75.24	72.95	69.00	64.10
Pittsburgh Bridges TOR	86.14	86.14	90.14	86.14	86.14	86.14	89.14	91.14	90.19	86.14	89.14
Statlog Heart	80.74	80.74	82.22	79.63	81.48	81.85	81.11	80.74	81.85	83.33	83.33
Statlog Vehicle	81.68	82.03	82.39	78.72	82.74	82.15	78.84	79.91	82.27	82.74	83.57
Tic Tac Toe	89.97	91.00	97.60	65.31	99.06	99.06	82.45	99.06	99.16	98.33	99.37
Zoo	92.00	93.00	96.00	92.00	94.00	94.00	95.00	96.00	96.00	95.00	96.00
Average Accuracy	78.84	79.25	80.82	76.24	80.18	79.71	79.31	80.92	82.08	81.61	81.69
Average Rank	8.08	7.58	5.58	9.03	6.66	7.42	6.47	6.42	2.84	3.39	2.53

Explanation of Results in Table 1:

RVFL, ELM, BLS, H-ELM, dRVFL, edRVFL, Fuzzy BLS, edEGERVFL: These are the baseline models against which the proposed edRVFL-FIS models are compared.
edRVFL-FIS-R, edRVFL-FIS-K, edRVFL-FIS-C: These represent the proposed ensemble deep random vector functional link neural network models using R-means, K-means, and Fuzzy C-means clustering, respectively.
Bold values: Indicate the highest accuracy for a given dataset.

The proposed models (edRVFL-FIS variants) consistently show higher average accuracies and lower average ranks compared to the baseline models, indicating their superior performance across a range of datasets.

The Wilcoxon-signed-rank test was used to compare the performance of the proposed models with baseline models. The p-values indicate that the proposed models significantly outperform the baseline models.

Table 2: Wilcoxon-signed-rank Test on Category-1 UCI Datasets

Baseline Model	p-value (edRVFL-FIS-R)	Null Hypothesis	p-value (edRVFL-FIS-K)	Null Hypothesis	p-value (edRVFL-FIS-C)	Null Hypothesis
RVFL	0.00014	Rejected	0.00021	Rejected	0.00014	Rejected
ELM	0.00014	Rejected	0.00041	Rejected	0.00014	Rejected
BLS	0.00727	Rejected	0.03689	Rejected	0.00344	Rejected
H-ELM	0.00026	Rejected	0.00027	Rejected	0.00022	Rejected
dRVFL	0.00045	Rejected	0.00078	Rejected	0.00021	Rejected
edRVFL	0.00021	Rejected	0.00041	Rejected	0.00014	Rejected
Fuzzy BLS	0.00053	Rejected	0.00300	Rejected	0.00045	Rejected
edEGERVFL	0.00206	Rejected	0.01842	Rejected	0.00565	Rejected

Explanation of Table 2:

p-value: The p-value represents the probability of obtaining the observed results assuming the null hypothesis is true. A lower p-value indicates stronger evidence against the null hypothesis.
Null Hypothesis: The null hypothesis in this context is that there is no significant difference in the performance of the proposed edRVFL-FIS models and the baseline models.

In Table 2, all the p-values are below 0.05, leading to the rejection of the null hypothesis in each case. This indicates that the proposed edRVFL-FIS models (R, K, and C variants) significantly outperform all the baseline models across the Category-1 UCI datasets.

5. Results on Category-2 UCI Datasets

The Category-2 UCI datasets, which include larger datasets, further validated the robustness of the proposed models. Table 3 shows that the edRVFL-FIS models continued to outperform the baseline models, with edRVFL-FIS-C achieving the highest average accuracy of 72.32%.

Table 3: Classification Accuracies on Category-2 UCI Datasets

Dataset	RVFL	ELM	BLS	H-ELM	dRVFL	edRVFL	Fuzzy BLS	edRVFL-FIS-R	edRVFL-FIS-K	edRVFL-FIS-C
Car	71.47	70.60	72.39	70.01	70.01	70.01	73.20	72.51	72.04	70.01
Cardiotocography (10 classes)	70.46	70.04	65.85	69.19	70.04	70.56	65.01	70.93	69.95	71.12
Cardiotocography (3 classes)	86.22	86.08	85.84	85.14	86.31	86.27	86.13	85.89	86.88	86.83
Chess KRvKp	79.20	78.98	77.91	81.70	85.64	81.98	70.12	81.14	81.42	84.83
Contraceptive Method Choice	40.52	41.14	41.00	39.10	41.07	40.65	40.32	41.33	41.87	42.01
Mushroom	99.86	99.91	98.65	95.04	99.88	99.72	99.77	99.91	100	100
Titanic	77.10	77.10	77.92	77.33	77.33	77.10	77.33	77.92	77.92	77.10
Wine Quality Red	59.66	59.91	60.17	58.91	59.48	60.16	59.60	60.79	60.41	60.67
Yeast	56.74	56.94	56.81	56.20	57.75	57.95	56.20	57.95	58.02	58.29
Average Accuracy	71.25	71.19	70.73	70.29	71.94	71.60	69.74	72.04	72.06	72.32
Average Rank	6.50	6.06	6.44	8.44	5.28	5.67	7.11	3.39	3.06	3.06

Explanation of Table 3:

RVFL, ELM, BLS, H-ELM, dRVFL, edRVFL, Fuzzy BLS: These are the baseline models against which the proposed edRVFL-FIS models are compared.
edRVFL-FIS-R, edRVFL-FIS-K, edRVFL-FIS-C: These represent the proposed ensemble deep random vector functional link neural network models using R-means, K-means, and Fuzzy C-means clustering, respectively.
Bold values: Indicate the highest accuracy for a given dataset.

The proposed edRVFL-FIS models (R, K, and C variants) generally achieve higher classification accuracies across the datasets, reflecting their ability to better capture the underlying data patterns compared to the baseline models. The edRVFL-FIS-C model, in particular, shows the highest average accuracy and ranks well across the datasets.

6. Results on NDC Datasets

The NDC datasets were used to evaluate the performance of the proposed models on large-scale data. These datasets ranged from 10,000 to 1 million samples. As shown in Table 4, the proposed edRVFL-FIS-K model achieved the highest average accuracy of 97.93%, demonstrating its effectiveness in handling big data.

Table 4: Classification Accuracies on NDC Datasets

Dataset	RVFL	ELM	BLS	H-ELM	dRVFL	edRVFL	edRVFL-FIS-R	edRVFL-FIS-K	edRVFL-FIS-C
NDC-10K	94.15	93.85	92.40	86.50	97.35	97.55	97.65	98.30	98.00
NDC-25K	95.15	95.25	92.65	85.45	97.15	97.05	97.10	98.20	97.80
NDC-50K	94.15	94.20	92.40	85.50	97.50	96.80	97.90	97.80	97.65
NDC-75K	94.05	94.00	92.05	86.00	97.20	96.95	97.80	97.95	97.60
NDC-100K	93.65	93.70	92.50	85.15	97.15	97.50	97.20	97.85	97.50
NDC-500K	94.25	94.00	92.35	86.25	97.20	97.25	97.05	98.00	97.60
NDC-1M	93.40	93.50	91.55	85.55	96.95	97.00	97.45	97.40	97.70
Average Accuracy	94.11	94.07	92.27	85.77	97.21	97.16	97.45	97.93	97.69
Average Rank	6.57	6.43	8.00	9.00	4.29	4.07	3.00	1.43	2.21

The results from these tables indicate that the proposed edRVFL-FIS models consistently outperform traditional models across different datasets, showcasing their ability to generalize effectively across small, medium, and large-scale datasets. The incorporation of fuzzy inference systems within the deep RVFL framework allows the models to capture more intricate patterns, leading to superior performance.

7. Statistical Significance

To ensure that the observed differences in performance are statistically significant, the authors employed the Friedman test followed by the Wilcoxon-signed-rank test. The Friedman test was used to rank the models based on their performance across all datasets, while the Wilcoxon-signed-rank test compared the proposed models against each baseline model in a pairwise manner.

The Friedman test results confirmed that the differences in rankings were significant, leading to the rejection of the null hypothesis (no difference between the models). The Wilcoxon-signed-rank test further supported these findings, showing that the proposed edRVFL-FIS models significantly outperformed the baseline models.

Macro level conclusion of the Experiments

The paper presents extensive experimental results to demonstrate the efficacy of the proposed edRVFL-FIS models. The models were evaluated on benchmark datasets from the UCI repository and the NDC datasets, which vary in size and complexity. The proposed models consistently outperformed existing baseline models in terms of accuracy, standard deviation, and ranking.

For instance, on Category-1 UCI datasets (sample sizes ranging from 100 to 1000), the edRVFL-FIS-R model achieved an average accuracy of 82.08%, while the edRVFL-FIS-C and edRVFL-FIS-K models achieved 81.69% and 81.61%, respectively. These results were further validated using statistical tests such as the Friedman and Wilcoxon-signed-rank tests, which confirmed the statistical superiority of the proposed models over baseline models.

Similarly, on larger NDC datasets (with sample sizes ranging from 10,000 to 1 million), the edRVFL-FIS-K model achieved the highest average accuracy of 97.93%, followed by edRVFL-FIS-C (97.69%) and edRVFL-FIS-R (97.45%). These results underscore the robustness of the proposed models in handling large and complex datasets.

Conclusion and Future Directions

The proposed edRVFL-FIS model represents a significant advancement in the field of neural networks by integrating the strengths of deep learning and fuzzy logic. The use of fuzzy inference systems within the edRVFL framework enables the model to capture intricate patterns and relationships in data, resulting in superior performance across various datasets.

The paper's findings suggest that the proposed edRVFL-FIS models offer a promising solution for addressing the limitations of traditional neural networks, particularly in scenarios involving complex and non-linear data patterns. Future research could explore further enhancements to the model, such as

the incorporation of more advanced clustering techniques or the application of the model to other domains.

In conclusion, the edRVFL-FIS model represents a powerful tool for data-driven decision-making, offering a robust and versatile approach to feature learning and prediction.

References

Sajid, M., Tanveer, M., & Suganthan, P. N. (2024). "Ensemble Deep Random Vector Functional Link Neural Network Based on Fuzzy Inference System." IEEE Transactions on Fuzzy Systems. DOI: 10.1109/TFUZZ.2024.3411614.
Huang, G. B., Zhu, Q. Y., & Siew, C. K. (2006). "Extreme Learning Machine: Theory and Applications." Neurocomputing, 70(1-3), 489-501. DOI: 10.1016/j.neucom.2005.12.126.
Chen, C. P., & Liu, Z. (2017). "Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture." IEEE Transactions on Neural Networks and Learning Systems, 29(1), 10-24. DOI: 10.1109/TNNLS.2017.2671881.
Zhou, J., Zhang, L., & Tang, C. (2017). "Hierarchical Extreme Learning Machine With Applications in EEG Classification." IEEE Transactions on Cybernetics, 48(12), 3291-3303. DOI: 10.1109/TCYB.2017.2771492.
Shi, L., Tanveer, M., & Kumar, P. (2021). "Deep Random Vector Functional Link Networks." Neural Computing and Applications, 33(11), 5925-5937. DOI: 10.1007/s00521-020-05383-5.
Tanveer, M., Tiwari, A., & Suganthan, P. N. (2022). "Ensemble Deep Extended Graph Embedded Random Vector Functional Link Network." IEEE Transactions on Neural Networks and Learning Systems, 33(10), 5247-5261. DOI: 10.1109/TNNLS.2021.3134696.
Zadeh, L. A. (1965). "Fuzzy Sets." Information and Control, 8(3), 338-353. DOI: 10.1016/S0019-9958(65)90241-X.
Takagi, T., & Sugeno, M. (1985). "Fuzzy Identification of Systems and its Applications to Modeling and Control." IEEE Transactions on Systems, Man, and Cybernetics, SMC-15(1), 116-132. DOI: 10.1109/TSMC.1985.6313399.
Sajid, M., Tanveer, M., & Suganthan, P. N. (2024). Ensemble deep random vector functional link neural network based on fuzzy inference system. IEEE Transactions on Fuzzy Systems. https://doi.org/10.1109/TFUZZ.2024.3411614