In recent years, advancements in artificial neural networks (ANNs) have played a pivotal role in solving complex problems across various domains. However, traditional ANNs often face challenges in capturing intricate features, especially when dealing with non-linear data patterns. To address these limitations, the ensemble deep random vector functional link (edRVFL) neural network has emerged as a promising solution. This blog delves into the recent paper titled "Ensemble Deep Random Vector Functional Link Neural Network Based on Fuzzy Inference System," which introduces a novel approach that combines the strengths of edRVFL and fuzzy inference systems (FIS) to enhance feature learning capabilities.
Neural networks have been at the forefront of machine learning due to their ability to model complex relationships between inputs and outputs. These networks aim to approximate a target function
To overcome these challenges, randomized neural networks (RNNs) were introduced. Unlike BP-based networks, RNNs fix certain parameters during training and compute output layer parameters using iterative processes or closed-form solutions. Among RNNs, the random vector functional link (RVFL) network stands out. RVFL is a shallow feed-forward network characterized by randomly initialized hidden layer parameters that remain fixed during training. This architecture allows for efficient learning with fewer adjustable settings, making it a practical choice for various applications.
However, the shallow RVFL model has limitations, such as its inability to effectively capture intricate hidden relationships due to its reliance on a single hidden layer. To address these limitations, the deep RVFL (dRVFL) and ensemble deep RVFL (edRVFL) networks were proposed. The dRVFL introduces multiple hidden layers, enabling the capture of more complex relationships, while the edRVFL leverages ensemble learning to enhance stability and effectiveness.
Fuzzy logic provides a mathematical framework for handling uncertainty and imprecise information. Within this framework, the fuzzy inference system (FIS) employs fuzzy logic principles to emulate human-like reasoning through the integration of fuzzy sets, linguistic variables, and a set of rules. The fundamental components of FIS include fuzzification, rule evaluation, and defuzzification. Fuzzification transforms precise input values into fuzzy sets, rule evaluation applies conditional statements to generate fuzzy output values, and defuzzification converts these fuzzy outputs into precise outcomes.
FIS has found applications in various domains, including control systems, pattern recognition, finance, and classification. Given the dynamic nature of feature enhancement capabilities in FIS, the paper proposes integrating FIS with the edRVFL architecture to create a hybrid model—edRVFL-FIS.
The paper introduces the ensemble deep random vector functional link neural network based on a fuzzy inference system (edRVFL-FIS). The core idea behind this model is to leverage the capabilities of both deep learning and fuzzy logic to produce rich feature representations for training the ensemble model. The edRVFL-FIS model incorporates diverse clustering methods, including R-means, K-means, and Fuzzy C-means, to establish fuzzy layer rules, resulting in three model variations: edRVFL-FIS-R, edRVFL-FIS-K, and edRVFL-FIS-C.
The architecture of the edRVFL-FIS model is modular and structured, allowing for a systematic processing of data. Input samples undergo a fuzzification process, forming a fuzzy layer. This layer generates enhanced fuzzified features by combining original input features with fuzzy features. Each hidden layer in the model operates as an independent base model within the ensemble framework. These base models utilize both hidden and defuzzified features to make predictions.
Each base model in the edRVFL-FIS framework can be viewed as an RVFL model that incorporates distinct input features. The ensemble structure leverages diverse supervised defuzzified features for each base model, enhancing the richness of feature extraction during output prediction.
To understand the edRVFL-FIS model, let's dive into its mathematical framework. The model's architecture is defined by several key equations that describe the formation of hidden layers, fuzzification, defuzzification, and output prediction.
Formation of Hidden Layers:
The first hidden layer
where
For subsequent hidden layers
Output Calculation:
The output of the first hidden layer is computed as:
where
For the
Fuzzification:
The fuzzification process generates a fuzzy layer matrix
where
Defuzzification:
The defuzzification process converts the fuzzified vectors into defuzzified outputs that contribute to the final output layers. The defuzzified output
where
Final Output:
For an unknown sample
The final output is obtained by combining the outputs of all base models through majority voting:
The paper titled "Ensemble Deep Random Vector Functional Link Neural Network Based on Fuzzy Inference System" conducts a series of experiments to validate the performance of the proposed ensemble deep random vector functional link neural network based on a fuzzy inference system (edRVFL-FIS). This section provides a detailed overview of these experiments, highlighting the datasets used, the compared models, the experimental setup, and the results.
The experiments were conducted using two main categories of datasets:
Each dataset varies in terms of domain, size, and complexity, providing a comprehensive evaluation of the model's performance.
The performance of the proposed edRVFL-FIS models was compared against several baseline models, including both shallow and deep variants of randomized neural networks (RNNs). The models compared in the experiments include:
The proposed models evaluated were:
The experiments were conducted with a focus on evaluating the classification accuracy, standard deviation, ranking, and statistical significance of the models' performance. The models were trained and tested on each dataset, and their performance was compared using several statistical tests, including the Friedman test and the Wilcoxon-signed-rank test. These tests were employed to determine the statistical significance of the differences in performance between the proposed models and the baseline models.
The key metrics used for comparison were:
Table 1 presents the classification accuracies of the proposed edRVFL-FIS models and the baseline models on Category-1 UCI datasets. The proposed models consistently outperformed the baseline models, with edRVFL-FIS-R achieving the highest average accuracy of 82.08%, followed by edRVFL-FIS-C and edRVFL-FIS-K.
Dataset | RVFL | ELM | BLS | H-ELM | dRVFL | edRVFL | Fuzzy BLS | edEGERVFL | edRVFL-FIS-R | edRVFL-FIS-K | edRVFL-FIS-C |
---|---|---|---|---|---|---|---|---|---|---|---|
Balance Scale | 98.24 | 98.24 | 98.08 | 91.68 | 95.68 | 95.36 | 98.40 | 97.76 | 98.56 | 98.56 | 98.88 |
Breast Cancer Wisconsin | 89.42 | 89.71 | 87.70 | 87.56 | 87.99 | 87.85 | 91.14 | 89.13 | 89.85 | 89.42 | 89.85 |
Dermatology | 97.00 | 97.26 | 97.81 | 97.26 | 96.99 | 97.27 | 97.54 | 96.99 | 97.54 | 97.81 | 97.82 |
Echocardiogram | 83.90 | 83.90 | 86.21 | 84.67 | 84.67 | 84.67 | 81.62 | 80.11 | 86.98 | 86.21 | 86.21 |
Ecoli | 60.61 | 60.91 | 60.92 | 59.43 | 60.32 | 61.21 | 60.02 | 56.73 | 61.51 | 60.91 | 61.22 |
Energy Y1 | 88.40 | 88.53 | 88.28 | 88.27 | 89.58 | 90.36 | 89.32 | 88.67 | 91.92 | 92.32 | 92.32 |
Energy Y2 | 91.15 | 91.66 | 89.84 | 88.01 | 90.62 | 90.49 | 91.40 | 87.63 | 92.32 | 92.19 | 92.58 |
Flags | 52.60 | 51.59 | 53.64 | 53.13 | 53.12 | 52.06 | 52.65 | 56.17 | 54.14 | 54.68 | 54.66 |
Heart Switzerland | 44.87 | 46.43 | 48.13 | 44.07 | 44.80 | 43.20 | 44.80 | 50.47 | 49.70 | 47.37 | 47.33 |
Iris | 65.33 | 65.33 | 76.00 | 62.67 | 74.00 | 72.00 | 71.33 | 74.67 | 74.67 | 76.00 | 76.00 |
LED Display | 72.70 | 72.60 | 72.10 | 73.40 | 72.30 | 72.30 | 73.10 | 71.90 | 72.90 | 73.20 | 73.10 |
Lymphography | 84.39 | 84.41 | 85.72 | 86.46 | 85.72 | 83.72 | 85.70 | 85.79 | 87.13 | 87.08 | 86.41 |
Mammographic | 79.71 | 79.09 | 78.67 | 78.36 | 79.92 | 79.40 | 79.92 | 79.40 | 79.92 | 80.23 | 80.23 |
Pittsburgh Bridges REL | 59.05 | 63.19 | 64.14 | 51.81 | 64.24 | 61.43 | 63.38 | 75.24 | 72.95 | 69.00 | 64.10 |
Pittsburgh Bridges TOR | 86.14 | 86.14 | 90.14 | 86.14 | 86.14 | 86.14 | 89.14 | 91.14 | 90.19 | 86.14 | 89.14 |
Statlog Heart | 80.74 | 80.74 | 82.22 | 79.63 | 81.48 | 81.85 | 81.11 | 80.74 | 81.85 | 83.33 | 83.33 |
Statlog Vehicle | 81.68 | 82.03 | 82.39 | 78.72 | 82.74 | 82.15 | 78.84 | 79.91 | 82.27 | 82.74 | 83.57 |
Tic Tac Toe | 89.97 | 91.00 | 97.60 | 65.31 | 99.06 | 99.06 | 82.45 | 99.06 | 99.16 | 98.33 | 99.37 |
Zoo | 92.00 | 93.00 | 96.00 | 92.00 | 94.00 | 94.00 | 95.00 | 96.00 | 96.00 | 95.00 | 96.00 |
Average Accuracy | 78.84 | 79.25 | 80.82 | 76.24 | 80.18 | 79.71 | 79.31 | 80.92 | 82.08 | 81.61 | 81.69 |
Average Rank | 8.08 | 7.58 | 5.58 | 9.03 | 6.66 | 7.42 | 6.47 | 6.42 | 2.84 | 3.39 | 2.53 |
The proposed models (edRVFL-FIS variants) consistently show higher average accuracies and lower average ranks compared to the baseline models, indicating their superior performance across a range of datasets.
The Wilcoxon-signed-rank test was used to compare the performance of the proposed models with baseline models. The p-values indicate that the proposed models significantly outperform the baseline models.
Table 2: Wilcoxon-signed-rank Test on Category-1 UCI Datasets
Baseline Model | p-value (edRVFL-FIS-R) | Null Hypothesis | p-value (edRVFL-FIS-K) | Null Hypothesis | p-value (edRVFL-FIS-C) | Null Hypothesis |
---|---|---|---|---|---|---|
RVFL | 0.00014 | Rejected | 0.00021 | Rejected | 0.00014 | Rejected |
ELM | 0.00014 | Rejected | 0.00041 | Rejected | 0.00014 | Rejected |
BLS | 0.00727 | Rejected | 0.03689 | Rejected | 0.00344 | Rejected |
H-ELM | 0.00026 | Rejected | 0.00027 | Rejected | 0.00022 | Rejected |
dRVFL | 0.00045 | Rejected | 0.00078 | Rejected | 0.00021 | Rejected |
edRVFL | 0.00021 | Rejected | 0.00041 | Rejected | 0.00014 | Rejected |
Fuzzy BLS | 0.00053 | Rejected | 0.00300 | Rejected | 0.00045 | Rejected |
edEGERVFL | 0.00206 | Rejected | 0.01842 | Rejected | 0.00565 | Rejected |
Explanation of Table 2:
In Table 2, all the p-values are below 0.05, leading to the rejection of the null hypothesis in each case. This indicates that the proposed edRVFL-FIS models (R, K, and C variants) significantly outperform all the baseline models across the Category-1 UCI datasets.
The Category-2 UCI datasets, which include larger datasets, further validated the robustness of the proposed models. Table 3 shows that the edRVFL-FIS models continued to outperform the baseline models, with edRVFL-FIS-C achieving the highest average accuracy of 72.32%.
Dataset | RVFL | ELM | BLS | H-ELM | dRVFL | edRVFL | Fuzzy BLS | edRVFL-FIS-R | edRVFL-FIS-K | edRVFL-FIS-C |
---|---|---|---|---|---|---|---|---|---|---|
Car | 71.47 | 70.60 | 72.39 | 70.01 | 70.01 | 70.01 | 73.20 | 72.51 | 72.04 | 70.01 |
Cardiotocography (10 classes) | 70.46 | 70.04 | 65.85 | 69.19 | 70.04 | 70.56 | 65.01 | 70.93 | 69.95 | 71.12 |
Cardiotocography (3 classes) | 86.22 | 86.08 | 85.84 | 85.14 | 86.31 | 86.27 | 86.13 | 85.89 | 86.88 | 86.83 |
Chess KRvKp | 79.20 | 78.98 | 77.91 | 81.70 | 85.64 | 81.98 | 70.12 | 81.14 | 81.42 | 84.83 |
Contraceptive Method Choice | 40.52 | 41.14 | 41.00 | 39.10 | 41.07 | 40.65 | 40.32 | 41.33 | 41.87 | 42.01 |
Mushroom | 99.86 | 99.91 | 98.65 | 95.04 | 99.88 | 99.72 | 99.77 | 99.91 | 100 | 100 |
Titanic | 77.10 | 77.10 | 77.92 | 77.33 | 77.33 | 77.10 | 77.33 | 77.92 | 77.92 | 77.10 |
Wine Quality Red | 59.66 | 59.91 | 60.17 | 58.91 | 59.48 | 60.16 | 59.60 | 60.79 | 60.41 | 60.67 |
Yeast | 56.74 | 56.94 | 56.81 | 56.20 | 57.75 | 57.95 | 56.20 | 57.95 | 58.02 | 58.29 |
Average Accuracy | 71.25 | 71.19 | 70.73 | 70.29 | 71.94 | 71.60 | 69.74 | 72.04 | 72.06 | 72.32 |
Average Rank | 6.50 | 6.06 | 6.44 | 8.44 | 5.28 | 5.67 | 7.11 | 3.39 | 3.06 | 3.06 |
The proposed edRVFL-FIS models (R, K, and C variants) generally achieve higher classification accuracies across the datasets, reflecting their ability to better capture the underlying data patterns compared to the baseline models. The edRVFL-FIS-C model, in particular, shows the highest average accuracy and ranks well across the datasets.
The NDC datasets were used to evaluate the performance of the proposed models on large-scale data. These datasets ranged from 10,000 to 1 million samples. As shown in Table 4, the proposed edRVFL-FIS-K model achieved the highest average accuracy of 97.93%, demonstrating its effectiveness in handling big data.
Table 4: Classification Accuracies on NDC Datasets
Dataset | RVFL | ELM | BLS | H-ELM | dRVFL | edRVFL | edRVFL-FIS-R | edRVFL-FIS-K | edRVFL-FIS-C |
---|---|---|---|---|---|---|---|---|---|
NDC-10K | 94.15 | 93.85 | 92.40 | 86.50 | 97.35 | 97.55 | 97.65 | 98.30 | 98.00 |
NDC-25K | 95.15 | 95.25 | 92.65 | 85.45 | 97.15 | 97.05 | 97.10 | 98.20 | 97.80 |
NDC-50K | 94.15 | 94.20 | 92.40 | 85.50 | 97.50 | 96.80 | 97.90 | 97.80 | 97.65 |
NDC-75K | 94.05 | 94.00 | 92.05 | 86.00 | 97.20 | 96.95 | 97.80 | 97.95 | 97.60 |
NDC-100K | 93.65 | 93.70 | 92.50 | 85.15 | 97.15 | 97.50 | 97.20 | 97.85 | 97.50 |
NDC-500K | 94.25 | 94.00 | 92.35 | 86.25 | 97.20 | 97.25 | 97.05 | 98.00 | 97.60 |
NDC-1M | 93.40 | 93.50 | 91.55 | 85.55 | 96.95 | 97.00 | 97.45 | 97.40 | 97.70 |
Average Accuracy | 94.11 | 94.07 | 92.27 | 85.77 | 97.21 | 97.16 | 97.45 | 97.93 | 97.69 |
Average Rank | 6.57 | 6.43 | 8.00 | 9.00 | 4.29 | 4.07 | 3.00 | 1.43 | 2.21 |
The results from these tables indicate that the proposed edRVFL-FIS models consistently outperform traditional models across different datasets, showcasing their ability to generalize effectively across small, medium, and large-scale datasets. The incorporation of fuzzy inference systems within the deep RVFL framework allows the models to capture more intricate patterns, leading to superior performance.
To ensure that the observed differences in performance are statistically significant, the authors employed the Friedman test followed by the Wilcoxon-signed-rank test. The Friedman test was used to rank the models based on their performance across all datasets, while the Wilcoxon-signed-rank test compared the proposed models against each baseline model in a pairwise manner.
The Friedman test results confirmed that the differences in rankings were significant, leading to the rejection of the null hypothesis (no difference between the models). The Wilcoxon-signed-rank test further supported these findings, showing that the proposed edRVFL-FIS models significantly outperformed the baseline models.
The paper presents extensive experimental results to demonstrate the efficacy of the proposed edRVFL-FIS models. The models were evaluated on benchmark datasets from the UCI repository and the NDC datasets, which vary in size and complexity. The proposed models consistently outperformed existing baseline models in terms of accuracy, standard deviation, and ranking.
For instance, on Category-1 UCI datasets (sample sizes ranging from 100 to 1000), the edRVFL-FIS-R model achieved an average accuracy of 82.08%, while the edRVFL-FIS-C and edRVFL-FIS-K models achieved 81.69% and 81.61%, respectively. These results were further validated using statistical tests such as the Friedman and Wilcoxon-signed-rank tests, which confirmed the statistical superiority of the proposed models over baseline models.
Similarly, on larger NDC datasets (with sample sizes ranging from 10,000 to 1 million), the edRVFL-FIS-K model achieved the highest average accuracy of 97.93%, followed by edRVFL-FIS-C (97.69%) and edRVFL-FIS-R (97.45%). These results underscore the robustness of the proposed models in handling large and complex datasets.
The proposed edRVFL-FIS model represents a significant advancement in the field of neural networks by integrating the strengths of deep learning and fuzzy logic. The use of fuzzy inference systems within the edRVFL framework enables the model to capture intricate patterns and relationships in data, resulting in superior performance across various datasets.
The paper's findings suggest that the proposed edRVFL-FIS models offer a promising solution for addressing the limitations of traditional neural networks, particularly in scenarios involving complex and non-linear data patterns. Future research could explore further enhancements to the model, such as
the incorporation of more advanced clustering techniques or the application of the model to other domains.
In conclusion, the edRVFL-FIS model represents a powerful tool for data-driven decision-making, offering a robust and versatile approach to feature learning and prediction.
Sajid, M., Tanveer, M., & Suganthan, P. N. (2024). "Ensemble Deep Random Vector Functional Link Neural Network Based on Fuzzy Inference System." IEEE Transactions on Fuzzy Systems. DOI: 10.1109/TFUZZ.2024.3411614.
Huang, G. B., Zhu, Q. Y., & Siew, C. K. (2006). "Extreme Learning Machine: Theory and Applications." Neurocomputing, 70(1-3), 489-501. DOI: 10.1016/j.neucom.2005.12.126.
Chen, C. P., & Liu, Z. (2017). "Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture." IEEE Transactions on Neural Networks and Learning Systems, 29(1), 10-24. DOI: 10.1109/TNNLS.2017.2671881.
Zhou, J., Zhang, L., & Tang, C. (2017). "Hierarchical Extreme Learning Machine With Applications in EEG Classification." IEEE Transactions on Cybernetics, 48(12), 3291-3303. DOI: 10.1109/TCYB.2017.2771492.
Shi, L., Tanveer, M., & Kumar, P. (2021). "Deep Random Vector Functional Link Networks." Neural Computing and Applications, 33(11), 5925-5937. DOI: 10.1007/s00521-020-05383-5.
Tanveer, M., Tiwari, A., & Suganthan, P. N. (2022). "Ensemble Deep Extended Graph Embedded Random Vector Functional Link Network." IEEE Transactions on Neural Networks and Learning Systems, 33(10), 5247-5261. DOI: 10.1109/TNNLS.2021.3134696.
Zadeh, L. A. (1965). "Fuzzy Sets." Information and Control, 8(3), 338-353. DOI: 10.1016/S0019-9958(65)90241-X.
Takagi, T., & Sugeno, M. (1985). "Fuzzy Identification of Systems and its Applications to Modeling and Control." IEEE Transactions on Systems, Man, and Cybernetics, SMC-15(1), 116-132. DOI: 10.1109/TSMC.1985.6313399.
Sajid, M., Tanveer, M., & Suganthan, P. N. (2024). Ensemble deep random vector functional link neural network based on fuzzy inference system. IEEE Transactions on Fuzzy Systems. https://doi.org/10.1109/TFUZZ.2024.3411614