International Journal of Intelligent Systems

Volume 37, Issue 10 pp. 7857-7887

RESEARCH ARTICLE

Generative and discriminative infinite restricted Boltzmann machine training

Qianglong Wang,

Qianglong Wang

orcid.org/0000-0001-5018-5013

School of Electronics and Information, Northwestern Polytechnical University, Xi'an, Shaanxi, China

Search for more papers by this author

Xiaoguang Gao,

Corresponding Author

Xiaoguang Gao

[email protected]

orcid.org/0000-0001-8556-4339

School of Electronics and Information, Northwestern Polytechnical University, Xi'an, Shaanxi, China

Correspondence Xiaoguang Gao, School of Electronics and Information, Northwestern Polytechnical University, 710129 Xi'an, Shaanxi, China.

Email: [email protected]

Search for more papers by this author

Kaifang Wan,

Kaifang Wan

orcid.org/0000-0002-1359-7112

School of Electronics and Information, Northwestern Polytechnical University, Xi'an, Shaanxi, China

Search for more papers by this author

Zijian Hu,

Zijian Hu

orcid.org/0000-0001-8167-8566

School of Electronics and Information, Northwestern Polytechnical University, Xi'an, Shaanxi, China

Search for more papers by this author

Qianglong Wang,

Qianglong Wang

orcid.org/0000-0001-5018-5013

School of Electronics and Information, Northwestern Polytechnical University, Xi'an, Shaanxi, China

Search for more papers by this author

Xiaoguang Gao,

Corresponding Author

Xiaoguang Gao

[email protected]

orcid.org/0000-0001-8556-4339

School of Electronics and Information, Northwestern Polytechnical University, Xi'an, Shaanxi, China

Correspondence Xiaoguang Gao, School of Electronics and Information, Northwestern Polytechnical University, 710129 Xi'an, Shaanxi, China.

Email: [email protected]

Search for more papers by this author

Kaifang Wan,

Kaifang Wan

orcid.org/0000-0002-1359-7112

School of Electronics and Information, Northwestern Polytechnical University, Xi'an, Shaanxi, China

Search for more papers by this author

Zijian Hu,

Zijian Hu

orcid.org/0000-0001-8167-8566

School of Electronics and Information, Northwestern Polytechnical University, Xi'an, Shaanxi, China

Search for more papers by this author

First published: 19 May 2022

https://doi.org/10.1002/int.22908

Share a link

Email
Wechat
Bluesky

Abstract

As one of the essential deep learning models, a restricted Boltzmann machine (RBM) is a commonly used generative training model. By adaptively growing the size of the hidden units, infinite RBM (IRBM) is obtained, which possesses an excellent property of automatically choosing the hidden layer size depending on a specific task. An IRBM presents a competitive generative capability with the traditional RBM. First, a generative model called Gaussian IRBM (GIRBM) is proposed to deal with practical scenarios from the perspective of data discretization. Subsequently, a discriminative IRBM (DIRBM) and a discriminative GIRBM (DGIRBM) are established to solve classification problems by attaching extra-label units next to the input layer. They are motivated by the fact that a discriminative variant of an RBM can complete an individual framework for classification with better performance than some standard classifiers. Remarkably, the proposed models retain both generative and discriminative properties synchronously, that is, they can reconstruct data effectively and be established in considerable self-contained classifiers. The experimental results on image classification (both large and small), text identification, and facial recognition (both clean and noisy) reflect that a DIRBM and a DGIRBM are superior to some state-of-the-art RBM models in terms of the reconstruction error and the classification accuracy. Intuitively, they require models to avoid utilizing more hidden units than needed when confronted with various sizes of data, prioritizing smaller networks. In addition, the proposed models behave more robustly than other classic classifiers when dealing with noisy facial recognition.

Open Research

DATA AVAILABILITY STATEMENT

The data sets generated during and/or analyzed during the current study are available in the following website.

CalTech101 Silhouettes data set: http://people.cs.umass.edu/marlin/data.shtml

MNIST: http://yann.lecun.com/exdb/mnist/

20 Newsgroups data set: https://cs.nyu.edu/roweis/data.html

20 Newsgroups data set: http://www.qwone.com/jason/20Newsgroups/

Olivetti data set: https://cs.nyu.edu/roweis/data.html

Yale Face: http://cvc.cs.yale.edu/cvc/projects/yalefaces/

REFERENCES

1Mcclelland J. Information Processing in Dynamical Systems: Foundations of Harmony Theory. MIT Press; 1986: 195-281.
Google Scholar
2Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Comput. 2016; 18(7): 1527-1554.
10.1162/neco.2006.18.7.1527
PubMed Web of Science® Google Scholar
3Swersky K, Chen B, Marlin BM, de Freitas N. A tutorial on stochastic approximation algorithms for training restricted Boltzmann machines and deep belief nets. In: IEEE: Information Theory and Applications Workshop (ITA), January 31–February 5, California, USA. IEEE; 2010: 80-89. https://ieeexplore.ieee.org/document/5454138
10.1109/ITA.2010.5454138
Google Scholar
4Hinton GE. Training products of experts by minimizing contrastive divergence. Neural Comput. 2002; 14(8): 1771-1800.
10.1162/089976602760128018
PubMed Web of Science® Google Scholar
5Hinton GE. Reducing the dimensionality of data with neural networks. Science. 2006; 313(5786): 504-507.
10.1126/science.1127647
CAS PubMed Web of Science® Google Scholar
6Salakhutdinov R, Hinton GE. An efficient learning procedure for deep Boltzmann machines. Neural Comput. 2012; 24(8): 1967-2006.
10.1162/NECO_a_00311
PubMed Web of Science® Google Scholar
7Salakhutdinov R, Mnih A, Hinton GE. Restricted Boltzmann machines for collaborative filtering. In: Z Ghahramani, ed. ACM: Proceedings of the Twenty-Fourth International Conference of Machine Learning (ICML), June 20–24, Oregon, USA. ACM; 2007: 791-798.
10.1145/1273496.1273596
Google Scholar
8Salakhutdinov R, Tenenbaum JB, Torralba A. Learning with hierarchical-deep models. IEEE Trans Pattern Anal Mach Intell. 2013; 35(8): 1958-1971.
10.1109/TPAMI.2012.269
PubMed Web of Science® Google Scholar
9Le Roux N, Heess N, Shotton J, Winn JM. Learning a generative model of images by factoring appearance and shape. Neural Comput. 2011; 23(3): 593-650.
10.1162/NECO_a_00086
PubMed Web of Science® Google Scholar
10Medhat F, Chesmore D, Robinson J. Recognition of acoustic events using masked conditional neural networks. In: Chen X, Luo B, Luo F, Palade V, Arif Wani M, eds. 16th IEEE International Conference on Machine Learning and Applications (ICMLA). IEEE; 2017: 199-206.
Google Scholar
11Larochelle H, Bengio Y. Classification using discriminative restricted Boltzmann machines. In: WW Cohen, A McCallum, ST Roweis, eds. ACM International Conference on Machine Learning (ICML), June 5–9, Helsinki, Finland. ACM; 2008: 536-543.
10.1145/1390156.1390224
Google Scholar
12Larochelle H, Mandel MI, Pascanu R, Bengio Y. Learning algorithms for the classification restricted Boltzmann machine. J Mach Learn Res. 2012; 13: 643-669.
Web of Science® Google Scholar
13Peng X, Gao X, Li X, Chen D. A multi-scale temporal model for multi-view radar high-resolution range profile recognition. In: IEEE International Conference on Computer and Communications, December 13–16, Chengdu, China. IEEE; 2017. https://ieeexplore.ieee.org/document/8322826
10.1109/CompComm.2017.8322826
Google Scholar
14Luo L, Zhang S, Wang Y. An alternate method between generative objective and discriminative objective in training classification restricted Boltzmann machine. Knowl-Based Syst. 2017; 144: 144-152.
10.1016/j.knosys.2017.12.032
Web of Science® Google Scholar
15Cherla S, Tran SN, d'Avila Garcez AS, Weyde T. Generalising the discriminative restricted Boltzmann machines. In: A Lintas, S Rovetta, FMJ Verschure Paul, EP Villa Alessandro, eds. International Conference on Artificial Neural Networks, September 11–14, Alghero, Italy. Vol 10614. Springer; 2017: 111-119.
10.1007/978-3-319-68612-7_13
Google Scholar
16Tran SN, Garcez Ad, Weyde T, Yin J, Zhang Q, Karunanithi M. Sequence classification restricted Boltzmann machines with gated units. IEEE Trans Neural Networks Learn Syst. 2020; 31(11): 4806-4815.
10.1109/TNNLS.2019.2958103
PubMed Web of Science® Google Scholar
17Yin J, Lv J, Sang Y, Guo J. Classification model of restricted Boltzmann machine based on reconstruction error. Neural Comput Appl. 2018; 29(11): 1171-1186.
10.1007/s00521-016-2628-6
Web of Science® Google Scholar
18Fischer A, Igel C. Empirical analysis of the divergence of Gibbs sampling based learning algorithms for restricted Boltzmann machines. In: KI Diamantaras, W Duch, LS Iliadis, eds. International Conference on Artificial Neural Networks (ICANN), September 15–18, Thessaloniki, Greece. Vol 6354. Springer; 2010: 208-217.
10.1007/978-3-642-15825-4_26
Google Scholar
19Tomczak JM, Gonczarek A. Sparse hidden units activation in restricted Boltzmann machine. In: H Selvaraj, D Zydek, G Chmaj, eds. International Conference on Systems Engineering (ICSEng), August 19–21, Vegas, USA. Vol 366. Springer; 2014: 181-185.
Google Scholar
20Côté M-A, Larochelle H. An infinite restricted Boltzmann machine. Neural Comput. 2016; 28(7): 1265-1288.
10.1162/NECO_a_00848
PubMed Web of Science® Google Scholar
21Huhnstock NA, Karlsson A, Riveiro M, Steinhauer HJ. On the behavior of the infinite restricted Boltzmann machine for clustering. In: NA Huhnstock, A Karlsson, M Riveiro, HJ Steinhauer, eds. ACM Symposium on Applied Computing (SAC), April 9–13, Pau, France. ACM; 2018: 461-470.
10.1145/3167132.3167183
Google Scholar
22Ping W, Liu Q, Ihler AT. Learning infinite RBMs with Frank–Wolfe. In: DD Lee, M Sugiyama, U Luxburg, I Guyon, R Garnett, eds. Conference on Neural Information Processing Systems, December 5–10, Barcelona, Spain. NIPS; 2016: 3063-3071.
Google Scholar
23Peng X, Gao X, Li X. On better training the infinite restricted Boltzmann machines. Mach Learn. 2018; 107(6): 943-968.
10.1007/s10994-018-5696-2
Web of Science® Google Scholar
24Feng X, Huang Q, Li X. Ultrasound image de-speckling by a hybrid deep network with transferred filtering and structural prior. Neurocomputing. 2020; 414: 346-355.
10.1016/j.neucom.2020.09.002
Web of Science® Google Scholar
25Chen CLP, Feng S. Generative and discriminative fuzzy restricted Boltzmann machine learning for text and image classification. IEEE Trans Cybern. 2020; 50(5): 2237-2248.
10.1109/TCYB.2018.2869902
PubMed Web of Science® Google Scholar
26Gao J, Yang J, Wang G, Li M. A novel feature extraction method for scene recognition based on centered convolutional restricted Boltzmann machines. Neurocomputing. 2016; 21410: 708-717.
10.1016/j.neucom.2016.06.055
Web of Science® Google Scholar
27Montúfar G, Ay N. Refinements of universal approximation results for deep belief networks and restricted Boltzmann machines. Neural Comput. 2011; 23(5): 1306-1319.
10.1162/NECO_a_00113
PubMed Web of Science® Google Scholar
28Fischer A, Igel C. Training restricted Boltzmann machines: an introduction. Pattern Recognit. 2014; 47(1): 25-39.
10.1016/j.patcog.2013.05.025
Web of Science® Google Scholar
29Nair V, Hinton GE. Rectified linear units improve restricted Boltzmann machines. In: J Furnkranz, T Joachims, eds. International Conference on Machine Learning (ICML), June 21–24, Haifa, Israel. Omnipress; 2010: 807-814.
Google Scholar
30Hjelm RD, Calhoun VD, Salakhutdinov R, Allen EA, Adali T, Plis SM. Restricted Boltzmann machines for neuroimaging: an application in identifying intrinsic networks. NeuroImage. 2014; 96(8): 245-260.
10.1016/j.neuroimage.2014.03.048
PubMed Web of Science® Google Scholar
31Marlin BM, Swersky K, Chen B, de Freitas N. Inductive principles for restricted Boltzmann machine learning. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, May 13–15, Sardinia, Italy. Vol 9; 2010: 509-516.
Google Scholar
32Lécun Y, Bottou L, Bengio Y. Gradient-based learning applied to document recognition. Proc IEEE. 1998; 86(11): 2278-2324.
10.1109/5.726791
Web of Science® Google Scholar
33Li X, Gao X, Wang C. A novel restricted Boltzmann machine training algorithm with dynamic tempering chains. IEEE Access. 2021; 9: 21939-21950.
10.1109/ACCESS.2020.3043599
Web of Science® Google Scholar
34Bishop CM. Pattern recognition and machine learning. Stat Sci. 2006: 387-402.
Google Scholar
35Lang K. NewsWeeder: learning to filter netnews. In: Proceedings of the Twelfth International Conference on Machine Learning, July 9-12, Tahoe City, California; 1995: 331-339.
10.1016/B978-1-55860-377-6.50048-7
Google Scholar
36Bengio Y, Louradour J, Collobert R, Weston J. Curriculum learning. In: AP Danyluk, L Bottou, ML Littman eds. International Conference on Machine Learning (ICML), June 14–18, Quebec, Canada. Vol 382. ACM; 2009: 41-48.
10.1145/1553374.1553380
Google Scholar
37Samaria F, Harter A. Parameterisation of a stochastic model for human face identification. In: IEEE Workshop on Applications of Computer Vision (WACV), December 5–7, Sarasota, USA. IEEE; 1994: 138-142. https://ieeexplore.ieee.org/document/341300
10.1109/ACV.1994.341300
Google Scholar
38Cai D, He X, Han J, Zhang HJ. Orthogonal Laplacian faces for face recognition. In: A Foi, ed. IEEE Trans Image Process. IEEE; 2006; 15(11): 3608-3614.
10.1109/TIP.2006.881945
PubMed Web of Science® Google Scholar
39Cai D, He X, Hu Y, Han J, Huang TS. Learning a spatially smooth subspace for face recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 18–23, Minnesota, USA; 2007.
10.1109/CVPR.2007.383054
Google Scholar
40Belhumeur PN, Hespanha JP, Kriegman DJ. Eigenfaces vs. fisherfaces: recognition using class specific linear projection. In: KM Lee, ed. IEEE Trans Pattern Anal Mach Intell. IEEE; 1997; 19(7): 711-720.
10.1109/34.598228
Web of Science® Google Scholar

All articles

Generative and discriminative infinite restricted Boltzmann machine training

Abstract

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Generative and discriminative infinite restricted Boltzmann machine training

Abstract

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

References

Related

Information