Concurrency and Computation: Practice and Experience

Volume 33, Issue 15 e5281

SPECIAL ISSUE PAPER

Fast dynamic routing based on weighted kernel density estimation

Suofei Zhang,

Corresponding Author

Suofei Zhang

[email protected]

orcid.org/0000-0003-4116-7555

School of Internet of Things, Nanjing University of Post and Telecommunication, Nanjing, China

Suofei Zhang, School of Internet of Things, Nanjing University of Post and Telecommunication, Nanjing 210003, China.

Email: [email protected]

Search for more papers by this author

Wei Zhao,

Wei Zhao

Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Beijing, China

Search for more papers by this author

Xiaofu Wu,

Xiaofu Wu

School of Internet of Things, Nanjing University of Post and Telecommunication, Nanjing, China

Search for more papers by this author

Quan Zhou,

Quan Zhou

School of Internet of Things, Nanjing University of Post and Telecommunication, Nanjing, China

Search for more papers by this author

Suofei Zhang,

Corresponding Author

Suofei Zhang

[email protected]

orcid.org/0000-0003-4116-7555

School of Internet of Things, Nanjing University of Post and Telecommunication, Nanjing, China

Suofei Zhang, School of Internet of Things, Nanjing University of Post and Telecommunication, Nanjing 210003, China.

Email: [email protected]

Search for more papers by this author

Wei Zhao,

Wei Zhao

Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Beijing, China

Search for more papers by this author

Xiaofu Wu,

Xiaofu Wu

School of Internet of Things, Nanjing University of Post and Telecommunication, Nanjing, China

Search for more papers by this author

Quan Zhou,

Quan Zhou

School of Internet of Things, Nanjing University of Post and Telecommunication, Nanjing, China

Search for more papers by this author

First published: 11 April 2019

https://doi.org/10.1002/cpe.5281

Citations: 5

Share a link

Email
Wechat
Bluesky

Summary

Capsules as well as dynamic routing between them are most recently proposed structures for deep neural networks. A capsule groups data into vectors or matrices as poses rather than conventional scalars to represent specific properties of target instance. Based on pose, a capsule should be attached to a probability (often denoted as activation) for its presence. The dynamic routing helps capsule network achieve more generalization capacity with fewer model parameters. However, the bottleneck, which prevents widespread applications of capsule, is the expense of computation during routing. To address this problem, we generalize existing routing methods within the framework of weighted kernel density estimation, proposing two fast routing methods with different optimization strategies. Our methods prompt the time efficiency of routing by nearly 40% with negligible performance degradation. By stacking a hybrid of convolutional layers and capsule layers, we construct a network architecture to handle inputs at a resolution of 64 × 64 pixels. The proposed models achieve a parallel performance with other leading methods in multiple benchmarks.

REFERENCES

1Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst. 2012; 25(2): 1097-1105.
Google Scholar
2Lu H, Li Y, Chen M, Kim H, Serikawa S. Brain intelligence: go beyond artificial intelligence. Mob Netw Appl. 2018; 23(2): 368-375.
10.1007/s11036-017-0932-8
Web of Science® Google Scholar
3Li Y, Lu H, Li J, Li X, Li Y, Serikawa S. Underwater image de-scattering and classification by deep neural network. Comput Electr Eng. 2016; 54: 68-77.
10.1016/j.compeleceng.2016.08.008
Web of Science® Google Scholar
4Karpathy A, Toderici G, Shetty S, Leung T, Sukthankar R, Fei-Fei L. Large-scale video classification with convolutional neural networks. In: CVPR '14 Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition; 2014; Columbus, OH.
Google Scholar
5Lu H, Li B, Zhu J, et al. Wound intensity correction and segmentation with convolutional neural networks. Concurrency Computat Pract Exper. 2016; 29(6):e3927.
10.1002/cpe.3927
Web of Science® Google Scholar
6Lu H, Li Y, Uemura T, Kim H, Serikawa S. Low illumination underwater light field images reconstruction using deep convolutional neural networks. Future Gener Comput Syst. 2018; 82; 142-148.
10.1016/j.future.2018.01.001
Web of Science® Google Scholar
7Alain G, Bengio Y. Understanding intermediate layers using linear classifier probes. arXiv preprint arXiv:1610.01644. 2016.
Google Scholar
8Hinton GE, Sabour S, Frosst N. Matrix capsules with EM routing. Paper presented at: 6th International Conference on Learning Representations (ICLR); 2018; Vancouver, Canada.
Google Scholar
9Hinton GE, Krizhevsky A, Wang SD. Transforming auto-encoders. In: Artificial Neural Networks and Machine Learning – ICANN 2011. Berlin, Germany: Springer; 2011; 44-51.
10.1007/978-3-642-21735-7_6
Google Scholar
10Sabour S, Frosst N, Hinton GE. Dynamic routing between capsules. In: NIPS'17 Proceedings of the 31st International Conference on Neural Information Processing Systems; 2017; Long Beach, CA.
Google Scholar
11Russakovsky O, Deng J, Su H, et al. Imagenet large scale visual recognition challenge. Int J Comput Vis. 2015; 115(3): 211-252.
10.1007/s11263-015-0816-y
Web of Science® Google Scholar
12Wand MP, Jones MC. Kernel Smoothing. Boca Raton, FL: CRC Press; 1994.
10.1201/b14876
Google Scholar
13Schwander O, Nielsen F. Learning mixtures by simplifying kernel density estimators. In: Matrix Information Geometry. Berlin, Germany:Springer; 2013; 403-426.
10.1007/978-3-642-30232-9_16
Google Scholar
14Comaniciu D, Meer P. Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell. 2002; 24(5): 603-619.
10.1109/34.1000236
Web of Science® Google Scholar
15Comaniciu D, Ramesh V, Meer P. Real-time tracking of non-rigid objects using mean shift. In: Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vol. 2; 2000; Hilton Head Island, SC.
10.1109/CVPR.2000.854761
Google Scholar
16Wright SJ. Coordinate descent algorithms. Math Program. 2015; 151(1): 3-34.
10.1007/s10107-015-0892-3
Web of Science® Google Scholar
17Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. J Royal Stat Soc: Ser B (Methodol). 1977; 39(1): 1-22.
10.1111/j.2517-6161.1977.tb01600.x
Google Scholar
18LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 1998; 86(11): 2278-2324.
10.1109/5.726791
Web of Science® Google Scholar
19LeCun Y, Huang FJ, Bottou L. Learning methods for generic object recognition with invariance to pose and lighting. In: CVPR'04 Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 2004; Washington, DC.
Google Scholar
20He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Paper presented at: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vol. 1; 2016; Las Vegas, NV.
Google Scholar
21Abadi M, Agarwal A, Barham P, et al. Tensorflow: large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467. 2016.
Google Scholar
22Xiao H, Rasul K, Vollgraf R. Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747. 2017.
Google Scholar
23Krizhevsky A, Hinton G. Learning Multiple Layers of Features From Tiny Images. Technical Report. Toronto, Canada: University of Toronto; 2009.
Google Scholar
24Carreira-Perpinan MA. Gaussian mean-shift is an EM algorithm. IEEE Trans Pattern Anal Mach Intell. 2007; 29(5): 767-776. https://doi.org/10.1109/TPAMI.2007.1057
10.1109/TPAMI.2007.1057
PubMed Web of Science® Google Scholar
25Coates A, Ng A, Lee H. An analysis of single-layer networks in unsupervised feature learning. In: Proceedings of the 14th International Conference on Artificial Intelligence and Statistics; 2011; Lauderdale, FL.
Google Scholar

Citing Literature

Volume33, Issue15

Special Issue:13th IEEE International Conference on Networking, Architecture, and Storage (NAS2018). Cognitive Computing for Intelligence Web Systems (ISAIR2018)

10 August 2021

e5281

Fast dynamic routing based on weighted kernel density estimation

Summary

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Fast dynamic routing based on weighted kernel density estimation

Summary

REFERENCES

Citing Literature

References

Related

Information