More recently, as images, memes and graphics interchange formats have dominated social feeds, typographic/infographic visual content has emerged as an important social media component. This multimodal text combines text and image, defining a novel visual language that must be analysed because it has the potential to modify, confirm or grade the sentiment's polarity. The problem is how to effectively use information from the visual and textual content in image-text posts. This article presents a new deep learning-based multimodal sentiment analysis (MSA) model using multimodal data such as images, text and multimodal text (image with embedded text). The text analytic unit, the discretization control unit, the picture analytic component and the decision-making component are all included in this system. The discretization unit separates the text from the picture using the variant and channel augmented maximally stable extremal regions (VCA-MSERs) technique, which are then analysed as discrete elements and fed into the appropriate image and text analytics units. The text analytics system utilizes a stacked recurrent neural network with multilevel attention and feedback module (SRNN-MAFM) to detect the sentiment of the text. A deep convolutional neural network (CNN) structure with parallel-dilated convolution and self-attention module (PDC-SAM) is developed to forecast the emotional response to visual content. Finally, the decision component employs a Boolean framework including an OR function to evaluate and classify the output into three fine-grained sentiment classes: positive, neutral and negative. The proposed work is simulated in the python platform using the STS-Gold, Flickr 8k and B-T4SA datasets for sentiment analysis of text and visual and multimodal text. Simulation outcomes proved that the suggested method achieved better accuracy of 97.8%, 97.7% and 90% for text, visual and MSA individually compared to other methods.

Open Research

DATA AVAILABILITY STATEMENT

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

REFERENCES

Abdar, M., Fahami, M. A., Chakrabarti, S., Khosravi, A., Pławiak, P., Acharya, U. R., Tadeusiewicz, R., & Nahavandi, S. (2021). BARF: A new direct and cross-based binary residual feature fusion with uncertainty-aware module for medical image classification. Information Sciences, 577, 353–378.
10.1016/j.ins.2021.07.024
Web of Science® Google Scholar
Abdar, M., Pourpanah, F., Hussain, S., Rezazadegan, D., Liu, L., Ghavamzadeh, M., Fieguth, P., Cao, X., Khosravi, A., Acharya, U. R., & Makarenkov, V. (2021). A review of uncertainty quantification in deep learning: Techniques, applications and challenges. Information Fusion, 76, 243–297.
10.1016/j.inffus.2021.05.008
Web of Science® Google Scholar
Abdar, M., Salari, S., Qahremani, S., Lam, H.K., Karray, F., Hussain, S., Khosravi, A., Acharya, U.R., and Nahavandi, S. (2021). UncertaintyFuseNet: Robust uncertainty-aware hierarchical feature fusion with ensemble Monte Carlo dropout for COVID-19 detection. arXiv preprint arXiv:2105.08590.
Google Scholar
Abdar, M., Samami, M., Mahmoodabad, S. D., Doan, T., Mazoure, B., Hashemifesharaki, R., Liu, L., Khosravi, A., Acharya, U. R., Makarenkov, V., & Nahavandi, S. (2021). Uncertainty quantification in skin cancer classification using three-way decision-based Bayesian deep learning. Computers in Biology and Medicine, 135, 104418.
10.1016/j.compbiomed.2021.104418
PubMed Web of Science® Google Scholar
Alarifi, A., Tolba, A., Al-Makhadmeh, Z., & Said, W. (2020). A big data approach to sentiment analysis using greedy feature selection with cat swarm optimization-based long short-term memory neural networks. The Journal of Supercomputing, 76(6), 4414–4429.
10.1007/s11227-018-2398-2
Web of Science® Google Scholar
Angadi, S., & Reddy, V. S. (2021). Multimodal sentiment analysis using relief feature selection and random forest classifier. International Journal of Computers and Applications, 43(9), 931–939.
10.1080/1206212X.2019.1658054
Google Scholar
Basiri, M. E., Nemati, S., Abdar, M., Asadi, S., & Acharrya, U. R. (2021). A novel fusion-based deep learning model for sentiment analysis of COVID-19 tweets. Knowledge-Based Systems, 228, 107242.
10.1016/j.knosys.2021.107242
PubMed Web of Science® Google Scholar
Cai, G., Lyu, G., Lin, Y.,. M., & Wen, Y. (2020). Multi-level deep correlative networks for multimodal sentiment analysis. Chinese Journal of Electronics, 29(6), 1025–1038.
10.1049/cje.2020.09.003
Web of Science® Google Scholar
Chaturvedi, I., Satapathy, R., Cavallari, S., & Cambria, E. (2019). Fuzzy commonsense reasoning for multimodal sentiment analysis. Pattern Recognition Letters, 125, 264–270.
10.1016/j.patrec.2019.04.024
Web of Science® Google Scholar
Chowdhury, N. K., Rahman, M. M., & Kabir, M. A. (2020). PDCOVIDNet: A parallel-dilated convolutional neural network architecture for detecting COVID-19 from chest X-ray images. Health information Science and Systems, 8(1), 1–14.
10.1007/s13755-020-00119-3
PubMed Web of Science® Google Scholar
Dave, K., Lawrence, S., and Pennock, D.M. (2003). Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. Proceedings of the 12th International Conference on World Wide Web 519–528.
Google Scholar
Gaspar, A., and Alexandre L.A. (2019). A multimodal approach to image sentiment analysis. International Conference on Intelligent Data Engineering and Automated Learning Springer, Cham, 302–309.
Google Scholar
Harris. (2013). GIGAOM–The history of Hadoop: From 4 nodes to the future of data, Accessed April 17 2015, https://gigaom.com/2013/03/04/the-history-ofhadoop-from-4-nodes-to-the-future-of-data/.
Google Scholar
Huang, F., Wei, K., Weng, J., & Li, Z. (2020). Attention-based modality-gated networks for image-text sentiment analysis. ACM Transactions on Multimedia Computing, Communications, and Applications, 16(3), 1–19.
10.1145/3388861
Web of Science® Google Scholar
Huang, F., Zhang, X., Zhao, Z., Xu, J., & Li, Z. (2019). Image–text sentiment analysis via deep multimodal attentive fusion. Knowledge-Based Systems, 167, 26–37.
10.1016/j.knosys.2019.01.019
Web of Science® Google Scholar
Kaur, H., Ahsaan, S. U., Alankar, B., & Chang, V. (2021). A proposed sentiment analysis deep learning algorithm for analyzing COVID-19 tweets. Information Systems Frontiers, 23, 1–13.
10.1007/s10796-021-10135-7
Web of Science® Google Scholar
Kermani, F. Z., Sadeghi, F., & Eslami, E. (2020). Solving the twitter sentiment analysis problem based on a machine learning-based approach. Evolutionary Intelligence, 13(3), 381–398.
10.1007/s12065-019-00301-x
Web of Science® Google Scholar
Kumar, A., & Geetanjali, G. (2019). Sentiment analysis of multimodal twitter data. Multimedia Tools and Applications, 78(17), 24103–24119.
10.1007/s11042-019-7390-1
Web of Science® Google Scholar
Kumar, A., Singh, J. P., Dwivedi, Y. K., & Rana, N. P. (2020). A deep multimodal neural network for informative twitter content classification during emergencies. Annals of Operations Research, 1–32.
Web of Science® Google Scholar
Kumar, A., & Teeja, M. S. (2012). Sentiment analysis: A perspective on its past, present and future. International Journal of Intelligent Systems and Applications, 4(10), 1–14.
10.5815/ijisa.2012.10.01
Google Scholar
Lamsal, R. (2021). Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence, 51(5), 2790–2804.
10.1007/s10489-020-02029-z
PubMed Web of Science® Google Scholar
Lan, Y., Hao, Y., Xia, K., Qian, B., & Li, C. (2020). Stacked residual recurrent neural networks with cross-layer attention for text classification. IEEE Access, 8, 70401–70410.
10.1109/ACCESS.2020.2987101
Web of Science® Google Scholar
Li, G., Yang, L., Lee, C. G., Wang, X., & Rong, M. (2020). A Bayesian deep learning RUL framework integrating epistemic and aleatoric uncertainties. IEEE Transactions on Industrial Electronics, 68(9), 8829–8841.
10.1109/TIE.2020.3009593
Web of Science® Google Scholar
Li, X., and Chen, M. (2020). Multimodal sentiment analysis with multi-perspective fusion network focusing on sense attentive language. China National Conference on Chinese Computational Linguistics, Springer, Cham, 359–373.
Google Scholar
Li, Y., Zhang, K., Wang, J., & Gao, X. (2021). A cognitive brain model for multimodal sentiment analysis based on attention neural networks. Neurocomputing, 430, 159–173.
10.1016/j.neucom.2020.10.021
Web of Science® Google Scholar
Majumder, N., Hazarika, D., Gelbukh, A., Cambria, E., & Poria, S. (2018). Multimodal sentiment analysis using hierarchical fusion with context modelling. Knowledge-Based Systems, 161, 124–133.
10.1016/j.knosys.2018.07.041
Web of Science® Google Scholar
Onan, A. (2021). Sentiment analysis on product reviews based on weighted word embeddings and deep neural networks. Concurrency and Computation: Practice and Experience, 33(23), e5909.
10.1002/cpe.5909
Web of Science® Google Scholar
Rustam, F., Khalid, M., Aslam, W., Rupapara, V., Mehmood, A., & Choi, G. S. (2021). A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis. PLoS One, 16(2), e0245909.
10.1371/journal.pone.0245909
CAS PubMed Web of Science® Google Scholar
Saif, H., He, Y., Fernandez, M., & Alani, H. (2016). Contextual semantics for sentiment analysis of twitter. Information Processing and Management, 52(1), 5–19.
10.1016/j.ipm.2015.01.005
Web of Science® Google Scholar
Satpathy, S., Prakash, M., Debbarma, S., Sengupta, A. S., & Bhattacaryya, B. K. (2019). Design a FPGA, fuzzy based, insolent method for prediction of multi-diseases in rural area. Journal of Intelligent Fuzzy Systems, 37(5), 7039–7046.
10.3233/JIFS-181577
Web of Science® Google Scholar
Satu, M. S., Khan, M. I., Mahmud, M., Uddin, S., Summers, M. A., Quinn, J. M., & Moni, M. A. (2021). Tclustvid: A novel machine learning classification model to investigate topics and sentiment in Covid-19 tweets. Knowledge-Based Systems, 226, 107126.
10.1016/j.knosys.2021.107126
PubMed Web of Science® Google Scholar
Song, C., Wang, X. K., Cheng, P. F., Wang, J. Q., & Li, L. (2020). SACPC: A framework based on probabilistic linguistic terms for short text sentiment analysis. Knowledge-Based Systems, 194, 105572.
10.1016/j.knosys.2020.105572
Web of Science® Google Scholar
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., & Stede, M. (2011). Lexicon-based methods for sentiment analysis. Computational Linguistics, 37(2), 267–307.
10.1162/COLI_a_00049
Web of Science® Google Scholar
Tan, S., & Zhang, J. (2008). An empirical study of sentiment analysis for Chinese documents. Expert Systems with Applications, 34(4), 2622–2629.
10.1016/j.eswa.2007.05.028
Web of Science® Google Scholar
Tembhurne, J. V., & Diwan, T. (2021). Sentiment analysis in textual, visual and multimodal inputs using recurrent neural networks. Multimedia Tools and Applications, 80(5), 6871–6910.
10.1007/s11042-020-10037-x
Web of Science® Google Scholar
Verma, S., Wang, C., Zhu, L., & Liu, W. (2019). DeepCU: Integrating both common and unique latent information for multimodal sentiment analysis. IJCAI, 3627–3634.
Google Scholar
Wang, J., Gu, D., Yang, C., Xue, Y., Song, Z., Zhao, H., and Xiao, L. (2021). Targeted aspect based multimodal sentiment analysis. An attention capsule extraction and multi-head fusion network, arXiv preprint arXiv:2103.07659.
Google Scholar
Wang, Y., and Rocková, V. (2020). Uncertainty quantification for sparse deep learning. International Conference on Artificial Intelligence and Statistics, PMLR, 298–308.
Google Scholar
Wu, W., Wang, Y., Xu, S., and Yan, K. (2020). SFNN: Semantic features fusion neural network for multimodal sentiment analysis. 2020 5th International Conference on Automation, Control and Robotics Engineering (CACRE), IEEE, 661–665.
Google Scholar
Xu, J., Li, Z., Huang, F., Li, C., & Philip, S. Y. (2020). Social image sentiment analysis by exploiting multimodal content and heterogeneous relations. IEEE Transactions on Industrial Informatics, 17(4), 2974–2982.
10.1109/TII.2020.3005405
Web of Science® Google Scholar
Xu, N., Mao, W., and Chen, G.A. (2018). Co-memory network for multimodal sentiment analysis. the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 929–932.
Google Scholar
Yadav, A., & Vishwakarma, D. K. (2020a). A deep learning architecture of RA-DLNet for visual sentiment analysis. Multimedia Systems, 26, 431–451.
10.1007/s00530-020-00656-7
Web of Science® Google Scholar
Yadav, A., and Vishwakarma, DK (2020b). A deep multi-level attentive network for multimodal sentiment analysis. arXiv preprint arXiv:2012.08256.
Google Scholar
Yan, C., Xie, H., Liu, S., Yin, J., Zhang, Y., & Dai, Q. (2017). Effective Uyghur language text detection in complex background images for traffic prompt identification. IEEE Transactions on Intelligent Transportation Systems, 19(1), 220–229.
10.1109/TITS.2017.2749977
Web of Science® Google Scholar
Yang, X., Feng, S., Wang, D., & Zhang, Y. (2020). Image-text multimodal emotion classification via multi-view attentional network. IEEE Transactions on Multimedia, 23, 4014–4026.
10.1109/TMM.2020.3035277
Web of Science® Google Scholar
Zhang, K., Geng, Y., Zhao, J., Liu, J., & Li, W. (2020). Sentiment of social media via multimodal feature fusion. Symmetry, 12(12), 2010.
10.3390/sym12122010
Web of Science® Google Scholar
Zhao, J., and Pjesivac-Grbovic, J. (2009). MapReduce: The programming model and practice.
Google Scholar
Zhao, Z., Zhu, H., Xue, Z., Liu, Z., Tian, J., Chua, M. C. H., & Liu, M. (2019). An image-text consistency driven multimodal sentiment analysis approach for social media. Information Processing & Management, 56(6), 102097.
10.1016/j.ipm.2019.102097
Web of Science® Google Scholar
Zhou, T., Cao, J., Zhu, X., Liu, B., & Li, S. (2020). Visual-textual sentiment analysis enhanced by hierarchical cross-modality interaction. IEEE Systems Journal, 15(3), 4303–4314.
10.1109/JSYST.2020.3026879
Web of Science® Google Scholar

Citing Literature

Volume40, Issue1

Special Issue:WorldCIST'20

January 2023

e13096

Effective deep learning based multimodal sentiment analysis from unstructured big data

Abstract

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Effective deep learning based multimodal sentiment analysis from unstructured big data

Abstract

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

References

Related

Information