ORIGINAL ARTICLE

PooRaa-Agri KG: An agricultural knowledge graph-based simplified multilingual query system

Corresponding Author

Nethraa Sivakumar

[email protected]

orcid.org/0009-0003-4551-7150

Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India

Correspondence

Nethraa Sivakumar, Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India.

Email: [email protected]

Search for more papers by this author

Pooja Srinivasan,

Pooja Srinivasan

Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India

Search for more papers by this author

Mrinalini Kannan,

Mrinalini Kannan

Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India

Search for more papers by this author

Vijayalakshmi P,

Vijayalakshmi P

Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India

Search for more papers by this author

Nagarajan T,

Nagarajan T

Department of Computer Science Engineering, Shiv Nadar University, Chennai, Tamil Nadu, India

Search for more papers by this author

Nethraa Sivakumar,

Corresponding Author

Nethraa Sivakumar

[email protected]

orcid.org/0009-0003-4551-7150

Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India

Correspondence

Nethraa Sivakumar, Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India.

Email: [email protected]

Search for more papers by this author

Pooja Srinivasan,

Pooja Srinivasan

Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India

Search for more papers by this author

Mrinalini Kannan,

Mrinalini Kannan

Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India

Search for more papers by this author

Vijayalakshmi P,

Vijayalakshmi P

Department of Electronics and Communication Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India

Search for more papers by this author

Nagarajan T,

Nagarajan T

Department of Computer Science Engineering, Shiv Nadar University, Chennai, Tamil Nadu, India

Search for more papers by this author

First published: 24 August 2023

https://doi.org/10.1111/exsy.13434

Share a link

Email
Wechat
Bluesky

Abstract

The current work proposes PooRaa-Agri KG, an agricultural knowledge graph-based simplified multilingual query system that works in real time to provide concise answers for agriculture-based queries. The proposed approach accommodates real-time and low-resource queries in English and Hindi with a novel multi-stage solution consisting of data pre-processing, sentence simplification, triplet extraction, knowledge graph generation, sentence reconstruction, query-to-reconstructed sentence matching, and machine translation as its sub-modules. In this work, a novel combination of rule-based sentence simplification and triplet extraction is carried out resulting in a triplet similarity score of 86.56% for the extracted triplets. This method is superior to the existing triplet extraction method whose triplet similarity score was found to be 60.65%. Further, the proposed work makes use of heuristic rules to reconstruct sentences which when evaluated by human evaluators for meaningfulness and grammar resulted in a score of 3.09/4 and 2.95/4 respectively. To complete end-to-end communication in the proposed system, a similarity-based query answer system is proposed in this work.

Open Research

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.

REFERENCES

Affolter, K., Stockinger, K., & Bernstein, A. (2019). A comparative survey of recent natural language interfaces for databases. CoRR, Abs/1906.08990 http://arxiv.org/abs/1906.08990
Google Scholar
Bhatia, G. (2020). keytotext. https://github.com/gagan3012/keytotext
Google Scholar
Chen, Q. (2020). T5: A detailed explanation, medium. https://medium.com/analytics-vidhya/t5-a-detailed-explanation-a0ac9bc53e51
Google Scholar
Diefenbach, D., Lopez, V., Singh, K., & Maret, P. (2018, 06). Core techniques of question answering systems over knowledge bases: A survey. Core Techniques of Question Answering Systems Over Knowledge Bases: A Survey, 55, 529–569. https://doi.org/10.1007/s10115-017-1100-y
10.1007/s10115-017-1100-y
Google Scholar
Dwivedi, P. (2020). Fine tuning a t5 transformer for any summarization task. https://towardsdatascience.com/fine-tuning-a-t5-transformer-for-any-summarization-task-82334c64c81
Google Scholar
Garain, A., Basu, A., Dawn, R., & Naskar, S. (2019, 07). Sentence simplification using syntactic parse trees. https://doi.org/10.1109/ISCON47742.2019.9036207
10.1109/ISCON47742.2019.9036207
Google Scholar
Hermjakob, U. (2002, 01). Parsing and question classification for question answering. https://doi.org/10.3115/1117856.1117859
10.3115/1117856.1117859
Google Scholar
Hogan, A., Blomqvist, E., Cochez, M., d'Amato, C., de Melo, G., Gutiérrez, C., Emilio Labra Gayo, J., Kirrane, S., Neumaier, S., Polleres, A., Navigli, R., Ngonga Ngomo, A., Rashid, S.M., Rula, A., Schmelzeisen, L., Sequeda., J., Staab, S., & Zimmermann, A. (2020). Knowledge graphs. CoRR, abs/2003.02320 https://arxiv.org/abs/2003.02320
Google Scholar
Ji, S., Pan, S., Cambria, E., Marttinen, P., & Yu, P. S. (2020). A survey on knowledge graphs: Representation, acquisition and applications. CoRR, abs/2002.00388. https://arxiv.org/abs/2002.00388
Google Scholar
Johnny, S., & Nirmala, S. J. (2021, Nov 03). Farmer query answering system. SN Computer Science, 3(1), 45. https://doi.org/10.1007/s42979-021-00908-x
10.1007/s42979-021-00908-x
Google Scholar
Junczys-Dowmunt, M., Grundkiewicz, R., Dwojak, T., Hoang, H., Heafield, K., Neckermann, T., Seide, F., Germann, U., Fikri Aji, A., Bogoychev, N., F. T. Martins, A., & Birch, A. (2018). Marian: Fast neural machine translation in C++. CoRR, abs/1804.00344. http://arxiv.org/abs/1804.00344
Google Scholar
Klein, G., Kim, Y., Deng, Y., Senellart, J., & Rush, A. M. (2017). Opennmt: Open-source toolkit for neural machine translation. CoRR, abs/1701.02810. http://arxiv.org/abs/1701.02810
Google Scholar
Kumar, A., & Dinakaran, S. (2021). Textbook to triples: Creating knowledge graph in the form of triples from AI textbook. CoRR, abs/2111.10692. https://arxiv.org/abs/2111.10692
Google Scholar
Lan, W., & Xu, W. (2018). Neural network models for paraphrase identification, semantic textual similarity, natural language inference, and question answering. CoRR, abs/1806.04330. http://arxiv.org/abs/1806.04330
Google Scholar
Liang, S.-Y., Stockinger, K., de Farias, T. M., Anisimova, M. O., & Gil, M. (2021). Querying knowledge graphs in natural language. Journal of Big Data, 8, 3.
10.1186/s40537-020-00383-w
PubMed Google Scholar
Malik, P., & Baghel, A. S. (2016). An improvement in bleu metric for english-hindi machine translation evaluation. In 2016 international conference on computing, communication and automation (iccca) (p. 331–336). https://doi.org/10.1109/CCAA.2016.7813740
10.1109/CCAA.2016.7813740
Google Scholar
Mrinalini, K., Nagarajan, T., & Vijayalakshmi, P. (2018, 12). Pause-based phrase extraction and effective oov handling for low-resource machine translation systems. ACM Transactions on Asian Language Information Processing, 18, 1–22. https://doi.org/10.1145/3265751
10.1145/3265751
Google Scholar
Mrinalini, K., Vijayalakshmi, P., & Nagarajan, T. (2022). Sbsim: A sentence-bert similarity-based evaluation metric for indian language neural machine translation systems. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 30, 1396–1406. https://doi.org/10.1109/TASLP.2022.3161160
10.1109/TASLP.2022.3161160
Web of Science® Google Scholar
Raffel, C., Shazeer, N., Roberts, A., Lee, K., Narang, S., Matena, M., Zhou, Y., Li, W., & Liu, P. J. (2019). Exploring the limits of transfer learning with a unifie text-to-text transformer. CoRR, abs/1910.10683. http://arxiv.org/abs/1910.10683
Google Scholar
Salunkhe, P., Kadam, A. D., Joshi, S., Patil, S., Thakore, D., & Jadhav, S. (2016). Hybrid machine translation for english to marathi: A research evaluation in machine translation: (hybrid translator). In 2016 international conference on electrical, electronics, and optimization techniques (iceeot) (p. 924–931). https://doi.org/10.1109/ICEEOT.2016.7754822
10.1109/ICEEOT.2016.7754822
Google Scholar
Sangavi, G., Mrinalini, K., & Vijayalakshmi, P. (2016). Analysis on bilingual machine translation systems for english and tamil. In 2016 international conference on computation of power, energy information and commuincation (iccpeic) (p. 245–250). https://doi.org/10.1109/ICCPEIC.2016.7557203
10.1109/ICCPEIC.2016.7557203
Google Scholar
Saravanan, M., Sathish, K., & Prabu, M. (2016). Tamil to english cross lingual information retrieval system for agricultural domain using vsm. International Journal of Innovations in Engineering and Technology (IJIET), 7(4), 281–287.
Google Scholar
Shaalan, K., Hendam, A., & Rafea, A. (2010, 10). An english-arabic bi-directional machine translation tool in the agriculture domain - a rule-based transfer approach for translating expert systems. In (Vol. 340, p. 281–290). https://doi.org/10.1007/978-3-642-16327-234
10.1007/978-3-642-16327-234
Google Scholar
Sitikhu, P., Pahi, K., Thapa, P., & Shakya, S. (2019). A comparison of semantic similarity methods for maximum human interpretability. CoRR, abs/1910.09129. http://arxiv.org/abs/1910.09129
Google Scholar
Sulaiman, N. H., & Mohamad, D. (2012). A jaccard-based similarity measure for soft sets. In 2012 ieee symposium on humanities, science and engineering research. (p. 659-663). https://doi.org/10.1109/SHUSER.2012.6268901
10.1109/SHUSER.2012.6268901
Google Scholar
Tayal, M., Raghuwanshi, M. M., & Malik, L. (2014, 01). Syntax parsing: Implementation using grammar-rules for english language. In (p. 376-381). https://doi.org/10.1109/ICESC.2014.71
10.1109/ICESC.2014.71
Google Scholar
Translator, M. (2019). Neural machine translation enabling human parity innovations in the cloud, microsoft translator blog. https://www.microsoft.com/en-us/translator/blog/2019/06/17/neural-machine-translation-enabling-human-parity-innovations-in-the-cloud/
Google Scholar
Trivedi, P., Maheshwari, G., Dubey, M., & Lehmann, J. (2017, 10). Lc-quad: A corpus for complex question answering over knowledge graphs. https://doi.org/10.1007/978-3-319-68204-422
10.1007/978-3-319-68204-422
Google Scholar
Wang, Q., Mao, Z., Wang, B., & Guo, L. (2017). Knowledge graph embedding: A survey of approaches and applications. IEEE Transactions on Knowledge and Data Engineering, 29(12), 2724–2743. https://doi.org/10.1109/TKDE.2017.2754499
10.1109/TKDE.2017.2754499
Web of Science® Google Scholar
Wu, Y., Schuster, M., Chen, Z., Le, Q. V., Norouzi, M., Macherey, W., Kirkun, M., Cao, Y., Gao, Q., Macherey, K., Klingner, J., Shah, A., Johnson, M., Liu, X., Kaiser, L., Gouws, S., Kato, Y., Kudo, T., Kazawa, H., … & Dean, J. (2016). Google's neural machine translation system: Bridging the gap between human and machine translation. CoRR, Abs/1609.08144 http://arxiv.org/abs/1609.08144
Google Scholar
Yuanzhe, C., Kuang, J., Cheng, D., Zheng, J., Gao, M., & Zhou, A. (2019, 04). Agrikg: An agricultural knowledge graph and its applications. In (p. 533-537). https://doi.org/10.1007/978-3-030-18590-981
10.1007/978-3-030-18590-981
Google Scholar

Volume40, Issue10

December 2023

e13434

PooRaa-Agri KG: An agricultural knowledge graph-based simplified multilingual query system

Abstract

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

PooRaa-Agri KG: An agricultural knowledge graph-based simplified multilingual query system

Abstract

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

References

Related

Information