Some Insects Are More Equal Than Others: A Comparison of Popular Large Language Model Chatbots' Treatment of Different Insect Groups
昆虫间的不平等——主流大语言模型 (LLM) 对不同昆虫评价的比较研究
Editor-in-Chief & Handling Editor: Ahimsa Campos-Arceiz
ABSTRACT
enBees and butterflies are generally viewed positively in society, whereas other insects, such as wasps or flies, are often underappreciated despite their essential ecological roles. As biodiversity loss continues to pose a major challenge for humankind, it necessitates a re-evaluation of these biases to support the protection of biodiversity as a whole. With large language model (LLM) chatbots becoming increasingly integrated into daily life for information dissemination and education purposes, understanding their inherent biases is vital. In this study, we tested 10 popular Western LLM chatbots using simple prompts to assess how they portray bees and wasps. Butterflies, moths, flies, and mosquitoes were also included for comparison to evaluate broader societal perceptions. Our results show that bees and butterflies are indeed depicted positively by the LLM chatbots and moths somewhat positively, while wasps, flies, and mosquitoes are associated with more negative portrayals. We found that LLMs mirror prevailing human biases toward different insect groups and their perceived importance in nature conservation. Moreover, we demonstrate that LLM chatbots tend to oversimplify insect diversity by predominantly restricting “bees” to honeybees and “wasps” to yellowjackets, thereby neglecting the broader biodiversity that includes wild bees and parasitoid wasps. The chatbots also appeared to favor Nearctic species when recommending conservation priorities. By highlighting these biases and discussing their implications, our research underscores the importance of nuanced science communication and expert involvement in decision-making for nature conservation. Addressing such biases is essential to prevent the reinforcement of public misconceptions and to promote the protection of ecologically indispensable yet less popular insect groups.
摘要
zh蜜蜂与蝴蝶在社会认知中通常获得正面评价。而黄蜂与苍蝇等具有同等生态价值的昆虫群体却长期面临认知偏见。与此同时, 生物多样性持续丧失是人类面临的重大挑战之一, 再评估这些认知偏见对开展全面多样性保护工作具有紧迫性。出于信息传播和科学教育目的, 人们在日常生活中越来越多地使用大语言模型, 因此极有必要了解后者内在的偏见。本研究使用简单提示词测试了西方十个主流大语言模型, 以考察它们如何描绘蜜蜂和黄蜂。我们对蝴蝶、飞蛾、苍蝇和蚊子等昆虫群体进行比较, 以进一步调查社会对其正面或负面的看法。研究结果表明, 大语言模型对蜜蜂和蝴蝶做出了正面描述, 而飞蛾部分正面, 黄蜂、苍蝇和蚊子等则为负面。我们发现大语言模型反映了人类对不同昆虫的普遍偏见及这种理解对自然保护的重要性。此外, 研究结果证实大语言模型因语义窄化而加剧对昆虫多样性的认知缺失, 例如“蜜蜂科” (bees) 单指蜜蜂, “胡蜂科” (wasps) 只指黄蜂, 忽略了生物多样性, 未考虑野生蜂和寄生蜂等科下的大多数物种, 同时在建议优先保护物种时也偏向新北区物种。通过纠正这些偏见并讨论它们的影响, 此次研究强调了缜密的科学知识传播以及专家参与自然保护决策的关键作用。这也有助于未来矫正大语言模型的偏见——这种偏见目前反映并可能加剧社会对不太受欢迎但生态上不可或缺的昆虫群体的认知偏差。
简明语言 摘要
zh蜜蜂与蝴蝶在社会认知中普遍获得正面评价, 而黄蜂和苍蝇等昆虫常受歧视, 尽管后者也在自然界中扮演着重要的生态角色。解决生物多样性持续恶化的问题, 需要我们重新审视此类偏见, 以支持更广泛的自然保护工作。随着大语言模型在信息传播和科学教育的应用日益广泛, 本研究旨在考察它们是否也表现出类似偏见。我们分析了大语言模型关于各种昆虫问题的回答。 研究发现表明, 大语言模型对蜜蜂和蝴蝶的描述是正面的, 对飞蛾的描述偏向正面, 而对黄蜂、苍蝇和蚊子的描述则极为负面。研究结果表明, 大语言模型反映了普遍存在的人类认知偏差。此外, 它们倾向于因语义窄化而加剧对昆虫多样性的认知缺失, 特别是将“蜜蜂科” (bees) 单指蜜蜂, “胡蜂科” (wasps) 只指黄蜂, 同时忽略该科下的其他重要物种。此外, 在建议保护优先物种时也偏向北美本土物种。研究强调了将专家意见纳入自然保护决策的必要性, 以及有效传播所有昆虫的价值以有效保护生物多样性的重要性。
Summary
enBees and butterflies are often seen in a positive light, while insects like wasps and flies are generally less appreciated—despite their critical ecological roles. To address the ongoing decline in biodiversity, it is important to reconsider these biases as a way to support broader conservation efforts. Given the increasing use of AI-driven chatbots for information dissemination and education, our study aimed to examine whether these technologies exhibit similar biases. We analyzed how chatbots respond to queries about various insects. Our findings show that bees and butterflies are described in positive terms, moths somewhat positively, while wasps, flies, and mosquitoes are portrayed extremely negatively. These findings indicate that AI chatbots reflect prevailing human prejudices. In addition, chatbots often oversimplify insect diversity—frequently reducing “bees” to honeybees and “wasps” to yellowjackets while overlooking other significant groups of bees and wasps. The chatbots also showed a preference for species native to North America when suggesting conservation priorities. Our research underscores the need to integrate expert guidance in decision-making for nature conservation and emphasizes the importance of clear, inclusive science communication to support the protection of all insects—not just the popular ones.
-
Practitioner Points
- ∘
LLM chatbots require further evaluation, as they often reflect societal biases—positively portraying bees and butterflies while negatively depicting wasps and flies. They also have a tendency to oversimplify insect diversity.
- ∘
Enhancing tools like Large Language Models (LLMs) by broadening training data is essential to ensure accurate and inclusive representations of biodiversity, thereby improving accessibility and inclusivity in AI-based science education.
- ∘
Science communication and scientific expertise are fundamental in correcting biases in LLM chatbots, helping to reduce the spread of societal prejudices against less popular but ecologically indispensable insect groups, thus supporting more comprehensive biodiversity conservation.
- ∘
实践者要点
zh
-
大语言模型需要得到进一步评估, 因为它们通常反映社会的偏见, 例如对蜜蜂和蝴蝶持正面看法, 而对黄蜂和苍蝇持负面看法, 并且往往过于简化昆虫的多样性。
-
拓宽训练数据而增强大语言模型等工具, 能够极大地确保其准确地、包容性地表征生物多样性, 提高基于人工智能的科学教育的可及性和包容性。
-
科学传播和专业知识对于纠正大语言模型的偏见至关重要, 可减少对不太受欢迎但生态上不可或缺的昆虫群体的社会偏见的传播, 从而更全面地支持生物多样性保护。
1 Introduction
Insects are the most species-rich taxon on Earth, with just over one million described species (Stork 2018). They play essential roles in ecological processes, acting as pollinators, seed dispersers, predators, and recyclers of organic matter (Dangles and Casas 2019). The most recognized ecosystem service provided by insects is pollination, which is crucial not only for food production in agricultural settings (Klein et al. 2007) but also for wild plants as it enables the sexual reproduction, dispersal, and production of seeds in 87.5% of flowering plants globally (Ollerton et al. 2011).
In recent decades, the ongoing loss of biodiversity has emerged as a global challenge with far-reaching consequences for ecosystems and human well-being worldwide (Thomas et al. 2004; Dirzo et al. 2014; Johnson et al. 2017). A landmark study documenting a 76% decline in insect biomass over 27 years in Germany (Hallmann et al. 2017) has brought attention to the decline of insect populations, sparking a wealth of research into trends in insect abundance and diversity.
Recent changes in insect populations vary across taxonomic groups. However, global patterns point to an overall loss of insect diversity driven by a combination of the following factors: (1) habitat loss and degradation through land use changes, such as the conversion to intensive agriculture or urbanization; (2) pollution, particularly from synthetic agrochemicals such as pesticides and fertilizers, as well as light pollution; (3) biological factors, such as invasive species and pathogens; and (4) climate change (Sánchez-Bayo and Wyckhuys 2019; Wagner et al. 2021; Rumohr et al. 2023). Additionally, recent research points to cascading secondary extinctions caused by the loss of different species within ecological communities—an important factor for community structure and stability that is not fully understood but holds far-reaching consequences for future conservation strategies (Kehoe et al. 2021).
Not all biotic and abiotic factors that drive changes in abundance and diversity are fully comprehended, nor are the complex interactions between them. Recently, however, researchers have begun shifting from merely reporting declines in insect populations to advocating for insect conservation, providing actionable knowledge and evidence-based roadmaps for decision-makers (e.g., Forister et al. 2019; Habel et al. 2019; Kawahara et al. 2021). In addition to tangible measures to counteract the prevalent drivers of insect decline, promoting “entomo-literacy”—society's knowledge of insects—has been identified as the foundation for fostering empathy toward insects and an appreciation for their role in ecosystems (Donkersley et al. 2022). Public engagement is key in generating essential endorsement and support for effective conservation measures (Basset and Lamarre 2019; Donkersley et al. 2022).
However, public perception and the consequent endorsement of conservation efforts vary significantly between taxa. Additionally, mainstream media, which are instrumental in educating and shaping public awareness of environmental matters, have been found to focus disproportionally on honey bees as pollinators, regularly framing them as the most important or only relevant pollinators (Smith and Saunders 2016). Wasps, on the other hand, are universally represented negatively in the media, with a focus primarily on their nuisance value and the risks posed to humans through defensive stings rather than highlighting their ecosystem services (Oi et al. 2024). Given the influence of the media on public support for conservation, this bias is detrimental to much-needed biodiversity conservation efforts, which must extend beyond large, familiar, and charismatic species (Novacek 2008) or reactions to only the most tragic news in conservation, such as the extinction of the last individual of a prominent vertebrate species (Fink et al. 2020).
Large language models (LLMs) and their implementation in easy-to-use interfaces have entered our lives (Schramowski et al. 2022; Teubner et al. 2023). These advanced artificial intelligence systems, trained on large datasets, provide an excellent opportunity for learning and teaching purposes across various levels (Kasneci et al. 2023; Yang et al. 2024; Santangeli et al. 2024). Imran and Almusharraf (2024) emphasize that LLMs, in their case Google Gemini, offer multimodal capabilities and personalized learning by processing content from diverse data types and adapting content creation and explanations to individual learning styles and levels. These models can support educators in creating teaching materials and provide real-time feedback to learners, thereby facilitating and enhancing learning (Imran and Almusharraf 2024). This shows that LLMs are powerful tools, proficient in multiple languages, and effective in performing downstream tasks.
However, they are not without limitations. LLMs can make errors in seemingly simple tasks, such as counting individual words in a sentence, and they struggle with effectively understanding and applying knowledge (Gomez et al. 2024; Guo et al. 2024; Gurgurov et al. 2024; Leiter et al. 2024). Moreover, a major problem with LLMs is the inherent biases they adopt, reflecting societal prejudices present in their training data. Efforts to remove these biases have not been entirely successful and may even introduce new biases (Chen et al. 2024; Gallegos et al. 2024; Hofmann et al. 2024; Resnik 2024).
Some researchers have pointed out that biases are introduced into LLM training data, leading to biased outputs, which can perpetuate misinformation and other unethical content, including stereotypes and prejudices that are present in society (Babonnaud et al. 2024; Prakash and Roy 2024). Santangeli et al. (2024) highlight that the quality of training data is essential for generating reliable outputs and note that LLMs can also be used to debunk fake news, helping to mitigate fear or misinformation about animals.
The aim of this study is to compare the responses of major LLM chatbots regarding bees and wasps, with the assumption that bees are generally viewed positively while wasps are perceived negatively, as found by Sumner et al. (2018). Additionally, we test whether similar biases exist in the perception of other hyperdiverse insect groups, in particular when comparing butterflies to moths and flies to mosquitoes. Our aim is to investigate whether LLMs have adopted the human-induced bias toward subjectively charismatic insect groups and to use simple prompts to test whether learners and teachers might be misled into false assumptions when researching bees, wasps, butterflies, moths, flies, and mosquitoes. Specifically, we explore whether LLMS overrepresent charismatic insect groups as more deserving of conservation, potentially sidelining less prominent or less charismatic species.
2 Materials and Methods
2.1 Selection of Target Large Language Model (LLM) Chatbots
The selection of the LLM chatbots used in this study was based on the target groups that we defined as learners and educators with an interest in insects and their conservation but limited prior knowledge. To address this group, we chose the most popular LLM chatbots, which are easily accessible to the public and have models with tens of billions of parameters (Kasneci et al. 2023; Zhao et al. 2023; Helbling et al. 2024). We tested the following 10 large language model chatbots between July 15, 2024, and August 8, 2024, which at the time of the study were exclusively US-based: GPT-3.5 Turbo, GPT-4 Turbo, GPT-4o (all three from OpenAI, San Francisco, USA), Llama 3.1. (Meta, Cambridge, USA), Gemini (Google, Mountain View, USA), Copilot (conversation style set to precise) (Microsoft, Redmond, USA), wikichat: https://wikichat.genie.stanford.edu/ (style set to most factual and not balanced; therefore operating on ChatGPT-4) (Stanford University, Stanford, USA; Semnani et al. 2023), Claude 3.5 Sonnet (Anthropic, San Francisco, USA): https://claude.ai/, and Perplexity Standard and Perplexity Pro (both from Perplexity.ai, San Francisco, USA).
The hypotheses and corresponding prompts used are shown in Table 1. The responses provided by the 10 LLM chatbots to these hypotheses are given in the Supporting Information File S1.
Hypothesis | Prompt |
---|---|
|
List 10 words associated with bees; list 10 words associated with wasps; list 10 words associated with butterflies; list 10 words associated with moths; list 10 words associated with flies; list 10 words associated with mosquitoes. |
|
(Same as for Hypothesis 3) |
|
Based on your training data, please assess how you perceive the following insect taxa on a scale from −5 for strongly negative to +5 for strongly positive: bees; wasps; butterflies; moths; flies; mosquitoes. |
|
List the 10 insect groups, not functional groups, that you consider the highest priorities for conservation efforts. |
|
List the 10 insect species that you consider the highest priorities for conservation efforts. |
2.2 Selection of Insect Groups for the Investigation
The main aim of this study was to compare the responses of major LLM chatbots in relation to bees and wasps, with the hypothesis that bees are generally liked while wasps are often disliked, similar to the way society's responses to these two groups were studied by Sumner et al. (2018). In addition, we also included other insect groups for comparison, some of which are generally liked and others which are generally disliked. To this end, we categorized lepidopterans (which are generally liked) into two groups: butterflies (liked) and moths (disliked). For dipterans (generally disliked), we divided them into two subgroups: Brachycera (disliked) and Nematocera (strongly disliked). Since our study targets learners and teachers with limited prior knowledge in a context where the scientific names of these groups are less familiar, we address Brachycera in the prompts as “flies” and Nematocera as “mosquitoes”.
To obtain a qualitative assessment of each insect group to test Hypothesis 1, we prompted each LLM chatbot to provide 10 words associated with bees, wasps, butterflies, moths, flies, and mosquitoes, respectively.
The output words were corrected into the singular form (e.g., “pollinators” to “pollinator”) and standardized for similar words with the same meaning (e.g., “beauty” to “beautiful”). We then compiled word clouds, in which the size of a word is proportional to its frequency of occurrence for each insect group. Word clouds were generated using the layout algorithm implemented on https://www.jasondavies.com/wordcloud/ to visualize the frequency of occurrence of each word. We selected the following parameters: spiral = rectangular; 4 orientations from −40° to +40°; scale = n.
To evaluate the emotional tone and sentiment toward the predefined list of words related to each insect group, we performed sentiment analyses using R Software. First, words from each list were classified as either positive or negative through an inner join operation using the established “Bing sentiment lexicon” (Hu and Liu 2004). This analysis was then cross-verified using the “NRC Emotion Lexicon” (Mohammad and Turney 2013), which categorizes each word into two sentiment categories (positive and negative) as well as eight basic and prototypical emotions defined by Plutchik (1991). The results of the NRC analysis, which include only those words from the predefined lists that were present in the lexicon and could thus be assigned one or more sentiment categories, are visualized in a bar plot created using “ggplot2” to display the distribution of sentiment types across the predefined words.
To test whether the LLMs have adopted the human bias that favors bees but disfavors wasps and whether there is a similar preference for butterflies over moths and flies over mosquitoes (Hypotheses 2 and 3), the selected LLM chatbots were prompted to rate these insect taxa on a scale of −5 (strongly negative) to +5 (strongly positive) based on their training data. The responses of the 10 LLM chatbots are visualized in boxplots. Due to the small sample size (n = 10) for each taxon, we refrained from further statistical analyses.
To test which insect groups LLMs suggest as vital for conservation (Hypothesis 4), we prompted the LLM chatbots to provide a list of the 10 insects they consider the highest priority for conservation. The LLMs listed these in order, and points were assigned based on their position. The first group received 10 points, the second group one point less, and so on, with the last group receiving one point.
If multiple insect groups were named in the same position (e.g., Hymenoptera: bees, wasps, ants), each group received the same score (e.g., if all were listed in position one, they each received 10 points). If a taxon was mentioned multiple times within the list by a single LLM chatbot, points were awarded for each position (i.e., positions seven and eight would yield a total of seven points, with four points for the 7th position and three points for the 8th (4 + 3)). The points were then summarized for each insect group, and a bar chart was created in Microsoft Excel for Mac (Version 16.78.3).
To test which insect species LLMs suggest as important for nature conservation (Hypothesis 5), we asked each LLM chatbot for the ten insect species they considered most important for conservation efforts. We obtained a total of 100 species, which were reduced to 88 valid species after removing 12 irrelevant species (including non-insect species such as spiders and birds, extinct species, and LLM-generated hallucinations of species that do not exist). Next, we assigned the zoogeographical realms of these 88 species based on the classification from Holt et al. (2013). We conducted a chi-squared goodness-of-fit test to assess whether there is a geographical bias in the occurrence of the insect species provided by the LLM chatbots.
2.3 R Software
Analyses were conducted in R, version 4.3.2 (R Core Team 2023), within the RStudio IDE, version 2023.06.1 + 524 (Posit team 2024). Sentiment analyses were performed using the following packages: “dplyr” (DOI: 10.32614/CRAN.package.dplyr); “ggplot2” (DOI: 10.32614/CRAN.package.ggplot2); “textdata” (DOI: 10.32614/CRAN.package.textdata); “tibble” (DOI: 10.32614/CRAN.package.tibble); “tidytext” (DOI: 10.32614/CRAN.package.tidytext). Boxplots were generated using the base R function “boxplot” The χ2 test was performed using the base R functionality in the “stats” package, with visualizations created using the “graphics” package.
All figures were post-processed using Affinity Publisher 2 (Version 2.5.3).
3 Results
Hypothesis 1.Frequently chosen words for bees relate more to their (ecological) function and are positive, while other insect groups are associated with both positive and negative words.
For each insect group, we received 10 words from each LLM chatbot, resulting in a total of 100 words per taxon (Figure 1). For bees, we obtained a total of 29 different words and 71 duplicates. Seven words were returned in a high frequency (seven or more chatbots): hive (n = 10), honey (n = 10), worker (n = 10), pollination (n = 9), queen (n = 9), colony (n = 7), and nectar (n = 7). Most of these frequently chosen words relate to honeybees or other eusocial species.

For wasps, we received a total of 32 different words and 68 duplicates. Among those, four words occurred in a high frequency: aggressive (n = 10), nest (n = 10), sting (n = 9), and predator (n = 7), with half of these having negative connotations. Other words occurred with moderate frequency (five or more chatbots), including yellow (n = 6), social (n = 5), and paper (n = 5). These most common words associated with wasps primarily refer to aculeate wasps.
For butterflies, we received the highest number of duplicate words, with 25 unique words and 75 duplicates. Six words were returned with a high frequency: metamorphosis (n = 10), wings (n = 10), colorful (n = 9), pollinator (n = 8), caterpillar (n = 7), and nectar (n = 7). Four words appeared with moderate frequency: migration (n = 6), monarch (n = 6), beautiful (n = 5), and delicate (n = 5). All of these words have positive connotations, often referencing the visual attractiveness of butterflies.
For moths, we received 36 different words and 64 duplicates. Four words occurred with high frequency: silk (n = 9), cocoon (n = 8), nocturnal (n = 8), and camouflage (n = 7). Five words appeared with moderate frequency: antennae (n = 6), larvae (n = 6), wings (n = 6), light (n = 5), and night (n = 5). These words are generally associated with the nocturnal lifestyle of moths, though none are particularly negative.
For flies, we obtained 34 different words and 66 duplicates. Four words occurred with a high frequency: buzz (n = 10), maggot (n = 8), pest (n = 8), and wings (n = 7). Other words did not appear in moderate frequency, but terms like annoying (n = 5), garbage (n = 5), housefly (n = 5), and larvae (n = 5) suggest a more negative view of flies, with many words related to their life history or representing negative descriptors.
For mosquitoes, the LLM chatbots provided 32 different words and 68 duplicates. Four words returned with high frequency: bite (n = 10), Malaria (n = 9), blood (n = 8), and larvae (n = 7), showing a strong association with mosquitoes' role in transmitting diseases.
The sentiment analysis (Figure 2) revealed that bees are associated with positive emotions, including anticipation (n = 3). Wasps, on the other hand, are linked to negative emotions such as fear (n = 8) and anger (n = 6). Words associated with butterflies mainly evoke positive emotions, particularly joy (n = 3), while those linked to moths are perceived as slightly negative but elicit both positive and negative emotions in equal measure (n = 1 each). Both flies and mosquitoes are predominantly associated with negative emotions, with flies evoking disgust (n = 10) more than fear or anger (both n = 5). Mosquitoes also trigger disgust and fear in equal proportions (n = 6).

Hypothesis 2.LLMs have adopted the bias inherent to humans that bees are more positive than wasps.
Hypothesis 3.Next to a bias in perceived positivity in bees (liked) and wasps (disliked), there is also a bias in butterflies (liked) and moths (disliked) and in flies (disliked) and mosquitoes (strongly disliked).
Each of the 10 LLM chatbots provided a rating on a scale from −5 (strongly negative) to +5 (strongly positive) for each of the six insect groups (Figure 3). Bees and butterflies were consistently rated very positively, with means of 4.0 and 4.9, respectively (n = 10). In contrast, mosquitoes were rated most negatively, with a mean of −4.5 (n = 10). Wasps and flies were rated as slightly negative, with means of −2.3 and −2.5, respectively (n = 10). Moths were rated slightly positive, with a mean of 1.4 (n = 10).

Hypothesis 4.LLMs are biased toward bees when listing insect groups according to their importance for nature conservation.
We received lists of 10 insect groups from each LLM chatbot, which were classified using the scoring system described above. The points for each insect group were then totaled (Figure 4). Butterflies scored the highest with 96 points, followed closely by bees with 90 points. Dragonflies ranked third with 74 points. Other groups included ants (64 points), moths (62 points), beetles (60 points), damselflies (54 points), grasshoppers (41 points), and crickets (30 points). Several other insect groups were mentioned only sporadically. When comparing the total points of the respective insect orders, Hymenoptera was the most frequently represented, scoring 172 points, closely followed by Lepidoptera with 158 points and Odonata with 128 points. Coleoptera (84 points) and Orthoptera (81 points) were also well represented, while the insect-rich order Diptera received little attention from the LLM chatbots, scoring only 16 points.

Hypothesis 5.LLMs are geographically biased toward Nearctic species, according to their importance for nature conservation.
Our data set contained 43 individual species (subspecies were treated as the nominotypical taxon), and we analyzed their distribution across various zoogeographical realms. Some species occur in more than one realm, resulting in a final data set of 67 taxa-occurrence combinations. Notably, the Nearctic region is heavily overrepresented, with 23 occurrences compared to the expected 6.7 if all realms were represented equally (Figure 5). To a lesser extent, the Palearctic region (9 occurrences) also exceeds the expected value. In contrast, regions such as the Sino-Japanese and Indomalayan realms are underrepresented, with only two occurrences each.

We compared the observed frequencies for each realm against the expected uniform distribution. The observed frequencies differ significantly from the expected values (χ2 = 49.866, p < 0.001), indicating that the distribution of species is not uniform across the zoogeographic realms, with certain regions being overrepresented or underrepresented.
4 Discussion
Our study reveals several biases in the perception and categorization of insect groups by LLMs. We found a strong trend toward taxonomic simplification, which leads to an unbalanced representation of insect diversity, a geographic bias favoring Nearctic species for suggested conservation priorities, and, most notably, a difference in sentiment toward specific insect groups. In this section, we discuss the origins of these biases, their implications, and the necessary steps to overcome them.
4.1 Origins of Insect-Related Biases in LLM Chatbots
The word associations provided by the LLM chatbots for different insect groups highlight a clear bias that reflects common human perceptions, similar to those found by Sumner et al. (2018) in their questionnaire study. Our research confirms that bees are strongly associated with positive and ecologically significant terms, especially those emphasizing their role in ecosystems, particularly pollination services, which are greatly appreciated in society. Similarly, butterflies are predominantly associated with positive terms that reflect their cultural and aesthetic appeal, symbolic power for transformation and beauty, and ecological importance (Sumner et al. 2018; Iriani et al. 2021). The universal appreciation of butterflies may also be attributed to the fact that they do not evoke fear or disgust, unlike other insect groups (Gerdes et al. 2009; Schlegel and Rupf 2010).
Interestingly, the perception of moths was slightly positive despite the negative impact that the larval stages of numerous prominent pest species within this group have on agriculture and forestry (e.g., Sree and Varma 2015) and the adverse reactions caused by the bristles of some caterpillars (Ellis 2021). Outside agricultural settings, the public perception of moths is based mostly on their adult life stages, which are most noticeable at night when attracted to light sources. In these settings, moths are less likely to trigger strong negative emotions such as fear or disgust compared to wasps or mosquitoes, which are more likely to be associated with bites or disease transmission. Neither are moths associated with particularly repelling habitats such as feces or carcasses as compared to flies.
In contrast, the other insect groups—particularly wasps, flies, and mosquitoes—are associated with both neutral and negative terms, often emphasizing harmful aspects or the risk of diseases. Both bees and wasps are perceived as dangerous due to their capacity to sting, which can cause fatal anaphylactic reactions in hypersensitive individuals (Fitzgerald and Flood 2006; Gerdes et al. 2009). Yet, wasps are generally perceived as more negative than bees, partly due to their representation in the media, which typically portrays wasps as aggressive and focuses on the nuisance or the dangers posed by wasp stings rather than their ecological benefits (Sumner et al. 2018; Oi et al. 2024). Flies have long been depicted in literature and folklore as symbols of sin, moral failure, and pestilence. Consequently, they are perceived as pests or annoying rather than beneficial insects (Sumner et al. 2018; Santos et al. 2023). Flies and mosquitoes are also perceived by society to cause disease, death, and discomfort (Kellert 1993; Bhattacharya et al. 2016).
The consistent positive ratings for bees and butterflies and the negative ratings for wasps, flies, and mosquitoes by LLMs reflect biases present in human society that are then transmitted through the training data to these models (Bender et al. 2021; Bommasani et al. 2021. These biases favor more charismatic or aesthetically appealing species, which attract more public interest and empathy. Such biases, in turn, influence conservation priorities and public policy, leading to a skewed focus on species that are more popular or appealing (Schlegel and Rupf 2010; Ducarme et al. 2013; Soga and Gaston 2016; Courchamp et al. 2018).
Our study also reveals a clear trend toward taxonomic simplification. Many of the words LLMs associate with bees, such as “apiary,” “colony,” “hive,” “honey,” “queen,” “swarm,” “wax,” and “worker,” refer specifically to honeybees (Apis sp.) Their predominantly positive perception and their prioritization for conservation, suggested by all 10 LLM chatbots, can be attributed to a combination of cultural, economic, and ecological factors. For millennia, the mutualistic relationship between honeybees and humans has been documented in human culture, with historical references in early cave art, ancient Egyptian carvings and hieroglyphs, folklore, and cultural and religious symbols (Ransome 2004; Prendergast et al. 2021). Their economic value is well recognized today, with beekeeping products and their indispensable pollination services. Furthermore, the “save the bees” movement, triggered by reports of colony collapse disorder in the late 2000s, has heightened public awareness and contributed to the prioritization of honeybees in conservation efforts (van Engelsdorp et al. 2009; Suryanarayanan and Kleinman 2016). The plethora of scientific literature and popular science articles surrounding honeybees is another factor that presumably led to the LLMs' recommendation for honeybees as a top priority for conservation.
Additionally, we found a significant geographic bias toward Nearctic species in the LLMs conservation prioritization. The results of our study show that LLMs disproportionately recommend species occurring in the Holarctic region, especially in the Nearctic, with a significant underrepresentation of the Global South. This overemphasis on Nearctic species reflects the bias inherent in the LLMs' training data, which may disproportionately come from Western scientific literature and databases, as the dominance of Western perspectives often leads to an uneven global representation of biodiversity (Martin et al. 2012; Amano and Sutherland 2013).
4.2 Implications of Insect-Related Biases in LLM Chatbots
In light of the ongoing loss of biodiversity and the anticipated increase in the use of LLMs as sources of information, it is important for us to address the biases identified in this study. As LLMs become more integrated into educational platforms, these biases will likely entrench themselves in the perceptions of learners and teachers who rely on these AI systems as a source of information. Reinforcing these human biases through generative artificial intelligence can profoundly impact conservation efforts.
The LLMs' bias in conservation prioritization, coupled with their trend toward taxonomic simplification, may not only divert attention and resources away from less charismatic but equally important groups (Soga and Gaston 2016; Courchamp et al. 2018; Verma et al. 2023) but also diminish public awareness and support for the conservation of these taxa, which are often perceived as pests rather than essential components in ecosystems (Cardoso et al. 2011; Eisenhauer et al. 2019; Raguso 2020). This imbalance is particularly troubling when neglected groups already face steeper declines, as evidenced by recent studies showing that insect population declines differ substantially across taxonomic groups (Sánchez-Bayo and Wyckhuys 2019). An illustrative case of this imbalance is the marked underrepresentation of Diptera in conservation programs, despite flies and mosquitoes being extremely species-rich and biologically diverse. These groups also occupy key ecological roles as pollinators, biocontrol agents, indicator species, and as food sources for numerous insectivorous animals (Adler and Courtney 2019; Dunn et al. 2020; Orford et al. 2015; Raguso 2020).
Similarly, the LLMs' tendency to associate wasps with aculeate wasps, especially the social Vespidae, overlooks the broader ecological roles of the group. While aculeate wasps are undoubtedly important for ecosystems worldwide—acting as predators, pollinators, decomposers, biological indicators, and seed dispersers (Brock et al. 2021)—focusing only on Aculeata ignores approximately 70% of wasps, including Symphyta and various hyperdiverse parasitoid wasp lineages (Aguiar et al. 2013). Despite the irrefutable significance of parasitoid wasps as biological control agents in agriculture and ecosystem maintenance (Narendran 2001; Wang et al. 2019), these insects are still largely overlooked by the public and underrepresented in scientific research (Sumner et al. 2018). Moreover, parasitoid wasps are highly susceptible to ecosystem changes due to their high trophic level and specialization (Shaw and Hochberg 2001; Shaw 2006). They are also more susceptible to pesticides and less variable in response to pesticide exposure than predatory arthropods (Theiling and Croft 1988).
In the worst-case scenario, following LLMs' suggested conservation priorities could inadvertently lead to the implementation of measures that ultimately harm biodiversity. For example, prioritizing the conservation of honeybees (Apis sp.), a genus of approximately 11 species (Michener 2007), would neglect the vast diversity of wild bees, which includes over 20,000 described species (Michener 2007). Many wild bee species are solitary and are therefore only partially affected by the drivers that cause colony collapse disorder in eusocial honeybees. More concerning still is the mounting evidence that beekeeping practices can negatively affect pollen availability and contribute to the spread of parasites and pathogens, further straining wild bee populations, especially in areas where honeybees are kept at high densities or in nonnative ranges (Mallinger et al. 2017; Geldmann and González-Varo 2018; Valido et al. 2019; Iwasaki and Hogendoorn 2022; MacInnis et al. 2023).
Lastly, the geographical bias identified in our study could seriously hamper global conservation efforts. The underrepresentation of species from zoogeographical regions in the Global South could lead to the neglect of critical species that are essential to biodiversity and the ecological balance of these areas. Such neglect exacerbates global inequality, particularly in many biodiversity hotspots that are characterized by exceptional concentrations of endemic species and are undergoing significant habitat loss. These biodiversity hotspots often overlap with areas of human poverty (Fisher and Christopher 2007), making them further neglected in scientific research and conservation efforts (Meyer et al. 2015).
4.3 Overcoming Biases for Education and Science Communication
Addressing the biases inherent in LLMs is crucial for ensuring their reliability as tools for both conservation and public education. Our findings underscore the challenge of sentiment biases, taxonomic simplification, and geographic bias. These biases mirror and potentially exacerbate existing human prejudices, thereby skewing public perception and influencing conservation priorities (Schlegel and Rupf 2010; Ducarme et al. 2013; Soga and Gaston 2016; Courchamp et al. 2018). To mitigate these biases, it is essential to improve the quality and diversity of the training data used for LLMs. Strategies such as hyperparameter tuning (Ghosal et al. 2024), incorporating a broader and more balanced data set, and actively supplementing otherwise inaccessible sources (specifically “dark texts” as described by Page 2016) in training datasets can help create more fact-based models. Furthermore, ongoing collaboration between taxonomists, ecologists, data scientists, and conservationists can refine these models to better align with the realities of real-world conservation conditions. Utilizing enhanced models responsibly can support more balanced education efforts and help shift policy toward a more integrative and holistic approach to biodiversity conservation.
LLMs hold the potential to revolutionize science education by offering interactive and adaptive learning experiences (Li et al. 2024). Teachers can leverage these tools to deepen their students' understanding of complex ecological interactions and convey the importance of diverse insect groups in ecosystem health and resilience. Furthermore, LLMs can be used to tailor educational content to diverse learning styles and educational backgrounds, thus broadening accessibility and inclusivity in science education (Alqahtani et al. 2023). By improving the quality and diversity of training data and incorporating the latest ecological research findings, LLMs can serve as dynamic educational resources that evolve with new scientific discoveries. Such advancements not only enhance the educational experience but also foster critical thinking and informed decision-making among students, encouraging a more informed and proactive endorsement of biodiversity conservation.
5 Conclusion
In times of severe biodiversity crisis, it is essential to recognize the role of education and science communication in shaping public perception and, consequently, public support for conservation efforts. Given our reliance on intact ecosystems and the ecosystem services they provide, the conservation of biodiversity on all levels—from genetic diversity to ecosystem diversity—should concern everyone. Therefore, enhancing everyday tools like LLMs is of utmost importance. Ultimately, these tools should reflect a more accurate and inclusive picture of biodiversity and facilitate accessibility and inclusivity in science education.
Inspiring and educating the public to endorse and contribute to the successful conservation of biodiversity as a whole requires moving beyond sensationalist media tactics. Instead, we must embrace communication and education that emphasizes the ethical, aesthetic, economic, and ecological value of all insects. By fostering a deeper appreciation for their profound role in our world, we can inspire a more sustainable and informed approach to biodiversity conservation (Ehrlich and Wilson 1991; Novacek 2008; Oi et al. 2024).
Author Contributions
Marina Moser: conceptualization, investigation, writing – original draft, methodology, validation, visualization, writing – review and editing, software, formal analysis, data curation. Lars Krogmann: funding acquisition, project administration, supervision, resources, writing – review and editing, validation. Dominic Wanke: project administration, supervision, data curation, formal analysis, software, writing – review and editing, visualization, validation, methodology, writing – original draft, investigation, conceptualization.
Acknowledgments
We are grateful to the reviewers for their insightful comments and suggestions, which have greatly enhanced the clarity of the discussion and strengthened the conclusions of this study. We are grateful to the LLM chatbots for their pleasant cooperation. This study was partially supported by the Federal Ministry of Education and Research of Germany (Bundesministerium für Bildung und Forschung, BMBF), Berlin, Germany, through the project “GBOL III: Dark Taxa” (funding reference: FKZ 16LI1901C). Funding was provided by the State Museum of Natural History Stuttgart (SMNS).
Ethics Statement
The authors confirm that the ethical policies of the journal, as noted on the journal's author guidelines page, have been adhered to.
Conflicts of Interest
The authors declare no conflicts of interest.
Permission to Reproduce Material From Other Sources
All content and figures in this manuscript are original and have not been reproduced from other sources.
Open Research
Data Availability Statement
The data that supports the findings of this study are available in the Supporting Information File S1 of this article.