Rahul G. Krishnan, Department of Computer Science and Laboratory Medicine and Pathobiology, The University of Toronto, Toronto, Ontario M5S 3H5, Canada.

Email: [email protected]

Search for more papers by this author

Michael Cooper,

Michael Cooper

Department of Computer Science, University of Toronto, Toronto, Ontario, Canada

University Health Network, Toronto, Ontario, Canada

Vector Institute, Toronto, Ontario, Canada

Search for more papers by this author

Zongliang Ji,

Zongliang Ji

orcid.org/0000-0002-6791-5964

Department of Computer Science, University of Toronto, Toronto, Ontario, Canada

Vector Institute, Toronto, Ontario, Canada

Search for more papers by this author

Rahul G. Krishnan,

Corresponding Author

Rahul G. Krishnan

[email protected]

orcid.org/0000-0002-7955-3956

Department of Computer Science, University of Toronto, Toronto, Ontario, Canada

Vector Institute, Toronto, Ontario, Canada

Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada

Correspondence

Rahul G. Krishnan, Department of Computer Science and Laboratory Medicine and Pathobiology, The University of Toronto, Toronto, Ontario M5S 3H5, Canada.

Email: [email protected]

Search for more papers by this author

First published: 14 June 2023

https://doi.org/10.1002/gcc.23177

Michael Cooper, Zongliang Ji, and Rahul G. Krishnan contributed equally to this study.

Share a link

Email
Wechat
Bluesky

Abstract

Digital histopathological images, high-resolution images of stained tissue samples, are a vital tool for clinicians to diagnose and stage cancers. The visual analysis of patient state based on these images are an important part of oncology workflow. Although pathology workflows have historically been conducted in laboratories under a microscope, the increasing digitization of histopathological images has led to their analysis on computers in the clinic. The last decade has seen the emergence of machine learning, and deep learning in particular, a powerful set of tools for the analysis of histopathological images. Machine learning models trained on large datasets of digitized histopathology slides have resulted in automated models for prediction and stratification of patient risk. In this review, we provide context for the rise of such models in computational histopathology, highlight the clinical tasks they have found success in automating, discuss the various machine learning techniques that have been applied to this domain, and underscore open problems and opportunities.

1 INTRODUCTION

Anatomic pathology has undergone many important evolutions over the past centuries, from manual examination with bright-field microscopes to whole-slide imaging (WSI), computer vision and image analysis techniques, high-throughput molecular sequencing technologies and now artificial intelligence (AI). Since the development of the first microscope by Hans and Zacharias Janssen in 1590,¹ the microscope has been an important driving force for many discoveries in pathology, with the first few microscopic analysis of human tissue by Marcello Malpighi,² Anton van Leeuwenhoek,³ and Johannes Muller⁴ in the 16th and 17th centuries, theory of cell biology and cancer origin by Rudolf Virchow in 1855,^{5, 6} and the concept of recording histopathological characteristics (e.g., features) to make diagnoses such as the Reed–Sternberg cell for Hodgkin lymphoma in 1898/1902.^7-10 Computer-based analyses did not emerge until 1966 when Prewitt and Mendelson used image analysis algorithms to extract quantitative features for cell subtyping in blood smears.¹¹ The 1990s and following decade also marked a pivotal period in which commercial slide scanners were developed that could digitize histology slides into high-resolution WSIs,^12-16 as well as computer-aided diagnosis (CAD) systems that implemented early machine learning (ML) techniques using handcrafted cell and tissue features from WSIs.^17-24 In 2010, Dundar et al. presented the first formulation of multiple instance learning (MIL) for cancer subtyping in WSIs based on Haralick texture features, a framework that is still widely utilized today in performing weakly-supervised cancer diagnosis at scale without needing detailed pathologist annotations.²⁵

Over the past half decade and still ongoing, the emergence of AI and ML, via deep learning, has been the latest driving force for advancements in pathology.²⁶ Following the 2012 success of convolutional neural networks (CNNs) in the ImageNet Large Scale Visual Recognition Challenge,^27-29 similar open challenges such as the CAMELYON^{30, 31} and PANDA³² challenges were created for lymph node metastasis detection and Gleason grading in prostate cancer, respectively, which led to important breakthroughs showing that CNN- and multiple-instance learning (MIL)-based classification systems could surpass pathologist-level performance on these diagnostic tasks.³³ Given a large enough repository of diagnostic WSIs (n > 100),³⁴ deep learning can be used to formulate and solve new clinical tasks beyond human pathologist capabilities such as metastatic origin of cancer prediction,³⁵ cancer prognostication,^36-40 and microsatellite instability (MSI) prediction.^41-43 Looking beyond supervised learning applications via CNNs and MIL, the development of other techniques such as generative AI modeling,^44-46 geometric deep learning,^{47, 48} unsupervised learning,^{49, 50} and multimodal deep learning⁵¹ may soon enable new clinical capabilities that could enter pathology and laboratory medicine workflows, such as virtual staining,^52-54 elucidating cell and tissue interactions,⁵⁵ untargeted biomarker discovery,^{56, 57} data fusion with genomics and other rich biomedical data streams.^58-61

Though direct application of many deep learning techniques may appear to work “out-of-the-box” and have emergent capabilities in computational pathology (CPATH), there exists a variety of technical challenges that would limit their adoption in clinical translation, deployment, and commercialization. Compared to natural images, WSI as a computer vision domain can be much more challenging due to the multi-pyramidal, gigapixel image resolutions of the WSI digital format, which can add cost in annotating regions-of-interests, finding data storage, and running image analysis pipelines. For slide-level cancer diagnosis tasks, established approaches can overcome these limitations, such as employing a pretrained (CNN) to pre-extract features from nonoverlapping (e.g., 256 × 256 image resolution at 20×, 40×) tissue patches in the WSI, and then inputting the pre-extracted features into a downstream MIL framework.³⁴ However, depending on the task, choices regarding patch image resolution, patch image magnification and spatial ordering of patch features would strongly influence model performance. Many slide-level tasks in computational pathology also suffer from small support sizes, especially in rare diseases, which constrains approaches to also be light-weight and data-efficient. The effect of how genomics, ancestry, self-reported race, and other environmental factors can manifest as biases within pathology data is understudied and may have more important implications when such models are deployed across diverse populations.⁶² Lastly, as many of these advances in computational pathology stem originally from advances made elsewhere in ML, computer vision and deep learning, limitations of these previous methods may still exist in their current application in pathology.

Computational pathology has received significant coverage from previous reviews and perspectives. References 7, 12, 63-66 cover early and historical overviews of digital and computational pathology. References 67-70 provide comprehensive overviews of current progress made in computational pathology. References 71-74 provide clinical perspectives on how AI developments can be translated to the clinic, with other views focusing on specific aspects such as MIL architectures,⁷⁵ fairness,⁶² and individual cancer types.⁷⁵ In this review, we organize a technical overview of current deep learning applications to pathology, disentangling the clinical tasks from the key methodological tools in ML used to solve them. We highlight several open opportunities for CPATH in the context of surrounding discussions in fairness, equity, interpretability, and the rise of large language models (LLMs). Figure 1 presents a visual overview of this review enumerating the models surveyed spanning cancer types, different ML models, learning strategies and their trends across time.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

A Sankey diagram of the experiments presented in the papers reviewed in this article. Each unit of height in the diagram represents a single experiment, comprising an anatomical region of the body, pathological task, machine learning methodology, and year of publication. This diagram allows visualization of the frequency of interactions between these characteristics of each experiment. Specifically, this figure highlights the increase in popularity of self-supervision, attention, and graph-based architectures in recent years, and the continued popularity of convolutional architectures and multiple-instance learning algorithms in this space.

2 CLINICAL TASKS IN COMPUTATIONAL HISTOPATHOLOGY

Digitized WSIs are routinely read by pathologists to extract and acquire insights for diagnosing the current and future state of patient status. When placed under a digital microscope, whole-slide histopathological images of even small biopsy or surgical specimens are typically mega- or gigapixel images. With the development of digital scanners, high performance computers, graphical processing units (GPUs) and larger hard-drive storage, there has been research studying how computation can aid the analysis of digitized images using tools from ML. Following the success of deep learning for classifying natural images²⁹ there have been numerous methods developed to tackle problems like image classification, object detection and image segmentation.^{29, 76-80} Drawing insights from these methods, new deep learning approaches have been designed for automating tasks of clinical interest using histopathological images. Here, we highlight some of the representative tasks for which ML has been leveraged.

For many of these studies, the data used to train models were curated and, in some cases, made available to the public. Many studies also leveraged The Cancer Genome Atlas (TCGA)⁸¹ for training and validation of ML models. Challenge datasets, publicly available WSIs and labels released with the goal of encouraging researchers to build and study ML models, have also played an important role in the advancement of computer vision CPATH. Some prominent examples of challenges based on WSIs include the CAMELYON dataset⁸² to identify breast cancer metastases in sentinel lymph nodes, the TUPAC dataset⁸³ to predict tumor proliferation, and the PANDA challenge³² for Gleason grading of prostate cancer. At the patch level, the CRC-100 K for colorectal cancer,⁸⁴ and BreastPathQ,⁸⁵ ICIAR-BACH⁸⁶ for breast cancer have proved valuable for the study of different ML methods in CPATH.

2.1 Classification

The winning solution²⁹ of ImageNet 2012²⁸ showed the capability of deep neural network to accurately classify natural images with large, labeled dataset. Given histopathological WSIs, the most common task that an ML model can help with is to tell whether the scanned tissue possesses abnormalities of interest. According to cancer data released by the World Health Organization, the five most common cancer types are breast cancer, lung cancer, colon cancer, prostate cancer, and stomach cancer.⁸⁷ Prior work in classification uses the word “detection” for certain diseases or disease subtypes. The task of detection in CPATH is different from the task of object detection in computer vision where images are significantly smaller and often contain a small number of objects. In CPATH, due to limited accessibility to bounding box or positional-level data, only a few studies in computational pathology^{88, 89, 83, 90, 91} can do fine-grained detection of mitotic events. Consequently, the vast majority of research methods focus on predicting the prevalence of clinically relevant patient or slide level outcomes posed as a classification task. The intended goal of such predictive systems can range from automation of prediction in low-resource settings to risk stratification for organizing clinical workflows.

For breast cancer, a key task is understanding whether the WSI contains mitotic or non-mitotic cells,^{89, 92-102} or tumorous cells.^{103, 102} Studies have explored the use of ML to distinguish breast cancer WSIs between normal, benign, carcinoma in situ (CIS) and breast invasive carcinoma (BIC).¹⁰⁴ Classification has also been used to identify specific subtypes of immunostaining for estrogen, progesterone, and Her2 receptors (ER/PR/Her2),^{105, 106} Ductal or Lobular, Basal-like or non-Basal-like, and different tumor grades.¹⁰⁵ Two-stage classification, first predicting if a given WSI tile contains a tumor, and then identifying whether the tumor image patch contains tumor-infiltrating lymphocytes (TIL) has also been explored.¹⁰³ For lung cancer, an important goal is cancer subtyping into adenocarcinoma (LUAD) or squamous cell carcinoma (LUSC)^{38, 107} and distinguishing genetic mutations within subtypes.³⁸ Research has studied the identification of histologic subtypes like lepidic, acinar, papillary, micropapillary, and solid^108-110 and categorizing WSIs into PDL1-positive, and PDL1-negative.¹¹¹ For colon cancer, in addition to cancer or non-cancer WSI classification,^{36, 43, 112-115} researchers have developed methods for classifying colon cancer cell subtypes into a 4-class categorization^{112, 116, 117} and a 9-class (adipose, background, debris, lymphocytes, mucus, smooth muscle, normal colon mucosa, cancer-associated stroma, and COAD epithelium) categorization.^{84, 118} For prostate cancer, research has focused on automating Gleason grading and checking whether the image represents benign or malignant prostate biopsy sample.^119-121 For stomach cancer, normally studied in conjunction with colon cancer, deep learning methods categorize patches into tumor versus non-tumor and whether each WSI tile belongs to an adenocarcinoma, adenoma, and nonneoplastic type.^122-124 Finally, for kidney cancer, some studies attempt to distinguish whether a given WSI tile contains kidney cancer cell or not^{125, 126} and categorize kidney tissue into 10 different classes including glomeruli, sclerotic glomeruli, empty Bowman's capsules, proximal tubuli, distal tubuli, atrophic tubuli, undefined tubuli, capsule, arteries, and interstitium.¹²⁷

CPATH has made inroads into predictive problems for diseases in the brain, liver and skin. For brain gilomas, researchers have used ML for grading WSIs and identifying if the tissue morphology indicates a IDH1 mutation.^128-130 For liver cancers,¹³¹ research has studied classification in the context of 2-class ballooning, 3-class inflammation, 4-class steatosis, and 5-class fibrosis,¹³² discriminating between two subtypes of primary liver cancer, hepatocellular carcinoma and cholangiocarcinoma, and built tools to assist real clinical workflows.¹³³ For bladder cancer, studies have focused on grading cancers using WSIs.^{134, 135} For skin cancer, a key task is identifying whether a given melanoma will recur based on a WSI.^{136, 137} Mesothelioma tissue WSIs were used to build classifier to identify transitional mesothelioma (TM) or not-TM tissue.^{138, 139} Finally, research has studied tumor versus non-tumor and TIL classification across different cancer types.^{33-35, 140-143}

2.2 Segmentation

While classification of clinical outcomes can provide utility at a slide level, ML has also been used to highlight the exact location and boundary of an abnormality in the WSI. This problem is typically posed as one of image segmentation. Pixel-level labels are required for models to perform segmentation tasks on histopathological images. However, pixel-level labels are hard to obtain since they would require pathologists to use software to draw boundaries around different type of tissues. Despite the effort required, researchers have made great progress in developing methods to automatically identify object of interests in histopathological images.

Due to the size of the WSI and the computational requirements of deep learning models, most methods split WSIs into patches where each patch has labeled segmentation masks (a binary mask over a pixel indicating which pixels represent regions of interest) for a model to learn. For colon cancer, research has studied the problem of segmenting glands^{90, 115, 144-146} and identifying different classes of tissues.¹⁴⁷ For kidney cancers, segmenting glomeruli^{126, 148} or different subtypes of kidney tissues^{127, 149} are key tasks of interest. Researchers have also made progress on segmenting normal and abnormal parts from histopathological images for breast,^{90, 99, 150} lung,^{110, 151} bladder,¹³⁴ stomach,¹²³ prostate⁹⁰ cancer.

2.3 Survival analysis

The successful prediction of patient outcomes such as mortality, and characteristics of their disease trajectory such as progression free survival, can help oncologists plan for treatments and assess individual patient disease severity. Histopathological images capture proxies of genetic abnormalities, tumor burden and subtype, all of which can inform this task. There are several studies that model patient trajectories using pathology data^{36, 39, 118-121, 124, 137} and those that blend clinical data with other data modalities such as demographic or genomics data.^{40, 152-154} In most biomedical studies, the time-to-event for many patients is not recorded due to loss of patients to follow-up. Consequently, data often contains the last-observed time points for patients rather than their actual event time. Such data is referred to as (right) censored and the tools used to predict event time, typically time of death, given such data fall under the umbrella of survival analysis.

Since the combination of neural networks with classical tools in survival models,^{155, 156} researchers have studied their utility for problems in CPATH for colon,^{36, 39, 43, 61, 118, 124} mesothelioma,¹³⁸ brain,¹⁵⁷ breast,^{55, 105, 158} lung,^{55, 152, 159, 160} prostate,^{119-121, 140} uterine,⁵⁵ and kidney^{55, 125, 140, 161} cancer. Researchers have also explored the application of survival analysis on datasets comprising multiple cancer types at same time.^{37, 41, 55, 60, 140, 152, 162, 163}

2.4 Counting

Clinicians are also interested in having detailed measurements of the sampled tissue at the cellular level. Counting mitotic cells is a typical clinical task for pathologists since the number of mitotic cells is a key factor for determining cancer grade. Research efforts have been made in counting mitosis cells in breast cancer, lymphocytes in breast cancer WSIs, centroblasts on follicular lymphoma WSIs, and plasma cells in bone marrow image patches.^{41, 64, 83, 164} Counting cells typically requires cell segmentation; a variety of attempts have been made for cell segmentation on breast cancer,^165-168 colon cancer,^{146, 169} and bladder cancer.¹⁷⁰ Research attempts have also been made on counting neuroendocrine tumor (NET) cells within the gastrointestinal tract and pancreas.¹⁷¹ Cell nuclei segmentation and counting has also been developed for grading squamous epithelium cervical intraepithelial neoplasia (CIN).¹⁷² Quantification of immunohistochemical labelling (e.g., MIB1/Ki-67 proliferation) is also an important prognostic application.^173-175 Although supervised cell segmentation requires tremendous effort to obtain fine-grained annotations, several cell segmentation methods^{79, 176, 177} have obtained promising results. There have also been several unsupervised cell segmentation methods have been developed over the past decade.^178-180

There has also been research studying the use of ML for tasks that can form the basis for future clinical workflows. Many WSIs from tissue samples come without labels and obtaining such labels may be hard or impossible. Consequently, unsupervised ML methods like clustering have been deployed to obtain insights from histopathological images. Given image patches from WSIs, researchers first extract features, or representations, using preexisting predictive models. These are then clustered to automate the identification of subgroups in lung,¹⁵⁹ brain,¹⁵⁹ breast,^{98, 181} colon cancer¹¹³ WSIs. In addition, research has begun to use ML to learn associations between gene expression and pathological images,^{110, 130, 182} perform stain normalization for histopathological images,^{106, 183-186} generate synthetic data,^{106, 130, 148, 151, 170} compress images¹⁸⁷ and automate histopathological captioning and diagnosis generation.¹³⁵

3 LEARNING STRATEGIES FOR COMPUTATIONAL HISTOPATHOLOGY

The goal of an ML model developed to tackle the tasks in Section 2 is to generalize well, that is, it must operate in regimes outside of those in which it was trained. Deep learning models learn functions that transform high-dimensional inputs like images to numbers representing event time (for regression, survival analysis), class probabilities (for classification), or other images (for segmentation). A deep learning model does so via the intermediate step of first translating inputs into representations,¹⁸⁸ vectors of numbers whose entries serve as a compressed store of the information content in the input. As the model is trained, the network learns to produce representations that capture the appropriate structure for a predictive task at hand, so that they can be readily converted to the desired output. The choice of function used in model can vary—the simplest kind of function in deep learning is a multilayer perceptron, in which each layer of the network is comprised of a stacked linear and nonlinear functions. In CNNs, the function being learned is represented by compositions of convolution operations, a network architecture that exhibits a degree of spatial invariance that makes them useful for modeling pixels. Transformers¹⁸⁹ are comprised of stacked attention layers, where a heat map is overlaid onto the previous layer's representations to compute the next layer. The parameters in a deep learning model are trained by solving an optimization problem, typically the maximization of a proxy to the accuracy of a model (or the minimization of a loss function) on a labeled training set. The triplet of a model, comprising a neural architecture, loss function, and dataset, play a large role in the degree to which a model generalizes. Most deep learning models are trained on GPUs whose hardware constraints often inform the form, complexity, and size of the model, loss function and dataset.

CPATH has challenges that make the straightforward adoption of tools from computer vision challenging. The cellular patterns in tumor microenvironments are often highly complex and require expertise to properly annotate, making it difficult to obtain a large amount of labeled data for training deep learning models. Histopathology images are large, requiring high computational resources and a large GPU memory to build predictive models. The patterns of interest are often small in size and low in contrast, making it difficult for deep learning models to accurately detect and segment these objects. Finally, images can be affected by various sources of noise and artifacts, such as staining variations. Over the years, these challenges have informed the various technical approaches researchers have adopted to build predictive systems. In what follows, we highlight three learning strategies and two neural network architectures that have found success for predictive applications in CPATH. Although they are by no means mutually exclusive or exhaustive—indeed many state-of-the-art methods mix and match among them—we have chosen to highlight the following methods as those which have seen recent success tackling some of the unique challenges of predictive modeling in this domain.

3.1 Multiple-instance learning

Deep learning models require that the input (in this case, images) be small enough to load onto GPU memory. The need for MIL arises because, as of 2023, a single histopathology image would exceed the available memory of a GPU. A popular approach is to decompose a large histopathology image into patches and treat the bag of patches as a collection, all of which are assigned the same label.

MIL^{190, 191} provides a class of methods for detecting whether a collection of objects contains one or more objects of interest. The input to the model consists of a collection of objects, each of which may be positive (of interest), or negative (not of interest). The aim of MIL is to determine which objects are positive and negative, based only on collection-level labels. MIL provides a practical means to make efficient use of training data with coarse labels. Rather than requiring pixel-wise annotations, methods in MIL allow for learning from slide- or region-wise annotations, which are significantly cheaper and easier to obtain.

Small-scale experiments applying MIL to CPATH have taken place as early as 2012. ccMIL¹¹² extends the MIL algorithm by introducing a smoothness prior over proximal image patches. After training on a small dataset consisting of 83 slides (53 cancerous, 30 noncancerous), ccMIL obtains image segmentation performance comparable to a model trained directly on pixel-wise annotations (F-measure of 0.7 for ccMIL, 0.72 for pixel-wise annotations). ccMIL obtains 0.997 AUC on binary classification of images into cancerous or noncancerous, and 0.965 and 0.970 AUC, respectively, on multitask classification of images into different subtypes of cancer. These results outperformed contemporary methods including multiple-instance support vector machines,¹⁹² multiple-instance boosting,¹⁹³ and multiple-clustered instance learning (MCIL).¹⁹⁴

One of the first large-scale MIL models was published in 2019.³³ In this study, the authors collected a dataset of 44 732 whole slide images from 15 187 patients, corresponding to biopsy samples spanning diagnoses of prostate cancer, basal cell carcinoma, and breast metastases. These images were not specially curated for clarity and contained artifacts that would be found on raw pathology slides. In binary classification of slides as cancerous or noncancerous, their MIL model achieved an AUC of 0.991 on detecting prostate cancer, 0.988 on detecting basal cell carcinoma, and 0.966 on detecting breast metastases. In evaluating generalization performance to external data, the authors conclude that weak supervision (e.g., slide-level labels) on larger datasets leads to better generalization than strong supervision (e.g., pixel-wise labels) on small datasets.

3.2 Transfer learning

The largest dataset to train computer vision models contains millions of labeled training examples. By contrast, the largest CPATH dataset may only have tens of thousands of labels (at the WSI level). Large, labeled datasets are an important way by which a deep learning model learns useful representations; however, datasets of such sizes may not be feasible in CPATH. Consequently, many researchers turn to a technique known as transfer learning.

Transfer learning refers to the class of methods that leverage parameters from a model trained on a prior task (also called an “upstream task”) as a starting point for learning an eventual task of interest (also called the “downstream task”). In computer vision, certain parameters within a model often transfer well between imaging modalities and tasks. The weights of the first few layers of a deep CNN, for example, often learn edge detection capabilities,¹⁹⁵ which represents a common skill that is useful across many domains of computer vision task. It is therefore typical that a computer vision model that has been pretrained on an upstream image classification or regression task to be able to learn a novel data distribution more efficiently from fewer samples than a model that was trained from scratch.

ImageNet,^{27, 28} a large-scale dataset of natural images, is a popular dataset for pretraining image recognition models, and ImageNet-pretrained models are readily available as starting points for downstream image tasks. Despite clear visual differences between ImageNet's natural images and the fine-grained images of tissue morphology found in histopathology, pretrained models like VGG,¹⁹⁶ ResNet,⁷⁷ AlexNet,²⁹ GoogLeNet/Inception,^{197, 198} remain effective starting points for building predictive models in computational histopathology (VGG,^{36, 84, 93} ResNet,¹²⁵ AlexNet,^{93, 199} GoogLeNet/Inception^{93, 95}). Occasionally, these pretrained models are used in combination, as in the ensemble approach of Reference 200: this system leverages pretrained Inception-V3,¹⁹⁸ Xception,²⁰¹ VGG-16¹⁹⁶ and ResNet-50⁷⁷ as individual networks in a weighted voting scheme for binary classification of histopathological slides. Another study²⁰² provides a comparative analysis of different ImageNet-pretrained models within the context of colorectal cancer histopathology slide segmentation, finding that a DenseNet-121²⁰³ feature extractor, paired with a LinkNet²⁰⁴ segmentation architecture, is the most promising approach among the pretrained models they evaluated, which spanned DenseNets, Inception Networks, MobileNets,²⁰⁵ ResNets, and VGG architectures.

Recent work²⁰⁶ compares the performance of models pretrained on ImageNet with those pretrained using self-supervised (Section 3.3, Reference 207) or multitask learning²⁰⁸ on histopathology data. On binary classification of slides as cancerous or noncancerous on a dataset containing 413 WSIs from duodenal biopsies, they find that the self-supervised encoders achieve a greater AUC than those pretrained on ImageNet. Moreover, they observe no discernible relationship between a model's performance on ImageNet, and its subsequent performance on the downstream task. This presents a challenge for researchers and practitioners attempting to leverage transfer learning in practice, as these results suggest that there presently exists no pretraining heuristic that can accurately ascertain the performance of a fine-tuned ImageNet model on a downstream CPATH classification task.

3.3 Self-supervised learning

Deep learning models are trained to maximize the accuracy of the model on a training set. This requires access to labels. Self-supervised learning refers to the class of methods that allow models to learn features of images relevant to a task without access to underlying labels. The key insight that self-supervised learning leverages is that one can often use domain knowledge to create pseudo-labels or learning objectives which, when optimized, yield informative representations of images.

3.3.1 Contrastive learning

Contrastive learning performs self-supervised learning by means of assigning pairs of instances in data to be “positive pairs” or “negative pairs” and learning a representation such that positive pairs have similar representations and negative pairs do not. Methods of contrastive learning in CPATH may vary based on how positive and negative pairs are assigned. Reference 183 selects a collection of “reference images”, and for each reference image, produces a batch of other image samples that have been stain-normalized with respect to the reference. Positive pairs are different stainings of the same image, while negative pairs are any two distinct images. This method produces a model that learns to be agnostic to stain variation across slides. Reference 209 defines a positive pair of patches as those which are spatially proximal, and a negative pair as those which are spatially distant; they find that a ResNet-18 pretrained using their contrastive objective outperforms an ImageNet-pretrained and NPID-pretrained²¹⁰ network on tumor tile retrieval in the CAMEYLON-16 dataset.

Two popular means of contrastive self-supervised learning include SimCLR⁵⁰ and MoCo.²¹¹ Reference 207 finds that over 57 datasets, a ResNet with SimCLR self-supervision improves classification and segmentation performance beyond random initialization or ImageNet pretraining, though these performance gains diminish under access to additional data.²¹² modifies MoCo v3²¹³ to obtain more positive pairs and finds that a joint CNN- and transformer-based¹⁸⁹ architecture pretrained in this fashion achieves state-of-the-art performance on five tasks across nine CPATH datasets.

3.3.2 Non-contrastive learning

Contrastive learning requires both positive and negative pairs. Methods in non-contrastive learning, on the other hand, remove the requirement for negative pairs of images. Three modern, popular methods of non-contrastive learning include BYOL,²¹⁴ SimSiam,²¹⁵ and DINO.²¹⁶ Reference 217 develops a hybrid CNN- and Transformer- based architecture pretrained using BYOL. The resulting model outperforms modern state-of-the-art vision models, including vision transformers (ViTs)²¹⁸ and T2T-ViT-24,²¹⁹ VT-ResNet²²⁰ and BoTNet-50.²²¹ Reference 222 applies ViTs pretrained using DINO to several tasks, including cancer subtyping and colorectal tissue phenotyping. Although an ImageNet-pretrained ResNet presents a strong baseline model for their task, self-supervision with DINO typically outperforms both the transfer-learned approach and the contrastive SimCLR method. The current research suggests that self-supervised learning typically outperforms ImageNet-pretrained architectures on computational histopathology tasks, although there does not yet appear to be a singular dominant method of self-supervision within the context of CPATH.²²³ The advantages of self-supervised learning appear to be most pronounced when a limited quantity of labeled data is available for domain-specific transfer learning.

3.4 Neural attention with transformers

Attention is a type of neural network expressing the inductive bias that context determines the importance of each input variable to the current layer of the network. In computer vision, attention learns from surrounding pixels the importance of each pixel to the current layer of computation. Empirically, this inductive bias proves an effective assumption in the context of natural language and image processing. Transformers, neural networks constructed using stacked neural attention layers, have set the new standard in natural language processing tasks,¹⁸⁹ while ViTs approach state-of-the-art results on vision tasks with significantly fewer parameters than convolutional networks.²¹⁸ One of the key strengths of the attention mechanism is interpretability: by visualizing the attention weights over each pixel as a heat map, a user can interpret the learned relative importance of each pixel to the ultimate prediction task (e.g., as in Reference 222). This allows for domain experts to conduct post hoc interpretability of the learned model to assess whether the model has learned the right signal for the task at hand.

Much of the success of attention in CPATH has leveraged attention mechanisms as the pooling function in MIL.^{224, 225} In the former work,²²⁴ use an attention mechanism as the pooling operator for MIL, instead of a fixed pooling function. In a classification task on breast²²⁶ and colon cancer¹¹⁶ data, their gated attention-based MIL approach achieved higher image-level binary classification accuracy, precision, recall, F-score, and AUC than either instance-wise or embedding-wise MIL approaches. In the latter work,²²⁵ the authors propose DeepAttnMISL, a MIL-based survival model that employs an attention layer over the representation of patches to pool them for MIL. Clustering-constrained multiple-instance learning (CLAM)³⁴ uses attention in combination with clustering to perform MIL. By using attention weights as pseudo-labels, they can stratify patches of the WSI into different clusters.²²⁷ develop a transformer based self-attention¹⁸⁹ mechanism for histopathological images by exploiting hierarchical structure present in the images. The hierarchical image pyramid transformer (HIPT) first applies a ViT model at the cell-level, then at the patch-level, and finally at the region-level, with the output of each representation feeding into the subsequent model in the hierarchy. HIPT leverages the strengths of image transformers,²¹⁸ without incurring intractability in the computation of the attention weights due to the size of the input image. In slide-level classification and survival prediction of H&E-stained slides from TCGA, HIPT outperforms other weakly-supervised techniques, including DeepAttnMISL, and the graph neural network (GNN)-based GCN-MIL.²²⁸

3.5 Graph neural networks

GNNs are a class of neural networks that encode a relational inductive bias: the assumption that properties of—and relationships between—discrete entities in the data are important to the overall prediction task.²²⁹ To do so, a GNN will compose each data sample as a series of nodes and edges and will learn a representation of this graph that most readily supports the overall prediction task. In the context of computational histopathology, nodes in the graph typically correspond to patches sampled from a slide. We can therefore group and compare GNN-based methods by the way in which the edges of each graph are assigned to their corresponding nodes.

3.5.1 Node feature similarity

Some methods will construct a graph out of each instance by placing an edge between nodes that are sufficiently similar to each other. This class of methods includes DeepGraphSurv,¹⁵⁶ one of the first graph CNNs designed for the task of survival prediction from a WSI. After sampling patches from the WSI (which will act as nodes in the graph), DeepGraphSurv extracts 128-dimensional node features via an ImageNet-pretrained neural network. Edges are then added between pairs of nodes with a sufficiently small Euclidean distance in representation space. The authors find that on three survival datasets (glioblastoma multiform and lung squamous cell carcinoma slides from TCGA,⁸¹ and slides from the National Lung Screening Trial²³⁰), DeepGraphSurv's incorporation of topological relationships between patches yields a substantial gain in concordance²³¹ when compared to competing models of survival. Rather than relying solely on node feature similarity,²³² introduces the idea of an adjacency learning layer, which incorporates both global context and node features to learn the adjacency matrix of the graph during training. This approach set a new standard of accuracy for binary classification on the MUSK1 MIL dataset²³³ (92.6% accuracy), and for cancer subtyping (lung adenocarcinoma vs. lung squamous cell carcinoma) on WSIs from TCGA⁸¹ (89% accuracy).

3.5.2 Node spatial location

In this paradigm, nodes in the GNN are typically represented by patches, while the graphical structure is produced based on which nodes are spatially proximal. This encodes the inductive bias that the presence of a feature in one location (e.g., a cancerous cell) is likely to inform the presence of that same feature in other nearby locations. HGSurvNet¹⁵⁷ is a GNN-based method that performs survival prediction from WSIs. To do so, it constructs two hypergraphs, and performs multi-hypergraph learning over the two graphs to produce a downstream survival prediction. One of the hypergraphs contains edges determined by the feature similarity of nodes, as determined by feature extraction via an ImageNet-pretrained model, while the other contains edges determined by spatial proximity to other nodes on the slide. Training this architecture using the Cox partial likelihood objective²³⁴ yields a survival model that outperforms competing methods like graph convolutional networks⁴⁸ and DeepGraphSurv to achieve a concordance of 0.6730 on the LUSC dataset,²³⁵ 0.6726 on the GBM dataset,²³⁵ and 0.6901 on the NLST dataset.²³⁰ Instead of using nuclei as nodes in the graph,²³⁶ uses cell nuclei as the nodes, then connects nuclei in the graph that are sufficiently proximal. Applying attention-based robust spatial filtering on this graph yields near-state-of-the-art on subtype classification of breast cancer⁸⁶ and Gleason grading of prostate cancer²³⁷ tasks and admits interpretable attention maps in which well-attended nuclei correlate strongly with the presence of a cancerous cell. Graphs based on node spatial location have also been used to improve the tractability of the learning problem: Slide Graph²³⁸ constructs a graph in which nodes are cell nuclei, with edges placed between proximal nuclei. This approach provides a scalable means to efficiently capture cellular structure across a WSI. In evaluation on a dataset of breast cancer pathology slides from TCGA, this method achieved a 0.73 AUC on HER-2 status prediction (with the next closest baseline¹²⁴ achieving 0.68), and a 0.75 AUC on PR status prediction (with the next closest baseline⁴² achieving 0.73).

3.5.3 Patch spatial location with superpixel node features

Reference 239 presents SegGini, a graph isomorphism network that leverages superpixel node features to perform weakly-supervised semantic segmentation of histopathological slides. It does so by way of node classification, wherein superpixel nodes are each classified into segmentation regions. A key advantage of this method is its ability to operate under inexact labels and partial annotation, and in evaluation on one prostate tissue microarray dataset²⁴⁰ and one prostate WSI dataset,²⁴¹ SegGini performs state-of-the-art segmentation as measured by the Dice score, outperforming a human clinician on the first dataset.

4 CHALLENGES AND OPPORTUNITIES

Despite enormous progress over the last decade in computational histopathology, there remain several open challenges in the translation of ML tools in computational histopathology from research laboratories into assistive software tools for clinicians.

4.1 Bias, fairness, and equity

The interactions that patients have with the medical system can manifest in patterns within their clinical data. These interactions can bias clinical data²⁴² used to train ML models and subsequently result in biased models affecting their ability to generalize. Reference 243 provides an overview of the different kinds of biases that can arise in medical imaging, highlight their statistical implications for building predictive models and provide insights into how tools from causal inference can prove valuable in bias mitigation.

The naive training of ML models to maximize average accuracy on a predictive task has been found to yield models that are prone to bias among subpopulations within the data. This is because populations represented in real-world datasets are diverse and average accuracy on a held-out set may not reflect the nuances of how the model will performs on members of various subpopulations during deployment.^{62, 244} In predictive systems for chest x-rays,²⁴⁵ demonstrate disparities in true predictive rates across subgroups based on protected attributes such as patient sex, age, race, and insurance type across various state of the art classifiers. Understanding the fairness and equity of predictive models is crucial to engender trust in deployed systems. Indeed, Reference 246 provides a simple theoretical model wherein repeated subgroup disparities in predictive systems can result in less frequent engagement by a minority group, increasing disparities in predictive outcomes.

One notion of bias and fairness that has recently been explored in computational histopathology is the implicit dependence that predictive models may have on the hospital from which the digitized slides are obtained. Reference 247 devises a method to reduce the dependence on the features extracted in histopathology images by explicitly encouraging the model to not capture patterns that are indicative of hospital identity via an evolutionary strategy. Reference 248 develops a learning strategy for predictive models to decrease variation in predictive performance among hospitals. In addition to variation across hospitals, the various hardware technology (e.g., scanners) that form the image procurement pipeline can also yield variation in predictive performance. Reference 249 evaluates the effect of this heterogeneity on the task of lymph node segmentation, and show that slide color normalization, model fine-tuning, and domain adversarial learning are promising means of accounting for such discrepancies.

One ripe area of opportunity in this space would be the development of benchmark datasets to quantify the bias associated with the various models and methods in this space. In medical imaging, for example, there have been several lines of work to highlight the need to better understand the fairness of algorithms in health care²⁵⁰: introduce a benchmark of medical images with paired annotations of sensitive attributes such as age, race and sex, with a view to study how different algorithms for bias mitigation perform across a suite of different fairness metrics in chest x-rays, fundus imaging, MRIs, CT scans and dermatology images. Because this dataset presents a means to quantify the bias and fairness of a given approach, these data have opened the door to applying methods from the ML fairness literature to medical image analysis. A similar initiative across different clinical tasks considered in histopathology image analysis would enable better understanding of the limitations of the current set of tools across the gamut of clinical tasks considered in Section 2.

4.2 Heterogeneity of predictive outcomes

Digitized histopathology images can exhibit variation that is dependent on the tumor microenvironment,²⁵¹ the stage of the disease,²⁵² the stain used in the image,²⁵³ and the patient's individual characteristics.²⁵⁴ This results in intra-observer variability from the subjective and manual interpretation of these images by pathologists. The manual annotations of these images, in many cases, form the labels used to train CPATH models. Many ML models assume that the noise present in the labels has the same degree of variation; the violation of this assumption can exacerbate bias in the resulting model. Reference 255 studies the effect of instance dependent noise in neural network models, showing that low-frequency noisy labels (such as those coming from a minority subpopulation) are more likely to be misclassified. A detailed study of the effects of label noise in computational histopathology, its origin, and mechanisms to mitigate its effects on generalization represents a promising area of future study and would further improve trust in CPATH models.

4.3 Multimodal integration and the need for interpretability

The treatment and care of patients suffering from cancers involves clinical decision making from a variety of modalities. Integrating and harmonizing this data from electronic medical records and clinical trials opens new avenues for ML to ask novel research questions such as individual risk prognostication and biomarker identification.^{256, 257} For example, Reference 182 combines clinical biomarker data with histopathology images from the NRG Oncology phase III randomized clinical trials to predict outcomes such as metastases and survival in prostate cancer.²⁵⁸ integrate clinical and genomics data, computed tomography scan images, and programmed death ligand-1, PD-(L)1, immunohistochemistry slides to predict individual response to PD-(L)1 blockade. For multimodal fusion of multiple stains,⁶¹ integrate H&E with multiplexed IHC for predicting response to neoadjuvant therapy in rectal cancer patients, and Reference 259 integrates H&E, Periodic acid–Schiff, Trichrome, and Jones' stains for kidney allograft rejection assessment.

In general, there remains a paucity of public data pairing anonymized patient characteristics with digitized pathology slides that would allow for the development and testing of multimodal approaches in the absence of preexisting collaborations between clinical centers and research labs. The publication of such data would represent a promising foundation on which future work exploring different techniques of multimodal integration can be built. Additionally, the success of contemporary multimodal predictive models in pathology begs the question of whether these successes can translate into a scientific understanding of the biological and mechanistic insights that drive the predictive signal. There is thus a need to develop tools to interpret the complex, multimodal signals captured by such models.

Interpretability in ML is a complex problem, with its own rich literature. Neural networks are susceptible to problems such as shortcut learning,²⁶⁰ wherein the model latches onto a signal that is predictive of a proxy for the outcome in the training set, rather than the outcome itself. The problem herein is that when deployed in new scenarios where the proxy is absent, the model will no longer generalize well. Research in ML has developed tools for interpretability that are model based and model agnostic. For the former, tools such as GradCAM²⁶¹ leverage the gradients of the output with respect to the input a neural network and GNN-explainer²⁶² identifies the minimal subgraph that correctly explains the prediction of a GNN. The latter class of methods are model-agnostic and includes tools such as SHAP²⁶³ and LIME.²⁶⁴

Reference 265 studies different methods for explaining graph-based predictive models of histopathology images and propose metrics using pathologically measurable concepts to rank different methods that generate explanations. Reference 34 shows how attention level heat maps, when overlaid onto patches, highlight metastatic regions and individual tumor cells. Reference 60 studies the improvement of multimodal integration using WSIs and molecular data in a pan-cancer setting, with local- and global-level interpretability used to understand how features correlate with risk. Reference 266 shows that subsets of neural network features capture discriminatory information for subtyping lung carcinomas. Reference 267 develops a variant of MIL that by design is interpretable (via the use of attention maps over patches). They show that they inferred attention maps correspond to regions in the WSI referenced by pathologists in clinical assessments. Reference 268 studies the low-dimensional organization of features captured in CNNs trained on histopathology data using nonlinear dimensionality reduction tools such as t-distributed stochastic neighbor embedding (t-SNE). While the field of explainable ML has made great strides, there is no single gold standard technique that guarantees the identification of the correct features that the model is relying on. Interpretability in computational histopathology must be combined with clinical expertise to ensure that there is a clinical or biological rationale behind the model's prediction. The problem of interpretability becomes harder for multimodal models where information content from multiple modalities is combined in opaque models to form a final prediction—developing reliable methods for practitioners to understand the predictions from multimodal models remains an open challenge.

Different methods of interpretability generate different types of explanations, and the utility of these explanations is often largely dependent on the context in which a predictive model is deployed. One promising line of work involves an ethnographic analysis of the workflow in which clinicians collaborate with algorithmic systems to perform diagnoses, to determine the circumstances under which each means of interpretability stands to provide the greatest utility to clinicians. One promising methodological avenue of future work consists of marrying methods from interpretability with the burgeoning field of causal inference²⁶⁹ to learn interpretations that are based on causal relationships in the data when such relationships are statistically identifiable.^270-272 To our knowledge, these methods have yet to make their way into the computational histopathology literature. Such approaches may improve confidence that a model's interpretation is correctly characterizing the biological pathways linking WSI features and diagnostic outcomes, which—beyond improving trust in our predictive models—may improve our scientific understanding of the biological mechanisms relating observed WSI features with clinical outcomes.

4.4 On the rise of large generative models

Discovery of scaling laws²⁷³ for transformer-based models of natural language text has ushered in a new era of LLMs. Models in natural language processing were bespoke with a single model being trained on a dataset to solve a specific task at hand. By scaling models to hundreds of billions of parameters, researchers found that LLMs exhibit the ability to solve different natural language tasks with little to no supervision. In parallel, large scale diffusion models²⁷⁴ have democratized the generation of high-resolution image data using text-prompts single sentence alone. The ramifications of this technology are only just being explored in the context of medicine²⁷⁵ but the next half decade will inevitably find their utility in CPATH.

5 DISCUSSION

To change patient care, a good ML model alone is insufficient. Equally important is the smooth integration of the model into the clinical workflow—an endeavor that intersect computational histopathology with human computer interaction. Indeed, creating reliable software tools for pathologists and oncologists would require a rethink of how hospital infrastructure is organized. As hospitals and clinics move toward an entirely digitized pathology workflow there is an opportunity to create new assistive clinical decision support tools using computational histopathology. This will require the implementation of high-throughput storage for digitized histopathology slides, fast interoperability with hospital electronic medical record systems and (local or cloud-based) high-performance compute to run ML models in real-time.

In summary, the increasing digitization of pathology workflows alongside the rapid pace of advances in ML bears promise for accelerating scientific discovery and in the creation of assistive tools for oncologists across a variety of cancers. As the field moves from research to translation and deployment, there is a need to recognize the ultimate end-uses of predictive systems within the clinical workflow and translate the technical requirements a system must satisfy into research challenges. The clinical translation of these tools and technologies will require pathologists, oncologists, computer scientists, hospital administrators and regulatory agencies to collaborate and develop an environment where clinicians can utilize such tools safely and effectively.

ACKNOWLEDGMENTS

The authors thank Richard J. Chen for many helpful discussions, comments on the manuscript, and help framing the introduction.

FUNDING INFORMATION

This study was supported by the AI Chair Award, Canadian Institute for Advanced Research and the Health Systems Impact Fellowship, Canadian Institutes of Health Research.

REFERENCES

1Hajdu SI. Microscopic contributions of pioneer pathologists. Ann Clin Lab Sci. 2011; 41(2): 201-206.
PubMed Web of Science® Google Scholar
2West JB. Marcello Malpighi and the discovery of the pulmonary capillaries and alveoli. Am J Physiol Lung Cell Mol Physiol. 2013; 304(6): L383-L390.
10.1152/ajplung.00016.2013
CAS PubMed Web of Science® Google Scholar
3Karamanou M, Poulakou-Rebelakou E, Tzetis M, Androutsos G. Anton van Leeuwenhoek (1632–1723): father of micromorphology and discoverer of spermatozoa. Rev Argent Microbiol. 2010; 42(4): 311-314.
CAS PubMed Web of Science® Google Scholar
4Hajdu SI. A note from history: landmarks in history of cancer, part 3. Cancer. 2012; 118(4): 1155-1168.
10.1002/cncr.26320
PubMed Web of Science® Google Scholar
5Virchow R. Cellular Pathology as Based Upon Physiological and Pathological Histology. Lecture XVI—Atheromatous Affection of Arteries. J. B. Lippincott; 1863.
10.5962/bhl.title.32770
Google Scholar
6Wagner RP. Rudolph Virchow and the genetic basis of somatic ecology. Genetics. 1999; 151(3): 917-920.
10.1093/genetics/151.3.917
CAS PubMed Web of Science® Google Scholar
7Van den Tweel JG, Taylor CR. A brief history of pathology: preface to a forthcoming series that highlights milestones in the evolution of pathology as a discipline. Virchows Arch. 2010; 457: 3-10.
10.1007/s00428-010-0934-4
PubMed Web of Science® Google Scholar
8Küppers R, Hansmann ML. The Hodgkin and Reed/Sternberg cell. Int J Biochem Cell Biol. 2005; 37(3): 511-517.
10.1016/j.biocel.2003.10.025
CAS PubMed Web of Science® Google Scholar
9Chan WC. The Reed–Sternberg cell in classical Hodgkin's disease. Hematol Oncol. 2001; 19(1): 1-17.
10.1002/hon.659
CAS PubMed Web of Science® Google Scholar
10Quintanilla-Martinez L, Fend F, Moguel LR, et al. Peripheral T-cell lymphoma with Reed–Sternberg-like cells of B-cell phenotype and genotype associated with Epstein–Barr virus infection. Am J Surg Pathol. 1999; 23(10): 1233.
10.1097/00000478-199910000-00008
CAS PubMed Web of Science® Google Scholar
11Prewitt JM, Mendelsohn ML. The analysis of cell images. Ann N Y Acad Sci. 1966; 128(3): 1035-1053.
10.1111/j.1749-6632.1965.tb11715.x
CAS PubMed Web of Science® Google Scholar
12Pantanowitz L, Valenstein PN, Evans AJ, et al. Review of the current state of whole slide imaging in pathology. J Pathol Inform. 2011; 2(1): 36.
10.4103/2153-3539.83746
PubMed Google Scholar
13Pantanowitz L, Sharma A, Carter AB, Kurc T, Sussman A, Saltz J. Twenty years of digital pathology: an overview of the road travelled, what is on the horizon, and the emergence of vendor-neutral archives. J Pathol Inform. 2018; 9(1): 40.
10.4103/jpi.jpi_69_18
PubMed Google Scholar
14Chang C, Moon B, Acharya A, Shock C, Sussman A, Saltz J. Titan: a high-performance remote-sensing database. Proceedings 13th International Conference on Data Engineering. IEEE; 1997: 375-384.
10.1109/ICDE.1997.581883
Google Scholar
15Ferreira R, Moon B, Humphries J, et al. The virtual microscope. Proceedings of the AMIA Annual Fall Symposium. American Medical Informatics Association; 1997: 449.
Google Scholar
16Afework A, Beynon MD, Bustamante F, et al. Digital dynamic telepathology—the virtual microscope. Proceedings of the AMIA Symposium. American Medical Informatics Association; 1998: 912.
Google Scholar
17Sertel O, Kong J, Shimada H, Catalyurek UV, Saltz JH, Gurcan MN. Computer-aided prognosis of neuroblastoma on whole-slide images: classification of stromal development. Pattern Recognit. 2009; 42(6): 1093-1103.
10.1016/j.patcog.2008.08.027
CAS PubMed Web of Science® Google Scholar
18Petushi S, Garcia FU, Haber MM, Katsinis C, Tozeren A. Large-scale computations on histology images reveal grade differentiating parameters for breast cancer. BMC Med Imaging. 2006; 6(1): 1-11.
10.1186/1471-2342-6-14
PubMed Google Scholar
19Esgiar AN, Naguib RN, Sharif BS, Bennett MK, Murray A. Microscopic image analysis for quantitative measurement and feature identification of normal and cancerous colonic mucosa. IEEE Trans Inf Technol Biomed. 1998; 2(3): 197-203.
10.1109/4233.735785
CAS PubMed Google Scholar
20Hamilton PW, Bartels PH, Thompson D, Anderson NH, Montironi R, Sloan JM. Automated location of dysplastic fields in colorectal histology using image texture analysis. J Pathol. 1997; 182(1): 68-75.
10.1002/(SICI)1096-9896(199705)182:1<68::AID-PATH811>3.0.CO;2-N
CAS PubMed Web of Science® Google Scholar
21Gunduz C, Yener B, Gultekin SH. The cell graphs of cancer. Bioinformatics. 2004; 20(Suppl 1): i145-i151.
10.1093/bioinformatics/bth933
CAS PubMed Web of Science® Google Scholar
22Gil J, Wu H, Wang BY. Image analysis and morphometry in the diagnosis of breast cancer. Microsc Res Tech. 2002; 59(2): 109-118.
10.1002/jemt.10182
CAS PubMed Web of Science® Google Scholar
23Choi HK, Jarkrans T, Bengtsson E, et al. Image analysis based grading of bladder carcinoma. Comparison of object, texture and graph based methods and their reproducibility. Anal Cell Pathol. 1997; 15(1): 1-18.
10.1155/1997/147187
CAS PubMed Web of Science® Google Scholar
24Kong J, Sertel O, Shimada H, Boyer KL, Saltz JH, Gurcan MN. Computer-aided evaluation of neuroblastoma on wholeslide histology images: classifying grade of neuroblastic differentiation. Pattern Recognit. 2009; 42(6): 1080-1092.
10.1016/j.patcog.2008.10.035
CAS PubMed Web of Science® Google Scholar
25Dundar MM, Badve S, Bilgin G, et al. Computerized classification of intraductal breast lesions using histopathological images. IEEE Trans Biomed Eng. 2011; 58(7): 1977-1984.
10.1109/TBME.2011.2110648
PubMed Web of Science® Google Scholar
26LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015; 521(7553): 436-444.
10.1038/nature14539
CAS PubMed Web of Science® Google Scholar
27Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L. Imagenet: a large-scale hierarchical image database. 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE; 2009: 248-255.
10.1109/CVPR.2009.5206848
Google Scholar
28Russakovsky O, Deng J, Su H, et al. Imagenet large scale visual recognition challenge. Int J Comput Vis. 2015; 115: 211-252.
10.1007/s11263-015-0816-y
Web of Science® Google Scholar
29Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. Commun ACM. 2017; 60(6): 84-90.
10.1145/3065386
Web of Science® Google Scholar
30Bejnordi BE, Veta M, Van Diest PJ, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Jama. 2017; 318(22): 2199-2210.
10.1001/jama.2017.14585
PubMed Web of Science® Google Scholar
31Bandi P, Geessink O, Manson Q, et al. From detection of individual metastases to classification of lymph node status at the patient level: the camelyon17 challenge. IEEE Trans Med Imaging. 2018; 38(2): 550-560.
10.1109/TMI.2018.2867350
Web of Science® Google Scholar
32Bulten W, Kartasalo K, Chen PHC, et al. Artificial intelligence for diagnosis and Gleason grading of prostate cancer: the PANDA challenge. Nat Med. 2022; 28(1): 154-163.
10.1038/s41591-021-01620-2
CAS PubMed Web of Science® Google Scholar
33Campanella G, Hanna MG, Geneslaw L, et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med. 2019; 25(8): 1301-1309.
10.1038/s41591-019-0508-1
CAS PubMed Web of Science® Google Scholar
34Lu MY, Williamson DF, Chen TY, Chen RJ, Barbieri M, Mahmood F. Data-efficient and weakly supervised computational pathology on whole-slide images. Nat Biomed Eng. 2021; 5(6): 555-570.
10.1038/s41551-020-00682-w
PubMed Web of Science® Google Scholar
35Lu MY, Chen TY, Williamson DF, et al. AI-based pathology predicts origins for cancers of unknown primary. Nature. 2021; 594(7861): 106-110.
10.1038/s41586-021-03512-4
CAS PubMed Web of Science® Google Scholar
36Bychkov D, Linder N, Turkki R, et al. Deep learning based tissue analysis predicts outcome in colorectal cancer. Sci Rep. 2018; 8(1):3395.
10.1038/s41598-018-21758-3
PubMed Web of Science® Google Scholar
37Wulczyn E, Steiner DF, Xu Z, et al. Deep learning-based survival prediction for multiple cancer types using histopathology images. PLoS One. 2020; 15(6):e0233678.
10.1371/journal.pone.0233678
CAS PubMed Web of Science® Google Scholar
38Coudray N, Ocampo PS, Sakellaropoulos T, et al. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nat Med. 2018; 24(10): 1559-1567.
10.1038/s41591-018-0177-5
CAS PubMed Web of Science® Google Scholar
39Skrede OJ, De Raedt S, Kleppe A, et al. Deep learning for prediction of colorectal cancer outcome: a discovery and validation study. Lancet. 2020; 395(10221): 350-360.
10.1016/S0140-6736(19)32998-8
CAS PubMed Web of Science® Google Scholar
40Mobadersany P, Yousefi S, Amgad M, et al. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc Natl Acad Sci. 2018; 115(13): E2970-E2979.
10.1073/pnas.1717139115
CAS PubMed Web of Science® Google Scholar
41Fu Y, Jung AW, Torne RV, et al. Pan-cancer computational histopathology reveals mutations, tumor composition and prognosis. Nat Cancer. 2020; 1(8): 800-810.
10.1038/s43018-020-0085-8
CAS PubMed Web of Science® Google Scholar
42Kather JN, Heij LR, Grabsch HI, et al. Pan-cancer image-based detection of clinically actionable genetic alterations. Nat Cancer. 2020; 1(8): 789-799.
10.1038/s43018-020-0087-6
CAS PubMed Web of Science® Google Scholar
43Wagner SJ, Reisenbüchler D, West NP, et al. Fully Transformer-Based Biomarker Prediction From Colorectal Cancer Histology: A Large-Scale Multicentric Study. 2023. arXiv preprint arXiv:230109617.
Google Scholar
44Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative adversarial networks. Commun ACM. 2020; 63(11): 139-144.
10.1145/3422622
Web of Science® Google Scholar
45Zhu JY, Park T, Isola P, Efros AA. Unpaired image-to-image translation using cycle-consistent adversarial networks. Proceedings of the IEEE International Conference on Computer Vision; IEEE; 2017: 2223-2232.
10.1109/ICCV.2017.244
Google Scholar
46Isola P, Zhu JY, Zhou T, Efros AA. Image-to-image translation with conditional adversarial networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE; 2017: 1125-1134.
10.1109/CVPR.2017.632
Google Scholar
47Scarselli F, Gori M, Tsoi AC, Hagenbuchner M, Monfardini G. The graph neural network model. IEEE Trans Neural Netw. 2008; 20(1): 61-80.
10.1109/TNN.2008.2005605
PubMed Google Scholar
48Kipf TN, Welling M. Semi-Supervised Classification With Graph Convolutional Networks. 2016. arXiv preprint arXiv:160902907.
Google Scholar
49Oord A, Li Y, Vinyals O. Representation Learning With Contrastive Predictive Coding. 2018. arXiv preprint arXiv:180703748.
Google Scholar
50Chen T, Kornblith S, Norouzi M, Hinton G. A simple framework for contrastive learning of visual representations. International Conference on Machine Learning. PMLR; 2020: 1597-1607.
Google Scholar
51Ngiam J, Khosla A, Kim M, Nam J, Lee H, Ng AY. Multimodal deep learning. Proceedings of the 28th International Conference on Machine Learning (ICML-11); Proceedings of Machine Learning Research; 2011: 689-696.
Google Scholar
52Bayramoglu N, Kaakinen M, Eklund L, Heikkila J. Towards virtual H&E staining of hyperspectral lung histology images using conditional generative adversarial networks. Proceedings of the IEEE International Conference on Computer Vision Workshops; IEEE; 2017: 64-71.
10.1109/ICCVW.2017.15
Google Scholar
53Rivenson Y, Wang H, Wei Z, et al. Virtual histological staining of unlabelled tissue autofluorescence images via deep learning. Nat Biomed Eng. 2019; 3(6): 466-477.
10.1038/s41551-019-0362-y
CAS PubMed Web of Science® Google Scholar
54Ozyoruk KB, Can S, Darbaz B, et al. A deep-learning model for transforming the style of tissue images from cryosectioned to formalin-fixed and paraffin-embedded. Nat Biomed Eng. 2022; 6: 1-13.
10.1038/s41551-022-00952-9
PubMed Google Scholar
55Lee Y, Park JH, Oh S, et al. Derivation of prognostic contextual histopathological features from whole-slide images of tumours via graph deep learning. Nat Biomed Eng. 2022; 1-15.
PubMed Web of Science® Google Scholar
56Kalra S, Tizhoosh HR, Choi C, et al. Yottixel–an image search engine for large archives of histopathology whole slide images. Med Image Anal. 2020; 65:101757.
10.1016/j.media.2020.101757
PubMed Web of Science® Google Scholar
57Chen C, Lu MY, Williamson DF, Chen TY, Schaumberg AJ, Mahmood F. Fast and scalable search of whole-slide images via self-supervised deep learning. Nat Biomed Eng. 2022; 6(12): 1420-1434.
10.1038/s41551-022-00929-8
PubMed Web of Science® Google Scholar
58Cheerla A, Gevaert O. Deep learning with multimodal representation for pancancer prognosis prediction. Bioinformatics. 2019; 35(14): i446-i454.
10.1093/bioinformatics/btz342
CAS PubMed Web of Science® Google Scholar
59Chen RJ, Lu MY, Wang J, et al. Pathomic fusion: an integrated framework for fusing histopathology and genomic features for cancer diagnosis and prognosis. IEEE Trans Med Imaging. 2020; 41(4): 757-770.
10.1109/TMI.2020.3021387
Web of Science® Google Scholar
60Chen RJ, Lu MY, Williamson DF, et al. Pan-cancer integrative histology-genomic analysis via multimodal deep learning. Cancer Cell. 2022; 40(8): 865-878.
10.1016/j.ccell.2022.07.004
CAS PubMed Web of Science® Google Scholar
61Foersch S, Glasner C, Woerl AC, et al. Multistain deep learning for prediction of prognosis and therapy response in colorectal cancer. Nat Med. 2023; 29: 1-10.
10.1038/s41591-022-02134-1
PubMed Google Scholar
62Chen RJ, Chen TY, Lipkova J, et al. Algorithm Fairness in AI for Medicine and Healthcare. 2021. arXiv preprint arXiv:211000603.
Google Scholar
63Farahani N, Parwani AV, Pantanowitz L, et al. Whole slide imaging in pathology: advantages, limitations, and emerging perspectives. Pathol Lab Med Int. 2015; 7(23–33): 4321.
Google Scholar
64Pantanowitz L, Hartman D, Qi Y, et al. Accuracy and efficiency of an artificial intelligence tool when counting breast mitoses. Diagn Pathol. 2020; 15: 1-10.
10.1186/s13000-020-00995-z
PubMed Web of Science® Google Scholar
65Gurcan MN, Boucheron LE, Can A, Madabhushi A, Rajpoot NM, Yener B. Histopathological image analysis: a review. IEEE Rev Biomed Eng. 2009; 2: 147-171.
10.1109/RBME.2009.2034865
CAS PubMed Google Scholar
66Fuchs TJ, Buhmann JM. Computational pathology: challenges and promises for tissue analysis. Comput Med Imaging Graph. 2011; 35(7–8): 515-530.
10.1016/j.compmedimag.2011.02.006
PubMed Web of Science® Google Scholar
67Bera K, Schalper KA, Rimm DL, Velcheti V, Madabhushi A. Artificial intelligence in digital pathology—new tools for diagnosis and precision oncology. Nat Rev Clin Oncol. 2019; 16(11): 703-715.
10.1038/s41571-019-0252-y
PubMed Web of Science® Google Scholar
68Madabhushi A, Reyes-Aldasoro CC. Special issue on computational pathology: an overview. Med Image Anal. 2021; 73.
Web of Science® Google Scholar
69Cui M, Zhang DY. Artificial intelligence and computational pathology. Lab Invest. 2021; 101(4): 412-422.
10.1038/s41374-020-00514-0
PubMed Web of Science® Google Scholar
70Morales S, Engan K, Naranjo V. Artificial intelligence in computational pathology–challenges and future directions. Digital Signal Process. 2021; 119:103196.
10.1016/j.dsp.2021.103196
Web of Science® Google Scholar
71Nawaz S, Yuan Y. Computational pathology: exploring the spatial dimension of tumor ecology. Cancer Lett. 2016; 380(1): 296-303.
10.1016/j.canlet.2015.11.018
CAS PubMed Web of Science® Google Scholar
72Cifci D, Veldhuizen GP, Foersch S, Kather JN. AI in computational pathology of cancer: improving diagnostic workflows and clinical outcomes? Annu Rev Cancer Biol. 2023; 7.
Google Scholar
73Echle A, Rindtorff NT, Brinker TJ, Luedde T, Pearson AT, Kather JN. Deep learning in cancer pathology: a new generation of clinical biomarkers. Br J Cancer. 2021; 124(4): 686-696.
10.1038/s41416-020-01122-x
PubMed Web of Science® Google Scholar
74Van der Laak J, Litjens G, Ciompi F. Deep learning in histopathology: the path to the clinic. Nat Med. 2021; 27(5): 775-784.
10.1038/s41591-021-01343-4
CAS PubMed Web of Science® Google Scholar
75Bilal M, Jewsbury R, Wang R, et al. An Aggregation of Aggregation Methods in Computational Pathology. 2022. arXiv preprint arXiv:221101256.
Google Scholar
76Girshick R. Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision; IEEE; 2015: 1440-1448.
10.1109/ICCV.2015.169
Google Scholar
77He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE; 2016: 770-778.
10.1109/CVPR.2016.90
Google Scholar
78Redmon J, Divvala S, Girshick R, Farhadi A. You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE; 2016: 779-788.
10.1109/CVPR.2016.91
Google Scholar
79Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III. Springer; 2015: 234-241.
10.1007/978-3-319-24574-4_28
Google Scholar
80Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL. Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell. 2017; 40(4): 834-848.
10.1109/TPAMI.2017.2699184
PubMed Web of Science® Google Scholar
81 53 DCCBRJMAKAPTPDWY, 68 TSSLDA. The Cancer Genome Atlas pan-cancer analysis project. Nat Genet. 2013; 45(10): 1113-1120.
10.1038/ng.2764
PubMed Web of Science® Google Scholar
82Litjens G, Bandi P, Ehteshami Bejnordi B, et al. 1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset. GigaScience. 2018; 7(6):giy065.
10.1093/gigascience/giy065
PubMed Web of Science® Google Scholar
83Veta M, Heng YJ, Stathonikos N, et al. Predicting breast tumor proliferation from whole-slide images: the TUPAC16 challenge. Med Image Anal. 2019; 54: 111-121.
10.1016/j.media.2019.02.012
PubMed Web of Science® Google Scholar
84Kather JN, Krisam J, Charoentong P, et al. Predicting survival from colorectal cancer histology slides using deep learning: a retrospective multicenter study. PLoS Med. 2019; 16(1):e1002730.
10.1371/journal.pmed.1002730
PubMed Web of Science® Google Scholar
85Petrick N, Akbar S, Cha KH, et al. SPIE-AAPM-NCI BreastPathQ Challenge: an image analysis challenge for quantitative tumor cellularity assessment in breast cancer histology images following neoadjuvant treatment. J Med Imag. 2021; 8(3):034501.
10.1117/1.JMI.8.3.034501
PubMed Web of Science® Google Scholar
86Aresta G, Araújo T, Kwok S, et al. Bach: grand challenge on breast cancer histology images. Med Image Anal. 2019; 56: 122-139.
10.1016/j.media.2019.05.010
PubMed Web of Science® Google Scholar
87Sung H, Ferlay J, Siegel RL, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021; 71(3): 209-249.
10.3322/caac.21660
PubMed Web of Science® Google Scholar
88Valkonen M, Isola J, Ylinen O, et al. Cytokeratin-supervised deep learning for automatic recognition of epithelial cells in breast cancers stained for ER, PR, and Ki-67. IEEE Trans Med Imaging. 2019; 39(2): 534-542.
10.1109/TMI.2019.2933656
PubMed Web of Science® Google Scholar
89Mahmood T, Arsalan M, Owais M, Lee MB, Park KR. Artificial intelligence-based mitosis detection in breast cancer histopathology images using faster R-CNN and deep CNNs. J Clin Med. 2020; 9(3): 749.
10.3390/jcm9030749
PubMed Web of Science® Google Scholar
90Swiderska-Chadaj Z, Pinckaers H, van Rijthoven M, et al. Learning to detect lymphocytes in immunohistochemistry with deep learning. Med Image Anal. 2019; 58:101547.
10.1016/j.media.2019.101547
PubMed Web of Science® Google Scholar
91Cireşan DC, Giusti A, Gambardella LM, Schmidhuber J. Mitosis detection in breast cancer histology images with deep neural networks. Medical Image Computing and Computer-Assisted Intervention—MICCAI 2013: 16th International Conference, Nagoya, Japan, September 22–26, 2013, Proceedings, Part II. Springer; 2013: 411-418.
10.1007/978-3-642-40763-5_51
Google Scholar
92Wang H, Cruz-Roa A, Basavanhally A, et al. Mitosis detection in breast cancer pathology images by combining handcrafted and convolutional neural network features. J Med Imag. 2014; 1(3):034003.
10.1117/1.JMI.1.3.034003
PubMed Google Scholar
93Wang D, Khosla A, Gargeya R, Irshad H, Beck AH. Deep Learning for Identifying Metastatic Breast Cancer. 2016. arXiv preprint arXiv:160605718.
Google Scholar
94Albarqouni S, Baur C, Achilles F, Belagiannis V, Demirci S, Navab N. Aggnet: deep learning from crowds for mitosis detection in breast cancer histology images. IEEE Trans Med Imaging. 2016; 35(5): 1313-1321.
10.1109/TMI.2016.2528120
PubMed Web of Science® Google Scholar
95Liu Y, Gadepalli K, Norouzi M, et al. Detecting Cancer Metastases on Gigapixel Pathology Images. 2017. arXiv preprint arXiv:170302442.
Google Scholar
96Tellez D, Balkenhol M, Otte-Höller I, et al. Whole-slide mitosis detection in H&E breast histology using PHH3 as a reference to train distilled stain-invariant convolutional networks. IEEE Trans Med Imaging. 2018; 37(9): 2126-2136.
10.1109/TMI.2018.2820199
Web of Science® Google Scholar
97Cruz-Roa A, Gilmore H, Basavanhally A, et al. Accurate and reproducible invasive breast cancer detection in whole-slide images: a deep learning approach for quantifying tumor extent. Sci Rep. 2017; 7(1): 1-14.
10.1038/srep46450
PubMed Google Scholar
98Feng Y, Zhang L, Mo J. Deep manifold preserving autoencoder for classifying breast cancer histopathological images. IEEE/ACM Trans Comput Biol Bioinform. 2018; 17(1): 91-101.
10.1109/TCBB.2018.2858763
PubMed Web of Science® Google Scholar
99Yang H, Kim JY, Kim H, Adhikari SP. Guided soft attention network for classification of breast cancer histopathology images. IEEE Trans Med Imaging. 2019; 39(5): 1306-1315.
10.1109/TMI.2019.2948026
PubMed Web of Science® Google Scholar
100Liu Y, Kohlberger T, Norouzi M, et al. Artificial intelligence-based breast cancer nodal metastasis detection: insights into the black box for pathologists. Arch Pathol Lab Med. 2019; 143(7): 859-868.
10.5858/arpa.2018-0147-OA
CAS PubMed Web of Science® Google Scholar
101Pinckaers H, Van Ginneken B, Litjens G. Streaming convolutional neural networks for end-to-end learning with multimegapixel images. IEEE Trans Pattern Anal Mach Intell. 2020; 44(3): 1581-1590.
10.1109/TPAMI.2020.3019563
Web of Science® Google Scholar
102Gildenblat J, Ben-Shaul I, Lapp Z, Klaiman E. Certainty pooling for multiple instance learning. Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10–15, 2021, Proceedings, Part I. Springer; 2021: 141-153.
10.1007/978-3-030-68763-2_11
Google Scholar
103Le H, Gupta R, Hou L, et al. Utilizing automated breast cancer detection to identify spatial distributions of tumor-infiltrating lymphocytes in invasive breast cancer. Am J Pathol. 2020; 190(7): 1491-1504.
10.1016/j.ajpath.2020.03.012
PubMed Web of Science® Google Scholar
104Awan R, Koohbanani NA, Shaban M, Lisowska A, Rajpoot N. Context-aware learning using transferable features for classification of breast cancer histology images. Image Analysis and Recognition: 15th International Conference, ICIAR 2018, Póvoa de Varzim, Portugal, June 27–29, 2018. Springer; 2018: 788-795.
10.1007/978-3-319-93000-8_89
Google Scholar
105Couture HD, Williams LA, Geradts J, et al. Image analysis with deep learning to predict breast cancer grade, ER status, histologic subtype, and intrinsic subtype. NPJ Breast Cancer. 2018; 4(1): 30.
10.1038/s41523-018-0079-1
PubMed Google Scholar
106Rawat RR, Ortega I, Roy P, et al. Deep learned tissue “fingerprints” classify breast cancers by ER/PR/Her2 status from H&E images. Sci Rep. 2020; 10(1): 1-13.
10.1038/s41598-020-64156-4
PubMed Web of Science® Google Scholar
107Wang X, Chen H, Gan C, et al. Weakly supervised deep learning for whole slide lung cancer image analysis. IEEE Trans Cybern. 2019; 50(9): 3950-3962.
10.1109/TCYB.2019.2935141
PubMed Web of Science® Google Scholar
108Gertych A, Swiderska-Chadaj Z, Ma Z, et al. Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides. Sci Rep. 2019; 9(1): 1-12.
10.1038/s41598-018-37638-9
PubMed Google Scholar
109Wei JW, Tafe LJ, Linnik YA, Vaickus LJ, Tomita N, Hassanpour S. Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks. Sci Rep. 2019; 9(1):3358.
10.1038/s41598-019-40041-7
PubMed Google Scholar
110AbdulJabbar K, Raza SEA, Rosenthal R, et al. Geospatial immune variability illuminates differential evolution of lung adenocarcinoma. Nat Med. 2020; 26(7): 1054-1062.
10.1038/s41591-020-0900-x
CAS PubMed Web of Science® Google Scholar
111Sha L, Osinski BL, Ho IY, et al. Multi-field-of-view deep learning model predicts nonsmall cell lung cancer programmed death-ligand 1 status from whole-slide hematoxylin and eosin images. J Pathol Inform. 2019; 10(1): 24.
10.4103/jpi.jpi_24_19
PubMed Google Scholar
112Xu Y, Zhang J, Chang EI, et al. Context-constrained multiple instance learning for histopathology image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer; 2012: 623-630.
Google Scholar
113Sari CT, Gunduz-Demir C. Unsupervised feature extraction via deep learning for histopathological classification of colon tissue images. IEEE Trans Med Imaging. 2018; 38(5): 1139-1149.
10.1109/TMI.2018.2879369
PubMed Web of Science® Google Scholar
114Zhou C, Jin Y, Chen Y, et al. Histopathology classification and localization of colorectal cancer using global labels by weakly supervised deep learning. Comput Med Imaging Graph. 2021; 88:101861.
10.1016/j.compmedimag.2021.101861
PubMed Web of Science® Google Scholar
115Graham S, Vu QD, Jahanifar M, et al. One model is all you need: multi-task learning enables simultaneous histology image segmentation and classification. Med Image Anal. 2023; 83:102685.
10.1016/j.media.2022.102685
PubMed Web of Science® Google Scholar
116Sirinukunwattana K, Raza SEA, Tsang YW, Snead DR, Cree IA, Rajpoot NM. Locality sensitive deep learning for detection and classification of nuclei in routine colon cancer histology images. IEEE Trans Med Imaging. 2016; 35(5): 1196-1206.
10.1109/TMI.2016.2525803
PubMed Web of Science® Google Scholar
117Sirinukunwattana K, Domingo E, Richman SD, et al. Image-based consensus molecular subtype (imCMS) classification of colorectal cancer using deep learning. Gut. 2021; 70(3): 544-554.
10.1136/gutjnl-2019-319866
CAS PubMed Web of Science® Google Scholar
118Geessink OG, Baidoshvili A, Klaase JM, et al. Computer aided quantification of intratumoral stroma yields an independent prognosticator in rectal cancer. Cell Oncol. 2019; 42: 331-341.
10.1007/s13402-019-00429-z
Web of Science® Google Scholar
119Nagpal K, Foote D, Liu Y, et al. Development and validation of a deep learning algorithm for improving Gleason scoring of prostate cancer. NPJ Digit Med. 2019; 2(1): 48.
10.1038/s41746-019-0112-2
PubMed Google Scholar
120Karimi D, Nir G, Fazli L, Black PC, Goldenberg L, Salcudean SE. Deep learning-based Gleason grading of prostate cancer from histopathology images—role of multiscale decision aggregation and data augmentation. IEEE J Biomed Health Inform. 2019; 24(5): 1413-1426.
10.1109/JBHI.2019.2944643
PubMed Web of Science® Google Scholar
121Ström P, Kartasalo K, Olsson H, et al. Artificial intelligence for diagnosis and grading of prostate cancer in biopsies: a population-based, diagnostic study. Lancet Oncol. 2020; 21(2): 222-232.
10.1016/S1470-2045(19)30738-7
PubMed Web of Science® Google Scholar
122Iizuka O, Kanavati F, Kato K, Rambeau M, Arihiro K, Tsuneki M. Deep learning models for histopathological classification of gastric and colonic epithelial tumours. Sci Rep. 2020; 10(1):1504.
10.1038/s41598-020-58467-9
CAS PubMed Web of Science® Google Scholar
123Song Z, Zou S, Zhou W, et al. Clinically applicable histopathological diagnosis system for gastric cancer detection using deep learning. Nat Commun. 2020; 11(1):4294.
10.1038/s41467-020-18147-8
CAS PubMed Web of Science® Google Scholar
124Kather JN, Pearson AT, Halama N, et al. Deep learning can predict microsatellite instability directly from histology in gastrointestinal cancer. Nat Med. 2019; 25(7): 1054-1056.
10.1038/s41591-019-0462-y
CAS PubMed Web of Science® Google Scholar
125Tabibu S, Vinod P, Jawahar C. Pan-renal cell carcinoma classification and survival prediction from histopathology images using deep learning. Sci Rep. 2019; 9(1): 1-9.
10.1038/s41598-019-46718-3
CAS PubMed Google Scholar
126Bueno G, Fernandez-Carrobles MM, Gonzalez-Lopez L, Deniz O. Glomerulosclerosis identification in whole slide images using semantic segmentation. Comput Methods Programs Biomed. 2020; 184:105273.
10.1016/j.cmpb.2019.105273
PubMed Web of Science® Google Scholar
127Hermsen M, de Bel T, Den Boer M, et al. Deep learning-based histopathologic assessment of kidney tissue. J Am Soc Nephrol. 2019; 30(10): 1968-1979.
10.1681/ASN.2019020144
PubMed Web of Science® Google Scholar
128Ertosun MG, Rubin DL. Automated grading of gliomas using deep learning in digital pathology images: a modular approach with ensemble of convolutional neural networks. AMIA Annual Symposium Proceedings. Vol 2015. American Medical Informatics Association; 2015: 1899.
Google Scholar
129Cui D, Liu Y, Liu G, Liu L. A multiple-instance learning-based convolutional neural network model to detect the IDH1 mutation in the histopathology images of glioma tissues. J Comput Biol. 2020; 27(8): 1264-1272.
10.1089/cmb.2019.0410
CAS PubMed Web of Science® Google Scholar
130Liu S, Shah Z, Sav A, et al. Isocitrate dehydrogenase (IDH) status prediction in histopathology images of gliomas using deep learning. Sci Rep. 2020; 10(1): 1-11.
PubMed Web of Science® Google Scholar
131Wong GLH, Yuen PC, Ma AJ, Chan AWH, Leung HHW, Wong VWS. Artificial intelligence in prediction of non-alcoholic fatty liver disease and fibrosis. J Gastroenterol Hepatol. 2021; 36(3): 543-550.
10.1111/jgh.15385
PubMed Web of Science® Google Scholar
132Heinemann F, Birk G, Stierstorfer B. Deep learning enables pathologist-like scoring of NASH models. Sci Rep. 2019; 9(1): 1-10.
10.1038/s41598-019-54904-6
PubMed Google Scholar
133Kiani A, Uyumazturk B, Rajpurkar P, et al. Impact of a deep learning assistant on the histopathologic classification of liver cancer. NPJ Digit Med. 2020; 3(1): 23.
10.1038/s41746-020-0232-8
PubMed Web of Science® Google Scholar
134Jansen I, Lucas M, Bosschieter J, et al. Automated detection and grading of non-muscle-invasive urothelial cell carcinoma of the bladder. Am J Pathol. 2020; 190(7): 1483-1490.
10.1016/j.ajpath.2020.03.013
PubMed Web of Science® Google Scholar
135Zhang Z, Chen P, McGough M, et al. Pathologist-level interpretable whole-slide cancer diagnosis with deep learning. Nat Mach Intell. 2019; 1(5): 236-245.
10.1038/s42256-019-0052-1
Google Scholar
136Kulkarni PM, Robinson EJ, Sarin Pradhan J, et al. Deep learning based on standard H&E images of primary melanoma tumors identifies patients at risk for visceral recurrence and death deep learning-based prognostic biomarker for melanoma. Clin Cancer Res. 2020; 26(5): 1126-1134.
10.1158/1078-0432.CCR-19-1495
CAS PubMed Web of Science® Google Scholar
137Zhou Y, Koyuncu C, Lu C, et al. Multi-site cross-organ calibrated deep learning (MuSClD): automated diagnosis of non-melanoma skin cancer. Med Image Anal. 2023; 84:102702.
10.1016/j.media.2022.102702
PubMed Web of Science® Google Scholar
138Courtiol P, Maussion C, Moarii M, et al. Deep learning-based classification of mesothelioma improves prediction of patient outcome. Nat Med. 2019; 25(10): 1519-1525.
10.1038/s41591-019-0583-3
CAS PubMed Web of Science® Google Scholar
139Salle FG, Le Stang N, Tirode F, et al. Comprehensive molecular and pathologic evaluation of transitional mesothelioma assisted by deep learning approach: a multi-institutional study of the International Mesothelioma Panel from the MESOPATH Reference Center. J Thorac Oncol. 2020; 15(6): 1037-1053.
10.1016/j.jtho.2020.01.025
PubMed Web of Science® Google Scholar
140Saltz J, Gupta R, Hou L, et al. Spatial organization and molecular correlation of tumor infiltrating lymphocytes using deep learning on pathology images. Cell Rep. 2018; 23(1): 181-193.
10.1016/j.celrep.2018.03.086
CAS PubMed Web of Science® Google Scholar
141Srinidhi CL, Martel AL. Improving self-supervised learning with hardness-aware dynamic curriculum learning: an application to digital pathology. Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE; 2021: 562-571.
Google Scholar
142Srinidhi CL, Kim SW, Chen FD, Martel AL. Self-supervised driven consistency training for annotation efficient histopathology image analysis. Med Image Anal. 2022; 75:102256.
10.1016/j.media.2021.102256
PubMed Web of Science® Google Scholar
143Fashi PA, Hemati S, Babaie M, Gonzalez R, Tizhoosh H. A self-supervised contrastive learning approach for whole slide image representation in digital pathology. J Pathol Inform. 2022; 13:100133.
10.1016/j.jpi.2022.100133
PubMed Google Scholar
144Pinckaers H, Litjens G. Neural Ordinary Differential Equations for Semantic Segmentation of Individual Colon Glands. 2019. arXiv preprint arXiv:191010470.
Google Scholar
145Graham S, Chen H, Gamper J, et al. MILD-Net: minimal information loss dilated network for gland instance segmentation in colon histology images. Med Image Anal. 2019; 52: 199-211.
10.1016/j.media.2018.12.001
PubMed Web of Science® Google Scholar
146Graham S, Vu QD, Raza SEA, et al. Hover-net: simultaneous segmentation and classification of nuclei in multi-tissue histology images. Med Image Anal. 2019; 58:101563.
10.1016/j.media.2019.101563
PubMed Web of Science® Google Scholar
147Xu Y, Zhu JY, Eric I, Chang C, Lai M, Tu Z. Weakly supervised histopathology cancer image segmentation and classification. Med Image Anal. 2014; 18(3): 591-604.
10.1016/j.media.2014.01.010
PubMed Web of Science® Google Scholar
148Gadermayr M, Gupta L, Klinkhammer BM, Boor P, Merhof D. Unsupervisedly Training GANs for Segmenting Digital Pathology With Automatically Generated Annotations. 2018. arXiv preprint arXiv:180510059.
Google Scholar
149de Bel T, Hermsen M, Smeets B, Hilbrands L, van der Laak J, Litjens G. Automatic segmentation of histopathological slides of renal tissue using deep learning. Medical Imaging 2018: Digital Pathology. Vol 10581. SPIE; 2018: 285-290.
10.1117/12.2293717
Google Scholar
150Xu G, Song Z, Sun Z, et al. Camel: a weakly supervised learning framework for histopathology image segmentation. Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE; 2019: 10682-10691.
10.1109/ICCV.2019.01078
Google Scholar
151Kapil A, Wiestler T, Lanzmich S, et al. DASGAN—Joint Domain Adaptation and Segmentation for the Analysis of Epithelial Regions in Histopathology PD-L1 Images. 2019. arXiv preprint arXiv:190611118.
Google Scholar
152Chen RJ, Lu MY, Weng WH, et al. Multimodal co-attention transformer for survival prediction in gigapixel whole slide images. Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE; 2021: 4015-4025.
Google Scholar
153Li C, Zhu X, Yao J, Huang J. Hierarchical transformer for survival prediction using multimodality whole slide images and genomics. 2022 26th International Conference on Pattern Recognition (ICPR). IEEE; 2022: 4256-4262.
10.1109/ICPR56361.2022.9956296
Google Scholar
154Qiu L, Khormali A, Liu K. Deep Biological Pathway Informed Pathology-Genomic Multimodal Survival Prediction. 2023. arXiv preprint arXiv:230102383.
Google Scholar
155Katzman JL, Shaham U, Cloninger A, Bates J, Jiang T, Kluger Y. DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med Res Methodol. 2018; 18(1): 1-12.
10.1186/s12874-018-0482-1
PubMed Web of Science® Google Scholar
156Li R, Yao J, Zhu X, Li Y, Huang J. Graph CNN for survival analysis on whole slide pathological images. Medical Image Computing and Computer Assisted Intervention—MICCAI 2018: 21st International Conference, Granada, Spain, September 16–20, 2018, Proceedings, Part II. Springer; 2018: 174-182.
10.1007/978-3-030-00934-2_20
Google Scholar
157Di D, Zou C, Feng Y, et al. Generating hypergraph-based high-order representations of whole slide histopathological images for survival prediction. IEEE Trans Pattern Anal Mach Intell. 2022; 45:5800-5815.
Web of Science® Google Scholar
158Mackenzie CC, Dawood M, Graham S, et al. Neural graph modelling of whole slide images for survival ranking. The First Learning on Graphs Conference, Proceedings of Machine Learning Research; 2022.
Google Scholar
159Zhu X, Yao J, Huang J. Deep convolutional neural network for survival analysis with pathological images. 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE; 2016: 544-547.
10.1109/BIBM.2016.7822579
Google Scholar
160Yu KH, Zhang C, Berry GJ, et al. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat Commun. 2016; 7(1):12474.
10.1038/ncomms12474
CAS PubMed Google Scholar
161Kolachalama VB, Singh P, Lin CQ, et al. Association of pathological fibrosis with renal survival using deep neural networks. Kidney Int Rep. 2018; 3(2): 464-475.
10.1016/j.ekir.2017.11.002
PubMed Web of Science® Google Scholar
162Benkirane H, Vakalopoulou M, Christodoulidis S, Garberis IJ, Michiels S, Cournède PH. Hyper-AdaC: adaptive clustering-based hypergraph representation of whole slide images for survival analysis. Machine Learning for Health. PMLR; 2022: 405-418.
Google Scholar
163Chen RJ, Lu MY, Shaban M, et al. Whole slide images are 2D point clouds: context-aware survival prediction using patch-based graph convolutional networks. Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part VIII. Springer; 2021: 339-349.
10.1007/978-3-030-87237-3_33
Google Scholar
164Gurcan MN, Madabhushi A, Rajpoot N. Pattern recognition in histopathological images: an ICPR 2010 contest. Recognizing patterns in signals, speech, images and videos: ICPR 2010 Contests, Istanbul, Turkey, August 23–26, 2010, Contest Reports. Springer; 2010: 226-234.
10.1007/978-3-642-17711-8_23
Google Scholar
165Balkenhol MC, Tellez D, Vreuls W, et al. Deep learning assisted mitotic counting for breast cancer. Lab Invest. 2019; 99(11): 1596-1606.
10.1038/s41374-019-0275-0
PubMed Web of Science® Google Scholar
166Chen J, Srinivas C. Automatic Lymphocyte Detection in H&E Images With Deep Neural Networks. 2016. arXiv preprint arXiv:161203217.
Google Scholar
167Chen H, Qi X, Yu L, Dou Q, Qin J, Heng PA. DCAN: deep contour-aware networks for object instance segmentation from histology images. Med Image Anal. 2017; 36: 135-146.
10.1016/j.media.2016.11.004
PubMed Web of Science® Google Scholar
168Han W, Cheung AM, Yaffe MJ, Martel AL. Cell segmentation for immunofluorescence multiplexed images using twostage domain adaptation and weakly labeled data for pre-training. Sci Rep. 2022; 12(1):4399.
10.1038/s41598-022-08355-1
CAS PubMed Web of Science® Google Scholar
169Xue Y, Ray N, Hugh J, Bigras G. Cell counting by regression using convolutional neural network. Computer Vision—ECCV 2016 Workshops: Amsterdam, The Netherlands, October 8–10 and 15–16, 2016, Proceedings, Part I 14. Springer; 2016: 274-290.
10.1007/978-3-319-46604-0_20
Google Scholar
170Brieu N, Meier A, Kapil A, et al. Domain Adaptation-Based Augmentation for Weakly Supervised Nuclei Detection. 2019. arXiv preprint arXiv:190704681.
Google Scholar
171Xing F, Su H, Neltner J, Yang L. Automatic ki-67 counting using robust cell detection and online dictionary learning. IEEE Trans Biomed Eng. 2013; 61(3): 859-870.
10.1109/TBME.2013.2291703
Web of Science® Google Scholar
172Sornapudi S, Stanley RJ, Stoecker WV, et al. Deep learning nuclei detection in digitized histology images by superpixels. J Pathol Inform. 2018; 9(1): 5.
10.4103/jpi.jpi_74_17
PubMed Google Scholar
173Luchini C, Pantanowitz L, Adsay V, et al. Ki-67 assessment of pancreatic neuroendocrine neoplasms: systematic review and meta-analysis of manual vs. digital pathology scoring. Mod Pathol. 2022; 35(6): 712-720.
10.1038/s41379-022-01055-1
PubMed Web of Science® Google Scholar
174Rizzardi AE, Johnson AT, Vogel RI, et al. Quantitative comparison of immunohistochemical staining measured by digital image analysis versus pathologist visual scoring. Diagn Pathol. 2012; 7: 1-10.
10.1186/1746-1596-7-42
PubMed Web of Science® Google Scholar
175Vesterinen T, Säilä J, Blom S, Pennanen M, Leijon H, Arola J. Automated assessment of Ki-67 proliferation index in neuroendocrine tumors by deep learning. Apmis. 2022; 130(1): 11-20.
10.1111/apm.13190
PubMed Web of Science® Google Scholar
176Long F. Microscopy cell nuclei segmentation with enhanced U-Net. BMC Bioinform. 2020; 21: 1-12.
10.1186/s12859-019-3332-1
PubMed Web of Science® Google Scholar
177Naylor P, Laé M, Reyal F, Walter T. Segmentation of nuclei in histopathology images by deep regression of the distance map. IEEE Trans Med Imaging. 2018; 38(2): 448-459.
10.1109/TMI.2018.2865709
Web of Science® Google Scholar
178Al-Kofahi Y, Lassoued W, Lee W, Roysam B. Improved automatic detection and segmentation of cell nuclei in histopathology images. IEEE Trans Biomed Eng. 2009; 57(4): 841-852.
10.1109/TBME.2009.2035102
PubMed Web of Science® Google Scholar
179Lu C, Xu H, Xu J, Gilmore H, Mandal M, Madabhushi A. Multi-pass adaptive voting for nuclei detection in histopathological images. Sci Rep. 2016; 6(1): 1-18.
10.1038/srep37589
PubMed Web of Science® Google Scholar
180Hou L, Nguyen V, Kanevsky AB, et al. Sparse autoencoder for unsupervised nucleus detection and representation in histopathology images. Pattern Recognit. 2019; 86: 188-200.
10.1016/j.patcog.2018.09.007
PubMed Web of Science® Google Scholar
181Akbar S, Martel AL. Cluster-Based Learning From Weakly Labeled Bags in Digital Pathology. 2018. arXiv preprint arXiv:181200884.
Google Scholar
182Esteva A, Feng J, van der Wal D, et al. Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials. NPJ Digit Med. 2022; 5(1): 71.
10.1038/s41746-022-00613-w
PubMed Web of Science® Google Scholar
183Tiard A, Wong A, Ho DJ, et al. Stain-Invariant Self Supervised Learning for Histopathology Image Analysis. 2022. arXiv preprint arXiv:221107590.
Google Scholar
184de Bel T, Hermsen M, Kers J, van der Laak J, Litjens G. Stain-transforming cycle-consistent generative adversarial networks for improved segmentation of renal histopathology. Medical imaging with deep. Learning. 2019; 102:151-163.
Google Scholar
185Janowczyk A, Basavanhally A, Madabhushi A. Stain normalization using sparse autoencoders (StaNoSA): application to digital pathology. Comput Med Imaging Graph. 2017; 57: 50-61.
10.1016/j.compmedimag.2016.05.003
PubMed Web of Science® Google Scholar
186Shaban MT, Baur C, Navab N, Albarqouni S. Staingan: stain style transfer for digital histological images. 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019). IEEE; 2019: 953-956.
10.1109/ISBI.2019.8759152
Google Scholar
187Tellez D, Litjens G, van der Laak J, Ciompi F. Neural image compression for gigapixel histopathology image analysis. IEEE Trans Pattern Anal Mach Intell. 2019; 43(2): 567-578.
10.1109/TPAMI.2019.2936841
Web of Science® Google Scholar
188Goodfellow I, Bengio Y, Courville A. Deep Learning. MIT Press; 2016.
Google Scholar
189Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. Adv Neural Inform Process Syst. 2017; 30:5998-6008.
Google Scholar
190Dietterich TG, Lathrop RH, Lozano-Pérez T. Solving the multiple instance problem with axis-parallel rectangles. Artif Intell. 1997; 89(1–2): 31-71.
10.1016/S0004-3702(96)00034-3
Web of Science® Google Scholar
191Maron O, Lozano-Pérez T. A framework for multiple-instance learning. Adv Neural Inform Process Syst. 1997; 10:570-576.
Google Scholar
192Liu Q, Qian Z, Marvasty I, Rinehart S, Voros S, Metaxas DN. Lesion-specific coronary artery calcium quantification for predicting cardiac event with multiple instance support vector machines. Medical Image Computing and ComputerAssisted Intervention—MICCAI 2010: 13th International Conference, Beijing, China, September 20–24, 2010, Proceedings, Part I. Springer; 2010: 484-492.
Google Scholar
193Mason L, Baxter J, Bartlett P, Frean M. Boosting algorithms as gradient descent. Adv Neural Inform Process Syst. 1999; 12:512-518.
Google Scholar
194Xu Y, Zhu JY, Chang E, Tu Z. Multiple clustered instance learning for histopathology cancer image classification, segmentation and clustering. 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE; 2012: 964-971.
Google Scholar
195Zeiler MD, Fergus R. Visualizing and understanding convolutional networks. Computer Vision—ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part I. Springer; 2014: 818-833.
10.1007/978-3-319-10590-1_53
Google Scholar
196Simonyan K, Zisserman A. Very Deep Convolutional Networks for Large-Scale Image Recognition. 2014. arXiv preprint arXiv:14091556.
Google Scholar
197Szegedy C, Liu W, Jia Y, et al. Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE; 2015: 1-9.
10.1109/CVPR.2015.7298594
Google Scholar
198Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z. Rethinking the inception architecture for computer vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE; 2016: 2818-2826.
10.1109/CVPR.2016.308
Google Scholar
199Mehmood S, Ghazal TM, Khan MA, et al. Malignancy detection in lung and colon histopathology images using transfer learning with class selective image processing. IEEE Access. 2022; 10: 25657-25668.
10.1109/ACCESS.2022.3150924
Web of Science® Google Scholar
200Xue D, Zhou X, Li C, et al. An application of transfer learning and ensemble learning techniques for cervical histopathology image classification. IEEE Access. 2020; 8: 104603-104618.
10.1109/ACCESS.2020.2999816
Web of Science® Google Scholar
201Chollet F. Xception: deep learning with depthwise separable convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE; 2017: 1251-1258.
10.1109/CVPR.2017.195
Google Scholar
202Kassani SH, Kassani PH, Wesolowski MJ, Schneider KA, Deters R. Deep transfer learning based model for colorectal cancer histopathology segmentation: a comparative study of deep pre-trained models. Int J Med Inform. 2022; 159:104669.
10.1016/j.ijmedinf.2021.104669
PubMed Web of Science® Google Scholar
203Huang G, Liu Z, Van Der Maaten L, Weinberger KQ. Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE; 2017: 4700-4708.
10.1109/CVPR.2017.243
Google Scholar
204Chaurasia A, Culurciello E. Linknet: exploiting encoder representations for efficient semantic segmentation. 2017 IEEE Visual Communications and Image Processing (VCIP). IEEE; 2017: 1-4.
10.1109/VCIP.2017.8305148
Google Scholar
205Howard AG, Zhu M, Chen B, et al. Mobilenets: Efficient Convolutional Neural Networks for Mobile Vision Applications. 2017. arXiv preprint arXiv:170404861.
Google Scholar
206Sharma Y, Ehsan L, Syed S, Brown DE. HistoTransfer: understanding transfer learning for histopathology. 2021 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI). IEEE; 2021: 1-4.
10.1109/BHI50953.2021.9508542
Google Scholar
207Ciga O, Xu T, Martel AL. Self supervised contrastive learning for digital histopathology. Mach Learn Appl. 2022; 7:100198.
10.1016/j.mlwa.2021.100198
Google Scholar
208Mormont R, Geurts P, Marée R. Multi-task pre-training of deep neural networks for digital pathology. IEEE J Biomed Health Inform. 2020; 25(2): 412-421.
10.1109/JBHI.2020.2992878
Web of Science® Google Scholar
209Gildenblat J, Klaiman E. Self-Supervised Similarity Learning for Digital Pathology. 2019. arXiv preprint arXiv:190508139.
Google Scholar
210Wu Z, Xiong Y, Yu SX, Lin D. Unsupervised feature learning via non-parametric instance discrimination. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE; 2018: 3733-3742.
10.1109/CVPR.2018.00393
Google Scholar
211Le-Khac PH, Healy G, Smeaton AF. Contrastive representation learning: a framework and review. IEEE Access. 2020; 8: 193907-193934.
10.1109/ACCESS.2020.3031549
Web of Science® Google Scholar
212Wang X, Yang S, Zhang J, et al. Transformer-based unsupervised contrastive learning for histopathological image classification. Med Image Anal. 2022; 81:102559.
10.1016/j.media.2022.102559
PubMed Web of Science® Google Scholar
213Chen X, Xie S, He K. An empirical study of training self-supervised vision transformers. Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE; 2021: 9640-9649.
Google Scholar
214Grill JB, Strub F, Altché F, et al. Bootstrap your own latent-a new approach to self-supervised learning. Adv Neural Inf Process Syst. 2020; 33: 21271-21284.
Google Scholar
215Chen X, He K. Exploring simple siamese representation learning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE; 2021: 15750-15758.
10.1109/CVPR46437.2021.01549
Google Scholar
216Caron M, Touvron H, Misra I, et al. Emerging properties in self-supervised vision transformers. Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE; 2021: 9650-9660.
Google Scholar
217Wang X, Yang S, Zhang J, et al. Transpath: Transformer-based self-supervised learning for histopathological image classification. Medical Image Computing and Computer Assisted Intervention—MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part VIII. Springer; 2021: 186-195.
10.1007/978-3-030-87237-3_18
Google Scholar
218Dosovitskiy A, Beyer L, Kolesnikov A, et al. An Image is Worth 16 × 16 Words: Transformers for Image Recognition at Scale. 2020. arXiv preprint arXiv:201011929.
Google Scholar
219Yuan L, Chen Y, Wang T, et al. Tokens-to-token vit: training vision transformers from scratch on imagenet. Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE; 2021: 558-567.
Google Scholar
220Wu B, Xu C, Dai X, et al. Visual Transformers: Token-Based Image Representation and Processing for Computer Vision. 2020. arXiv preprint arXiv:200603677.
Google Scholar
221Srinivas A, Lin TY, Parmar N, Shlens J, Abbeel P, Vaswani A. Bottleneck transformers for visual recognition. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE; 2021: 16519-16529.
10.1109/CVPR46437.2021.01625
Google Scholar
222Chen RJ, Krishnan RG. Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology. 2022. arXiv preprint arXiv:220300585.
Google Scholar
223Kang M, Song H, Park S, Yoo D, Pereira S. Benchmarking Self-Supervised Learning on Diverse Pathology Datasets. 2022. arXiv preprint arXiv:221204690.
Google Scholar
224Ilse M, Tomczak J, Welling M. Attention-based deep multiple instance learning. International Conference on Machine Learning. PMLR; 2018: 2127-2136.
Google Scholar
225Yao J, Zhu X, Jonnagaddala J, Hawkins N, Huang J. Whole slide images based cancer survival prediction using attention guided deep multiple instance learning networks. Med Image Anal. 2020; 65:101789.
10.1016/j.media.2020.101789
PubMed Web of Science® Google Scholar
226Gelasca ED, Byun J, Obara B, Manjunath B. Evaluation and benchmark for biological image segmentation. 2008 15th IEEE International Conference on Image Processing. IEEE; 2008: 1816-1819.
10.1109/ICIP.2008.4712130
Google Scholar
227Chen RJ, Chen C, Li Y, et al. Scaling vision transformers to gigapixel images via hierarchical self-supervised learning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE; 2022: 16144-16155.
Google Scholar
228Zhao Y, Yang F, Fang Y, et al. Predicting lymph node metastasis using histopathological images based on multiple instance learning with deep graph convolution. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE; 2020: 4837-4846.
10.1109/CVPR42600.2020.00489
Google Scholar
229Battaglia PW, Hamrick JB, Bapst V, et al. Relational Inductive Biases, Deep Learning, and Graph Networks. 2018. arXiv preprint arXiv:180601261.
Google Scholar
230Kramer BS, Berg CD, Aberle DR, Prorok PC. Lung Cancer Screening With Low-Dose Helical CT: Results From the National Lung Screening Trial (NLST). SAGE Publications; 2011.
Google Scholar
231Harrell FE, Califf RM, Pryor DB, Lee KL, Rosati RA. Evaluating the yield of medical tests. JAMA. 1982; 247(18): 2543-2546.
10.1001/jama.1982.03320430047030
PubMed Web of Science® Google Scholar
232Adnan M, Kalra S, Tizhoosh HR. Representation learning of histopathology images using graph neural networks. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops; IEEE; 2020: 988-989.
10.1109/CVPRW50498.2020.00502
Google Scholar
233Dua D, Graff C. UCI Machine Learning Repository. 2017. Accessed January 25, 2023. http://archiveicsuciedu/ml
Google Scholar
234Cox DR. Regression models and life-tables. J R Stat Soc B Methodol. 1972; 34(2): 187-202.
10.1111/j.2517-6161.1972.tb00899.x
Google Scholar
235Kandoth C, McLellan MD, Vandin F, et al. Mutational landscape and significance across 12 major cancer types. Nature. 2013; 502(7471): 333-339.
10.1038/nature12634
CAS PubMed Web of Science® Google Scholar
236Sureka M, Patil A, Anand D, Sethi A. Visualization for histopathology images using graph convolutional neural networks. 2020 IEEE 20th International Conference on Bioinformatics and Bioengineering (BIBE). IEEE; 2020: 331-335.
10.1109/BIBE50027.2020.00060
Google Scholar
237Arvaniti E, Fricker KS, Moret M, et al. Automated Gleason grading of prostate cancer tissue microarrays via deep learning. Sci Rep. 2018; 8(1):12054.
10.1038/s41598-018-30535-1
PubMed Web of Science® Google Scholar
238Lu W, Graham S, Bilal M, Rajpoot N, Minhas F. Capturing cellular topology in multi-gigapixel pathology images. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops; IEEE; 2020: 260-261.
Google Scholar
239Anklin V, Pati P, Jaume G, et al. Learning whole-slide segmentation from inexact and incomplete labels using tissue graphs. Medical Image Computing and Computer Assisted Intervention—MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part II. Springer; 2021: 636-646.
10.1007/978-3-030-87196-3_59
Google Scholar
240Zhong Q, Guo T, Rechsteiner M, et al. A curated collection of tissue microarray images and clinical outcome data of prostate cancer patients. Sci Data. 2017; 4(1): 1-9.
10.1038/sdata.2017.14
Google Scholar
241Silva-Rodríguez J, Colomer A, Sales MA, Molina R, Naranjo V. Going deeper through the Gleason scoring scale: an automatic end-to-end system for histology prostate grading and cribriform pattern detection. Comput Methods Programs Biomed. 2020; 195:105637.
10.1016/j.cmpb.2020.105637
PubMed Web of Science® Google Scholar
242Vokinger KN, Feuerriegel S, Kesselheim AS. Mitigating bias in machine learning for medicine. Commun Med. 2021; 1(1): 25.
10.1038/s43856-021-00028-w
PubMed Google Scholar
243Castro DC, Walker I, Glocker B. Causality matters in medical imaging. Nat Commun. 2020; 11(1): 3673.
10.1038/s41467-020-17478-w
CAS PubMed Web of Science® Google Scholar
244Xu J, Xiao Y, Wang WH, et al. Algorithmic fairness in computational medicine. EBioMedicine. 2022; 84:104250.
10.1016/j.ebiom.2022.104250
PubMed Web of Science® Google Scholar
245Seyyed-Kalantari L, Liu G, McDermott M, Chen IY, Ghassemi M. CheXclusion: fairness gaps in deep chest x-ray classifiers. Proceedings of the Pacific Symposium on Biocomputing. World Scientific; 2021: 232-243.
Google Scholar
246Hashimoto T, Srivastava M, Namkoong H, Liang P. Fairness without demographics in repeated loss minimization. International Conference on Machine Learning. PMLR; 2018: 1929-1938.
Google Scholar
247Asilian Bidgoli A, Rahnamayan S, Dehkharghanian T, Grami A, Tizhoosh HR. Bias reduction in representation of histopathology images using deep feature selection. Sci Rep. 2022; 12(1): 1-12.
10.1038/s41598-022-24317-z
PubMed Web of Science® Google Scholar
248Hosseini SM, Sikaroudi M, Babaie M, Tizhoosh H. Proportionally fair hospital collaborations in federated learning of histopathology images. IEEE Trans Med Imaging. 2023.
10.1109/TMI.2023.3234450
PubMed Google Scholar
249Khan A, Janowczyk A, Müller F, et al. Impact of scanner variability on lymph node segmentation in computational pathology. J Pathol Inform. 2022; 13:100127.
10.1016/j.jpi.2022.100127
PubMed Google Scholar
250Zong Y, Yang Y, Hospedales T. MEDFAIR: Benchmarking Fairness for Medical Imaging. 2022. arXiv preprint arXiv:221001725.
Google Scholar
251Fischer EG. Nuclear morphology and the biology of cancer cells. Acta Cytol. 2020; 64(6): 511-519.
10.1159/000508780
CAS PubMed Web of Science® Google Scholar
252Allen-Proctor MK, Rahman M, Reddy CA, Koyfman SA, Chute DJ, Griffith CC. Variability in depth of invasion measurements in carcinomas of the Oral cavity and the effect on pathologic tumor staging. Head Neck Pathol. 2022; 16: 1-6.
10.1007/s12105-022-01439-4
PubMed Web of Science® Google Scholar
253Bejnordi BE, Timofeeva N, Otte-Höller I, Karssemeijer N, van der Laak JA. Quantitative analysis of stain variability in histology slides and an algorithm for standardization. Medical Imaging 2014: Digital Pathology. Vol 9041. SPIE; 2014: 45-51.
Google Scholar
254Allison KH, Reisch LM, Carney PA, et al. Understanding diagnostic variability in breast pathology: lessons learned from an expert consensus review panel. Histopathology. 2014; 65(2): 240-251.
10.1111/his.12387
PubMed Web of Science® Google Scholar
255Liu Y. Understanding instance-level label noise: disparate impacts and treatments. International Conference on Machine Learning. PMLR; 2021: 6725-6735.
Google Scholar
256Lipkova J, Chen RJ, Chen B, et al. Artificial intelligence for multimodal data integration in oncology. Cancer Cell. 2022; 40(10): 1095-1110.
10.1016/j.ccell.2022.09.012
CAS PubMed Web of Science® Google Scholar
257Boehm KM, Khosravi P, Vanguri R, Gao J, Shah SP. Harnessing multimodal data integration to advance precision oncology. Nat Rev Cancer. 2022; 22(2): 114-126.
10.1038/s41568-021-00408-3
CAS PubMed Web of Science® Google Scholar
258Vanguri RS, Luo J, Aukerman AT, et al. Multimodal integration of radiology, pathology and genomics for prediction of response to PD-(L) 1 blockade in patients with non-small cell lung cancer. Nat Cancer. 2022; 3(10): 1151-1164.
10.1038/s43018-022-00416-8
CAS PubMed Google Scholar
259Sahai S, Chen RJ, Lipova J, et al. Multimodal AI-based assessment of renal allograft biopsies. J Pathol Inform. 2022; 13:100059.
10.1016/j.jpi.2022.100059
Google Scholar
260Geirhos R, Jacobsen JH, Michaelis C, et al. Shortcut learning in deep neural networks. Nat Mach Intell. 2020; 2(11): 665-673.
10.1038/s42256-020-00257-z
Google Scholar
261Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-cam: visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision; IEEE; 2017: 618-626.
10.1109/ICCV.2017.74
Google Scholar
262Ying Z, Bourgeois D, You J, Zitnik M, Leskovec J. Gnnexplainer: generating explanations for graph neural networks. Adv Neural Inform Process Syst. 2019; 32:9244-9255.
Google Scholar
263Lundberg SM, Lee SI. A Unified Approach to Interpreting Model Predictions. Advances in Neural Information Processing Systems, Vol 30, Curran Associates, Inc.; 2017.
Google Scholar
264Ribeiro MT, Singh S, Guestrin C. “Why should i trust you?” Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Association for Computing Machinery; 2016: 1135-1144.
10.1145/2939672.2939778
Google Scholar
265Jaume G, Pati P, Bozorgtabar B, et al. Quantifying explainers of graph neural networks in computational pathology. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE; 2021: 8106-8116.
10.1109/CVPR46437.2021.00801
Google Scholar
266Dehkharghanian T, Rahnamayan S, Riasatian A, et al. Selection, visualization, and interpretation of deep features in lung adenocarcinoma and squamous cell carcinoma. Am J Pathol. 2021; 191(12): 2172-2183.
10.1016/j.ajpath.2021.08.013
PubMed Web of Science® Google Scholar
267Javed SA, Juyal D, Padigela H, et al. Additive MIL: intrinsically interpretable multiple instance learning for pathology. Advances in Neural Information Processing Systems; Curran Associates, Inc.; 2022.
Google Scholar
268Faust K, Xie Q, Han D, et al. Visualizing histopathologic deep learning classification and anomaly detection using nonlinear feature space dimensionality reduction. BMC Bioinform. 2018; 19: 1-15.
10.1186/s12859-018-2184-4
PubMed Web of Science® Google Scholar
269Pearl J. Causal Inference in Statistics: An Overview. Vol 3. Statistics Surveys; 2009:96-146.
Google Scholar
270Moraffah R, Karami M, Guo R, Raglin A, Liu H. Causal interpretability for machine learning-problems, methods and evaluation. ACM SIGKDD Explor Newsl. 2020; 22(1): 18-33.
10.1145/3400051.3400058
Google Scholar
271O'Shaughnessy M, Canal G, Connor M, Rozell C, Davenport M. Generative causal explanations of black-box classifiers. Adv Neural Inform Process Syst. 2020; 33: 5453-5467.
Google Scholar
272Panda P, Kancheti SS, Balasubramanian VN. Instance-wise causal feature selection for model interpretation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE; 2021: 1756-1759.
Google Scholar
273Kaplan J, McCandlish S, Henighan T, et al. Scaling Laws for Neural Language Models. 2020. arXiv preprint arXiv:200108361.
Google Scholar
274Rombach R, Blattmann A, Lorenz D, Esser P, Ommer B. High-resolution image synthesis with latent diffusion models. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE; 2022: 10684-10695.
10.1109/CVPR52688.2022.01042
Google Scholar
275Singhal K, Azizi S, Tu T, et al. Large Language Models Encode Clinical Knowledge. 2022. arXiv preprint arXiv:221213138.
Google Scholar

Volume62, Issue9

September 2023

Pages 540-556

Machine learning in computational histopathology: Challenges and opportunities

Abstract

1 INTRODUCTION

2 CLINICAL TASKS IN COMPUTATIONAL HISTOPATHOLOGY

2.1 Classification

2.2 Segmentation

2.3 Survival analysis

2.4 Counting

3 LEARNING STRATEGIES FOR COMPUTATIONAL HISTOPATHOLOGY

3.1 Multiple-instance learning

3.2 Transfer learning

3.3 Self-supervised learning

3.3.1 Contrastive learning

3.3.2 Non-contrastive learning

3.4 Neural attention with transformers

3.5 Graph neural networks

3.5.1 Node feature similarity

3.5.2 Node spatial location

3.5.3 Patch spatial location with superpixel node features

4 CHALLENGES AND OPPORTUNITIES

4.1 Bias, fairness, and equity

4.2 Heterogeneity of predictive outcomes

4.3 Multimodal integration and the need for interpretability

4.4 On the rise of large generative models

5 DISCUSSION

ACKNOWLEDGMENTS

FUNDING INFORMATION

REFERENCES

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Machine learning in computational histopathology: Challenges and opportunities

Abstract

1 INTRODUCTION

2 CLINICAL TASKS IN COMPUTATIONAL HISTOPATHOLOGY

2.1 Classification

2.2 Segmentation

2.3 Survival analysis

2.4 Counting

3 LEARNING STRATEGIES FOR COMPUTATIONAL HISTOPATHOLOGY

3.1 Multiple-instance learning

3.2 Transfer learning

3.3 Self-supervised learning

3.3.1 Contrastive learning

3.3.2 Non-contrastive learning

3.4 Neural attention with transformers

3.5 Graph neural networks

3.5.1 Node feature similarity

3.5.2 Node spatial location

3.5.3 Patch spatial location with superpixel node features

4 CHALLENGES AND OPPORTUNITIES

4.1 Bias, fairness, and equity

4.2 Heterogeneity of predictive outcomes

4.3 Multimodal integration and the need for interpretability

4.4 On the rise of large generative models

5 DISCUSSION

ACKNOWLEDGMENTS

FUNDING INFORMATION

REFERENCES

Figures

References

Related

Information