Quantitative Analysis of Histological Staining Using Color Deconvolution

Author(s)	Diana Chiang Jurado
Reviewers

Overview
Questions:

How can I quantify the percentage of stained area in histological images?

How does color deconvolution separate individual stain components from brightfield microscopy images?

How can I apply this workflow to IHC stained tissue sections?

Objectives:

Apply color deconvolution to separate stain channels in histological images

Extract and isolate the stain channel of interest (e.g. DAB)

Apply automatic thresholding to distinguish stained from unstained regions

Calculate the percentage of positively stained area relative to total tissue area

Interpret quantitative staining results across multiple images

Requirements:

Introduction to Galaxy Analyses

tutorial Hands-on: FAIR Bioimage Metadata

tutorial Hands-on: REMBI - Recommended Metadata for Biological Images – metadata guidelines for bioimaging data

Time estimation: 3 hours

Supporting Materials:

Datasets

Workflows

FAQs

Published: Jun 15, 2026

Last modification: Jun 21, 2026

License: Tutorial Content is licensed under Creative Commons Attribution 4.0 International License. The GTN Framework is licensed under MIT

purl PURL: https://gxy.io/GTN:T00581

version Revision: 2

Manually scoring histological staining across dozens of images is time-consuming and subjective. Two researchers looking at the same slide may reach different conclusions about the amount of staining. Computational automated quantification solves this problem: it applies the same criteria to every image, produces a numeric result, and scales to large datasets without additional effort.

This tutorial walks you through a Galaxy workflow that quantifies stained area in brightfield histological images, from a raw microscopy image to a final table of percentages ready for statistical analysis. The workflow also includes an optional step that detects individual stained regions and exports them as polygon ROIs, which can be uploaded to image management platforms like OMERO for visual validation, annotation, and collaborative review.

The approach is built around color deconvolution, a technique that mathematically separates overlapping stain signals so you can measure each one independently.

In this tutorial, you will work with IHC (Immunohistochemistry), detecting CD11b-positive myeloid cells using a DAB chromogen.

Comment: Workflow applicability

This tutorial uses IHC (CD11b/DAB) images as the working example. The same workflow applies directly to Masson’s Trichrome (MT) staining for collagen quantification. The only difference is selecting the channel index that corresponds to the aniline blue instead of DAB in Step 2. MT support will be added as an extended version of this tutorial in the future.

Agenda

In this tutorial, we will cover:

Background: What Is Color Deconvolution and Why Do We Need It?

The Dataset: Cardiac Tissue After Myocardial Infarction

Data Upload

Step 1 — Normalization

Step 2 — Color Deconvolution

Step 3 — Split Channels and Extract the Stain of Interest

Step 4 — Capture Total Image Area

Step 5 — Threshold the Stain Channel

Step 6 — Generate ROIs for Visual Validation

Step 7 — Extract Quantitative Features

Step 8 — Compile Results and Calculate Percent Stained Area

Merge the Feature Results

Add the Total Area Column

Calculate Percent Stained Area

Conclusion

Background: What Is Color Deconvolution and Why Do We Need It?

When two stains are applied to the same tissue section, for example, hematoxylin (blue) and DAB (brown) in IHC, their colors overlap in the RGB image. This means that one cannot simply threshold on “brownness” or “blueness” because the RGB channels mix all stain signals together.

Color deconvolution solves this by using a stain matrix: a set of vectors that describe how much each stain absorbs light in the red, green, and blue channels. The algorithm inverts this matrix to compute the optical density of each stain independently at every pixel, producing one grayscale image per stain component. Higher pixel values in the output correspond to stronger staining.

In this tutorial, we use the H-E-DAB (HED) preset:

Channel	Index	IHC interpretation
Channel 1	0	Hematoxylin (nuclei, counterstain)
Channel 2	1	DAB — stain of interest
Channel 3	2	Residual

A stain vector describes how much a given stain absorbs light in each of the three RGB color channels. For example, DAB absorbs strongly in the blue channel and weakly in the red channel, while hematoxylin absorbs more in the red and green channels. These values are determined empirically from pure stain reference images or taken from published standards. The color deconvolution algorithm uses these vectors to solve a system of linear equations and separate the mixed stain signals pixel by pixel.

For most standard IHC images with a clean DAB + hematoxylin combination, the HED preset is sufficient. However, when staining is uneven, the signal is weak, or there is significant spectral overlap, a data-driven approach may give better results.

Non-negative Matrix Factorization (NMF) learns the stain components directly from the image data, without relying on predefined stain vectors. This makes it more flexible when staining deviates from standard reference spectra, for example, due to differences in staining protocols, tissue processing, or scanner calibration. To use NMF, select Non-negative matrix factorization as the transformation type. Since NMF components are not labeled by stain name, you will need to visually inspect the output channels to identify which one corresponds to your stain of interest.

The Dataset: Cardiac Tissue After Myocardial Infarction

The images in this tutorial come from a study (Rettkowski et al. 2025) investigating the role of 4-oxo retinoic acid (4-oxo RA) in maintaining hematopoietic stem cell dormancy after myocardial infarction (MI). Understanding this biological context helps interpret what the numbers mean.

The experimental model: Mouse hearts were subjected to LAD coronary artery ligation to induce MI. Animals were treated with 4-oxo RA or with the vehicle solution (DSMO) after MI. Serial cardiac sections were stained to assess the post-infarction immune response.

IHC for CD11b detects myeloid leukocytes (monocytes, macrophages, neutrophils), a readout of local inflammation. Higher CD11b-positive area = more immune cell infiltration after MI.

The hypothesis in the study is that 4-oxo RA reduces immune cell mobilization from bone marrow, leading to less inflammatory infiltration (reduced presence of CD11b+ cells positive stained regions) that would lead to less fibrosis and prevent adverse cardiac remodeling concluding on preserved cardiac function.

What you are working with: In the original study, regions of interest (ROIs) were manually selected from whole-slide images to focus on specific anatomical zones (Infarct, Border, Remote) and exclude tissue artifacts. For this tutorial, we will keep it simple and focus on analyzing the whole-slide image and process the data as a list of collection allowing you to focus entirely in understanding the analysis and quantification workflow.

Here is an example of what the raw input images look like:

Example IHC image showing CD11b staining in brown (DAB) with hematoxylin counterstain in blue. The brown signal marks myeloid cell infiltration in cardiac tissue post-MI. — **Figure 1**: whole-slide image IHC

Example Masson's Trichrome image showing collagen in blue (aniline blue), muscle in red (Biebrich scarlet), and nuclei in dark purple. Blue area reflects fibrotic remodeling. — **Figure 2**: Example input: Masson's Trichrome staining in cardiac tissue after myocardial infarction.

Comment: Sample data vs. full dataset

The images provided here are a representative subset of the full dataset from Rettkowski et al. 2025. In practice, workflows like this one are run across large image batches spanning multiple experiments, animals, conditions, and anatomical zones. The workflow you will run here is similar to what was applied at scale in the original study. The Masson’s Trichrome image is shown for reference. MT quantification follows the same workflow and will be covered in a future version of this tutorial.

Data Upload

Hands On: Upload your images
Create a new history and name it something meaningful, e.g. Histology Stain Quantification
Import the images from Zenodo or from the shared data library (GTN - Material → imaging → Quantitative Analysis of Histological Staining Using Color Deconvolution):

Tx (treated)

Vehicle (DMSO or untreated)
https://zenodo.org/records/20629365/files/Tx_Sample1.tiff
https://zenodo.org/records/20629365/files/Tx_Sample2.tiff
https://zenodo.org/records/20629365/files/Tx_Sample3.tiff
https://zenodo.org/records/20629365/files/Tx_Sample4.tiff
https://zenodo.org/records/20629365/files/Tx_Sample5.tiff
https://zenodo.org/records/20629365/files/Tx_Sample6.tiff
https://zenodo.org/records/20629365/files/Tx_Sample7.tiff
https://zenodo.org/records/20629365/files/Tx_Sample8.tiff
https://zenodo.org/records/20629365/files/Tx_Sample9.tiff
https://zenodo.org/records/20629365/files/Tx_Sample10.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample1.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample2.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample3.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample4.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample5.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample6.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample7.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample8.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample9.tiff
https://zenodo.org/records/20629365/files/Vehicle_Sample10.tiff
Copy the link location

Click galaxy-upload Upload at the top of the activity panel

Select galaxy-wf-edit Paste/Fetch Data

Paste the link(s) into the text field

Press Start

Close the window

As an alternative to uploading the data from a URL or your computer, the files may also have been made available from a shared data library:

Go into Libraries (left panel)

Navigate to the correct folder as indicated by your instructor.

On most Galaxies tutorial data will be provided in a folder named GTN - Material –> Topic Name -> Tutorial Name.

Select the desired files

Click on Add to History galaxy-dropdown near the top and select as Datasets from the dropdown menu

In the pop-up window, choose

“Select history”: the history you want to import the data to (or create a new one)

Click on Import
Rename the datasets with descriptive names if needed (e.g. IHC_TXsample1.tiff or IHC_Vehiclesample1.tiff). In total, there should be 20 TIFF images, 10 for each group.

Check that the datatype is tiff

Click on the galaxy-pencil pencil icon for the dataset to edit its attributes

In the central panel, click galaxy-chart-select-data Datatypes tab on the top

In the galaxy-chart-select-data Assign Datatype, select tiff from “New Type” dropdown

Tip: you can start typing the datatype into the field to filter the dropdown menu

Click the Save button

Organize your images into a dataset collection. Collections allow the workflow to process all images in a single run.

Click on galaxy-selector Select Items at the top of the history panel

Check all the datasets in your history you would like to include

Click n of N selected and choose Advanced Build List

You are in collection building wizard. Choose Flat List and click ‘Next’ button at the right bottom corner.

Double clcik on the file names to edit. For example, remove file extensions or common prefix/suffixes to cleanup the names.

Enter a name for your collection

Click Build to build your collection

Click on the checkmark icon at the top of your history again
Comment: Image requirements

Images must be brightfield RGB microscopy images in TIFF format. Fluorescence images are not compatible with color deconvolution. Avoid images with strong artifacts, out-of-focus regions, or uneven illumination, as these reduce quantification accuracy. Therefore, it is strongly recommended that you inspect your images one-by-one before proceeding to the pre-processing steps.

The workflow associated with this tutorial requires a dataset collection as input. The steps described below are explained individually for clarity, but in Galaxy, all images are processed together in a single batch run. For this reason, make sure your images are named descriptively before building the collection — the final results table will use those names to identify each sample.

Click on galaxy-selector Select Items at the top of the history panel

Check all the datasets in your history you would like to include

Click n of N selected and choose Advanced Build List

You are in collection building wizard. Choose Flat List and click ‘Next’ button at the right bottom corner.

Double clcik on the file names to edit. For example, remove file extensions or common prefix/suffixes to cleanup the names.

Enter a name for your collection

Click Build to build your collection

Click on the checkmark icon at the top of your history again

Step 1 — Normalization

Variations in staining intensity across images due to differences in staining batches, slide preparation, or scanner settings, can affect the consistency of downstream quantification. To reduce this variability, we apply histogram equalization as a preprocessing step before color deconvolution.

We use CLAHE (Contrast Limited Adaptive Histogram Equalization), which enhances local contrast without amplifying noise, making stain signals more consistent across the image batch.

Comment: About stain normalization

More advanced stain normalization methods, such as Macenko or Reinhard normalization, can standardize color appearance across slides by transferring the stain profile of a reference image to all others. These methods are not currently available as Galaxy tools, but we are working on adding them. If you have access to them in the meantime (e.g. via Python or QuPath), applying them before this workflow may further improve consistency across batches.

Hands On: Apply histogram equalization

Perform histogram equalization ( Galaxy version 0.25.2+galaxy0) with the following parameters:

param-collection “Input image”: your image collection

“Histogram equalization algorithm”: CLAHE

Your normalized output should look similar to this example. Notice how the contrast is more balanced compared to the raw input, with stain signals appearing more uniform across the tissue:

Example output after CLAHE histogram equalization. — **Figure 3**: Example output after CLAHE normalization.

Step 2 — Color Deconvolution

This step separates the mixed stain signals in your brightfield image into individual channels. We use the H-E-DAB (HED) color space for this dataset. Note that the staining in these images does not follow a typical H-DAB pattern, using the H-DAB option would yield inaccurate results. If your own images present standard H-DAB coloring, you can use the H-DAB deconvolution option instead.

Hands On: Run color deconvolution

Perform color deconvolution or transformation ( Galaxy version 0.9+galaxy0) with the following parameters:

param-collection “Input image”: your image collection

“Transformation type”: Deconvolve RGB into Hematoxylin + Eosin + DAB

Comment: Output

Each input image produces one multi-channel TIFF. Each channel is a grayscale image where brighter pixels indicate higher optical density (stronger staining) for that component. You will extract the relevant channel in the next step.

The figure below shows what to expect after deconvolution for an IHC image. The DAB channel (right) clearly isolates the brown signal from the blue hematoxylin counterstain:

Color deconvolution output for IHC. — **Figure 4**: Color deconvolution output for IHC (CD11b/DAB).

Step 3 — Split Channels and Extract the Stain of Interest

The deconvolution output is a multi-channel TIFF containing one channel per stain component. You need to split it into individual single-channel images and then select the one that corresponds to your stain of interest.

Hands On: Split the multi-channel image

Split image along axes ( Galaxy version 2.3.5+galaxy0) with the following parameters:

param-collection “Image to split”: output of Perform color deconvolution or transformation tool

This produces a collection of three single-channel grayscale images — one per stain component. Now extract the DAB channel:

Hands On: Extract the DAB channel (IHC)

Extract dataset with the following parameters:

param-collection “Input List”: output of Split image along axes tool

“How should a dataset be selected?”: Select by index

“Element index”: 1

Comment: Why index 1?

The split collection is zero-indexed. In the HED preset, Channel 2 (DAB) corresponds to index 1.

Isolated DAB channel after splitting the deconvolution output. The grayscale image shows the CD11b/DAB signal separated from the other stain components. — **Figure 5**: Isolated DAB channel (index 1) after color deconvolution and split.

Question

What does a high pixel intensity value mean in the deconvolved DAB channel?

What index would you use to extract the hematoxylin channel from an HED deconvolution?

Higher pixel values in the deconvolved channel represent stronger staining (higher optical density of that dye at that location). Darker brown regions in the original IHC image become bright pixels in the DAB channel.

Hematoxylin is Channel 1, so you would use index 0.

Step 4 — Capture Total Image Area

To calculate the percentage of stained area, you need two numbers: how many pixels are stained, and how many pixels make up the whole tissue image. The stained pixels will be measured later from the thresholded image. The workflow performs this step automatically, but here we will go through it together to understand what information is being extracted and why image dimensions matter for computing the total area.

Hands On: Get image dimensions from the original image

Show image info ( Galaxy version 5.7.1+galaxy1) with the following parameters:

param-collection “Input Image”: your original image collection (the same collection you provided as input to color deconvolution)

Comment: Why use the original image here?

We extract dimensions from the raw input image (not the deconvolved output) because the original image faithfully represents the full tissue area captured by the microscope. This ensures the total area denominator is correct regardless of any processing applied downstream.

Select to extract the image width:

param-collection “Select lines from”: output of Show image info tool

“the pattern”: Width =

Select to extract the image height:

param-collection “Select lines from”: output of Show image info tool

“the pattern”: Height =

Text transformation ( Galaxy version 9.5+galaxy3) to isolate the width value:

param-collection “File to process”: output of Select (Width) tool

“SED Program”: s/.*= //

Comment: What does this expression do?

The image info tool returns lines like Width = 1024. The sed expression s/.*= // strips everything up to and including = , leaving just the number. This is necessary so it can be used in arithmetic downstream.

Text transformation ( Galaxy version 9.5+galaxy3) to isolate the height value:

param-collection “File to process”: output of Select (Height) tool

“SED Program”: s/.*= //

Paste to combine width and height side by side:

param-collection “Paste”: output of Text transformation (width) tool

param-collection “and”: output of Text transformation (height) tool

Compute ( Galaxy version 2.1+galaxy0) to calculate total pixel area (width × height):

param-collection “Input file”: output of Paste tool

“Input has a header line with column names?”: No

In “Expressions” → “Insert Expressions”:

“Add expression”: c1 * c2

“If an expression cannot be computed for a row”: Fail the entire tool run

Question

An image is 1024 × 768 pixels. What is the total pixel area, and why does it matter?

Total pixel area = 1024 × 768 = 786,432 pixels. This is the denominator in the percentage formula: (stained pixels / total pixels) × 100. Without it you can count stained pixels in absolute terms but cannot compare meaningfully across images of different sizes.

Step 5 — Threshold the Stain Channel

Now you will convert the extracted grayscale channel into a binary mask: pixels are classified as either stained (value = 1) or unstained (value = 0). This is the foundation for measuring stained area, and a very important step.

Hands On: Apply Otsu thresholding

Threshold image ( Galaxy version 0.25.2+galaxy0) with the following parameters:

param-collection “Input image”: output of Extract dataset tool (your extracted stain channel)

“Thresholding method”: Globally adaptive / Otsu

Otsu’s method automatically finds the threshold that best separates the two pixel populations (stained and unstained) by minimizing the variance within each group. Because it adapts to each image’s intensity distribution, you do not need to set a manual value per image, making the results more consistent across a large batch. Other thresholding methods are also available in the tool, so feel free to select the one that best fits your images. Additionally, if you would like to restrict the threshold for positive pixel detection, you can adjust the offset value slightly. A value between 0.0 and 0.01 is a good starting point.

Thresholding output for IHC. Binary Otsu mask where white pixels represent DAB-positive area. — **Figure 6**: Otsu thresholding output for IHC (CD11b/DAB).

Step 6 — Generate ROIs for Visual Validation

Before moving to quantification, it is good practice to visually verify that the thresholded mask captures what you expect. This step detects stained regions in the binary mask and generates polygon ROIs around them. In IHC images like ours, the CD11b-positive cells are small and scattered across the tissue, so the resulting ROIs appear as small point-like outlines that can be difficult to spot in the Galaxy image viewer at full scale. Zooming in on the outline image will help you inspect them more clearly. For a richer validation experience, the ROI files can be uploaded to OMERO and overlaid directly on the original images. At full scale (left), the yellow outlines mark the overall distribution of detected DAB-positive regions. Zooming in (right) makes individual ROIs much easier to identify, as shown below.

IHC image with detected ROI polygons overlaid in yellow in OMERO. Left: full image scale showing the overall distribution of CD11b-positive detections. Right: zoomed view showing individual ROIs corresponding to small DAB-positive cells scattered across the cardiac tissue. — **Figure 7**: ROI overlay in OMERO at full scale and zoomed view for visual validation of detected stained regions.

Hands On: Detect stained regions and generate ROIs

Analyze particles ( Galaxy version 20240614+galaxy0) with the following parameters:

param-collection “Select image”: output of Threshold image tool

“Black background”: Yes

“Size (pixel^2)”: 2-Infinity

“Show”: Outlines

“Export particles outlines coordinates”: Yes

Comment: What does the size filter do?

Setting a minimum particle size of 2 pixels excludes very small specks that are likely noise or staining artifacts rather than real signal. Adjust this value depending on the scale and resolution of your images.

This produces two outputs per image: an outline image showing detected particles, and a tabular file with ROI coordinates. The table lists each detected region as a polygon with its corresponding pixel coordinates, label, and timepoint/z-slice index, as shown below.

Example ROI coordinate table output from Analyze Particles, showing polygon shapes with pixel coordinates for each detected stained region. — **Figure 8**: ROI coordinate table output from Analyze Particles.

Use the outline image to quickly check whether the detected regions match the staining you see in the original image before proceeding to quantification.

Outline image showing detected DAB-positive particles drawn over the binary mask. — **Figure 9**: Outline image showing detected particles from Analyze Particles.

If your facility uses OMERO for image management, the ROI files generated here can be uploaded alongside the original whole-slide images to overlay and visually validate your threshold results at scale. For a full walkthrough of how to work with OMERO in Galaxy, see the Overview of the Galaxy OMERO-suite tutorial.

Step 7 — Extract Quantitative Features

With a validated binary mask, you can now measure the stained area. The feature extraction tool measures properties of the labeled regions in the mask, using the grayscale stain channel for intensity information.

Hands On: Measure stained pixel area and intensity

Extract image features ( Galaxy version 0.25.2+galaxy1) with the following parameters:

param-file “Label map”: output of Threshold image tool

“Features to compute”: Use the intensity image to compute additional features

param-file “Intensity image”: output of Extract dataset tool

“Available features”: Label from the label map, Area, Filled area, Mean intensity

Comment: What do these features mean?

Label from the label map — the pixel class (1 = stained region)

Area — the number of stained pixels (this is what you will use for the percentage)

Filled area — area including any internal holes in the stained region

Mean intensity — average pixel intensity within the stained region, which can complement the area measurement as a proxy for staining strength

Additional features such as perimeter, centroid, or eccentricity are also available and can be selected depending on your analysis needs.

Step 8 — Compile Results and Calculate Percent Stained Area

You now have, for each image: the stained pixel area (from feature extraction) and the total pixel area (from image dimensions). This final section merges all per-sample results into a single table and calculates the percentage.

Merge the Feature Results

Hands On: Collapse per-image results into one table

Extract dataset to get the first dataset from the feature collection (used to extract the header row):

param-collection “Input List”: output of Extract image features tool

“How should a dataset be selected?”: The first dataset

Select first to isolate the header row:

“Select first”: 1

param-collection “from”: output of Extract dataset tool

Select last to extract the data row from each image’s feature file:

“Select last”: 1

param-collection “from”: output of Extract image features tool

Extract element identifiers ( Galaxy version 0.0.2) to get sample file names:

param-collection “Dataset collection”: output of Extract image features tool

Collapse Collection ( Galaxy version 5.1.0) to merge all data rows into one file:

param-collection “Collection of files to collapse”: output of Select last tool

Create text file ( Galaxy version 9.5+galaxy3) to create a sample_id column header:

“Line”: sample_id

Paste to combine the sample_id header with the feature header:

param-file “Paste”: output of Create text file (sample_id) tool

param-file “and”: output of Select first (feature header) tool

Paste to combine sample names with their corresponding data rows:

param-file “Paste”: output of Extract element identifiers tool

param-file “and”: output of Collapse Collection tool

Concatenate multiple datasets or collections to build the full table with header:

param-file “Concatenate Dataset”: output of Paste (header row) tool

In “Dataset” → “Insert Dataset”:

param-file “Select”: output of Paste (data rows) tool

Add the Total Area Column

Hands On: Merge the total area values

Cut to extract the total area column (column 3) from the computed area file:

“Cut columns”: c3

param-collection “From”: output of Compute (width × height) tool

Collapse Collection ( Galaxy version 5.1.0) to merge the total area values across samples:

param-collection “Collection of files to collapse”: output of Cut tool

Create text file ( Galaxy version 9.5+galaxy3) to create a total_area column header.

“Line”: total_area

Concatenate datasets ( Galaxy version 9.5+galaxy3) to prepend the header to the total area values:

param-file “Datasets to concatenate”: output of Create text file (total_area) tool

In “Dataset” → “Insert Dataset”:

param-file “Select”: output of Collapse Collection (total area) tool

Paste to join the feature results table with the total area column:

param-file “Paste”: output of Concatenate multiple datasets or collections tool

param-file “and”: output of Concatenate datasets tool

Calculate Percent Stained Area

You now have a single table with all the values you need. The final step divides the stained area by the total area and multiplies by 100.

Hands On: Compute percent stained area

Compute ( Galaxy version 2.1+galaxy0) with the following parameters:

param-file “Input file”: output of Paste (full table) tool

“Input has a header line with column names?”: Yes

In “Expressions” → “Insert Expressions”:

“Add expression”: c4 / c6 * 100

“The new column name”: percent_area

“If an expression cannot be computed for a row”: Fail the entire tool run

Comment: Understanding the formula

c4 = stained pixel area (from feature extraction); c6 = total pixel area (width × height). Dividing and multiplying by 100 expresses the result as a percentage. If your column order differs, verify the column numbers by inspecting the merged table before running this step.

Your final output is a TSV table with one row per sample, containing: sample_id, label, area, area_filled, mean_intensity, total_area, and percent_area. This file is ready for downstream statistical analysis:

sample_id	label	mean_intensity	area	area_filled	total_area	percent_area
Tx_Sample1	255	0.682	3653.0	3686.0	5065984	0.072
Tx_Sample2	255	0.734	6574.0	6593.0	5065984	0.130
Tx_Sample3	255	0.768	1396.0	1398.0	5065984	0.028
Tx_Sample4	255	0.688	7184.0	7228.0	5065984	0.142
Tx_Sample5	255	0.678	18245.0	18371.0	5065984	0.360
Tx_Sample6	255	0.610	31943.0	32352.0	5065984	0.631
Tx_Sample7	255	0.639	34089.0	34435.0	5065984	0.673
Tx_Sample8	255	0.755	5271.0	5282.0	5065984	0.104
Tx_Sample9	255	0.763	2350.0	2356.0	5065984	0.046
Tx_Sample10	255	0.763	2099.0	2100.0	5065984	0.041
Vehicle_Sample1	255	0.637	55764.0	58010.0	5065984	1.101
Vehicle_Sample2	255	0.621	28170.0	28328.0	5065984	0.556
Vehicle_Sample3	255	0.643	107793.0	110878.0	5065984	2.128
Vehicle_Sample4	255	0.652	40870.0	41626.0	5065984	0.807
Vehicle_Sample5	255	0.534	40760.0	41651.0	5065984	0.805
Vehicle_Sample6	255	0.575	26149.0	26844.0	5065984	0.516
Vehicle_Sample7	255	0.642	2571.0	2582.0	5065984	0.051
Vehicle_Sample8	255	0.650	16919.0	17185.0	5065984	0.334
Vehicle_Sample9	255	0.674	29399.0	29698.0	5065984	0.580
Vehicle_Sample10	255	0.708	27675.0	28060.0	5065984	0.546

Question

Looking at the table, what is the percent stained area for Tx_Sample6, and how do you calculate it from the values provided?

Looking at the results table, which group shows a higher CD11b-positive area — Tx or Vehicle? What does this suggest biologically?

Vehicle_Sample3 has a notably higher percent area (2.128%) compared to the other Vehicle samples. How might this affect the interpretation of the group comparison?

Tx_Sample6 has an area of 31,943 pixels and a total area of 5,065,984 pixels. The calculation is: (31,943 / 5,065,984) × 100 ≈ 0.631%. This is the highest value in the Tx group, but still well below most Vehicle samples, which range from 0.334% to 2.128%.

The Vehicle (DMSO) group shows consistently higher CD11b-positive area, with a mean of ~0.74% compared to ~0.22% in the Tx group. This suggests that 4-oxo RA treatment reduced myeloid cell infiltration into the infarcted myocardium, consistent with the hypothesis in Rettkowski et al. 2025 that the treatment suppresses immune cell mobilization from bone marrow after MI. Importantly, it is not possible to know only through histology which of these myeloid cells are infiltrated ones and which are local, as both are present after infarction, but we can clearly see that there is a reduced percentage of stain in the treated group.

Vehicle_Sample3 is an outlier at 2.128%, more than double the next highest Vehicle value. While the overall trend still holds. All but one Vehicle sample exceeds the Tx group mean, this variability reflects the biological heterogeneity typical of in vivo infarction models. It is important to test for outliers of the final data results and report variability transparently and consider robust statistical tests when comparing groups with unequal variance, such as the Welch t-test.

Conclusion

You have now built and run a complete image analysis workflow that takes raw histological images and produces a quantitative, reproducible measure of staining coverage. Here is a summary of what each step contributes:

Step	What it does	Why it matters
Normalization	Applies CLAHE histogram equalization to each image	Reduces variability in staining intensity across the batch
Color deconvolution	Separates mixed stain signals into individual channels	Isolates DAB from the hematoxylin counterstain
Channel extraction	Selects the DAB channel (index 1)	Targets the CD11b signal of interest
Image dimensions	Captures total pixel area from the original image	Provides the denominator for the percentage
Otsu thresholding	Converts the stain channel into a binary stained/unstained mask	Automated, consistent across images
Analyze particles	Generates ROIs around detected stained regions	Enables visual validation of the threshold
Feature extraction	Measures area and intensity of stained pixels	Produces the numerator for the percentage
Final table + compute	Merges results and calculates percent stained area	Delivers a ready-to-analyze summary

The CD11b-positive area quantified from IHC images reflects myeloid cell infiltration in infarcted cardiac tissue — an objective, reproducible measure that can be compared across experimental groups and applied consistently at scale, as demonstrated in Rettkowski et al. 2025.

Comment: Extending this workflow to other staining types

The same workflow applies to Masson’s Trichrome and other HED-compatible stains. To quantify collagen with MT, simply change the channel index in Step 2 from 1 (DAB) to 2 (aniline blue). No other steps need to change.

Overview of the complete stain quantification workflow from raw image to percent stained area table. — **Figure 10**: Workflow overview: from raw histological image to percent stained area.

You've Finished the Tutorial

Key points

Color deconvolution mathematically separates RGB images into individual stain components based on known absorption spectra

The H-E-DAB (HED) preset is used for IHC; channel selection determines the stain of interest

Otsu thresholding provides an automated, reproducible way to distinguish stained from unstained pixels

The percent stained area is calculated as (stained pixels / total tissue pixels) × 100

This workflow accepts a collection of images for batch quantification across multiple samples

Frequently Asked Questions

Have questions about this tutorial? Have a look at the available FAQ pages and support channels

Useful literature

Further information, including links to documentation and original publications, regarding the tools, analysis techniques and the interpretation of results described in this tutorial can be found here.

References

Rettkowski, J., M. C. Romero-Mulero, I. Singh, C. Wadle, J. Wrobel et al., 2025 Modulation of bone marrow haematopoietic stem cell activity as a therapeutic strategy after myocardial infarction: a preclinical study. Nature Cell Biology 27: 591–604. 10.1038/s41556-025-01639-4

Feedback

Did you use this material as an instructor? Feel free to give us feedback on how it went.
Did you use this material as a learner or student? Click the form below to leave feedback.

Citing this Tutorial

Diana Chiang Jurado, Quantitative Analysis of Histological Staining Using Color Deconvolution (Galaxy Training Materials). https://training.galaxyproject.org/training-material/topics/imaging/tutorials/stain-quantification-color-deconvolution/tutorial.html Online; accessed TODAY
Hiltemann, Saskia, Rasche, Helena et al., 2023 Galaxy Training: A Powerful Framework for Teaching! PLOS Computational Biology 10.1371/journal.pcbi.1010752
Batut et al., 2018 Community-Driven Data Analysis Training for Biology Cell Systems 10.1016/j.cels.2018.05.012

@misc{imaging-stain-quantification-color-deconvolution,
author = "Diana Chiang Jurado",
	title = "Quantitative Analysis of Histological Staining Using Color Deconvolution (Galaxy Training Materials)",
	year = "",
	month = "",
	day = "",
	url = "\url{https://training.galaxyproject.org/training-material/topics/imaging/tutorials/stain-quantification-color-deconvolution/tutorial.html}",
	note = "[Online; accessed TODAY]"
}
@article{Hiltemann_2023,
	doi = {10.1371/journal.pcbi.1010752},
	url = {https://doi.org/10.1371%2Fjournal.pcbi.1010752},
	year = 2023,
	month = {jan},
	publisher = {Public Library of Science ({PLoS})},
	volume = {19},
	number = {1},
	pages = {e1010752},
	author = {Saskia Hiltemann and Helena Rasche and Simon Gladman and Hans-Rudolf Hotz and Delphine Larivi{\`{e}}re and Daniel Blankenberg and Pratik D. Jagtap and Thomas Wollmann and Anthony Bretaudeau and Nadia Gou{\'{e}} and Timothy J. Griffin and Coline Royaux and Yvan Le Bras and Subina Mehta and Anna Syme and Frederik Coppens and Bert Droesbeke and Nicola Soranzo and Wendi Bacon and Fotis Psomopoulos and Crist{\'{o}}bal Gallardo-Alba and John Davis and Melanie Christine Föll and Matthias Fahrner and Maria A. Doyle and Beatriz Serrano-Solano and Anne Claire Fouilloux and Peter van Heusden and Wolfgang Maier and Dave Clements and Florian Heyl and Björn Grüning and B{\'{e}}r{\'{e}}nice Batut and},
	editor = {Francis Ouellette},
	title = {Galaxy Training: A powerful framework for teaching!},
	journal = {PLoS Comput Biol}
}

                   

Funding

These individuals or organisations provided funding support for the development of this resource

NFDI4Bioimage

NFDI4BIOIMAGE is a nationwide community of researchers, imaging specialists and data experts who work with biological image data. We support anyone who generates, analyses or manages microscopy and bioimaging data. Our aim is to make imaging data easier to find, access, reuse and share by providing practical tools, clear guidance and hands on training. We collaborate closely with laboratories and research communities across Germany to understand their needs and develop solutions that help with real everyday challenges. Whether you are building new workflows, exploring advanced analysis methods or improving your data management, NFDI4BIOIMAGE offers support, expertise and a welcoming network.

UFR

Congratulations on successfully completing this tutorial!

You can use Ephemeris's shed-tools install command to install the tools used in this tutorial.

shed-tools install [-g GALAXY] [-a API_KEY] -t <(curl https://training.galaxyproject.org/training-material/api/topics/imaging/tutorials/stain-quantification-color-deconvolution/tutorial.json | jq .admin_install_yaml -r)

Alternatively you can copy and paste the following YAML

---
install_tool_dependencies: true
install_repository_dependencies: true
install_resolver_dependencies: true
tools:
- name: text_processing
  owner: bgruening
  revisions: ab83aa685821
  tool_panel_section_label: Text Manipulation
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: text_processing
  owner: bgruening
  revisions: ab83aa685821
  tool_panel_section_label: Text Manipulation
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: text_processing
  owner: bgruening
  revisions: ab83aa685821
  tool_panel_section_label: Text Manipulation
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: column_maker
  owner: devteam
  revisions: 61f9ddbc63ca
  tool_panel_section_label: Text Manipulation
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: 2d_auto_threshold
  owner: imgteam
  revisions: 2ee04d2ebdcf
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: 2d_feature_extraction
  owner: imgteam
  revisions: 519fad2c552a
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: 2d_histogram_equalization
  owner: imgteam
  revisions: 99e0ef91ea5e
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: color_deconvolution
  owner: imgteam
  revisions: 0cbdf78fee14
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: image_info
  owner: imgteam
  revisions: f8b4eada923c
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: image_info
  owner: imgteam
  revisions: e28851b24032
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: imagej2_analyze_particles_binary
  owner: imgteam
  revisions: 862af85a50ec
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: imagej2_analyze_particles_binary
  owner: imgteam
  revisions: 0601a9056642
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: split_image
  owner: imgteam
  revisions: 390943df8a35
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: split_image
  owner: imgteam
  revisions: 7191fd16988f
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: collection_element_identifiers
  owner: iuc
  revisions: d3c07d270a50
  tool_panel_section_label: Collection Operations
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: collection_element_identifiers
  owner: iuc
  revisions: 3e27acfa4830
  tool_panel_section_label: Collection Operations
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: collapse_collections
  owner: nml
  revisions: 90981f86000f
  tool_panel_section_label: Collection Operations
  tool_shed_url: https://toolshed.g2.bx.psu.edu/

No feedback has been recieved yet for this training. Be the first one by filling in the feedback form.