WSIReader¶
tiatoolbox.wsicore.wsireader.WSIReader
- class WSIReader(input_img, mpp=None, power=None)[source]¶
Base whole slide image (WSI) reader class.
This class defines functions for reading pixel data and metadata from whole slide image (WSI) files.
- input_img¶
Input path to WSI file.
- Type
- Parameters
input_img (
strorpathlib.Pathornumpy.ndarray) – Input path to WSI.mpp (
tupleorlistorNone, optional) – The MPP of the WSI. If not provided, the MPP is approximated from the objective power.power (
floatorNone, optional) – The objective power of the WSI. If not provided, the power is approximated from the MPP.
- Return type
None
Methods
Converts resolution value between different units.
Find optimal parameters for reading a rect at a given resolution.
Return an appropriate
WSIReaderobject.Read a region of the whole slide image within given bounds.
Read a region of the whole slide image at a location and size.
Read a region of the whole slide image (OpenSlide format args).
Generate image tiles from whole slide images.
Return the size of WSI at requested resolution.
Read the whole slide image thumbnail (1.25x by default).
Create a tissue mask and wrap it in a VirtualWSIReader.
Attributes
WSI metadata property.
- convert_resolution_units(input_res, input_unit, output_unit=None)[source]¶
Converts resolution value between different units.
This function accepts a resolution and its units in the input and converts it to all other units (‘mpp’, ‘power’, ‘baseline’). To achieve resolution in ‘mpp’ and ‘power’ units in theoutput, WSI meta data should cotain mpp and objective_power information, respectively.
- Parameters
input_res (float) – the resolution which we want to convert to
units. (the other) –
input_unit (str) – the unit of the input resolution (input_res). Acceptable
'mpp' (input_units are) –
'power' –
'baseline' –
'level'. (and) –
output_unit (str) – the desired unit to which we want to convert
are (the input_res. Acceptable values for output_unit) – ‘mpp’,
'power' –
provided (and 'baseline'. If output_unit is not) –
of (all) –
a (the conversions to all of the mentioned units will be returned in) –
dictionary. –
- Returns
either a float which is the converted input_res to the desired output_unit or a dictionary containing the converted input_res to all acceptable units (‘mpp’, ‘power’, ‘baseline’). If there is not enough meta data to calculate a unit (like mpp or power), they will be set to None in the dictionary.
- Return type
output_res (float or dictionary)
- find_read_rect_params(location, size, resolution, units, precision=3)[source]¶
Find optimal parameters for reading a rect at a given resolution.
Reading the image at full baseline resolution and re-sampling to the desired resolution would require a large amount of memory and be very slow. This function checks the other resolutions stored in the WSI’s pyramid of resolutions to find the lowest resolution (smallest level) which is higher resolution (a larger level) than the requested output resolution.
In addition to finding this ‘optimal level’, the scale factor to apply after reading in order to obtain the desired resolution is found along with conversions of the location and size into level and baseline coordinates.
- Parameters
location (tuple(int)) – Location in terms of the baseline image (level 0) resolution.
size (tuple(int)) – Desired output size in pixels (width, height) tuple.
resolution (float) – Desired output resolution.
units (str) – Units of scale, default = “level”. Supported units are: - microns per pixel (‘mpp’) - objective power (‘power’) - pyramid / resolution level (‘level’) - pixels per baseline pixel (“baseline”)
precision (int, optional) – Decimal places to use when finding optimal scale. See
find_optimal_level_and_downsample()for more.
- Returns
Parameters for reading the requested region -
int- Optimal read level -tuple- Read location in level coordinates- Return type
- property info: tiatoolbox.wsicore.wsimeta.WSIMeta¶
WSI metadata property.
This property is cached and only generated on the first call.
- Returns
An object containing normalized slide metadata
- Return type
- static open(input_img, mpp=None, power=None)[source]¶
Return an appropriate
WSIReaderobject.- Parameters
(str (input_img) – obj:WSIReader): Input to create a WSI object from. Supported types of input are: str and pathlib.Path which point to the location on the disk where image is stored,
numpy.ndarrayin which the input image in the form of numpy array (HxWxC) is stored, or :obj:WSIReader which is an already created tiatoolbox WSI handler. In the latter case, the function directly passes the input_imge to the output.pathlib.Path – obj:WSIReader): Input to create a WSI object from. Supported types of input are: str and pathlib.Path which point to the location on the disk where image is stored,
numpy.ndarrayin which the input image in the form of numpy array (HxWxC) is stored, or :obj:WSIReader which is an already created tiatoolbox WSI handler. In the latter case, the function directly passes the input_imge to the output.input_img (Union[str, pathlib.Path, numpy.ndarray]) –
mpp (Optional[Tuple[numbers.Number, numbers.Number]]) –
power (Optional[numbers.Number]) –
- Return type
- :param
numpy.ndarray: obj:WSIReader): Input to create a WSI object from. Supported types of input are: str and pathlib.Path which point to the location on the disk where image is stored,
numpy.ndarrayin which the input image in the form of numpy array (HxWxC) is stored, or :obj:WSIReader which is an already created tiatoolbox WSI handler. In the latter case, the function directly passes the input_imge to the output.
- Parameters
or – obj:WSIReader): Input to create a WSI object from. Supported types of input are: str and pathlib.Path which point to the location on the disk where image is stored,
numpy.ndarrayin which the input image in the form of numpy array (HxWxC) is stored, or :obj:WSIReader which is an already created tiatoolbox WSI handler. In the latter case, the function directly passes the input_imge to the output.mpp (tuple) – (x, y) tuple of the MPP in the units of the input image.
power (float) – Objective power of the input image.
input_img (Union[str, pathlib.Path, numpy.ndarray]) –
- Returns
an object with base
WSIReaderas base class.- Return type
Examples
>>> from tiatoolbox.wsicore.wsireader import get_wsireader >>> wsi = get_wsireader(input_img="./sample.svs")
- read_bounds(bounds, resolution=0, units='level', interpolation='optimise', pad_mode='constant', pad_constant_values=0, coord_space='baseline', **kwargs)[source]¶
Read a region of the whole slide image within given bounds.
Bounds are in terms of the baseline image (level 0 / maximum resolution).
Reads can be performed at different resolutions by supplying a pair of arguments for the resolution and the units of resolution. If meta data does not specify mpp or objective_power then baseline units should be selected with resolution 1.0
The output image size may be different to the width and height of the bounds as the resolution will affect this. To read a region with a fixed output image size see
read_rect().- Parameters
bounds (tuple(int)) – By default, this is a tuple of (start_x, start_y, end_x, end_y) i.e. (left, top, right, bottom) of the region in baseline reference frame. However, with coord_space=”resolution”, the bound is expected to be at the requested resolution system.
resolution (int or float or tuple(float)) – resolution at which to read the image, default = 0. Either a single number or a sequence of two numbers for x and y are valid. This value is in terms of the corresponding units. For example: resolution=0.5 and units=”mpp” will read the slide at 0.5 microns per-pixel, and resolution=3, units=”level” will read at level at pyramid level / resolution layer 3.
units (str) – the units of resolution, default = “level”. Supported units are: microns per pixel (mpp), objective power (power), pyramid / resolution level (level), pixels per baseline pixel (baseline).
interpolation (str) – Method to use when resampling the output image. Possible values are “linear”, “cubic”, “lanczos”, “area”, and “optimse”. Defaults to ‘optimise’ which will use cubic interpolation for upscaling and area interpolation for downscaling to avoid moiré patterns.
pad_mode (str) – Method to use when padding at the edges of the image. Defaults to ‘constant’. See
numpy.pad()for available modes.coord_space (str) – default to “baseline”, this is a flag to indicate if the input bounds is in the baseline coordinate system (“baseline”) or is in the requested resolution system (“resolution”).
**kwargs (dict) – Extra key-word arguments for reader specific parameters. Currently only used by
VirtualWSIReader. See class docstrings for more information.pad_constant_values (Union[numbers.Number, Iterable[Tuple[numbers.Number, numbers.Number]]]) –
- Returns
array of size MxNx3 M=end_h-start_h, N=end_w-start_w
- Return type
Examples
>>> from tiatoolbox.wsicore.wsireader import get_wsireader >>> from matplotlib import pyplot as plt >>> wsi = get_wsireader(input_img="./CMU-1.ndpi") >>> # Read a region at level 0 (baseline / full resolution) >>> bounds = [1000, 2000, 2000, 3000] >>> img = wsi.read_bounds(bounds) >>> plt.imshow(img) >>> # This could also be written more verbosely as follows >>> img = wsi.read_bounds( ... bounds, ... resolution=0, ... units="level", ... ) >>> plt.imshow(img)
Note: The field of view remains the same as resolution is varied when using
read_bounds().This is because the bounds are in the baseline (level 0) reference frame. Therefore, varying the resolution does not change what is visible within the output image.
If the WSI does not have a resolution layer corresponding exactly to the requested resolution (shown above in white with a dashed outline), a larger resolution is downscaled to achieve the correct requested output resolution.
If the requested resolution is higher than the baseline (maximum resultion of the image), then bicubic interpolation is applied to the output image.
- read_rect(location, size, resolution=0, units='level', interpolation='optimise', pad_mode='constant', pad_constant_values=0, coord_space='baseline', **kwargs)[source]¶
Read a region of the whole slide image at a location and size.
Location is in terms of the baseline image (level 0 / maximum resolution), and size is the output image size.
Reads can be performed at different resolutions by supplying a pair of arguments for the resolution and the units of resolution. If meta data does not specify mpp or objective_power then baseline units should be selected with resolution 1.0
The field of view varies with resolution. For a fixed field of view see
read_bounds().- Parameters
location (tuple(int)) – (x, y) tuple giving the top left pixel in the baseline (level 0) reference frame.
size (tuple(int)) – (width, height) tuple giving the desired output image size.
resolution (int or float or tuple(float)) – resolution at which to read the image, default = 0. Either a single number or a sequence of two numbers for x and y are valid. This value is in terms of the corresponding units. For example: resolution=0.5 and units=”mpp” will read the slide at 0.5 microns per-pixel, and resolution=3, units=”level” will read at level at pyramid level / resolution layer 3.
units (str) – the units of resolution, default = “level”. Supported units are: microns per pixel (mpp), objective power (power), pyramid / resolution level (level), pixels per baseline pixel (baseline).
interpolation (str) – Method to use when resampling the output image. Possible values are “linear”, “cubic”, “lanczos”, “area”, and “optimse”. Defaults to ‘optimise’ which will use cubic interpolation for upscaling and area interpolation for downscaling to avoid moiré patterns.
pad_mode (str) – Method to use when padding at the edges of the image. Defaults to ‘constant’. See
numpy.pad()for available modes.coord_space (str) – default to “baseline”, this is a flag to indicate if the input bounds is in the baseline coordinate system (“baseline”) or is in the requested resolution system (“resolution”).
**kwargs (dict) – Extra key-word arguments for reader specific parameters. Currently only used by VirtualWSIReader. See class docstrings for more information.
pad_constant_values (Union[numbers.Number, Iterable[Tuple[numbers.Number, numbers.Number]]]) –
- Returns
array of size MxNx3 M=size[0], N=size[1]
- Return type
Example
>>> from tiatoolbox.wsicore.wsireader import get_wsireader >>> # Load a WSI image >>> wsi = get_wsireader(input_img="./CMU-1.ndpi") >>> location = (0, 0) >>> size = (256, 256) >>> # Read a region at level 0 (baseline / full resolution) >>> img = wsi.read_rect(location, size) >>> # Read a region at 0.5 microns per pixel (mpp) >>> img = wsi.read_rect(location, size, 0.5, "mpp") >>> # This could also be written more verbosely as follows >>> img = wsi.read_rect( ... location, ... size, ... resolution=(0.5, 0.5), ... units="mpp", ... )
Note: The field of view varies with resolution when using
read_rect().As the location is in the baseline reference frame but the size (width and height) is the output image size, the field of view therefore changes as resolution changes.
If the WSI does not have a resolution layer corresponding exactly to the requested resolution (shown above in white with a dashed outline), a larger resolution is downscaled to achieve the correct requested output resolution.
If the requested resolution is higher than the baseline (maximum resultion of the image), then bicubic interpolation is applied to the output image.
When reading between the levels stored in the WSI, the coordinates of the requested region are projected to the next highest resolution. This resolution is then decoded and downsampled to produced the desired output. This is a major source of variability in the time take to perform a read operation. Reads which require reading a large region before downsampling will be significantly slower than reading at a fixed level.
Examples
>>> from tiatoolbox.wsicore.wsireader import get_wsireader >>> # Load a WSI image >>> wsi = get_wsireader(input_img="./CMU-1.ndpi") >>> location = (0, 0) >>> size = (256, 256) >>> # The resolution can be different in x and y, e.g. >>> img = wsi.read_rect( ... location, ... size, ... resolution=(0.5, 0.75), ... units="mpp", ... ) >>> # Several units can be used including: objective power, >>> # microns per pixel, pyramid/resolution level, and >>> # fraction of baseline. >>> # E.g. Read a region at an objective power of 10x >>> img = wsi.read_rect( ... location, ... size, ... resolution=10, ... units="power", ... ) >>> # Read a region at pyramid / resolution level 1 >>> img = wsi.read_rect( ... location, ... size, ... resolution=1, ... units="level", ... ) >>> # Read at a fractional level, this will linearly >>> # interpolate the downsampling factor between levels. >>> # E.g. if levels 0 and 1 have a downsampling of 1x and >>> # 2x of baseline, then level 0.5 will correspond to a >>> # downsampling factor 1.5x of baseline. >>> img = wsi.read_rect( ... location, ... size, ... resolution=0.5, ... units="level", ... ) >>> # Read a region at half of the full / baseline >>> # resolution. >>> img = wsi.read_rect( ... location, ... size, ... resolution=0.5, ... units="baseline", ... ) >>> # Read at a higher resolution than the baseline >>> # (interpolation applied to output) >>> img = wsi.read_rect( ... location, ... size, ... resolution=1.25, ... units="baseline", ... ) >>> # Assuming the image has a native mpp of 0.5, >>> # interpolation will be applied here. >>> img = wsi.read_rect( ... location, ... size, ... resolution=0.25, ... units="mpp", ... )
- read_region(location, level, size)[source]¶
Read a region of the whole slide image (OpenSlide format args).
This function is to help with writing code which is backwards compatible with OpenSlide. As such, it has the same arguments.
This internally calls
read_rect()which should be implemented by anyWSIReadersubclass. Therefore, some WSI formats which are not supported by OpenSlide, such as Omnyx JP2 files, may also be readable with the same syntax.
- save_tiles(output_dir, tile_objective_value, tile_read_size, tile_format='.jpg', verbose=True)[source]¶
Generate image tiles from whole slide images.
- Parameters
output_dir (str or pathlib.Path) – Output directory to save the tiles.
tile_objective_value (int) – Objective value at which tile is generated.
tile_format (str) – File format to save image tiles, defaults to “.jpg”.
verbose (bool) – Print output, default to True.
- Return type
None
Examples
>>> from tiatoolbox.wsicore.wsireader import get_wsireader >>> wsi = get_wsireader(input_img="./CMU-1.ndpi") >>> wsi.save_tiles(output_dir='./dev_test', ... tile_objective_value=10, ... tile_read_size=(2000, 2000))
>>> from tiatoolbox.wsicore.wsireader import get_wsireader >>> wsi = get_wsireader(input_img="./CMU-1.ndpi") >>> slide_param = wsi.info
- slide_dimensions(resolution, units, precisions=3)[source]¶
Return the size of WSI at requested resolution.
- Parameters
- Returns
shape of WSI in (width, height).
- Return type
Examples
>>> from tiatoolbox.wsicore import wsireader >>> wsi = get_wsireader(input_img="./CMU-1.ndpi") >>> slide_shape = wsi.slide_dimensions(0.55, 'mpp')
- slide_thumbnail(resolution=1.25, units='power')[source]¶
Read the whole slide image thumbnail (1.25x by default).
For more information on resolution and units see
read_rect()- Parameters
- Returns
thumbnail image
- Return type
Examples
>>> from tiatoolbox.wsicore.wsireader import get_wsireader >>> wsi = get_wsireader(input_img="./CMU-1.ndpi") >>> slide_thumbnail = wsi.slide_thumbnail()
- tissue_mask(method='otsu', resolution=1.25, units='power', **masker_kwargs)[source]¶
Create a tissue mask and wrap it in a VirtualWSIReader.
For the morphological method, mpp is used for calculating the scale of the morphological operations. If no mpp is available, objective power is used instead to estimate a good scale. This can be overridden with a custom size, via passing a kernel_size key-word argument in masker_kwargs, see
tissuemask.MorphologicalMaskerfor more.- Parameters
- Return type


