Shift Scale Rotate

If you have ever worked on a Computer Vision project, you might know that using augmentations to diversify the dataset is the best practice. On this page, we will:

Сover the Shift Scale Rotate augmentation;
Check out its parameters;
See how Shift Scale Rotate affects an image;
And check out how to work with Shift Scale Rotate using Python through the Albumentations library.

Let’s jump in.

Shift Scale Rotate explained

To define the term, Shift Scale Rotate is a data augmentation technique used to randomly apply affine transformations such as shift, scale, and rotate.

It is worth mentioning that Shift Scale Rotate requires updating labels (bounding boxes, polygons, etc.) to correspond to an object after the transformation is applied. Fortunately, Hasty has your back and does all the necessary things for you to focus on designing your solution without bothering about routine.

Using Shift Scale Rotate helps Data Scientists increase the variety of points of view on an object in the training set. This approach creates the needed diversity without the need to find and label more data.

Parameters

Shift factor - specifies a specific range from which a random shift factor is picked and used to shift an image both horizontally and vertically;

The shift factor ranges between -1 and 1. The negative shift factor shifts the pixels leftwards in the horizontal direction and upwards in the vertical direction. The positive shift factor shifts the pixels rightwards in the horizontal direction and downwards in the vertical direction. The magnitudes define the shifting degree.

Scale factor - specifies a specific range from which a random scale factor is picked and used to rescale an image;

The scale factor should be between -1 and 1. A negative scale factor zooms out the image, whereas a positive scale factor zooms in the image. The magnitude defines the zooming degree.

Rotation range - specifies a specific range from which a random angle (in degrees) is picked and used to rotate an image;

A negative value specifies a clockwise direction, whereas a positive angle is for a counter-clockwise direction. For example, if your range is [-90, -45], an image will be rotated in the clockwise direction.

Probability of applying transform - defines the likelihood of applying Shift Scale Rotate to an image.

Advanced options

Interpolation - specifies the interpolation algorithm:
- Nearest;
- Linear;
- Cubic;
- Area;
- LANCZOS4.
Extrapolation - specifies the extrapolation algorithm:
- Constant;
- Replicate;
- Reflect;
- Wrap;
- Reflect 101.
Padding value if border mode is Constant and Padding value if order mode is Constant applied or masks - please check out the general Extrapolation page to learn more.

Shift Scale Rotate visualized

Image after Shift Scale Rotate with the default albumentations parameters is applied

Code implementation

  
Hello, thank you for using the code provided by CloudFactory. Please note that some code blocks might not be 100% complete and ready to be run as is. This is done intentionally as we focus on implementing only the most challenging parts that might be tough to pick up from scratch. View our code block as a LEGO block - you can’t use it as a standalone solution, but you can take it and add it to your system to complement it.

      python
      
      import albumentations as albu
from PIL import Image
import numpy as np

transform = albu.augmentations.geometric.transforms.ShiftScaleRotate(shift_limit = 0.0625, scale_limit = 0.1, rotate_limit = 45)
image = np.array(Image.open('/some/random/image.png'))
augmented_image = transform(image=image)['image']

# We have the augmented image in augmented_image.

Learn more about other augmentations …

Boost model performance quickly with AI-powered labeling and 100% QA.

Learn more

Last modified 9d ago

Previous - Computer Vision augmentations

Gaussian Noise

Next - Computer Vision augmentations

Longest max size