Capsule Networks
Last updated
Was this helpful?
Last updated
Was this helpful?
An alternative to pooling in convolutional neural networks.
Sometimes when outputting a smaller representation of an image, spatial information is lost. We can use capsule networks to avoid this.
They detect parts in an object within the context of the spatial information.
Capsule networks are made of parent and child nodes that build up a complete picture of an object.
Capsules are a collection of nodes. Each node contains information about a specific part (width, orientation, colour), and outputs a vector with:
data:image/s3,"s3://crabby-images/16d81/16d811f011ec7038efafbd602e31c7dd4c283a87" alt="Cat face, recognized in different orientations using magnitude and orientation"
Here's a Github repo of a PyTorch implementation.
Magnitude () = the probability that a part exists; a value between 0 and 1.
Orientation () = the state of the part properties