Describir: Self-Similarity in Deep Neural Network Modules for Images and Videos