Learning filter scale and orientation in convolutional neural networks

Título:

Autor personal:

Çam, İlker, author.

PRODUCTION_INFO:

[s.l. : s.n.], 2019.

Descripción física:

xi, 40 leaves : illustrations, tables ; 30 cm + 1 CD-ROM.

Nota general:

Date of approval: 04.02.2019

Síntesis:

Convolutional neural networks have many hyper-parameters such as ﬁlter size, number of ﬁlters, and pooling size, which require manual tuning. Though deep stacked structures are able to create multi-scale and hierarchical representations, manually ﬁxed ﬁlter sizes limit the scale of representations that can be learned in a single convolutional layer. Can we adaptively learn to scale the ﬁlters on training time? Proposed adaptive ﬁlter model can learn the scale and orientation parameters of ﬁlters using backpropagation. Therefore, in a single convolution layer, we can create ﬁlters of diﬀerent scale and orientation that can adapt to small or large features and objects. The proposed model uses a relatively large base size (grid) for ﬁlters. In the grid, a diﬀerentiable function acts as an envelope for the ﬁlters. The envelope function guides eﬀective ﬁlter scale and shape/orientation by masking the ﬁlter weights before the convolution. Therefore, only the weights in the envelope are updated during training. In this work, we employed a multivariate (2D) Gaussian as the envelope function and showed that it can grow, shrink, or rotate by updating its covariance matrix during backpropagation training. We tested the model with its basic settings to show the collaboration of weight matrix with envelope function is possible. A deeper architecture was used to show the performance on deeper and wider networks. We tested the new ﬁlter model on MNIST, MNIST-cluttered, and CIFAR-10 datasets. Compared the results with the networks that used conventional convolution layers. The results demonstrate that the new model can eﬀectively learn and produce ﬁlters of diﬀerent scales and orientations in a single layer. Moreover, the experiments show that the adaptive convolution layers perform equally; or better, especially when data includes objects of varying scale and noisy backgrounds.

Evrişimsel sinir ağlarında ﬁltre boyutu, sayısı ve ortaklama boyutu elle seçilmektedir. Derin katmanlı sinir ağları hiyerarşik çok ölçekli temsiller öğrenebilmesine rağmen, sabit ﬁltre boyutları farklı ölçekteki öğrenilebilecek ﬁltre sayısını sınırlamaktadır. Aynı katmanda farklı ölçeklerde ﬁltreleri eğitim aşamasında öğrenebilen bir mimari olabilir mi? Önerilen ﬁltre modelimizde ﬁltre ölçek ve oryantasyonları geriye yayılım ile öğrenilebilir. Bu şekilde, aynı evrişimsel katmanda farklı ölçek ve oryantasonlarla büyük ve küçük objeleri tanımlayabiliriz. Önerilen model, nispeten büyük ﬁltre (ızgara) boyutlarına sahiptir. Türevi olan bir çevreleyici fonksiyon ile ﬁltrelerin efektif ölçeklerini ve oryantasyonlarını, evrişim işlemine girmeden, katsayı matrislerini maskeleyebiliriz. Bu sayede, sadece çevreleyici fonksiyon içerisindeki katsayılar eğitilecektir. Bu çalışmamızda, çok değişkenli (2 Boyutlu) Gaussian fonksiyonunu çevreleyici fonksiyon olarak kullandık. Kovaryans matrisinin geriye yayılım yöntemiyle eğitilmesiyle, çevreyeliyici fonksiyonun büyüyüp, küçüldüğünü ve dönebildiğini gösterdik. Çevreliyici fonksiyonun eğitilebildiğini ve katsayılarla işbirliğini, modelin en basit haliyle deneyimledik. Derin katmanlardaki performansını, derin ve geniş mimariler üzerinde çalıştırdık ve performansını izledik. Önerilen modeli, MNIST, MNIST-cluttered ve CIFAR-10 veri kümelerinde çaıştırdık ve geleneksel evrişimsel sinir ağ mimarilerindeki çalışma performanslarıyla karşılaştırdık. Sonuçlar, önerdiğimiz modelin, farklı ölçek ve oryanyasyonlarda, aynı katmanda, ﬁltreler öğrenebildiğini gösterdi. Ayrıca, deneylerimiz, adaptifevrişimsel katmanının aynı, özellikle veri kümesinde farklı ölçeklerde obje ve gürültülü arkaplan içeren veri kümelerinde daha iyi çalıştığını gösterdik.

Término de la materia:

Adaptive computing systems.

Convolutions (Mathematics)

Dissertations, Academic.

Autor añadido:

Tek, Boray,

Yıldız, Olcay Taner,

Gürgen, Fikret,

Autor corporativo añadido:

Işık University.

Işık Üniversitesi.

M.S. in Computer Engineering. Thesis.