Nuit Blanche: Parallel Paths for Deep Learning and Signal Processing ?

Wednesday, August 27, 2014

Parallel Paths for Deep Learning and Signal Processing ?

This is a follow-up to this thread.

Deep learning in Machine Learning stands for neural networks with many layers for the purpose of producing features that can eventually be used for classification. In image/signal processing, the generic idea is to acquire and decompose a signal along a family of (generally) known signals (bases in known/identified spaces). Simple machine learning techniques can then rely on the elements of that decomposition (the features in ML) to achieve the last step of a classification task. Is there a convergence between the two approaches ?

As Stephane Mallat [1] pointed out in this panel, an FFT/IFFT is already deep, in fact a process that is decomposed through several matrix factorizations is also deep as it requires several iterations of different factorizations ( see Sparse Matrix Factorization: Simple rules for growing neural nets and Provable Bounds for Learning Some Deep Representations ) with each factored matrix representing a layer.

But if we explore recent developments in Compressive Sensing, we know that that most reconstruction solvers used to take a long time to converge and that any convergence was measured in hundreds if not thousands of iterations. If any iteration could be construed as a layer (as is the case in autoencoders) , a very deep network composed of a thousand layers would be clearly a non publishable offence. Aside from the long "depth", some of these solvers rely on linear operations whereas current neural networks implicitely use nonlinear functionals.

Recently, many things changed in Compressive Sensing with the appearance of Approximate Message Passing algorithms. They are theoretically sound and require only a few iterations (5 to 20) to obtain convergence. Each of these iterations can be expressed as a nonlinear functional of a matrix-vector multiply akin to each layer's computation in neural networks:

From Message Passing Algorithms for Compressed Sensing

by D. Donoho, A. Maleki, and A. Montanari

From: The SwAMP Thing! Sparse Estimation with the Swept Approximated Message-Passing Algorithm -implementation -

One could argue that the first layer in Compressive Sensing does not parallel that in Neural Networks. It turns out that a few people [6] are working on what is called one bit sensing [5] which is very close in spirit to neural networks.

In all, each of these approaches are building deep networks in their own way...using different vocabularies.

[1] Unsupervised Learning by Deep Scattering Contractions by Xu Chen, Xiuyuan Cheng, Stéphane Mallat

[2] Rong Ge's PhD thesis: Provable Algorithms for Machine Learning Problems

[3] Provable Bounds for Learning Some Deep Representations by Sanjeev Arora,Aditya Bhaskara, Rong Ge, Tengyu Ma

[4] Sparse Matrix Factorization by Behnam Neyshabur, Rina Panigrahy

[5] One-bit compressive sensing with norm estimation
[6] Resource on Quantization