Convolutional Neural Networks

Thread replies: 8
Thread images: 5

Anonymous
Convolutional Neural Networks 2016-03-23 12:39:04 Post No. 7951454
[Report] Image search: [Google]

File: letterd.png (1KB, 163x224px) Image search: [Google]

Convolutional Neural Networks Anonymous 2016-03-23 12:39:04 Post No. 7951454 [Report]

I've worked with basic feed forward neural networks with backpropagation. But I'm having difficulty understanding convolutional neural networks.

I'd like to try making a small character recognition program.

Anonymous 2016-03-23 15:53:58 Post No.7951793
[Report] Image search: [Google]

Anonymous 2016-03-23 15:53:58 Post No.7951793 [Report]

File: convolutional neural network.png (10KB, 383x239px) Image search: [Google]

10KB, 383x239px

Here's a picture I was annoyed by the absence of when I read about the topic. The idea of a convolutional net is to have each neuron only look at the neurons spatially close to it in the layer above it, and for the neurons of each layer only to be of a small-ish number of types which share the same receiving parameters. I think other approaches are more popular for character image recognition now, but apparently you can make a mean go bot with them.

Anonymous 2016-03-23 16:33:42 Post No.7951866
[Report]

Anonymous 2016-03-23 16:33:42 Post No.7951866 [Report]

>>7951793
That's not really how it works.

I'd check this out:
http://cs231n.github.io/convolutional-networks/

Anonymous 2016-03-23 16:42:07 Post No.7951885
[Report] Image search: [Google]

Anonymous 2016-03-23 16:42:07 Post No.7951885 [Report]

File: pic.png (84KB, 779x674px) Image search: [Google]

84KB, 779x674px

>>7951866
Ignoring the boring stuff like sampling it says what I said. I pushed the neuron analogy a little further though. In its language each filter corresponds to a different kind of neuron in my language, and the connecting parameters for each kind of neuron corresponds to the matrix entries in the filter.

Anonymous 2016-03-23 17:00:33 Post No.7951920
[Report]

Anonymous 2016-03-23 17:00:33 Post No.7951920 [Report]

>>7951885
>In its language each filter corresponds to a different kind of neuron in my language
Okay but then what you said earlier
>The idea of a convolutional net is to have each neuron only look at the neurons spatially close to it in the layer above it
is wrong. Each filter/neuron looks at the entire input volume above it, outputting a single slice of the volume to be processed by the following layer.

Anonymous 2016-03-23 17:08:34 Post No.7951933
[Report]

Anonymous 2016-03-23 17:08:34 Post No.7951933 [Report]

>>7951920
I said that filters correspond to different kinds of neurons (as labeled by letters in the picture) not individual neurons. The neurons of each variety of letter together have the effect of making one slice of the output volume, like how each filter makes a slice of the output volume. I swear it's all equivalent.

Anonymous 2016-03-23 19:06:09 Post No.7952125
[Report] Image search: [Google]

Anonymous 2016-03-23 19:06:09 Post No.7952125 [Report]

File: weights.jpg (43KB, 627x248px) Image search: [Google]

43KB, 627x248px

OP here...

>>7951793
What other approaches are more popular for OCR?

>>7951866
Thanks. That source helps quite a bit. I'm still not sure where convolution comes into play though.

Anonymous 2016-03-23 19:25:51 Post No.7952148
[Report] Image search: [Google]

Anonymous 2016-03-23 19:25:51 Post No.7952148 [Report]

File: Convolution_of_spiky_function_with_box2[1].gif (76KB, 468x135px) Image search: [Google]

76KB, 468x135px

>>7952125
My mistake, it seems this is one of the most powerful approaches to ocr.
>convolution
The shift-multiply-add thing that goes on with filters is exactly convolution. It's a multidimensional sort of convolution though.

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible. Read more on this topic here - https://archived.moe/talk/thread/1694/

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/