[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

Convolutional Neural Networks

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 8
Thread images: 5

File: letterd.png (1KB, 163x224px) Image search: [Google]
letterd.png
1KB, 163x224px
I've worked with basic feed forward neural networks with backpropagation. But I'm having difficulty understanding convolutional neural networks.

I'd like to try making a small character recognition program.
>>
File: convolutional neural network.png (10KB, 383x239px) Image search: [Google]
convolutional neural network.png
10KB, 383x239px
Here's a picture I was annoyed by the absence of when I read about the topic. The idea of a convolutional net is to have each neuron only look at the neurons spatially close to it in the layer above it, and for the neurons of each layer only to be of a small-ish number of types which share the same receiving parameters. I think other approaches are more popular for character image recognition now, but apparently you can make a mean go bot with them.
>>
>>7951793
That's not really how it works.

I'd check this out:
http://cs231n.github.io/convolutional-networks/
>>
File: pic.png (84KB, 779x674px) Image search: [Google]
pic.png
84KB, 779x674px
>>7951866
Ignoring the boring stuff like sampling it says what I said. I pushed the neuron analogy a little further though. In its language each filter corresponds to a different kind of neuron in my language, and the connecting parameters for each kind of neuron corresponds to the matrix entries in the filter.
>>
>>7951885
>In its language each filter corresponds to a different kind of neuron in my language
Okay but then what you said earlier
>The idea of a convolutional net is to have each neuron only look at the neurons spatially close to it in the layer above it
is wrong. Each filter/neuron looks at the entire input volume above it, outputting a single slice of the volume to be processed by the following layer.
>>
>>7951920
I said that filters correspond to different kinds of neurons (as labeled by letters in the picture) not individual neurons. The neurons of each variety of letter together have the effect of making one slice of the output volume, like how each filter makes a slice of the output volume. I swear it's all equivalent.
>>
File: weights.jpg (43KB, 627x248px) Image search: [Google]
weights.jpg
43KB, 627x248px
OP here...

>>7951793
What other approaches are more popular for OCR?

>>7951866
Thanks. That source helps quite a bit. I'm still not sure where convolution comes into play though.
>>
>>7952125
My mistake, it seems this is one of the most powerful approaches to ocr.
>convolution
The shift-multiply-add thing that goes on with filters is exactly convolution. It's a multidimensional sort of convolution though.
Thread posts: 8
Thread images: 5


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.