/dlg/ - Deep Learning General - /g/ - Technology

Thread replies: 68
Thread images: 10

Anonymous
/dlg/ - Deep Learning General 2017-06-02 11:40:19 Post No. 60707174
[Report] Image search: [Google]

File: acc.png (46KB, 911x622px) Image search: [Google]

/dlg/ - Deep Learning General Anonymous 2017-06-02 11:40:19 Post No. 60707174 [Report]

Just getting started?

You need at least the following at introductory levels:
Linear Algebra
Multivariable Calculus
Statistics
Python

Anonymous 2017-06-02 11:41:45 Post No.60707183
[Report]

Anonymous 2017-06-02 11:41:45 Post No.60707183 [Report]

>>60707174
Book for beginners:
http://neuralnetworksanddeeplearning.com/index.html

Book for intermediates:
http://www.deeplearningbook.org/

Anonymous 2017-06-02 11:47:21 Post No.60707248
[Report]

Anonymous 2017-06-02 11:47:21 Post No.60707248 [Report]

>take NN course
>still confused about the differing details of implimenting a neuron and a perceptron based system
>still havent actually implimented backtracking
>only have a vague understanding of how it works
i think ill have a go at knocking out a basic single threaded proper NN at some point soon, NNs are fun

Anonymous 2017-06-02 11:48:24 Post No.60707257
[Report]

Anonymous 2017-06-02 11:48:24 Post No.60707257 [Report]

>>60707248
>backtracking
i mean backpropagation, because apparently im retarded

Anonymous 2017-06-02 11:50:49 Post No.60707279
[Report]

Anonymous 2017-06-02 11:50:49 Post No.60707279 [Report]

>>60707248
Check chapter 2 of the beginner book I linked above

Anonymous 2017-06-02 12:43:50 Post No.60707860
[Report]

Anonymous 2017-06-02 12:43:50 Post No.60707860 [Report]

>>60707174
>Linear Algebra
Check

>Multivariable Calculus
How much? Just partial derivatives? I've done calc 1 so I could easily learn that.

>Statistics
Done an introductory statistics course, so check I guess.

>Python
LMAO
M
A
O

I was wondering why people were complaining about machine learning being slow, but now I fucking know why: They're using the slowest fucking language!

Anonymous 2017-06-02 12:50:16 Post No.60707914
[Report]

Anonymous 2017-06-02 12:50:16 Post No.60707914 [Report]

>>60707248
This is because NNiggers intentionally obscure what big data ML is, because VC money bubble.

It's just a fucking decision tree evolved by brute force instead of inferring the hidden variables in more inductive way a real human bean would.

The xor problem is "avoided" by chaining bunch of those (so called layers) to the point you enumerate all XOR corner cases (ie XOR like problems are implicit overfit).

Gradient and descent and backprop are to narrow the search space. Meaning very simple NNs (fe simple captcha breaking) really don't need one. Just like simple chess algorithms don't do any tree prunning, and just bruteforce and rank very few steps ahead.

Anonymous 2017-06-02 12:51:08 Post No.60707918
[Report]

Anonymous 2017-06-02 12:51:08 Post No.60707918 [Report]

Thank you for making this lovely thread anon instead of making a intel vs amd thread.

Pls teach the new memes in AI.

Anonymous 2017-06-02 12:52:13 Post No.60707937
[Report]

Anonymous 2017-06-02 12:52:13 Post No.60707937 [Report]

>>60707914
A rare moment of someone on /g/ knowing what they're talking about

Anonymous 2017-06-02 12:53:41 Post No.60707948
[Report]

Anonymous 2017-06-02 12:53:41 Post No.60707948 [Report]

>>60707860
Python is a good prototyping language. It's there to glue the efficient parts written in native C/C++/CUDA together for ease of use and with a well written input pipeline it should never be your bottleneck.

Most big DL libraries have (often exclusive) Python APIs, like Pytorch (Facebook), Caffe2 (Facebook), CNTK (M$), Tensorflow (Google) and Theano (Canucks)

Anonymous 2017-06-02 12:54:17 Post No.60707955
[Report]

Anonymous 2017-06-02 12:54:17 Post No.60707955 [Report]

>>60707174
>>60707183
Memes, thats all. Not a single working application has been released by any of the authors claiming to experts in fields like deep learning, ai etc.

Anonymous 2017-06-02 12:54:50 Post No.60707961
[Report]

Anonymous 2017-06-02 12:54:50 Post No.60707961 [Report]

>>60707860
>Done an introductory statistics course, so check I guess.
This generally isnt sufficient if you want a good idea about what's going on. Machine learning is basically applied statistics.

>Lolling over python
The bottleneck in ML is seldom processing the data and more storing/acquiring/transfering it in the first place. Also numpy uses BLAS which is bretty good for array operations anyway.

Anonymous 2017-06-02 12:59:03 Post No.60707997
[Report]

Anonymous 2017-06-02 12:59:03 Post No.60707997 [Report]

>>60707955
Demis Hassabis wrote the original RollerCoaster Tycoon.

Anonymous 2017-06-02 01:01:12 Post No.60708019
[Report]

Anonymous 2017-06-02 01:01:12 Post No.60708019 [Report]

>>60707955
This is like accusing mathematicians of not being engineers.

Anonymous 2017-06-02 01:09:29 Post No.60708096
[Report]

Anonymous 2017-06-02 01:09:29 Post No.60708096 [Report]

>>60708019
No its more like writing books how to be successful when the author is a poor loser.

Mathematical knowledge does not translate to deep learning or AI otherwise we would have had fully functional AI years ago.

Anonymous 2017-06-02 01:11:45 Post No.60708119
[Report]

Anonymous 2017-06-02 01:11:45 Post No.60708119 [Report]

>>60708096
lol Schmidhuber basically invented AI back in 1923

Anonymous 2017-06-02 01:18:33 Post No.60708186
[Report]

Anonymous 2017-06-02 01:18:33 Post No.60708186 [Report]

>>60707174
Just getting started?

>You need at least the following at introductory levels:
>Linear Algebra
Done
>Multivariable Calculus
Done
>Statistics
Done
>Python
Why? I use MATLAB

Anonymous 2017-06-02 01:20:45 Post No.60708207
[Report]

Anonymous 2017-06-02 01:20:45 Post No.60708207 [Report]

>>60708186
See
>>60707948
>>60707961
TL;DR: Everyone uses Python, and it's effectively the only one available.

Anonymous 2017-06-02 01:25:15 Post No.60708247
[Report]

Anonymous 2017-06-02 01:25:15 Post No.60708247 [Report]

>>60708096
>No its more like writing books how to be successful when the author is a poor loser.
Why?

Coming up with a very effective algorithm doesn't necessarily mean you have an effective use case for it yet.

Case in point: Bayesian Optimisation has been around since the 1970s but it's only recently found it's real niche due to the lack of computing power available in those days.

Anonymous 2017-06-02 01:35:34 Post No.60708347
[Report]

Anonymous 2017-06-02 01:35:34 Post No.60708347 [Report]

>>60707860
>How much? Just partial derivatives? I've done calc 1 so I could easily learn that.

PDEs are the most obvious and commonly used method to deal with minmax in mainstream frameworks, but there are drastically different approaches to ML too which are conceptually similiar in engineering terms, but somewhat different math.

The opposite to PDE on this spectrum would be binary NNs, where minmax is performed by a SAT solver (basically partial MQ polynomial in GF(2)).

General MQ polynomial math of all kinds - real, integer, field, groups - can be all used, because the only thing training algorithm need is "approximate future input trend for given output trend", and a lot of math subfields exhibit and deal with this property, not just real/complex field PDEs used in mainstream engineering.

Anonymous 2017-06-02 01:46:06 Post No.60708453
[Report]

Anonymous 2017-06-02 01:46:06 Post No.60708453 [Report]

>>60707174
You should probably add probability theory up there too, OP

Anonymous 2017-06-02 01:48:58 Post No.60708490
[Report] Image search: [Google]

Anonymous 2017-06-02 01:48:58 Post No.60708490 [Report]

File: b49.jpg (36KB, 680x680px) Image search: [Google]

36KB, 680x680px

> muh ReLU: reinventing LP constraints
> muh CNN-s: reinventing tunable filter banks
> muh RNN-s: reinventing tunable IIR filters

Thank you based Hinton zombies for reinventing optimization and signal processing in the third millenium.

Anonymous 2017-06-02 01:50:32 Post No.60708508
[Report]

Anonymous 2017-06-02 01:50:32 Post No.60708508 [Report]

>>60708186
You don't HAVE to use python but it will save you a lot of boilerplate.

Anonymous 2017-06-02 01:55:16 Post No.60708554
[Report]

Anonymous 2017-06-02 01:55:16 Post No.60708554 [Report]

>>60707961
>Machine learning is basically applied statistics.
I disagree. It appears to be more applied calculus to me.

Anonymous 2017-06-02 01:57:07 Post No.60708572
[Report]

Anonymous 2017-06-02 01:57:07 Post No.60708572 [Report]

>tfw should be learning AI in the next 1-2 years at uni

hopefully i can learn it early and well enough to become an early adopter

Anonymous 2017-06-02 02:00:12 Post No.60708603
[Report]

Anonymous 2017-06-02 02:00:12 Post No.60708603 [Report]

>>60708490
Look at the bright side, all this "NN-ready" hardware coming out of this fad can be used as pretty fine DSPs.

Anonymous 2017-06-02 02:01:33 Post No.60708619
[Report]

Anonymous 2017-06-02 02:01:33 Post No.60708619 [Report]

>>60708572
It's too late. Successful machine learning researchers are identified in elementary school machine learning competitions. Only the most creative, innovative, and gifted students are selected. If you were never aware of the process, then it means that you failed in the secret initial qualifiers, and weren't even close to earning a place in the program. This process may sound harsh, but it would simply be cruel to try to train someone in the art of machine learning if they don't possess the raw talent.

Anonymous 2017-06-02 02:03:07 Post No.60708636
[Report]

Anonymous 2017-06-02 02:03:07 Post No.60708636 [Report]

>>60708347
Are there any clear advantages of using other methods than PDEs?

Anonymous 2017-06-02 02:06:13 Post No.60708675
[Report] Image search: [Google]

Anonymous 2017-06-02 02:06:13 Post No.60708675 [Report]

File: whynot.jpg (27KB, 350x350px) Image search: [Google]

27KB, 350x350px

>>60708572
Why should humans bother/ Why not just teach AI how to tune ML models if it's too hard for bones and flesh? The model parameter selection is already black magic alchemy at this point.

I can't wait for firmware for misogynic killer robots either, anon.

Anonymous 2017-06-02 02:07:23 Post No.60708689
[Report]

Anonymous 2017-06-02 02:07:23 Post No.60708689 [Report]

>>60708636
No quasi-newton method or stochastic optimization method will measure up to minibatch SGD on all but toy problems like MNIST

Anonymous 2017-06-02 02:08:32 Post No.60708705
[Report]

Anonymous 2017-06-02 02:08:32 Post No.60708705 [Report]

>>60708675
there are already techniques that attempt to "learn" the correct parameters.

Anonymous 2017-06-02 02:09:38 Post No.60708720
[Report]

Anonymous 2017-06-02 02:09:38 Post No.60708720 [Report]

>>60708675
Large scale hyperparameter optimization is still pretty nascent, bayesian optimization mentioned in >>60708247 is one of them

Anonymous 2017-06-02 02:13:26 Post No.60708771
[Report]

Anonymous 2017-06-02 02:13:26 Post No.60708771 [Report]

>>60708675
Also when it takes a week to evaluate the function (loss on validation set vs your parameter) once, you bet human insight still beats automatic parameter selection.

Anonymous 2017-06-02 02:16:52 Post No.60708814
[Report]

Anonymous 2017-06-02 02:16:52 Post No.60708814 [Report]

>>60708636
There are tradeoffs. Disadvantage is that those approaches are much more difficult to reason about - how do you minmax in a chaotic ring?

There are ways, but not obvious. For example, you can define minmax not in terms of real function derivative, but in terms of population representation in a cellular automata, or as a group operator in chaotic group, or simply sets of on/off bits (binary weight NN).

Advantage is often much better performance (especially in case of binary NNs). There's also possibility to express chaotic states as partial statistical component of input/output pad sets which linear functions simply can't do.

If you want to see some PoC code out there, try https://github.com/allenai/XNOR-Net for example

Anonymous 2017-06-02 02:25:02 Post No.60708906
[Report]

Anonymous 2017-06-02 02:25:02 Post No.60708906 [Report]

>>60708554
I forgot I was in deep learning general actually, in which case I would be inclined to agree

Anonymous 2017-06-02 02:30:18 Post No.60708971
[Report] Image search: [Google]

Anonymous 2017-06-02 02:30:18 Post No.60708971 [Report]

File: traprogrammer.jpg (115KB, 1000x900px) Image search: [Google]

115KB, 1000x900px

Reminder that wearing your programming socks make you a better deep learning researcher.

Anonymous 2017-06-02 02:33:35 Post No.60709011
[Report]

Anonymous 2017-06-02 02:33:35 Post No.60709011 [Report]

>>60708906
Wait, what's the difference between "normal" machine learning and deep learning that makes deep learning more like calculus?

Anonymous 2017-06-02 02:33:56 Post No.60709017
[Report] Image search: [Google]

Anonymous 2017-06-02 02:33:56 Post No.60709017 [Report]

File: xaxa (2).png (25KB, 424x213px) Image search: [Google]

25KB, 424x213px

ML isn't a trap, as long it's cute.

Anonymous 2017-06-02 02:36:21 Post No.60709046
[Report]

Anonymous 2017-06-02 02:36:21 Post No.60709046 [Report]

>>60709011
Buzz term. But it refers to multi-layer networks, as opposed to simple single layer NNs.

Anonymous 2017-06-02 02:38:32 Post No.60709066
[Report]

Anonymous 2017-06-02 02:38:32 Post No.60709066 [Report]

>>60709046
>But it refers to multi-layer networks, as opposed to simple single layer NNs.
I know that, but how does it make it more like calculus?
Can't you apply gradient descent to single layer nets?

Anonymous 2017-06-02 02:42:28 Post No.60709119
[Report] Image search: [Google]

Anonymous 2017-06-02 02:42:28 Post No.60709119 [Report]

File: maxresdefault.jpg (191KB, 1920x1080px) Image search: [Google]

191KB, 1920x1080px

lmao why bother with all that crap when I can just learn electronics repair and resolder macbooks for mad cash 24/7

Anonymous 2017-06-02 02:44:37 Post No.60709144
[Report]

Anonymous 2017-06-02 02:44:37 Post No.60709144 [Report]

>>60709066
In single NN it's called delta-rule. GD is simply incremental generalization of it over layers. But in both cases you deal with approximating inputs for some outputs, making it "calculus", I guess.

Anonymous 2017-06-02 03:21:45 Post No.60709616
[Report]

Anonymous 2017-06-02 03:21:45 Post No.60709616 [Report]

>>60707257

Its simpler than what its made to be.
https://mattmazur.com/2015/03/17/a-step-by-step-backpropagation-example/

Anonymous 2017-06-02 03:23:17 Post No.60709632
[Report]

Anonymous 2017-06-02 03:23:17 Post No.60709632 [Report]

>>60708619
This is absolute BS

Anonymous 2017-06-02 03:30:15 Post No.60709721
[Report]

Anonymous 2017-06-02 03:30:15 Post No.60709721 [Report]

>>60709632
Then answer me this. Why were Demis Hassabis and CERN's Director General invited to Bilderberg meetings?

Anonymous 2017-06-02 03:40:44 Post No.60709858
[Report]

Anonymous 2017-06-02 03:40:44 Post No.60709858 [Report]

>>60707860
python is only there to control the general flow of the program.
most of the work in machine learning consists of doing calculations over huge matrices. Those operations are usually implemented as C programs which you can conveniently call from your python program.

Machine learning is slow because machine learning is complicated.

Anonymous 2017-06-02 03:59:49 Post No.60710112
[Report] Image search: [Google]

Anonymous 2017-06-02 03:59:49 Post No.60710112 [Report]

File: math_trench.jpg (403KB, 1100x3300px) Image search: [Google]

403KB, 1100x3300px

Anonymous 2017-06-02 04:03:57 Post No.60710157
[Report]

Anonymous 2017-06-02 04:03:57 Post No.60710157 [Report]

>>60708554
But machine learning is actually a field of statistics.

Anonymous 2017-06-02 04:05:22 Post No.60710171
[Report]

Anonymous 2017-06-02 04:05:22 Post No.60710171 [Report]

>>60709721
> Why are two people at the top of their field invited to a meeting for the people at the top of their respective fields.

You do not *need* to be a child prodigy to make significant contributions to machine learning or anything else for that matter. Of course people will be interested in the vision of the pioneering minds but to claim you'll never make a contribution because you're not Jürgen Schmidhuber is ridiculous.

Yes, there are people who fall into this category, and yes there are sudden paradigm shifts occasionally, but the vast majority of scientific and/or mathematic advancement is one tiny enhancement or observation at a time, built out of a network of contributions and feedback from the community.

Anonymous 2017-06-02 04:15:20 Post No.60710309
[Report]

Anonymous 2017-06-02 04:15:20 Post No.60710309 [Report]

>>60710112
For people who want to start yapping about the last group: All those problems are obviously intractable. The joke is that advanced AI capable of "reasoning" about such problems is intractable too.

Or humans truly are too dumb to deal with those, and intractable simply reflects our ignorance and limited capacity.

Anonymous 2017-06-02 04:23:31 Post No.60710433
[Report]

Anonymous 2017-06-02 04:23:31 Post No.60710433 [Report]

>>60708619
Le reddit ML troll

Anonymous 2017-06-02 05:21:15 Post No.60711226
[Report] Image search: [Google]

Anonymous 2017-06-02 05:21:15 Post No.60711226 [Report]

File: never everr.png (568KB, 600x580px) Image search: [Google]

568KB, 600x580px

>>60707174
>Linear Algebra
>Multivariable Calculus
>Statistics
>Python

L O L
O
L

How to maching """LEARNING""":
import sklearn

Anonymous 2017-06-02 05:44:50 Post No.60711540
[Report]

Anonymous 2017-06-02 05:44:50 Post No.60711540 [Report]

>>60710112
>made by a highschooler

Anonymous 2017-06-02 06:22:21 Post No.60712061
[Report]

Anonymous 2017-06-02 06:22:21 Post No.60712061 [Report]

>>60708207
>what is R

Anonymous 2017-06-02 06:28:30 Post No.60712137
[Report]

Anonymous 2017-06-02 06:28:30 Post No.60712137 [Report]

>>60709066
autodiff

Anonymous 2017-06-02 06:37:28 Post No.60712237
[Report]

Anonymous 2017-06-02 06:37:28 Post No.60712237 [Report]

>>60710112
> one time pad decryption

Anonymous 2017-06-02 07:47:58 Post No.60713235
[Report] Image search: [Google]

Anonymous 2017-06-02 07:47:58 Post No.60713235 [Report]

File: .jpg (59KB, 600x637px) Image search: [Google]

59KB, 600x637px

>suddenly realize neural networks are meme trash and solve my problem using actual statistics in 1/1000th the resources and 1/100th the development time

its an enlightened feel
im better than all of you

Anonymous 2017-06-02 08:48:20 Post No.60714039
[Report]

Anonymous 2017-06-02 08:48:20 Post No.60714039 [Report]

>>60707914
i mean i get that much, ive done a module on general machine learning using weka before, its just i have an incomplete understanding of the actual process of backpropagation and weight sliding. for my project i essentially resorted to just individually sliding the weights based on T-O delta * learning rate, which yielded better results but clearly isnt how things are supposed to be done.

>>60707279
ill read it later senpai, thanks

>>60709616
i just need to get round to implimenting it, but its a bit late in the day now

Anonymous 2017-06-02 08:53:00 Post No.60714094
[Report]

Anonymous 2017-06-02 08:53:00 Post No.60714094 [Report]

>>60707174
>Linear Algebra
>Multivariable Calculus
>Statistics
>Python

That doesn't seem too bad

Anonymous 2017-06-02 09:02:14 Post No.60714213
[Report]

Anonymous 2017-06-02 09:02:14 Post No.60714213 [Report]

>>60714039
>to just individually sliding the weights based on T-O delta * learning rate, which yielded better results but clearly isnt how things are supposed to be done.

This is pretty close approximation of delta rule you do for a simple NN.

>>60713235
Not quite. The power of ML still is that it is fairly general and "fuzzy", while more powerful algorithms, symbolic logic, PCA, SVM ... work only on well defined problems.

You're right that a lot of ML tasks are now thrown at ANNs simply because it's cheap to do so in terms of development cost, and hardware cost is neglible in comparison. It's time to market bloat like any other - often, ANN is to expert system like electron is to desktop apllications.

Anonymous 2017-06-02 09:09:36 Post No.60714309
[Report] Image search: [Google]

Anonymous 2017-06-02 09:09:36 Post No.60714309 [Report]

File: 9PVle5o.jpg (283KB, 900x1200px) Image search: [Google]

283KB, 900x1200px

I'm interested in neural networks so I read this karpathy.github.io/neuralnets/ and implemented a basic network to classify iris species(https://archive.ics.uci.edu/ml/datasets/iris) so far. I want to implement my own library for neural networks but I haven't found any good info on implementation architectures/types of implementations. Any good resources(or good code to study)?

Anonymous 2017-06-02 09:31:41 Post No.60714599
[Report]

Anonymous 2017-06-02 09:31:41 Post No.60714599 [Report]

>>60714213
yeah, but its not really the right way to do it though

Anonymous 2017-06-02 09:33:18 Post No.60714622
[Report]

Anonymous 2017-06-02 09:33:18 Post No.60714622 [Report]

>>60709119
The mark of the true pajeet. Some of us are in this scientific field because of genuine interest.

Anonymous 2017-06-02 09:38:22 Post No.60714682
[Report]

Anonymous 2017-06-02 09:38:22 Post No.60714682 [Report]

>>60714309
For starting toys, I'd recommend:
https://github.com/NathanEpstein/Pavlov.js - reinforce/markov decision trees, not strictly NN
https://github.com/karpathy/convnetjs - small self contained CNN

And to get insight how heavy duty frameworks tick, read:

https://github.com/BVLC/caffe

caffe is a "vanilla" framework. it doesn't try to scale to bazillion gpus, or have a nice end user UX and whatnot, instead, it has a very compact, readable implementation and documentation, so one can get idea how things are actually done -how all the solvers and filters are put together, as well as implementation of each module.

Anonymous 2017-06-02 10:15:35 Post No.60715190
[Report]

Anonymous 2017-06-02 10:15:35 Post No.60715190 [Report]

>>60714682
Thanks

Anonymous 2017-06-02 10:53:53 Post No.60715651
[Report]

Anonymous 2017-06-02 10:53:53 Post No.60715651 [Report]

>>60707174
>try to implement CNNs myself
>fprop: fucking easy
>bprop: should be easy too
>wait a minute, no math I've read covers any nontrivial case (padding, stride)
>finally find a normal paper on it
>try to look at easy to read libraries -- they're all fucking wrong in their implementation of backprop
>end up copying Caffe's approach
Like and subscribe

Anonymous 2017-06-02 11:25:50 Post No.60716064
[Report]

Anonymous 2017-06-02 11:25:50 Post No.60716064 [Report]

>>60707174
what are some good beginner level projects for machine learning?

Anonymous 2017-06-02 11:47:18 Post No.60716363
[Report]

Anonymous 2017-06-02 11:47:18 Post No.60716363 [Report]

>>60707914
wow someone who gets it on /g/

/dlg/ - Deep Learning General

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible. Read more on this topic here - https://archived.moe/talk/thread/1694/

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/