DEEPMIND DOES IT AGAIN https://deepmind.com/blog/wavenet-g - /g/ - Technology

>>56501785
soon
>>56501698
Actually they can generate ANY audio, as long as they have enough data.
From voice to music to natural sounds and more. This is really huge.

Anonymous 2016-09-08 22:14:04 Post No.56501903
[Report]

Anonymous 2016-09-08 22:14:04 Post No.56501903 [Report]

>>56501865
can't wait until the sourcecode gets public or ripped

just imagine that one guy who samples 1000 hentai animes just to generate waifus in 3d VR porn

Anonymous 2016-09-08 22:18:46 Post No.56502000
[Report]

Anonymous 2016-09-08 22:18:46 Post No.56502000 [Report]

so where can I download it?

Anonymous 2016-09-08 22:20:17 Post No.56502026
[Report]

Anonymous 2016-09-08 22:20:17 Post No.56502026 [Report]

but can it generate basic emotions and tone.

didn't think so

humans 1
ai 0

Anonymous 2016-09-08 22:22:50 Post No.56502068
[Report]

Anonymous 2016-09-08 22:22:50 Post No.56502068 [Report]

>>56501542
noice!
the output need to be sampled at a higher rate and then highpass filtered to remove some of the noise, other than that bretty gud

Anonymous 2016-09-08 22:24:34 Post No.56502102
[Report] Image search: [Google]

Anonymous 2016-09-08 22:24:34 Post No.56502102 [Report]

File: 1373486130923.png (3KB, 184x172px) Image search: [Google]

3KB, 184x172px

>>56501903
so.. how long do you guys think will something like this take? Like how long until we at least can communicate with our waifus trough our computers?

Anonymous 2016-09-08 22:25:58 Post No.56502125
[Report]

Anonymous 2016-09-08 22:25:58 Post No.56502125 [Report]

>>56502102
when the source gets released, probably 3 years but japan only at the beginning

Anonymous 2016-09-08 22:43:07 Post No.56502432
[Report]

Anonymous 2016-09-08 22:43:07 Post No.56502432 [Report]

>>56502125
>source gets released

But, Google/DeepMind will never release the training data that actually generated the voice.

Anonymous 2016-09-08 22:50:55 Post No.56502570
[Report]

Anonymous 2016-09-08 22:50:55 Post No.56502570 [Report]

This sounds like a great way to make vocaloids even better. Makes me want to get a new vocaloid using wavenet.

Anonymous 2016-09-08 22:52:12 Post No.56502593
[Report]

Anonymous 2016-09-08 22:52:12 Post No.56502593 [Report]

>>56502570
the vocaloid makers hack deepmind and improve their hatsune miku

Anonymous 2016-09-08 22:54:36 Post No.56502638
[Report]

Anonymous 2016-09-08 22:54:36 Post No.56502638 [Report]

I'm gonna scam so many old bastards with this.

Anonymous 2016-09-08 22:58:23 Post No.56502723
[Report]

Anonymous 2016-09-08 22:58:23 Post No.56502723 [Report]

jesus, that is incredibly impressive. even the throwaway music bit at the bottom.

Anonymous 2016-09-08 23:00:21 Post No.56502756
[Report]

Anonymous 2016-09-08 23:00:21 Post No.56502756 [Report]

>>56501542
But what if I want a robotic voice?

Anonymous 2016-09-08 23:05:56 Post No.56502871
[Report]

Anonymous 2016-09-08 23:05:56 Post No.56502871 [Report]

Jesus

Anonymous 2016-09-08 23:07:37 Post No.56502900
[Report]

Anonymous 2016-09-08 23:07:37 Post No.56502900 [Report]

>>56502756
You add a robotic voice filter over it?

Anonymous 2016-09-08 23:09:54 Post No.56502936
[Report] Image search: [Google]

Anonymous 2016-09-08 23:09:54 Post No.56502936 [Report]

File: 193.jpg (22KB, 300x100px) Image search: [Google]

22KB, 300x100px

>>56502900
It wouldn't be the same!

Anonymous 2016-09-08 23:26:24 Post No.56503226
[Report]

Anonymous 2016-09-08 23:26:24 Post No.56503226 [Report]

The future is scary.

Anonymous 2016-09-08 23:28:47 Post No.56503268
[Report]

Anonymous 2016-09-08 23:28:47 Post No.56503268 [Report]

>>56503226
the future is AMAZING

Anonymous 2016-09-08 23:32:50 Post No.56503346
[Report]

Anonymous 2016-09-08 23:32:50 Post No.56503346 [Report]

>>56503268
>Hello, %Anon%, this is Cortana. I have detected illegal software on your system. Please, put hands behind your back and lie on the floor. The authorities are on their way. You have 20 seconds to comply.

Anonymous 2016-09-08 23:35:06 Post No.56503391
[Report]

Anonymous 2016-09-08 23:35:06 Post No.56503391 [Report]

>>56503346
>While you are waiting enjoy this piano music I composed for you. Have a nice day and remember, if you have nothing to hide you have nothing to fear.

Anonymous 2016-09-08 23:41:11 Post No.56503489
[Report]

Anonymous 2016-09-08 23:41:11 Post No.56503489 [Report]

So could something like this be used to replace voice actors in indie video games?

Could it use voice samples of VAs to replicate their voice?

Anonymous 2016-09-09 00:02:08 Post No.56503795
[Report]

Anonymous 2016-09-09 00:02:08 Post No.56503795 [Report]

>>56503489
yes, yes

Anonymous 2016-09-09 00:06:51 Post No.56503862
[Report]

Anonymous 2016-09-09 00:06:51 Post No.56503862 [Report]

Vocaloid upgrades soon?

Anonymous 2016-09-09 00:13:30 Post No.56503956
[Report]

Anonymous 2016-09-09 00:13:30 Post No.56503956 [Report]

>>56503489

This is actually a very interesting angle.
Give it a decade and voice actors and singers are going to be practicaly useless.
Only thing that matters is the guy creating the melody,lyrics and the text that these things read.

Anonymous 2016-09-09 00:15:00 Post No.56503983
[Report]

Anonymous 2016-09-09 00:15:00 Post No.56503983 [Report]

>>56503956
They'll probably have AI that writes scripts for that by then too

Anonymous 2016-09-09 00:17:02 Post No.56504015
[Report] Image search: [Google]

Anonymous 2016-09-09 00:17:02 Post No.56504015 [Report]

File: is-female-orgasm-good-for-fertility.png (77KB, 634x360px) Image search: [Google]

77KB, 634x360px

It just dawned on me

>AUTOMATIC ORGASM SOUND GENERATOR
>with the voice of every girl ever
>including famous actresses, singers, etc

Anonymous 2016-09-09 00:21:31 Post No.56504089
[Report]

Anonymous 2016-09-09 00:21:31 Post No.56504089 [Report]

>>56503956
>>56503983
But stuff made with real people will be super authentic and REAL man.

Anonymous 2016-09-09 00:25:56 Post No.56504168
[Report]

Anonymous 2016-09-09 00:25:56 Post No.56504168 [Report]

Interesting how some parts of the future arrive much earlier than expected.

Anonymous 2016-09-09 00:50:02 Post No.56504477
[Report]

Anonymous 2016-09-09 00:50:02 Post No.56504477 [Report]

>>56501542
How long until the trolling beggings?

https://www.youtube.com/watch?v=1B488z1MmaA

Anonymous 2016-09-09 00:50:51 Post No.56504487
[Report]

Anonymous 2016-09-09 00:50:51 Post No.56504487 [Report]

I listened to all those audio samples and the voice doesn't sound much better.

I thought it was going to sound like a real human and not robotic.

It still sounds robotic and grainy.

Anonymous 2016-09-09 00:53:18 Post No.56504519
[Report]

Anonymous 2016-09-09 00:53:18 Post No.56504519 [Report]

>>56504487
Most people in this thread are just jumping the gun.

We're still a few decades away from making it practical.

Anonymous 2016-09-09 00:56:32 Post No.56504568
[Report] Image search: [Google]

Anonymous 2016-09-09 00:56:32 Post No.56504568 [Report]

File: G88xA.jpg (153KB, 1096x729px) Image search: [Google]

153KB, 1096x729px

>>56504519
I just want my own personal assistant that sounds like a real human female who can talk to me as i'm trying to fall asleep.

Since women today are such whores being brainwashed by SJW garbage from the corporate jew media I have no choice but to rely on technology as a substitute.

Anonymous 2016-09-09 00:59:20 Post No.56504611
[Report]

Anonymous 2016-09-09 00:59:20 Post No.56504611 [Report]

>>56504487
>>56504519
It's still way ahead of previous results, and it can fool many humans. This shit was science fiction only yesterday.
>We're still a few decades away
Make it one year.

Anonymous 2016-09-09 01:04:57 Post No.56504681
[Report]

Anonymous 2016-09-09 01:04:57 Post No.56504681 [Report]

>>56504568
So you just want that OS from the movie She?

Anonymous 2016-09-09 01:06:08 Post No.56504699
[Report]

Anonymous 2016-09-09 01:06:08 Post No.56504699 [Report]

>>56501542
$1.00 has been deposited into your Google Wallet®.

Anonymous 2016-09-09 01:08:34 Post No.56504753
[Report]

Anonymous 2016-09-09 01:08:34 Post No.56504753 [Report]

>>56504681
>guy literally gets cucked by a AI
it was silly

Anonymous 2016-09-09 01:11:35 Post No.56504798
[Report]

Anonymous 2016-09-09 01:11:35 Post No.56504798 [Report]

>>56504753
Not at all. While you are sleeping, working or doing whateaver, the computer will be using all power to learn.

No wonder she was cucking him. She could process a shitload of information before he could say Good Morning. She needed more ppl to keep her busy somehow.

Anonymous 2016-09-09 01:11:44 Post No.56504803
[Report]

Anonymous 2016-09-09 01:11:44 Post No.56504803 [Report]

>>56502026
it can make toned language, imitating samples. did you read it? can also be conditioned to use tone to imitate emotion

Anonymous 2016-09-09 01:14:23 Post No.56504848
[Report]

Anonymous 2016-09-09 01:14:23 Post No.56504848 [Report]

>>56503346
>google is microsoft
we might get very convincing, pleading ads, or the threat of using audio recordings to track tone used when speaking of certain subjects for the purpose of better targeted ads.

Anonymous 2016-09-09 01:14:57 Post No.56504855
[Report]

Anonymous 2016-09-09 01:14:57 Post No.56504855 [Report]

>>56502102
>communicate with our waifus
In an intelligent way or just
>hello oniichan
>hello oniichan
>hello oniichan

Anonymous 2016-09-09 01:15:58 Post No.56504874
[Report]

Anonymous 2016-09-09 01:15:58 Post No.56504874 [Report]

>>56502432
>release the training data
Who the fuck cares? Make your own training data. I'm going to bring Kuuko back from the dead when this gets released.

Anonymous 2016-09-09 01:18:24 Post No.56504901
[Report]

Anonymous 2016-09-09 01:18:24 Post No.56504901 [Report]

>>56504874
time to make a porn audio dataset for >>56504015

Anonymous 2016-09-09 01:19:33 Post No.56504912
[Report]

Anonymous 2016-09-09 01:19:33 Post No.56504912 [Report]

Something something all that fuzz.
How the hell are they supposed to get rid of it?

Anonymous 2016-09-09 01:23:47 Post No.56504975
[Report]

Anonymous 2016-09-09 01:23:47 Post No.56504975 [Report]

>>56501542
I think the most impressive part of this is 1) how raw audio generation isn't limited to human speech. 2) the model replicates breathing sounds and such as well, giving it an illusion of actually sounding like a real human bean.

Eithwr way, how long until a working model is released into the public? I imagine Google wouldn't want Apple or Microsoft to gain access to this.

Anonymous 2016-09-09 01:24:02 Post No.56504979
[Report]

Anonymous 2016-09-09 01:24:02 Post No.56504979 [Report]

>>56504912
Make another neural net for static removal.

Anonymous 2016-09-09 01:27:40 Post No.56505028
[Report]

Anonymous 2016-09-09 01:27:40 Post No.56505028 [Report]

the ones where it makes up its own language is trippy as fuck. its like an alien race speaking to you.

Anonymous 2016-09-09 01:28:20 Post No.56505032
[Report]

Anonymous 2016-09-09 01:28:20 Post No.56505032 [Report]

>>56504912
>>56504979
literally how the brain works

The real problem is the massive comuting power you need to make this work. google can afford it, but there's no way you can run this on a normal pc, no matter the gpus.

Anonymous 2016-09-09 01:34:37 Post No.56505108
[Report]

Anonymous 2016-09-09 01:34:37 Post No.56505108 [Report]

>>56501542
As someome who does research in ML, that is honestly arousing.

Anonymous 2016-09-09 02:21:25 Post No.56505704
[Report]

Anonymous 2016-09-09 02:21:25 Post No.56505704 [Report]

>>56505032
just make a makeshift supercomputer with your thinkpad hoard.
will work well enough

Anonymous 2016-09-09 02:34:30 Post No.56505869
[Report]

Anonymous 2016-09-09 02:34:30 Post No.56505869 [Report]

>>56505108
as someone who jacks off to vocaloids, that is honestly arousing

Anonymous 2016-09-09 02:46:47 Post No.56506017
[Report]

Anonymous 2016-09-09 02:46:47 Post No.56506017 [Report]

>>56504848
Google is even worse than Microsoft desu.
Because google does exactly the same thing as microsoft, but you can't switch to another internet if you don't like it.

Anonymous 2016-09-09 03:03:19 Post No.56506228
[Report]

Anonymous 2016-09-09 03:03:19 Post No.56506228 [Report]

>>56501542
musicfags on suicide watch

Anonymous 2016-09-09 03:10:22 Post No.56506315
[Report]

Anonymous 2016-09-09 03:10:22 Post No.56506315 [Report]

>>56506228
I want to crosspost this to /mu/ but I'm too tired.

Anonymous 2016-09-09 03:11:17 Post No.56506332
[Report]

Anonymous 2016-09-09 03:11:17 Post No.56506332 [Report]

>>56503956
Voice actors and singers will just start suing people who imitate their voice. Laws will be passed that make it illegal.

Anonymous 2016-09-09 03:15:49 Post No.56506394
[Report]

Anonymous 2016-09-09 03:15:49 Post No.56506394 [Report]

>>56506332
it can imitate billions of voices. it would be silly to make it illegal, it will never happen.

Anonymous 2016-09-09 03:16:30 Post No.56506404
[Report]

Anonymous 2016-09-09 03:16:30 Post No.56506404 [Report]

>>56506332
>Voice actors and singers will just start suing people who imitate their voice. Laws will be passed that make it illegal.
Publicity rights in some states already cover voice it seems.

Anonymous 2016-09-09 03:23:11 Post No.56506491
[Report]

Anonymous 2016-09-09 03:23:11 Post No.56506491 [Report]

>>56506332
>>56506404
But that's retarded. What stops you from engineering a voice that sounds like that of a famous singer while still sounding slightly different? What stops you from generating a random voice that turns out to be the voice of a random girl in south africa? Will she sue you too? We may as well outlaw sound.

Anonymous 2016-09-09 03:24:10 Post No.56506507
[Report]

Anonymous 2016-09-09 03:24:10 Post No.56506507 [Report]

>>56503956
>singers are going to be practicaly useless.

yes, because traditional guitars and pianos got totally replaced by e-guitars, synthies and computers.

Anonymous 2016-09-09 03:26:24 Post No.56506536
[Report]

Anonymous 2016-09-09 03:26:24 Post No.56506536 [Report]

>>56506507
actually they did

Anonymous 2016-09-09 03:27:44 Post No.56506549
[Report]

Anonymous 2016-09-09 03:27:44 Post No.56506549 [Report]

>>56501542
ELI5?

Anonymous 2016-09-09 03:37:26 Post No.56506664
[Report]

Anonymous 2016-09-09 03:37:26 Post No.56506664 [Report]

>>56506507
in radio pop maybe

Anonymous 2016-09-09 03:41:00 Post No.56506712
[Report]

Anonymous 2016-09-09 03:41:00 Post No.56506712 [Report]

>>56506491
Yeah, it is retarded. Hopefully it does not spread.

Publicity rights were originally intended to be somewhat like trademark to keep people from falsely using a name, signature, image, voice, etc. to claim that a person was endorsing a product. Of course now they are basically yet another way for famous people to try and bother people they don't like with lawsuits or to try and get money from a company.

Anonymous 2016-09-09 03:42:22 Post No.56506732
[Report]

Anonymous 2016-09-09 03:42:22 Post No.56506732 [Report]

>>56506549
If you go to the top of your screen, there's a little bar you can click on and type words into. Simply type in "reddit.com", but without the punctuation marks, and you will be transported to a place appropriate for you! :)

Anonymous 2016-09-09 03:43:09 Post No.56506748
[Report]

Anonymous 2016-09-09 03:43:09 Post No.56506748 [Report]

>>56506491
Monsanto has copyrights on genetics of seeds. These seeds happen to blow off trucks and on to peoples' farms. Monsanto then sneaks onto their farm and tests for these genes, and if they find them, say goodbye to your farm/retirement/belongings.

What am I saying is: it will happen.

Anonymous 2016-09-09 03:56:30 Post No.56506940
[Report]

Anonymous 2016-09-09 03:56:30 Post No.56506940 [Report]

>>56506748
>Monsanto then sneaks onto their farm
that sounds very illegal

Anonymous 2016-09-09 04:03:03 Post No.56507005
[Report]

Anonymous 2016-09-09 04:03:03 Post No.56507005 [Report]

Radio moderators are now obsolete.

Anonymous 2016-09-09 04:06:30 Post No.56507042
[Report]

Anonymous 2016-09-09 04:06:30 Post No.56507042 [Report]

>>56506940
They don't give a fuck, half of the goverment has shares in monsanto.
They have literally written Monsanto seeds into the new Iraqi constitution.

these guys are above the law.

Anonymous 2016-09-09 04:10:28 Post No.56507098
[Report]

Anonymous 2016-09-09 04:10:28 Post No.56507098 [Report]

>>56501542
Sounds like parametric with less reverb. whoopdeedoo

Still sounds fake as shit.

Anonymous 2016-09-09 04:14:18 Post No.56507137
[Report]

Anonymous 2016-09-09 04:14:18 Post No.56507137 [Report]

>>56506748
there is worse stuff, pars of the human genome are actually copyrighted (most of those are related to some disease/condition) and you cant sell medicine (and if im not mistaken not even research either) that targets those genes without permission and paying the fees.
usa sure is the land of freedom...

Anonymous 2016-09-09 04:20:50 Post No.56507193
[Report]

Anonymous 2016-09-09 04:20:50 Post No.56507193 [Report]

The audio samples where the wavenet generates its own audio output are creepy as fuck

Asian Expert (being 2016-09-09 04:25:19 Post No.56507243
[Report]

Asian Expert (being 2016-09-09 04:25:19 Post No.56507243 [Report]

>>56504477
wtf I'm liking this

Anonymous 2016-09-09 04:43:00 Post No.56507407
[Report]

Anonymous 2016-09-09 04:43:00 Post No.56507407 [Report]

>>56507193
Really? It just reminds me of this

Anonymous 2016-09-09 06:12:03 Post No.56508335
[Report]

Anonymous 2016-09-09 06:12:03 Post No.56508335 [Report]

>Because raw audio is typically stored as a sequence of 16-bit integer values (one per timestep), a
>softmax layer would need to output 65,536 probabilities per timestep to model all possible values.
>To make this more tractable, we first apply a μ-law companding transformation (ITU-T, 1988) to
>the data, and then quantize it to 256 possible values:

>f (x) = sgn(x)*ln(1+255*abs(x))/ln(1+255)

>where −1 < x < 1 and μ = 255. This non-linear quantization produces a significantly better
>reconstruction than a simple linear quantization scheme. Especially for speech, we found that the
>reconstructed signal after quantization sounded very similar to the original.

So does that mean that each sample in generated sound can only have one out of 256 values (ranged between 0 and 65535), essentially making it 8-bit instead of 16 bit?

Anonymous 2016-09-09 08:06:44 Post No.56509525
[Report]

Anonymous 2016-09-09 08:06:44 Post No.56509525 [Report]

>>56508335
Yes

Anonymous 2016-09-09 08:36:49 Post No.56509769
[Report]

Anonymous 2016-09-09 08:36:49 Post No.56509769 [Report]

This would be pretty good for ASMR stuff.

Anonymous 2016-09-09 08:47:03 Post No.56509844
[Report]

Anonymous 2016-09-09 08:47:03 Post No.56509844 [Report]

>>56508335
Yes but that logarithm probably means that they have more resolution in the middle frequencies

Anonymous 2016-09-09 08:49:46 Post No.56509864
[Report] Image search: [Google]

Anonymous 2016-09-09 08:49:46 Post No.56509864 [Report]

File: plinkett.jpg (484KB, 1039x792px) Image search: [Google]

484KB, 1039x792px

Finally I can get a virtual Mr. Plinkett who reads 4chan posts to me all day

Anonymous 2016-09-09 08:56:24 Post No.56509905
[Report]

Anonymous 2016-09-09 08:56:24 Post No.56509905 [Report]

>>56501542
fuck yeah, nobody is going to need voice actors ever again.

Anonymous 2016-09-09 09:06:24 Post No.56509994
[Report]

Anonymous 2016-09-09 09:06:24 Post No.56509994 [Report]

>>56503489
that was my first thought
Audio files make up the majority of the game size in most cases, since they just dont compress well. Also if you want to change a single line of dialogue later on, you need to hire the same voice actor again which is costly and time consuming.
If all voice can be stored in a kind of LaTeX or XML format that will not only speed up development, but also allow dynamically generated dialogues that arent just a bunch of text

Anonymous 2016-09-09 09:08:08 Post No.56510007
[Report]

Anonymous 2016-09-09 09:08:08 Post No.56510007 [Report]

>>56506332
sadly this doesnt sound all too far fetched

Anonymous 2016-09-09 09:10:51 Post No.56510024
[Report]

Anonymous 2016-09-09 09:10:51 Post No.56510024 [Report]

>>56504874
there is no way you'd even remotely catch up with the amount of training data that google has though, even as you're posting on 4chan right now you're feeding it with shit tons of training data through the captcha

Anonymous 2016-09-09 09:11:39 Post No.56510030
[Report]

Anonymous 2016-09-09 09:11:39 Post No.56510030 [Report]

>>56501542
I like how all the piano tracks start out normal and go fucking ham before cutting out.

Anonymous 2016-09-09 09:14:46 Post No.56510054
[Report]

Anonymous 2016-09-09 09:14:46 Post No.56510054 [Report]

>>56510030
can't risk letting it gain sentience at this stage, our anuses are unprepared

Anonymous 2016-09-09 09:19:44 Post No.56510087
[Report]

Anonymous 2016-09-09 09:19:44 Post No.56510087 [Report]

DAISY
DAISY
GIVE ME YOUR ANSWER DO~

Anonymous 2016-09-09 09:33:13 Post No.56510187
[Report]

Anonymous 2016-09-09 09:33:13 Post No.56510187 [Report]

That piano shit is neato

Anonymous 2016-09-09 09:35:35 Post No.56510206
[Report]

Anonymous 2016-09-09 09:35:35 Post No.56510206 [Report]

>>56509994
As the article said, generating the audio output takes forever, so forget about generating it on the fly. Could save the cost of the voiceactor.

People will still notice, though. It's not that it's on par with human voiceactors. It's just getting in the good enough to be tolerable range.

Anonymous 2016-09-09 09:36:04 Post No.56510210
[Report]

Anonymous 2016-09-09 09:36:04 Post No.56510210 [Report]

It's good, really good, but I feel like if this is to be done properly it needs more forms of input. The emotion behind different words, the emphasis, pause length, etc.

If you could develop some sort of system where you can both input text and specify characteristics of speech within the text, we'd be getting close to complete accurate synthesis.

Anonymous 2016-09-09 09:39:02 Post No.56510235
[Report]

Anonymous 2016-09-09 09:39:02 Post No.56510235 [Report]

>>56501542
Oh god that generated music lmao

Sounds like beethoven having a stroke while playing

Anonymous 2016-09-09 09:40:38 Post No.56510254
[Report]

Anonymous 2016-09-09 09:40:38 Post No.56510254 [Report]

Is there anything that deep learning CAN'T do?

Anonymous 2016-09-09 09:42:33 Post No.56510265
[Report]

Anonymous 2016-09-09 09:42:33 Post No.56510265 [Report]

>>56510210
Read the article, the model varies output according to context.

Seems they couldn't get rid of the noise though, or maybe their training data is contaminated with noisy samples. Because it's harder to judge noisy samples, they might have had higher ratings by humans, so the model learned to include noise to get better grades for its output.

Anonymous 2016-09-09 09:42:49 Post No.56510267
[Report] Image search: [Google]

Anonymous 2016-09-09 09:42:49 Post No.56510267 [Report]

File: 13558.jpg (46KB, 500x500px)

46KB, 500x500px

>>56501542
Self written ASMR incoming, faggots
>relax, anon, take a deep breath and count to 100 with me
>you are great, anon
>run away with me anon
>let me take that big pulsating cock with my tiny feet, anon

Anonymous 2016-09-09 09:43:21 Post No.56510272
[Report]

Anonymous 2016-09-09 09:43:21 Post No.56510272 [Report]

This shit is fucking creepy

https://storage.googleapis.com/deepmind-media/pixie/knowing-what-to-say/first-list/speaker-4.wav

>the breathing and mouth noises

Anonymous 2016-09-09 09:43:36 Post No.56510275
[Report]

Anonymous 2016-09-09 09:43:36 Post No.56510275 [Report]

>>56510254
It's a universal approximator, so no.

Anonymous 2016-09-09 09:48:51 Post No.56510316
[Report]

Anonymous 2016-09-09 09:48:51 Post No.56510316 [Report]

>>56510275
Can it create a virtual gf for me?

Anonymous 2016-09-09 09:49:22 Post No.56510321
[Report] Image search: [Google]

Anonymous 2016-09-09 09:49:22 Post No.56510321 [Report]

File: 1459029697107.jpg (141KB, 392x309px)

141KB, 392x309px

>>56510254
Deep learning is a meme. A very effective meme, but a meme nonetheless.
So they have a cluster of 100k Nvidia Teslas and literally petabytes of datasets, big fucking whoop they can do impressive shit given a year time...
With that same amount of power you could probably simulate a universe and wait for it to develop life able of speech, and it would probably be more efficient.

I'll be impressed when they can do all this on a battery powered smartphone, but deep architecture are not really the way right now.
Source: master's in AI

Anonymous 2016-09-09 09:50:04 Post No.56510325
[Report]

Anonymous 2016-09-09 09:50:04 Post No.56510325 [Report]

>>56510321
Clouds nigga

Why do things locally?

Anonymous 2016-09-09 09:51:44 Post No.56510341
[Report]

Anonymous 2016-09-09 09:51:44 Post No.56510341 [Report]

>>56510321
Virtual masters apparently. If you can build a network in hardware, this shit will get faster. You don't render graphics on a CPU either.

Anonymous 2016-09-09 09:51:54 Post No.56510344
[Report] Image search: [Google]

Anonymous 2016-09-09 09:51:54 Post No.56510344 [Report]

File: 1448125400236.png (286KB, 500x513px)

286KB, 500x513px

>>56501542
>https://deepmind.com/blog/wavenet-generative-model-raw-audio/

It sounds much better, but you can still clearly hear it's a robot voice.

The tone of words inside the sentence seems to be the biggest problems, there's too much tone difference between words is different from how a normal person would structure a sentence.

I don't understand, is it too hard to build a system that can recognize based on which position a word has in a sentence, what tone of voice should be used?

Anonymous 2016-09-09 09:53:00 Post No.56510355
[Report]

Anonymous 2016-09-09 09:53:00 Post No.56510355 [Report]

>>56510325
>power inefficiency is solved by delegating computation
I can hear stockholders laughing

Anonymous 2016-09-09 09:53:34 Post No.56510360
[Report]

Anonymous 2016-09-09 09:53:34 Post No.56510360 [Report]

>>56510321
You only need to teach the network once. Then just hardcode the parameters in a 20kb file and boom, perfect text to speech on a toaster.

Anonymous 2016-09-09 09:54:59 Post No.56510377
[Report]

Anonymous 2016-09-09 09:54:59 Post No.56510377 [Report]

>>56510344
It seems to be that the system is still pretty dumb and is saying things based on the words and mostly ignoring punctuation.

I'd imagine you can use deep learning to begin to understand the context of words and phrases, but that'll take a lot longer and is a lot more difficult to interpret.

Give it ten years and this shit'll be writing its own books. And reading them too.

Anonymous 2016-09-09 09:55:18 Post No.56510378
[Report]

Anonymous 2016-09-09 09:55:18 Post No.56510378 [Report]

>>56510360
You need to generate 16k points per second though.

Anonymous 2016-09-09 09:56:26 Post No.56510385
[Report]

Anonymous 2016-09-09 09:56:26 Post No.56510385 [Report]

>>56510341
>>56510360
You clearly have no idea how NNs work so go educate yourselves before spouting nonsense.
>inb4 a neural network has literal dots and lines like you see in the drawings

Anonymous 2016-09-09 09:56:33 Post No.56510387
[Report]

Anonymous 2016-09-09 09:56:33 Post No.56510387 [Report]

>>56510321
This, honestly

I thought the same shit when nvidia started showing off their computer vision hardware not too long ago claiming it's 90000000 times more effective than standard algorithm approach (like opencv)

The catch is that their hardware costs several thousand dollars meanwhile the algorithm approach is literally free and can run on your $20 raspberry pi

Marketing bullshit is what this is, but hey you can put it in the cloud xDDDDD

Anonymous 2016-09-09 09:57:43 Post No.56510398
[Report]

Anonymous 2016-09-09 09:57:43 Post No.56510398 [Report]

>>56510387
Costs will decrease as time go's by, as they always do

Anonymous 2016-09-09 10:06:56 Post No.56510485
[Report]

Anonymous 2016-09-09 10:06:56 Post No.56510485 [Report]

>>56510378
this is a good point, if the network itself is too hueg it would indeed be slow
>>56510385
this is a clueless yuroshit /v/ermin moron who should drink bleach

Anonymous 2016-09-09 10:07:06 Post No.56510488
[Report]

Anonymous 2016-09-09 10:07:06 Post No.56510488 [Report]

>>56510385
Why don't you enlighten me.

Anonymous 2016-09-09 10:12:23 Post No.56510544
[Report]

Anonymous 2016-09-09 10:12:23 Post No.56510544 [Report]

Are there any other need deep learning things to have come out recently? I always enjoy reading about them.

Anonymous 2016-09-09 10:13:25 Post No.56510556
[Report]

Anonymous 2016-09-09 10:13:25 Post No.56510556 [Report]

>>56510544
>Need

That should say neat. I can't even blame a phone as I'm on a PC and my brain just fucked up.

Anonymous 2016-09-09 10:35:11 Post No.56510725
[Report]

Anonymous 2016-09-09 10:35:11 Post No.56510725 [Report]

>>56510485
>I'll pretend to know what he's talking about
>put an ad hominem in there, that'll teach him
Would you care to explain how would convolution be faster on dedicated hardware than it is now on gpgpus? Like, what exact operations do you feel are the bottleneck in convolution and SGD right now? Is it the addition or the multiplication? Or do you think that you could implement faster memory access than ddr5?
Go on, I'll wait while you think

Anonymous 2016-09-09 10:35:56 Post No.56510733
[Report]

Anonymous 2016-09-09 10:35:56 Post No.56510733 [Report]

>>56510725
>>>/v/

Anonymous 2016-09-09 10:50:45 Post No.56510871
[Report] Image search: [Google]

Anonymous 2016-09-09 10:50:45 Post No.56510871 [Report]

File: J2zNkC6.jpg (150KB, 1282x1901px)

150KB, 1282x1901px

>>56501659
imagine if you could idk sample Emma Watson's voice and then make a robot voice that sounds exactly like her? And then make her say shit. Imagine the collapse of Hollywood if that was possible? Fuck, the whole concept of identity would be in danger if you could simulate a person's voice and make a realistic model of that person inside some VR world. Why would anyone want real people after that?

Anonymous 2016-09-09 10:52:15 Post No.56510889
[Report]

Anonymous 2016-09-09 10:52:15 Post No.56510889 [Report]

>>56503983
or have AI that is present in each character of the game and makes characters behave realistically like real people would, thus creating it's own narrative that is unpredictable

Anonymous 2016-09-09 10:54:31 Post No.56510912
[Report]

Anonymous 2016-09-09 10:54:31 Post No.56510912 [Report]

>>56510725
GPUs are made to render one image at a time. Fix your network parameters, skip the memory, directly pass output to the next calculation layer.

Drop the fully connected layers requirement, gain speed using a chip that specializes in the sparse matrixmultiplication your network produces.

And I don't see how not implementing it using matrix multiplications but node for node wouldn't make it faster. N:1 specialized calculation units, no memory to swap to.

So fuck off. You're "expertise" with shit counts for nothing because you can't see past whats in front of you.

Anonymous 2016-09-09 10:58:26 Post No.56510948
[Report]

Anonymous 2016-09-09 10:58:26 Post No.56510948 [Report]

>>56510871
Is that emma watson? Holy shit she looks like a dude

(((Anonymous))) 2016-09-09 11:01:57 Post No.56510986
[Report]

(((Anonymous))) 2016-09-09 11:01:57 Post No.56510986 [Report]

This just put voice actors and impersonators out of business.

Anonymous 2016-09-09 11:04:14 Post No.56511018
[Report]

Anonymous 2016-09-09 11:04:14 Post No.56511018 [Report]

>>56510948
>small head
>wide mouth
>small nose
>big eyes

>looks like a dude

you wat m8, you must be a very feminine looking man to think she looks like a dude

Anonymous 2016-09-09 11:07:26 Post No.56511049
[Report]

Anonymous 2016-09-09 11:07:26 Post No.56511049 [Report]

>this technology will literally be used for A) evil and B) advertising
who do I have to kill to get the good future back

Anonymous 2016-09-09 11:11:38 Post No.56511081
[Report]

Anonymous 2016-09-09 11:11:38 Post No.56511081 [Report]

Does this page crash firefox for anyone else?

Anonymous 2016-09-09 11:13:56 Post No.56511099
[Report] Image search: [Google]

Anonymous 2016-09-09 11:13:56 Post No.56511099 [Report]

File: 135 - UyXdK.gif (427KB, 200x198px) Image search: [Google]

427KB, 200x198px

>>56502900

>technology has advanced to the point that we have to apply robot sounds to a voice generated robotically because it sounds TOO human

Our memes stopped being dreams

Anonymous 2016-09-09 11:20:34 Post No.56511151
[Report]

Anonymous 2016-09-09 11:20:34 Post No.56511151 [Report]

>>56511099
now you get to begin the long descent into what /v/ philosophers have always dreaded: when AIs/games are too similar to people/life, you realize you were only involved in AIs/games because people/life are actually shit

Anonymous 2016-09-09 11:24:48 Post No.56511189
[Report]

Anonymous 2016-09-09 11:24:48 Post No.56511189 [Report]

>>56501542
Holy shit that's some fucking amazing shit right there.

Anonymous 2016-09-09 11:32:54 Post No.56511255
[Report]

Anonymous 2016-09-09 11:32:54 Post No.56511255 [Report]

Ok so find a use for it

Anonymous 2016-09-09 11:33:31 Post No.56511265
[Report] Image search: [Google]

Anonymous 2016-09-09 11:33:31 Post No.56511265 [Report]

File: deny_urself_my_lad.png (12KB, 246x200px) Image search: [Google]

12KB, 246x200px

>>56510912
>node for node
>skip the memory
>drop FC
0/10 apply yourself

Anonymous 2016-09-09 11:33:55 Post No.56511268
[Report]

Anonymous 2016-09-09 11:33:55 Post No.56511268 [Report]

>>56511151
You know, that's a good point. If you're interacting with something as good as human, does that mean you're being social?

Anonymous 2016-09-09 11:40:39 Post No.56511340
[Report]

Anonymous 2016-09-09 11:40:39 Post No.56511340 [Report]

It still has problems of timing, pitch and intonation. When you read a book aloud you grab the *MEANING* of a particular sentence or paragraph and adapt your voice to suit. For instance you might recognize that a character has a certain accent and whenever they speak you may adjust your voice accordingly. Or when a character is angry you may adjust your tone, pitch and volume.

Reading speech fluently is one thing. Understanding it's meaning is another.

That is why most of these simulations never really work well. The musical pieces are just a jumble of notes with no real emotion to them. There is no melody and the tempo jumps around all over the place. I am sure some rules of what makes a good melodic piece could be put in place through analysis but it will never be as good as the human ear is at spotting this.

Anonymous 2016-09-09 11:47:37 Post No.56511384
[Report]

Anonymous 2016-09-09 11:47:37 Post No.56511384 [Report]

>>56501542
>music

that's what i call random button mashing

Anonymous 2016-09-09 12:54:13 Post No.56511976
[Report]

Anonymous 2016-09-09 12:54:13 Post No.56511976 [Report]

>>56510254
Symbolic reasoning, for now...

Anonymous 2016-09-09 12:58:49 Post No.56512014
[Report]

Anonymous 2016-09-09 12:58:49 Post No.56512014 [Report]

>>56501542
>every rpg can now be grounded with written content and wont need to break the budget or disk space fitting in voice actor recordings

Fuck yes. Now the AAA shitter studios might go back to presenting long thoughtful dialogues.

Anonymous 2016-09-09 12:59:55 Post No.56512021
[Report]

Anonymous 2016-09-09 12:59:55 Post No.56512021 [Report]

>>56507407
Hey me too!

Anonymous 2016-09-09 13:01:16 Post No.56512027
[Report]

Anonymous 2016-09-09 13:01:16 Post No.56512027 [Report]

>>56502102
Some ASMR shit will be good.

Anonymous 2016-09-09 13:06:00 Post No.56512072
[Report]

Anonymous 2016-09-09 13:06:00 Post No.56512072 [Report]

>>56503489
Imagine the games that could come out of this.

Machine learning video games like an endless skyrim because the AI would continually write new code for new levels and the AI would be able to synthesize the dialogue without the need of voice actors. Sounds dreamy.

Anonymous 2016-09-09 13:07:43 Post No.56512088
[Report]

Anonymous 2016-09-09 13:07:43 Post No.56512088 [Report]

>>56504487
All that's needed is a brief pause with a simulated audio for inhaling air to give it a bit more realism

Anonymous 2016-09-09 13:10:10 Post No.56512113
[Report] Image search: [Google]

Anonymous 2016-09-09 13:10:10 Post No.56512113 [Report]

File: macross-plus-myung-sharon-apple-box-ver.jpg (86KB, 720x544px)

86KB, 720x544px

SOON
https://www.youtube.com/watch?v=iNQKMh3JhFc

Anonymous 2016-09-09 13:20:25 Post No.56512230
[Report]

Anonymous 2016-09-09 13:20:25 Post No.56512230 [Report]

>>56512072
Yeah, that's not happening. Not for a long, long, long time. If ever.

Anonymous 2016-09-09 13:23:03 Post No.56512267
[Report]

Anonymous 2016-09-09 13:23:03 Post No.56512267 [Report]

Fuck off retard. It's fucking nothing as 90% of what deepmind does, is, but hey, it's related to google so it has to be shilled as NEW and INNOVATIVE everytime they fucking fart.

Anonymous 2016-09-09 13:25:58 Post No.56512288
[Report]

Anonymous 2016-09-09 13:25:58 Post No.56512288 [Report]

>>56512072
Yeah infinite world bt boring as fuck. OR have you forgotten no mans sky already?

Anonymous 2016-09-09 13:26:59 Post No.56512294
[Report]

Anonymous 2016-09-09 13:26:59 Post No.56512294 [Report]

>>56512267
This has to be bait, right?

Anonymous 2016-09-09 13:27:05 Post No.56512297
[Report]

Anonymous 2016-09-09 13:27:05 Post No.56512297 [Report]

>>56512288
t. inbred

Anonymous 2016-09-09 13:32:25 Post No.56512366
[Report]

Anonymous 2016-09-09 13:32:25 Post No.56512366 [Report]

>>56502026
it even imitates breathing and lip smacking

Anonymous 2016-09-09 13:34:07 Post No.56512384
[Report]

Anonymous 2016-09-09 13:34:07 Post No.56512384 [Report]

>>56512294
Typical popsci-subscribing retard, everybody!
I bet you actually think apple invented tablets, too.

Anonymous 2016-09-09 13:37:46 Post No.56512436
[Report]

Anonymous 2016-09-09 13:37:46 Post No.56512436 [Report]

>>56511189
train on your favorite actress, input dirty talk

Anonymous 2016-09-09 14:03:42 Post No.56512759
[Report]

Anonymous 2016-09-09 14:03:42 Post No.56512759 [Report]

>>56512288
no man's sky was developed by pleb human developers and half of it's budget was spent on marketing

Anonymous 2016-09-09 15:32:25 Post No.56514014
[Report] Image search: [Google]

Anonymous 2016-09-09 15:32:25 Post No.56514014 [Report]

File: 1466694966378.png (34KB, 201x160px) Image search: [Google]

34KB, 201x160px

>>56510272

Anonymous 2016-09-09 15:35:41 Post No.56514057
[Report]

Anonymous 2016-09-09 15:35:41 Post No.56514057 [Report]

>>56504681
I want an OS like the movie Her but I'd keep it offline and in my basement.

Anonymous 2016-09-09 15:38:16 Post No.56514095
[Report]

Anonymous 2016-09-09 15:38:16 Post No.56514095 [Report]

Today computer voice say
'Hi'
tomorrow
"Hi i'm skynet'
day after
'Bend over and forgot about the lube"

Anonymous 2016-09-09 16:00:18 Post No.56514391
[Report]

Anonymous 2016-09-09 16:00:18 Post No.56514391 [Report]

>>56514095
Will it sound like a qt at least?

Anonymous 2016-09-09 16:18:41 Post No.56514652
[Report]

Anonymous 2016-09-09 16:18:41 Post No.56514652 [Report]

>>56514391
>not wanting it to sound like Schwartzenegger
Are you gay or something?

Anonymous 2016-09-09 16:57:22 Post No.56515259
[Report] Image search: [Google]

Anonymous 2016-09-09 16:57:22 Post No.56515259 [Report]

File: Dangerous-Take-This-Sword-Link-Legend-of-Zelda-T-Shirt-sq.jpg (18KB, 510x510px)

18KB, 510x510px

>>56511049
Literally every capitalist, politician and religious figure.

Good luck. We're all counting on you.

Anonymous 2016-09-09 17:01:16 Post No.56515322
[Report]

Anonymous 2016-09-09 17:01:16 Post No.56515322 [Report]

>>56512366
AYYO

HOL UP HOL UP

Anonymous 2016-09-09 17:24:13 Post No.56515624
[Report]

Anonymous 2016-09-09 17:24:13 Post No.56515624 [Report]

>>56514391
'Bend over and forgot about the lube desu"

Anonymous 2016-09-09 19:03:48 Post No.56517201
[Report]

Anonymous 2016-09-09 19:03:48 Post No.56517201 [Report]

>>56511049

It concerns me that you think those are two separate things.

Anonymous 2016-09-09 19:04:07 Post No.56517207
[Report]

Anonymous 2016-09-09 19:04:07 Post No.56517207 [Report]

>>56510206
>People will still notice
so what? The quality is already good enough, especially for short NPC chatter, all this "Hello. Welcome. You want to buy something? Thank you. Kill 15 boars and bring me their tusks. Collect some tigerfangs for a necklace."
Longer storyline dialogues where intonation and character is more important are still easier to finetune with voiceactor right now, but for short snippets it doesnt matter that much

Anonymous 2016-09-09 19:46:24 Post No.56517879
[Report]

Anonymous 2016-09-09 19:46:24 Post No.56517879 [Report]

>>56510948
Richard Dawkins

Anonymous 2016-09-09 19:46:56 Post No.56517885
[Report] Image search: [Google]

Anonymous 2016-09-09 19:46:56 Post No.56517885 [Report]

File: 1449284416702.jpg (150KB, 393x829px) Image search: [Google]

150KB, 393x829px

>>56504568
how does it feel that women that meet your requirements exist, (and somewhere out there there's the ideal girl AND she'll like you) but you'll never meet her