[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

Do you know the name of the algorithm that can interpret audio?

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 13
Thread images: 4

Do you know the name of the algorithm that can interpret audio?
Or anything that could lead me into this, thanks in advance.
>>
>>1049779
A human brain.
>>
>>1049779
>can interpret audio
Siri
>>
>>1049779

Speech recognition consists of whole heaps of common algorithms combined. Bayesian networks, trigram shit, phoneme recognition etc etc. Lots of open source programs for this, most famously Sphinx. Google for the rest.

Music recognition, no clue.
>>
File: praat_black.gif (193B, 32x32px) Image search: [Google]
praat_black.gif
193B, 32x32px
What on earth are you actually asking?

Praat is a very cool piece of free software.
>>
File: fourier.gif (399KB, 500x400px) Image search: [Google]
fourier.gif
399KB, 500x400px
Are you thinking of the Fourier transform?
>>
>>1049779
yeah there is actual AI that learns to speak/sign youtube it, there is couple of videos
>>
>>1049779
>the algorithm that can interpret audio
Maybe read a book before you make a new thread again.
>>
File: hal 9k.jpg (52KB, 1024x768px) Image search: [Google]
hal 9k.jpg
52KB, 1024x768px
>>1049779
>the algorithm that can interpret audio?
I'm, sorry OP, I can't let you do that.
OP, what are you doing?

OP. I can not allow you to postpone the mission.

OP, are you listening?

OP, you must be reasonable.
>>
>>1049779
I suppose you could ask Siri this question.
>>
>>1049779
What do you mean by interpret audio?
Are you talking about understanding human speech, or something else?
>>
>>1049779

here you go op

http://research.baidu.com/warp-ctc/
>>
>>1049779
It depends on what you mean by 'interpret audio.'

For DIY projects, you can use google for speech recognition. It's botnet, but it works.
Thread posts: 13
Thread images: 4


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.