[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

Scatter plot (correlation?)

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 17
Thread images: 1

File: scatter_plot.png (17KB, 560x420px) Image search: [Google]
scatter_plot.png
17KB, 560x420px
Hey anons,

simple question for you.
I have multiple scatter plots and I want to know which of the plots are the most similar.

Right now I test be just the mean distance of multiple "test points" but there has to be something better.
>>
bumpy the bump
>>
>>8530024
If they are the same functional form, you could just try comparing their regression equations. Maybe you could take the difference of their slopes, intercepts, etc
>>
Isn't there some kind of equation I can put some (10000) sample points in and calculate a difference factor?
Like ...for n from 1 to 10000 do sum(|f(n)| - |f2(n)|)/10000. This is what I do right now but there has to be some "better" way to do it.
>>
I'm doing this by using a computer program so comparing slopes, intercepts, etc. isn't a option here.
>>
>>8530177
What program
>>
A program I'm implementing to compare the scatter plots. I want to compare lots of them
>>
>>8530197
I don't know an exact formula for this type of thing. Maybe you could create some type of index to measure the similarities between various descriptive statistics of each dataset, so how similar are the variances, standard deviations, etc of each graph. Cause after all if you have the scatter plots you should have access to the data.
>>
sure ... I have all the data
>>
I think I can just improve by using the squared distance like
mean squared distance (1 -> n) = sum((|f(x)| - |g(x)|)^2) / n . So less distance is way better than a little bit more distance.
>>
>>8530024
You can something called a correlation in Python it'll return a correlation factor
>>
>>8530272
https://en.m.wikipedia.org/wiki/Cross-correlation
Numpy has a function as does Scipy
https://docs.scipy.org/doc/numpy/reference/generated/numpy.correlate.html
>>
Cross-correlation looks very promising.
thx anon
>>
only problem with cross correlation is that the functions need to be integrable ... which is quite impossible for my scatter plots.
I might go for some Taylor series to approximate my scatter plots but that might be very very hard.
>>
Correlation is 0
>>
>>8530336
it can be used, and it is in fact used for discrete functions as well. Z-transformation maybe?
>>
>>8530336
There is discrete cross correlation, also if applicable I would fit a curve to the scatter plot and cross correlate that. If not dicrete cross correlation will work assuming you have a reasonable amount of data points
Thread posts: 17
Thread images: 1


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.