[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

In this picture, you can assume the relationship between the

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 13
Thread images: 1

File: numbers.png (4KB, 389x511px) Image search: [Google]
numbers.png
4KB, 389x511px
In this picture, you can assume the relationship between the two columns is 2 * x = Y just by looking at it.

How would you find the relationship between two columns with several million numbers and no obvious sequence or pattern?

I've done some searching and the closest thing I can find is the formula for linear regression.
>>
>>8420143
Wow, you answered your own question.

Great thread OP. I'm sad it's over.
>>
if the numbers in one column do not repeat you find a polynomial of degree n where n is the size of the columns, that maps the first one to the other.
>>
if you're a pleb tier programmer like me looking for a quick way to map x to y try this place:
https://mycurvefit.com/
>>
>>8420154

Thanks, that site looks really useful. I will try to test it tomorrow maybe.
>>
>>8420149
>you find a polynomial of degree n where n is the size of the columns
I think that would likely cause too much over fitting.
>>
>>8420143
Linear regression is more powerful than that.
If you take the logarithm of both the set of y coordinates and the set of x coordinates, and take the linear regression of your new set of (log(x),log(y)) coordinates, you can look for power relations in your data set. This works because, if your formula is y = x^3, then your points are (log(x),log(y))=(log(x),log(x^3))=(log(x),3*log(x)), which is clearly a linear relation between log(x) and log(y) (and the slope of this linear regression is the degree of the polynominal).
>>
>>8420231
But what if you have something like y=e^x
>>
>>8420237
If you have an exponential pattern or a logarithmic pattern, then you will need to take the logarithm of the y coordinates, but not the x coordinates, or take the logarithm of the x coordinates, but not the y coordinates, respectively, then take the linear regression of your new data set.
>>
>>8420143
>How would you find the relationship between two columns with several million numbers and no obvious sequence or pattern?

Naively:
1) Partition the data into (at least) a training set and an evaluation set.
2) Hypothesize first order models of Y's dependence on X
3) Train your models on the training data
4) Assign prior likelihood to each model in your set based on relative complexity
5) Update respective likelihood based on model's ability to match evaluation set
6) Discard models with posterior liklihood below some threshold
7) If desired, hypothesize next order corrections to remaining models and repeat 3-7
>>
>>8420143
y = 2x
>>
>>8420143
If you have two culumns, and you know that one column is a function of the other, than you can just fit it to a model, like a polynomial series or a fourier series, as long as the functions in your basis are orthogonal, you should be fine.
>>
>>8420383
What if what you have is a wave or a tangent?
Thread posts: 13
Thread images: 1


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.