[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

Statistics

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 25
Thread images: 5

File: Graph1.png (18KB, 684x498px) Image search: [Google]
Graph1.png
18KB, 684x498px
Dear Gentlmen,

I am currently wiriting on my master thesis and have a major problem concerning my OLS regression.
The results are contrary to the scientific literature and I think it is because the variables have no lineare relation. Maybe someone knows how to transform the following graphs to get a more linear relationship.
>>
File: Graph2.png (18KB, 684x498px) Image search: [Google]
Graph2.png
18KB, 684x498px
>>
File: Graph3.png (21KB, 684x498px) Image search: [Google]
Graph3.png
21KB, 684x498px
>>
>>8715795
Well try exponential
>>
>>8715795
You've got heteroskedacity
>>
Take out outliers and keep influential data points?
Unless you crop up the y axis, your data does not seem to have much correlation/relation.
Calculate the correlation coefficient and you can see how strong that relationship is.
I can assume you have the raw data right?
>>
how about sounding less like a sperg u tryhard loser lmoa :^)

idk what "the literature" in your field is but you could like not use linear regression, make a log transform or be better at collecting data, pick one
>>
I cannot take the outliers out and yes I have the raw data. I tried log and exp but did not work out either. I am only familiar with ols and logit, but I have no clue for further processing
>>
>>8715837
echoing this sentiment. your data seems to have different populations within it.
>>
yes they are 3 different products. the problem is only if the linar relation is strong enough and I dont think so. Sadly, I cannot figur out a transformation.
But thanks to any advice from you guys
>>
>>8715795
>The results are contrary to the scientific literature
That's good enough anon, economy is not a science after all.
>>
OP you'd probably perform better at whatever it is you're doing if you could SPEAK FUCKING ENGLISH
>>
that´s probably true :-)
>>
For the first two, try recoding both variables as their logs, sometimes that can make nonlinear relationships linear

As for the last one, I'm not sure how you can fix that shit
>>
Where exactly is the problem? If the data doesn't fit the previous literature, it just doesn't.

Present your findings, say that previous research has different conclusions, talk about your methodological limitations (sampling problems, bias etc.). There is no problem if you manage your results right.

That being said, your second graph looks like it has a curve-like relationship. Try nonlinear regression models, might help.
>>
try using a linear mixed model that has an estimator incorporating shrinkage
>>
>>8715795
what's your end goal? Is it to illustrate the relationship between these two variables or to produce a graph that looks like a figure in a textbook on linear regression?
>>
>>8715795

those labels make me angry.
you can fit a linear model to EVERYTHING the RMS-Error can be huge though.
did you expect the "relative price difference" to rise in your attributes? if so your data might just be shit or different to the data in the "literature".

also try taking absolute values of relative differences
>>
stop using OLS
>>
>>8717445
or if you insist on using least squares, transform the X's so they have a linear relationship with the Y. People have said heteroskedasticity might be happening, however if you are only willing to use OLS you can't really do anything about that. If you do want to tackle the potential heteroskedasticity you'll have to generate wieghts by regressing your residuals against your predictors, and use those weights to re-estimate the regression (google weighted least squares).
>>
>>8717465
This guy right here. This is the guy.
>>
>>8715795
This might be a good resource for correcting the issue with your data. I've used this SPSS code before.

https://www.docdroid.net/B1kZTpJ/heteroscedasticity-testing-and-correcting-in-spss.pdf.html
>>
If all you're concerned about is whether there is a relationship and aren't specifically concerned about specific parameter estimates you could try ranking data and then doing regression.
>>
File: 1470904645625.jpg (261KB, 1057x1200px) Image search: [Google]
1470904645625.jpg
261KB, 1057x1200px
>>8719050
>SPSS
>>
File: IMG_1790.png (188KB, 700x674px) Image search: [Google]
IMG_1790.png
188KB, 700x674px
>>8719281
I've actually learned how to do extensive coding in SPSS
Thread posts: 25
Thread images: 5


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.