[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

Statistics for curve-fit data

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 12
Thread images: 2

File: Capture3.png (42KB, 746x422px) Image search: [Google]
Capture3.png
42KB, 746x422px
I need some help with statistical analysis on some data from raman spectroscopy.

I need a statistical value that shows the "goodness of fit" between the collected values and the curve-fit values. Would it be chi-squared test, or population variance where mu is taken to be the curve-fit values? My adviser said to find the R-squared value but she must not know what she's talking about since I would have to have a linear or power law function in order to do so.

Please help.

Column A is the x-axis values, column B is the actual intensity values collected and column C is the curve-fit values corresponding to B.
>>
>>9150552
That depends on how your fitted curve is constructed. Could you shed any light on that?
>>
you're right, there is no r-squared value for nonlinear data.

there really isn't a simple way to do this unfortunately. there ARE various tests you can do to demonstrate difference from a given distribution, but similarity is a tough thing to demonstrate
>>
File: IMG_2639.jpg (2MB, 4032x3024px)
IMG_2639.jpg
2MB, 4032x3024px
>>9150571
Parameters for the curve-fit:
Center - 489.4
Width - 40.4
Height - 1453.2
% Gaussian - 84.5
Area - 67085.4

Would it be better if the curve-fit is 100% Gaussian or Lorentzian? The problem with that is the the fit does not come out as well
>>
>>9150573
Is there any meaningful test I can do?

I need to show that the curve-fit remains suitable across a number of collected spectra
>>
>>9150592
honestly, the question is kinda fucked anyway - you fit a curve, and then you're asking how well you fit the curve. that's not really a useful thing to ask, because by definition you were trying to fit the curve as well as you could.

my gut feeling is you'll need something like a KS test or other related tests - https://en.wikipedia.org/wiki/Kolmogorov%E2%80%93Smirnov_test
>>
>>9150585
So basically you aren't fitting just any curve, but some mix of probability distributions.
What >>9150601 said could be useful.
As a note, the question still makes sense, somehow or other you are trying to fit the data as well as you can, the question is how well that turned out to be.
>>
>>9150601
Yeah, that's how I feel about this too. My adviser is asking me to do this, but I think she is confused.

I suppose she is looking for confirmation that the data is not noisy, but after baseline substraction, it seems pretty obvious that it isn't.
>>
>>9150612
Yes, I think what >>9150601 is pointing out is that the curve-fit graph is itself proof. I'm curve-fitting all the all spectra individually, but suppose I optimized my collection parameters and got cleaner data. How would I go about proving quantitatively that the data is cleaner?
>>
>>9150642
it wouldn't be a perfect solution, but maybe you could use a maximum likelihood estimator to show that the new curve fit to the new data is better than the old curve fit to the new data?

you're still not producing any kind of p-value or r2-value, but you can at least do better than eyeballing it
>>
>>9150552
Use something like the Kullback-Leibler divergence or conditional entropy. Something like [math]-\frac{1}{N} \sum N(x) \log p(x) - H(p)[/math] where N(x) is the number of occurrences of x, or the intensity of x, in the sample set (with sum N), p is the distribution you're fitting the data to and H(p) is its entropy.
For a normal distribution, you can simplify this expression using
[eqn]-\log p(x) = \frac{(x-\mu)^2}{2 \sigma^2} + \frac{1}{2} \log (2 \pi \sigma^2)[/eqn]
to give
[eqn]-\frac{1}{N} \sum N(x) \log p(x) - H(p) = \frac{s^2+(m-\mu)^2}{2 \sigma^2} + \frac{1}{2} \log (2 \pi \sigma^2) - \frac{1}{2} \log (2 \pi e \sigma^2) = \frac{s^2+(m-\mu)^2}{2 \sigma^2} + \frac{1}{2},[/eqn]
where m and s^2 are the sample mean and (biased) variance, and the sample expectation of the square error from mu follows from the bias-variance decomposition. If you add H(p) to this value, you get an expression with minimum at the MLE fit for the data.
>>
>>9151501
Thank you for your response!

Bumping so I can refer to it later
Thread posts: 12
Thread images: 2


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.