Democratic Underground Latest Greatest Lobby Journals Search Options Help Login
Google

Given data, does applying probability and statistics necessarily yield reliable scientific results?

Printer-friendly format Printer-friendly format
Printer-friendly format Email this thread to a friend
Printer-friendly format Bookmark this thread
Home » Discuss » Topic Forums » Science Donate to DU
 
Boojatta Donating Member (1000+ posts) Send PM | Profile | Ignore Sat Oct-24-09 08:46 AM
Original message
Given data, does applying probability and statistics necessarily yield reliable scientific results?
Alternatively, are there different methodologies, principles, rules, or tools of probability and statistics for different kinds of data?

For example, if some data is associated with not merely arbitrary correlations, but with actual causation, then is there any scientifically acceptable motivation to give special treatment to the data?
Refresh | 0 Recommendations Printer Friendly | Permalink | Reply | Top
orpupilofnature57 Donating Member (1000+ posts) Send PM | Profile | Ignore Sat Oct-24-09 08:48 AM
Response to Original message
1. I put my trust in a pot smoking star gazer a long time ago ,and it never fails.
Edited on Sat Oct-24-09 08:49 AM by orpupilofnature57
Printer Friendly | Permalink | Reply | Top
 
rucky Donating Member (1000+ posts) Send PM | Profile | Ignore Sat Oct-24-09 08:59 AM
Response to Reply #1
2. Bookmarked!
thanks
Printer Friendly | Permalink | Reply | Top
 
phasma ex machina Donating Member (1000+ posts) Send PM | Profile | Ignore Sat Oct-24-09 11:34 AM
Response to Reply #1
5. Spin more than one hypothesis - don't simply run with the first idea that caught your fancy.
A former coworker of mine always gave more weight to the second, following, counterintuitive idea.
Printer Friendly | Permalink | Reply | Top
 
orpupilofnature57 Donating Member (1000+ posts) Send PM | Profile | Ignore Sat Oct-24-09 12:49 PM
Response to Reply #5
6. Exactly !! people are to anxious for that Eureka moment ,unfortunately
in science it's Dangerous.
Printer Friendly | Permalink | Reply | Top
 
hlthe2b Donating Member (1000+ posts) Send PM | Profile | Ignore Sat Oct-24-09 09:11 AM
Response to Original message
3. Well, there is an old adage... "garbage in, garbage out..."
Which means, the analysis is only as good as the data and the methods used to collect the data.

That said, causation is not proven by correlation alone, no matter how statistically strong the findings. At least in epidemiology studies, it is inferred from the consistency of multiple studies, biologic plausibility of the association (often from animal studies), dose response when present (likelihood of outcome increases with increasing dose or exposure) and the strength of the statistical findings....

And yes, there are different methodologies, principles, rules and statistical tools for the analysis of different kinds of data-- enough to keep countless statisticians, methodological epidemiologists, computer analysts, and mathematicians busy for countless careers.
Printer Friendly | Permalink | Reply | Top
 
Jim__ Donating Member (1000+ posts) Send PM | Profile | Ignore Sat Oct-24-09 11:03 AM
Response to Original message
4. Speaking to probability and statistics in general and not to specific scientific uses, ...
Edited on Sat Oct-24-09 11:09 AM by Jim__
I'd say probability is very reliable, statistics, depending on what you're looking for, is more difficult.

Probability is about predicting sample populations from a known population. A simple example of a probability problem, if you have a box containing 30 red balls, 20 green balls and 50 yellow balls, and you randomly select 10 balls from this population, what is the probability that your sample will contain 3 red, 2 green and 5 white balls. A computation of the probability will usually come very close to an actual count from a large number of tests.

Statistics is about going the other way. Same example, given that you have picked 10 balls from a box containing a mixture of 100 red, green, and yellow balls, and you picked 3 red, 2 green, and 5 yellow, what is the probabilty that the percentage of red, green and blue balls is 30%, 20%, and 50%.

When they sample voters to give estimates of what election results would be if they were held today, that's statisitics.

As to the question about correlations, when I was doing statistics, and I found "curious" correlations, I investigated them to find out what the actual relationship was. For instance if 2 variables had a .35 correlation, how were they actually related. My experience (testing specific types of data) was that there was usually a subset of the data for which these 2 variables were strongly correlated (say above .9). Then, investigatign these subsets could tell you a lot about how your system performed under certain conditiions (the conditions of this particular subset). By checking various weakly correlated pairs, you could learn to predict and react to adverse events, either early, or act to prevent them.
Printer Friendly | Permalink | Reply | Top
 
Boojatta Donating Member (1000+ posts) Send PM | Profile | Ignore Mon Jun-14-10 07:48 PM
Response to Original message
7. Kick
Printer Friendly | Permalink | Reply | Top
 
Boojatta Donating Member (1000+ posts) Send PM | Profile | Ignore Wed Mar-02-11 07:07 PM
Response to Original message
8. Kick because the concept of causation is worth discussing
Printer Friendly | Permalink | Reply | Top
 
dimbear Donating Member (1000+ posts) Send PM | Profile | Ignore Wed Mar-02-11 11:04 PM
Response to Original message
9. Obviously not, and here is the why. Statisticians have only a few
tools in their bag, all of which depend on the assumption that reality conforms to a specific distribution, usually the Gaussian distribution. That one gets picked because its easy to calculate. (If not that specifically, there are a very few others.) Statisticians pick one and run with it and never look back.

Printer Friendly | Permalink | Reply | Top
 
DetlefK Donating Member (449 posts) Send PM | Profile | Ignore Thu Mar-03-11 06:55 AM
Response to Original message
10. How do you know whether your data deserves special treatment?
Every data is always treated the same:
1. You bin it into a probability density function.
2. Theory: From the measurement mechanism you have to derive the pdf your data SHOULD have. (binomial, Breit-Wigner, Lorenz, Chi², Poisson, t-, Gamma, Gaussian or in some very rare cases some very exotic and difficult ones)
3. If your data fits to the pdf, your measurement is of good quality. If not, it's bad quality.

All data is created equal. But then discrimination kicks in... And your only award is, you gonna get used. ;-)
Printer Friendly | Permalink | Reply | Top
 
DU AdBot (1000+ posts) Click to send private message to this author Click to view 
this author's profile Click to add 
this author to your buddy list Click to add 
this author to your Ignore list Sat Jan 04th 2025, 05:38 PM
Response to Original message
Advertisements [?]
 Top

Home » Discuss » Topic Forums » Science Donate to DU

Powered by DCForum+ Version 1.1 Copyright 1997-2002 DCScripts.com
Software has been extensively modified by the DU administrators


Important Notices: By participating on this discussion board, visitors agree to abide by the rules outlined on our Rules page. Messages posted on the Democratic Underground Discussion Forums are the opinions of the individuals who post them, and do not necessarily represent the opinions of Democratic Underground, LLC.

Home  |  Discussion Forums  |  Journals |  Store  |  Donate

About DU  |  Contact Us  |  Privacy Policy

Got a message for Democratic Underground? Click here to send us a message.

© 2001 - 2011 Democratic Underground, LLC