That is has been explained in detail as to why it is important by a number of posts, by me and other posters also by
publicagenda, American Statistical Assosiation , National Science Assosiation, this is where the links are coming from, they all say, You have to have data collected in a proper fashion, or else the results your going to get will be flawed.
Sure... what makes you think that the current approach is
totally wrong and useless ? I didn't even say the percentages are correct/precise by and large... but I believe that by understanding where the shortfalls are, we can have better intuition about the numbers. We can also improve our poll next time. So what if the data are only valid for B3D community ? They are still meaningful for me (keeping in mind the assumptions). If we truly want to measure the failure rate, we need to change the poll to measure per-box/SKU rather than per-user breakdown experience. Since we didn't, the poll is already "gimped".
Its irrelevant as to what you are polling. If you do not collect data in a proper fashion, and a specially if your sample is based on volunteers, they are going to get flawed because of personal interest (a person with a broken console is more interested in reporting this, than a person with a working one, for example) and biases.
As I mentioned, this behaviour applies equally to PS3 and Wii. So it's indicative in that manner... like how you acknowledged that 360 reliability sucked compared to PS3 and Wii. That's all. Do I think the failure rate is 30% ? I don't know enough and am not interested to comment on it.
What part of people voluntarily seeking out this topic and voting, and people being randomly selected to participate in a poll dont you understand the difference of? Tell me and i will answer.
The B3D pool can just represent one of the random sample pools. If additional data is collected for the
Xbox 360 populations, you can actually qualify the data some more before using them.
It has already been explained in the prior posts, and its also pointed out by the articles linked and quoted from made by the National Science and Statistics Associations: the fact alone that this is a voluntary poll is going to tamper with results.
What you think will affect the poll result or not is not something im going to try to bother spending more time on explaining, as your asking the exact same question regardless of the answer.
This has already been explained in detail several times.
The sample population should be Xbox 360s... not consumer populations. If we ask the voters their 360s' date of purchase/manufacture, we can actually profile and reason about the consoles by qualifying 360s that spread across a wide range of purchase date. I don't think I'm violating whatever scientific article you cited. The poll is just one of the ways to gather raw data. You can still qualify them.
Really? YOU think polling have some advantages? What experience do you have with statistics, if you dont mind me asking?
Took a few courses... worked on a one or two market research projects (I wrote the code, the [chair] marketing professor did the formula). Assisted a few shopping malls to organize and analyze their customer data.... before bringing in the big guns.
Also saw so-called market research companies fudging results... and hiring participants that do not fit the survey profile for $50... just to finish survey quicker and move on to another project.
Like I said, there are advantages and disadvantages in various mechanisms. As long as you are aware of the caveat, and you know what you want... you spend less (more) to get the information quickly (or carefully).
Its not a matter of how hard it is to filter out overlapping data, its having data that is random enough to be accurate (sounds kinda wierd huh?) and that also fits certain (already explained characteristics in earlier posts) if your going to combine sample data. It is crazy hard.
You can still qualify the data after collecting them.
The rest of your article just regurgitate the same basic idea, but I think you're oversimplify the data collection process. We usually need to cleanse and qualify the data before use. This is assuming if we have time in real life